WO2017210242A1 - Insulin-nuclear hormone conjugates - Google Patents

Insulin-nuclear hormone conjugates Download PDF

Info

Publication number
WO2017210242A1
WO2017210242A1 PCT/US2017/035102 US2017035102W WO2017210242A1 WO 2017210242 A1 WO2017210242 A1 WO 2017210242A1 US 2017035102 W US2017035102 W US 2017035102W WO 2017210242 A1 WO2017210242 A1 WO 2017210242A1
Authority
WO
WIPO (PCT)
Prior art keywords
alkyl
group
seq
chain
acid
Prior art date
Application number
PCT/US2017/035102
Other languages
French (fr)
Inventor
Richard D. Dimarchi
Bin Yang
Original Assignee
Indiana University Research And Technology Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Indiana University Research And Technology Corporation filed Critical Indiana University Research And Technology Corporation
Publication of WO2017210242A1 publication Critical patent/WO2017210242A1/en

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/50Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
    • A61K47/51Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
    • A61K47/54Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic compound
    • A61K47/554Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic compound the modifying agent being a steroid plant sterol, glycyrrhetic acid, enoxolone or bile acid
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/50Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
    • A61K47/51Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
    • A61K47/54Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic compound
    • A61K47/55Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic compound the modifying agent being also a pharmacologically or therapeutically active agent, i.e. the entire conjugate being a codrug, i.e. a dimer, oligomer or polymer of pharmacologically or therapeutically active compounds
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K47/00Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
    • A61K47/50Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
    • A61K47/51Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
    • A61K47/56Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic macromolecular compound, e.g. an oligomeric, polymeric or dendrimeric molecule
    • A61K47/59Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic macromolecular compound, e.g. an oligomeric, polymeric or dendrimeric molecule obtained otherwise than by reactions only involving carbon-to-carbon unsaturated bonds, e.g. polyureas or polyurethanes
    • A61K47/60Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic macromolecular compound, e.g. an oligomeric, polymeric or dendrimeric molecule obtained otherwise than by reactions only involving carbon-to-carbon unsaturated bonds, e.g. polyureas or polyurethanes the organic macromolecular compound being a polyoxyalkylene oligomer, polymer or dendrimer, e.g. PEG, PPG, PEO or polyglycerol
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/62Insulins
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K9/00Medicinal preparations characterised by special physical form
    • A61K9/0012Galenical forms characterised by the site of application
    • A61K9/0019Injectable compositions; Intramuscular, intravenous, arterial, subcutaneous administration; Compositions to be administered through the skin in an invasive manner

Definitions

  • Insulin is a proven therapy for the treatment of juvenile-onset diabetes and later stage adult-onset diabetes.
  • the peptide is biosynthesized as a larger linear precursor of low potency (approximately 2% to 9% of native insulin), named proinsulin.
  • Proinsulin is proteolytically converted to insulin by the selective removal of a 35-residue connecting peptide (C peptide).
  • C peptide 35-residue connecting peptide
  • the resultant heteroduplex formed by disulfide links between the insulin "A chain” (SEQ ID NO: 1) and "B chain” (SEQ ID NO: 2) chain, representing a total of 51 amino acids, has high potency for the insulin receptor (nM range).
  • Native insulin has approximately one hundredfold selective affinity for the insulin receptor relative to the related insulin-like growth factor 1 receptor, but demonstrates little selectively for the two different insulin receptor isoforms, named A & B.
  • the insulin-like growth factors 1 and 2 are single chain liner peptide hormones that are highly homologous in their A and B chain sequences, sharing approximately fifty percent homology with native insulin.
  • the IGF A and B chains are linked by a "C-peptide", wherein the C-peptides of the two IGFs differ in size and amino acid sequence, the first being twelve and the second being eight amino acids in length.
  • Human IGF-1 is a 70 aa basic peptide having the protein sequence shown in SEQ ID NO: 3, and has a 43% homology with proinsulin (Rinderknecht et al. (1978) J. Biol. Chem. 253:2769-2776).
  • Human IGF-2 is a 67 amino acid basic peptide having the protein sequence shown in SEQ ID NO: 4. The IGFs demonstrate considerably less activity at the insulin B receptor isoform than the A-receptor isoform.
  • IGF-1 based insulin peptides analogs (wherein the native Gln-Phe dipeptide of the B -chain is replaced by Tyr-Leu) that display high activity at the insulin receptor (see PCT/US2009/068713, the disclosure of which is incorporated herein).
  • IGF YL analog peptides are more readily synthesized than insulin and enable the development of co-agonist analogs for insulin and IGF- 1 receptors, and selective insulin receptor specific analogs.
  • these insulin analogs can also be formulated as single chain insulin agonists in accordance with the present disclosure.
  • Single chain insulin analogs comprising the insulin A and B chains have been previously prepared (see EP 1,193,272 and US 2007/0129284).
  • single chain high potency insulin agonists can also be prepared by insertion of the IGF-1 C-peptide, or analogs thereof, as a connecting peptide linking the insulin B and A peptides.
  • the selective mutation of individual amino acids in the C-peptide sequence yields peptides that are highly selective for insulin relative to IGF- 1 receptor.
  • Nuclear hormone receptor proteins form a class of ligand activated proteins that, when bound to specific sequences of DNA, serve as on-off switches for transcription within the cell nucleus. These switches control the development and differentiation of skin, bone and behavioral centers in the brain, as well as the continual regulation of reproductive tissues.
  • Nuclear hormone receptor ligands such as steroids, sterols, retinoids, thyroid hormones, and vitamin D function to activate nuclear hormone receptors.
  • the interaction of the hormone and receptor triggers a conformational change in the receptor, which results in the up-regulation of gene expression.
  • the level of cellular signal transduction activated by the interaction of a ligand and a nuclear hormone receptor is determined by the number of ligands and receptors available for binding, and by the binding affinity between the ligand and the receptor.
  • Thyroid hormones powerfully influence systemic metabolism through multiple pathways, with profound effects on energy expenditure, fat oxidation, and cholesterol metabolism.
  • adverse side effects of thyroid hormone treatment include increased heart rate, cardiac hypertrophy, muscle wasting, and reduced bone density, terminating its clinical use.
  • Discovery of thyromimetics capable of separating lipid metabolism benefits from adverse cardiovascular effects has remained a desire for patients, physicians and the pharmaceutical industry.
  • Peroxisome proliferator-activated receptors are nuclear hormone receptors. PPAR receptors activate transcription by binding to elements of DNA sequences, known as peroxisome proliferator response elements (PPRE), in the form of a heterodimer with retinoid X receptors (known as RXRs).
  • PPRE peroxisome proliferator response elements
  • RXRs retinoid X receptors
  • PPAR- alpha Three sub-types of human PPAR have been identified and described: PPAR- alpha, PPAR-gamma and PPAR-delta (or NUCI).
  • PPAR- alpha is mainly expressed in the liver, while PPAR-delta is ubiquitous.
  • PPAR-gamma is involved in regulating the differentiation of adipocytes, where it is highly expressed. It also has a key role in systemic lipid homeostasis.
  • PPARs A number of compounds that modulate the activity of PPARs have been identified including thiazolidinediones, which have been employed in the treatment of diabetes and metabolic disease. Activation of PPAR gamma is known to reduce blood glucose levels without increasing insulin secretion. Discovery of PPAR agonists capable of separating lipid metabolism benefits from adverse effects has remained a desire for patients, physicians and the pharmaceutical industry
  • conjugates are formed between an insulin peptide and a nuclear hormone receptor ligand, wherein the conjugate has agonist activity at both the insulin receptor and the corresponding nuclear hormone receptor. More particularly, the
  • conjugation of a nuclear hormone receptor ligand is anticipated to produce a beneficial modification of the insulin peptide activity while moderating adverse effects associated with nuclear hormone agonists.
  • conjugates of the present disclosure can be represented by the following formula:
  • the insulin peptide component of the conjugate can be native insulin or any known insulin analog that has activity at the insulin receptor including for example any insulin peptide disclosed in published international applications W096/34882, WO 2010/080607, WO 2010/080609, WO 2011/159882, WO/2011/159895 and US Patent No. 6,630,348, the disclosures of which are incorporated herein by reference.
  • the NHR ligand (Y) is wholly or partly non-peptidic and acts at a nuclear hormone receptor with an activity in accordance with any of the teachings set forth herein.
  • the NHR ligand is an agonist that, in its unbound state, has an EC50 or IC50 of about 1 mM or less, or 100 ⁇ or less, or 10 ⁇ or less, or 1 ⁇ or less.
  • the NHR ligand has a molecular weight of up to about 5000 daltons, or up to about 2000 daltons, or up to about 1000 daltons, or up to about 500 daltons.
  • the NHR ligand may act at any of the nuclear hormone receptors described herein or have any of the structures described herein.
  • the NHR ligand component of the conjugate can be a ligand that activates the thyroid hormone receptor or activates the peroxisome proliferator- activated receptors (PPAR).
  • PPAR peroxisome proliferator- activated receptors
  • the conjugates of the present disclosure are anticipated to lessen the amount of insulin that is needed to control hyperglycemia while also simultaneously lowering body weight and triglycerides and/or cholesterol levels. More particularly, the chemical addition of nuclear hormone activity to insulin agonist peptides provides supplemental pharmacology that is envisioned to broaden insulin action such that it could lower triglycerides, cholesterol and body weight in a means that is currently not possible with insulin therapy alone.
  • Thyroid hormone is recognized for its ability to lower body weight and
  • insulin-thyroid hormone conjugates will provide new dimensions to insulin action and lessen the amount of insulin needed to control blood glucose leading to less hypoglycemia and less body weight gain.
  • insulin can be conjugated to a PPAR agonist to improve insulin sensitivity which should lower exogenous insulin requirements leading to less hypoglycemia and less body weight gain.
  • the nuclear hormone receptor ligand is linked to the C-terminus of the insulin B chain or at the N-terminus of the insulin A or B chain, either directly via an ester or amide bond, or indirectly through a spacer.
  • the nuclear hormone receptor ligand is covalently linked to the insulin peptide through the side chain of an amino acid at a position selected from the group consisting of Al, B l, B28 and B29, or the N-terminal alpha amine of the A or B chain, or at the side chain of an amino acid at any position of a linking moiety that links the A chain and B chain of a single chain insulin analog.
  • the insulin peptide of the conjugate is a two chain insulin analog comprising an A chain and B chain linked to one another via intermolecular disulfide bonds.
  • the conjugate comprises a two chain insulin analog wherein a first and second nuclear hormone receptor ligand are covalently linked to the insulin peptide at a position selected from the side chain of the B28 or B29 amino acid and the N-terminal alpha amine of the A or B chain.
  • the NHR ligand-insulin conjugate comprises a hydrophilic moiety linked to the N-terminal alpha amine of the B chain or to the side chain of an amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain or positions B l, B2, B IO, B22, B28 or B29 of the B chain or to a side chain of an amino acid of the linking moiety in a single chain insulin analog.
  • the NHR ligand is linked to the amino acid side chain of a lysine present at position B28 or B29 and optionally a hydrophilic moiety is linked to the amino terminus of the B chain, optionally through the N-terminal alpha amine of the B chain.
  • the hydrophilic moiety is a polyethylene glycol chain.
  • a polyethylene glycol chain is covalently bound to the side chain of an amino acid of the linking moiety of the insulin peptide component, when the insulin peptide is a single chain insulin analog.
  • the insulin peptide is a single chain insulin wherein linking moiety joining the B and A chains comprises an amino acid sequence of no more than 17 amino acids in length and comprising the sequence
  • GYGSSSX 57 X 58 (SEQ ID NO: 21), GAGSSSRR (SEQ ID NO: 22) or
  • G YGS S S X57X58 APQT (SEQ ID NO: 69) wherein X 57 and X 58 are independently arginine, lysine or ornithine and the amino acid designated by Xs 7 or Xs 8 optionally further comprises a hydrophilic moiety linked to the side chain of the amino acid at that position.
  • the hydrophilic moiety is a polyethylene glycol chain.
  • Acylation or alkylation can increase the half-life of the NHR ligand-insulin conjugate in circulation. Acylation or alkylation can advantageously delay the onset of action and/or extend the duration of action at the insulin receptors.
  • the insulin peptide of the conjugates may be acylated or alkylated at the same amino acid position where a hydrophilic moiety is linked (including, for example at position 8 of the linking moiety), or at a different amino acid position.
  • compositions comprising the NHR ligand-insulin conjugates and a pharmaceutically acceptable carrier.
  • a pharmaceutical composition comprising any of the NHR ligand-insulin conjugates disclosed herein preferably at a purity level of at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%, and a pharmaceutically acceptable diluent, carrier or excipient.
  • compositions may contain a conjugate as disclosed herein at a concentration of at least 0.5 mg/ml, 1 mg/ml, 2 mg/ml, 3 mg/ml, 4 mg/ml, 5 mg/ml, 6 mg/ml, 7 mg/ml, 8 mg/ml, 9 mg/ml, 10 mg/ml, 11 mg/ml, 12 mg/ml, 13 mg/ml, 14 mg/ml, 15 mg/ml, 16 mg/ml, 17 mg/ml, 18 mg/ml, 19 mg/ml, 20 mg/ml, 21 mg/ml, 22 mg/ml, 23 mg/ml, 24 mg/ml, 25 mg/ml or higher.
  • the pharmaceutical compositions comprise aqueous solutions that are sterilized and optionally stored within various package containers.
  • the pharmaceutical compositions comprise a lyophilized powder.
  • the pharmaceutical compositions can be further packaged as part of a kit that includes a disposable device for administering the composition to a patient.
  • the containers or kits may be labeled for storage at ambient room temperature or at refrigerated temperature.
  • an improved method of regulating blood glucose levels in insulin dependent patients comprises the steps of administering to a patient a NHR ligand-insulin conjugate as disclosed herein in an amount therapeutically effective for the control of diabetes.
  • a method of reducing weight or preventing weight gain is provided wherein the method comprises administering a NHR ligand-insulin conjugate as disclosed herein to a patient in need of such therapy.
  • Fig. 1. is a schematic overview of the two step synthetic strategy for preparing human insulin. Details of the procedure are provided in Example 1.
  • Fig. 2 is a graph comparing insulin receptor specific binding of synthetic human insulin relative to purified native insulin.
  • the synthetic insulin was produced by the approach detailed in Fig. 1 where the A 7 -B 7 bond is the first disulfide formed. As indicated by the data presented in the graph, the two molecules have similar binding activities.
  • Fig. 3 shows the structure of an insulin-dexamethasone and two insulin-thyroxine (T3) conjugates, wherein native insulin is modified by linkage of the dexamethasone/T3 to the side chain of the native Lys at position B29. Also provided is the activity of the three conjugates and native insulin at the insulin IR-B receptor using the in vitro assay of Example 3.
  • Fig. 4 is a graph presenting data for an in vivo study on blood glucose levels in mice administered the insulin-nuclear hormone ligand conjugates CIU-15, CIU-16 and CIU-17 relative to control and native insulin.
  • Fig. 5 shows the structure of an insulin-Tesaglitazar conjugate, wherein the native insulin heteroduplex (comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2) is linked to Tesaglitazar via a Benzyl-PEG2-GGG-K linker.
  • the native insulin heteroduplex comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2
  • Tesaglitazar via a Benzyl-PEG2-GGG-K linker.
  • Fig. 6 shows the structure of an insulin- Aleglitazar conjugate, wherein the native insulin heteroduplex (comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2) is linked to Tesaglitazar via an NHCO linker to the B29 Lys side chain (CIU-164), to the A chain N-terminal amine and the B29 Lys side chain (CIU-165) or the B29 Lys side chain of a B l pegylated insulin analog (CIU-166).
  • the native insulin heteroduplex comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2
  • Tesaglitazar via an NHCO linker to the B29 Lys side chain (CIU-164), to the A chain N-terminal amine and the B29 Lys side chain (CIU-165) or the B29 Lys side chain of a B l pegylated insulin analog (CIU-166).
  • Figs. 7A-7D shows the activity of an insulin- Aleglitazar conjugate (CIU-166) and an insulin-T3 conjugate (CIU-167), wherein the insulin component is native insulin (comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2) pegylated with a 20K peg at the B l position.
  • Fig. 7A shows the activity of CIU-166 and CIU-167 the insulin IR-B receptor using an in vitro assay.
  • Fig. 7B is a graph presenting data for an in vivo study on blood glucose levels in mice administered CIU-166 and CIU-167.
  • Fig. 7C is a graph presenting data for an in vivo study on body weight in mice administered CIU-166 and CIU- 167;
  • Fig. 7D is a graph presenting data for glucose tolerance test in mice administered CIU- 166 and CIU-167.
  • amino acid encompasses any molecule containing both amino and carboxyl functional groups, wherein the amino and carboxylate groups are attached to the same carbon (the alpha carbon).
  • the alpha carbon optionally may have one or two further organic substituents.
  • designation of an amino acid without specifying its stereochemistry is intended to encompass either the L or D form of the amino acid, or a racemic mixture.
  • hydroxyl acid refers to amino acids that have been modified to replace the alpha carbon amino group with a hydroxyl group.
  • non-coded amino acid encompasses any amino acid that is not an L-isomer of any of the following 20 amino acids: Ala, Cys, Asp, Glu, Phe, Gly, His, He, Lys, Leu, Met, Asn, Pro, Gin, Arg, Ser, Thr, Val, Trp, Tyr.
  • a general reference to a peptide is intended to encompass peptides that have modified amino and carboxy termini.
  • an amino acid sequence designating the standard amino acids is intended to encompass standard amino acids at the N- and C- terminus as well as a corresponding hydroxyl acid at the N-terminus and/or a corresponding C-terminal amino acid modified to comprise an amide group in place of the terminal carboxylic acid.
  • an "acylated" amino acid is an amino acid comprising an acyl group which is non-native to a naturally-occurring amino acid, regardless by the means by which it is produced.
  • exemplary methods of producing acylated amino acids and acylated peptides are known in the art and include acylating an amino acid before inclusion in the peptide or peptide synthesis followed by chemical acylation of the peptide.
  • the acyl group causes the peptide to have one or more of (i) a prolonged half-life in circulation, (ii) a delayed onset of action, (iii) an extended duration of action, (iv) an improved resistance to proteases, such as DPP-IV, and (v) increased potency at the IGF and/or insulin peptide receptors.
  • an "alkylated” amino acid is an amino acid comprising an alkyl group which is non-native to a naturally-occurring amino acid, regardless of the means by which it is produced.
  • exemplary methods of producing alkylated amino acids and alkylated peptides are known in the art and including alkylating an amino acid before inclusion in the peptide or peptide synthesis followed by chemical alkylation of the peptide.
  • alkylation of peptides will achieve similar, if not the same, effects as acylation of the peptides, e.g., a prolonged half-life in circulation, a delayed onset of action, an extended duration of action, an improved resistance to proteases, such as DPP-IV, and increased potency at the IGF and/or insulin receptors.
  • the term "pharmaceutically acceptable carrier” includes any of the standard pharmaceutical carriers, such as a phosphate buffered saline solution, water, emulsions such as an oil/water or water/oil emulsion, and various types of wetting agents.
  • the term also encompasses any of the agents approved by a regulatory agency of the US Federal government or listed in the US Pharmacopeia for use in animals, including humans.
  • pharmaceutically acceptable salt refers to salts of compounds that retain the biological activity of the parent compound, and which are not biologically or otherwise undesirable. Many of the compounds disclosed herein are capable of forming acid and/or base salts by virtue of the presence of amino and/or carboxyl groups or groups similar thereto.
  • hydrophilic moiety refers to any compound that is readily water-soluble or readily absorbs water, and which are tolerated in vivo by mammalian species without toxic effects (i.e. are biocompatible).
  • hydrophilic moieties include polyethylene glycol (PEG), polylactic acid, polyglycolic acid, a polylactic- polyglycolic acid copolymer, polyvinyl alcohol, polyvinylpyrrolidone, polymethoxazoline, polyethyloxazoline, polyhydroxyethyl methacrylate, polyhydroxypropyl methacrylamide, polymethacrylamide, polydimethylacrylamide, and derivatised celluloses such as hydroxymethylcellulose or hydroxyethylcellulose and co-polymers thereof, as well as natural polymers including, for example, albumin, heparin and dextran.
  • treating includes prophylaxis of the specific disorder or condition, or alleviation of the symptoms associated with a specific disorder or condition and/or preventing or eliminating said symptoms.
  • treating diabetes will refer in general to maintaining glucose blood levels near normal levels and may include increasing or decreasing blood glucose levels depending on a given situation.
  • an "effective" amount or a “therapeutically effective amount” of an insulin analog refers to a nontoxic but sufficient amount of an insulin analog to provide the desired effect.
  • one desired effect would be the prevention or treatment of hyperglycemia.
  • the amount that is "effective” will vary from subject to subject, depending on the age and general condition of the individual, mode of administration, and the like. Thus, it is not always possible to specify an exact “effective amount.” However, an appropriate “effective” amount in any individual case may be determined by one of ordinary skill in the art using routine experimentation.
  • parenteral means not through the alimentary canal but by some other route such as intranasal, inhalation, subcutaneous, intramuscular, intraspinal, or intravenous.
  • references to a particular amino acid position in an insulin analog by letter and number refer to the amino acid at that position of either the A chain (e.g. position A5) or the B chain (e.g. position B5) in the respective native human insulin A chain (SEQ ID NO: 1) or B chain (SEQ ID NO: 2), or the corresponding amino acid position in any analogs thereof.
  • a reference herein to "position B28" absent any further elaboration would mean the corresponding position B27 of the B chain of an insulin analog in which the first amino acid of SEQ ID NO: 2 has been deleted.
  • amino acids added to the N-terminus of the native B chain are numbered starting with BO, followed by numbers of increasing negative value (e.g., B-l, B-2...) as amino acids are added to the N-terminus.
  • any reference to an amino acid position in the linking moiety of a single chain analog is made in reference to the native C chain of IGF 1 (SEQ ID NO: 17).
  • position 9 of the native C chain (or the "position C9") has an alanine residue.
  • the term “native insulin peptide” is intended to designate the 51 amino acid heteroduplex comprising the A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2, as well as single-chain insulin analogs that comprise SEQ ID NOS: 1 and 2.
  • insulin peptide as used herein, absent further descriptive language is intended to encompass the 51 amino acid heteroduplex comprising the A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2, as well as single-chain insulin analogs thereof (including for example those disclosed in published international application W096/34882 and US Patent No.
  • modified analogs include modification of the amino acid at position A 19, B 16 or B25 to a 4-amino phenylalanine or one or more amino acid substitutions at positions selected from A5, A8, A9, A10, A12, A14, A15, A17, A18, A21, B l, B2, B3, B4, B5, B9, B 10, B 13, B 14, B 17, B20, B21, B22, B23, B26, B27, B28, B29 and B30 or deletions of any or all of positions B l-4 and B26-30.
  • Insulin peptides as defined herein can also be analogs derived from a naturally occurring insulin by insertion or substitution of a non-peptide moiety, e.g. a retroinverso fragment, or incorporation of non- peptide bonds such as an azapeptide bond (CO substituted by NH) or pseudo-peptide bond (e.g. NH substituted with C3 ⁇ 4) or an ester bond (e.g., a depsipeptide, wherein one or more of the amide (-CONHR-) bonds are replaced by ester (COOR) bonds).
  • a non-peptide moiety e.g. a retroinverso fragment
  • non- peptide bonds such as an azapeptide bond (CO substituted by NH) or pseudo-peptide bond (e.g. NH substituted with C3 ⁇ 4) or an ester bond (e.g., a depsipeptide, wherein one or more of the amide (-CONHR-) bonds are replaced by ester (COOR
  • single-chain insulin analog encompasses a group of structurally-related proteins wherein insulin or IGF A and B chains, or analogs or derivatives thereof, are covalently linked to one another to form a linear polypeptide chain.
  • the single-chain insulin analog comprises the covalent linkage of the carboxy terminus of the B chain to the amino terminus of the A chain via a linking moiety.
  • insulin A chain absent further descriptive language is intended to encompass the 21 amino acid sequence of SEQ ID NO: 1 as well as functional analogs and derivatives thereof, including insulin analogs known to those skilled in the art, including modification of the sequence of SEQ ID NO: 1 by one or more amino acid insertions, deletions or substitutions at positions selected from A4, A5, A8, A9, A10, A12, A14, A15, A17, A18, A21.
  • insulin B chain absent further descriptive language is intended to encompass the 30 amino acid sequence of SEQ ID NO: 2, as well as modified functional analogs of the native B chain, including one or more amino acid insertions, deletions or substitutions at positions selected from B l, B2, B3, B4, B5, B9, B 10, B 13, B 14, B 17, B20, B21, B22, B23, B25, B26, B27, B28, B29 and B30 or deletions of any or all of positions B l-4 and B26-30.
  • identity as used herein relates to the similarity between two or more sequences. Identity is measured by dividing the number of identical residues by the total number of residues and multiplying the product by 100 to achieve a percentage. Thus, two copies of exactly the same sequence have 100% identity, whereas two sequences that have amino acid deletions, additions, or substitutions relative to one another have a lower degree of identity.
  • BLAST Basic Local Alignment Search Tool, Altschul et al. (1993) J. Mol. Biol. 215:403-410) are available for determining sequence identity.
  • the term "selectivity" of a molecule for a first receptor relative to a second receptor refers to the following ratio: EC 50 of the molecule at the second receptor divided by the EC 50 of the molecule at the first receptor. For example, a molecule that has an EC 50 of 1 nM at a first receptor and an EC 50 of 100 nM at a second receptor has 100-fold selectivity for the first receptor relative to the second receptor.
  • an amino acid “modification” refers to a substitution of an amino acid, or the derivation of an amino acid by the addition and/or removal of chemical groups to/from the amino acid, and includes substitution with any of the 20 amino acids commonly found in human proteins, as well as atypical or non-naturally occurring amino acids.
  • Atypical amino acids include Sigma-Aldrich (Milwaukee, WI), ChemPep Inc. (Miami, FL), and Genzyme Pharmaceuticals (Cambridge, MA). Atypical amino acids may be purchased from commercial suppliers, synthesized de novo, or chemically modified or derivatized from naturally occurring amino acids.
  • substitution refers to the replacement of one amino acid residue by a different amino acid residue.
  • conservative amino acid substitution is defined herein as exchanges within one of the following five groups: I. Small aliphatic, nonpolar or slightly polar residues:
  • polyethylene glycol chain refers to mixtures of condensation polymers of ethylene oxide and water, in a branched or straight chain, represented by the general formula H(OCH 2 CH 2 ) n OH, wherein n is at least 2.
  • Polyethylene glycol chain or “PEG chain” is used in combination with a numeric suffix to indicate the approximate average molecular weight thereof.
  • PEG-5,000 refers to polyethylene glycol chain having a total molecular weight average of about 5,000 Daltons
  • pegylated and like terms refers to a compound that has been modified from its native state by linking a polyethylene glycol chain to the compound.
  • a “pegylated polypeptide” is a polypeptide that has a PEG chain covalently bound to the polypeptide.
  • Linker is a bond, molecule or group of molecules that binds two separate entities to one another. Linkers may provide for optimal spacing of the two entities or may further supply a labile linkage that allows the two entities to be separated from each other. Labile linkages include photocleavable groups, acid-labile moieties, base-labile moieties and enzyme-cleavable groups.
  • a "dimer” is a complex comprising two subunits covalently bound to one another via a linker.
  • dimer when used absent any qualifying language, encompasses both homodimers and heterodimers.
  • a homodimer comprises two identical subunits, whereas a heterodimer comprises two subunits that differ, although the two subunits are substantially similar to one another.
  • Ci-C n alkyl wherein n can be from 1 through 6, as used herein, represents a branched or linear alkyl group having from one to the specified number of carbon atoms.
  • Typical Ci-C 6 alkyl groups include, but are not limited to, methyl, ethyl, n-propyl, iso- propyl, butyl, iso-Butyl, sec -butyl, tert-butyl, pentyl, hexyl and the like.
  • C 2 -C n alkenyl wherein n can be from 2 through 6, as used herein, represents an olefinically unsaturated branched or linear group having from 2 to the specified number of carbon atoms and at least one double bond.
  • C 2 -C n alkynyl wherein n can be from 2 to 6, refers to an unsaturated branched or linear group having from 2 to n carbon atoms and at least one triple bond.
  • Examples of such groups include, but are not limited to, 1-propynyl, 2-propynyl, 1-butynyl, 2-butynyl, 1-pentynyl, and the like.
  • aryl refers to a mono- or bicyclic carbocyclic ring system having one or two aromatic rings including, but not limited to, phenyl, naphthyl,
  • aryl ring and the presence of substituents or linking groups are indicated by designating the number of carbons present.
  • substituents or linking groups are indicated by designating the number of carbons present.
  • (Ci-C 3 alkyl)(C6-Cio aryl) refers to a 5 to 10 membered aryl that is attached to a parent moiety via a one to three membered alkyl chain.
  • heteroaryl refers to a mono- or bi- cyclic ring system containing one or two aromatic rings and containing at least one nitrogen, oxygen, or sulfur atom in an aromatic ring.
  • (Ci-C n alkyl)(Cs-C6 heteroaryl) refers to a 5 or 6 membered heteroaryl that is attached to a parent moiety via a one to "n" membered alkyl chain.
  • halo refers to one or more members of the group consisting of fluorine, chlorine, bromine, and iodine.
  • patient without further designation is intended to encompass any warm blooded vertebrate domesticated animal (including for example, but not limited to livestock, horses, cats, dogs and other pets) and humans.
  • isolated means having been removed from its natural environment.
  • the analog is made through recombinant methods and the analog is isolated from the host cell.
  • purified as used herein relates to the isolation of a molecule or compound in a form that is substantially free of contaminants normally associated with the molecule or compound in a native or natural environment and means having been increased in purity as a result of being separated from other components of the original composition.
  • purified polypeptide is used herein to describe a polypeptide which has been separated from other compounds including, but not limited to nucleic acid molecules, lipids and carbohydrates.
  • a “peptidomimetic” refers to a chemical compound having a structure that is different from the general structure of an existing peptide, but that functions in a manner similar to the existing peptide, e.g., by mimicking the biological activity of that peptide.
  • Peptidomimetics typically comprise naturally-occurring amino acids and/or unnatural amino acids, but can also comprise modifications to the peptide backbone.
  • a peptidomimetic may include a sequence of naturally-occurring amino acids with the insertion or substitution of a non-peptide moiety, e.g. a retroinverso fragment, or
  • peptidomimetic may be devoid of any naturally-occurring amino acids.
  • azapeptide bond CO substituted by NH
  • pseudo-peptide bond e.g. NH substituted with CH2
  • ester bond e.g., depsipeptides, wherein one or more of the amide (-CONHR-) bonds are replaced by ester (COOR) bonds.
  • the peptidomimetic may be devoid of any naturally-occurring amino acids.
  • Insulin analogs will be abbreviated as follows:
  • the insulin A and B chains will be designated generically by a capital A for the A chain and a capital B for the B chain.
  • a superscript 0 e.g., A 0 or B°
  • a superscript 1 e.g., A 1 or B 1
  • IGF-1 sequence A chain: SEQ ID NO: 5, B chain SEQ ID NO: 6
  • Modifications that deviate from the native insulin and IGF sequence are indicated in parenthesis following the designation of the A or B chain (e.g., [ ⁇ , ⁇ , ⁇ , ⁇ ) : A 1 (H8,N18,N21)]) with the single letter amino acid abbreviation indicating the substitution and the number indicating the position of the substitution in the respective A or B chain, using native insulin numbering.
  • a colon between the A and B chain indicates a two chain insulin whereas a dash will indicate a covalent bond and thus a single chain analog.
  • a linking moiety will be included between the A and B chains and the designation C 1 refers to the native IGF 1 C peptide, SEQ ID NO: 17.
  • the designation "position C8" in reference to the linking moiety designates an amino acid located at the position corresponding to the eighth amino acid of SEQ ID NO: 17.
  • conjugates of an insulin peptide and a NHR ligand are conjugates of an insulin peptide and a NHR ligand.
  • the NHR ligand is an NHR agonist.
  • agonist/NHR ligand conjugate comprises the structure Q-L-Y, wherein Q is an insulin peptide, Y is a NHR ligand and L is a linking group or a bond, wherein said conjugate has activity at both the corresponding nuclear hormone receptor of Y and an insulin receptor, including for example the insulin subtype B receptor.
  • the NHR ligand is selected from the group consisting of a steroid that exhibits an EC 50 of about 1 ⁇ or less when unconjugated to Q-L, and has a molecular weight of up to about 1000 daltons.
  • the NHR ligand is a ligand that activates the thyroid hormone receptor or a ligand that activates the peroxisome proliferator- activated receptors (PPAR).
  • the NHR ligand is an NHR agonist. In one embodiment the
  • NHR agonist has activity at a Type I NHR when bound to Q-L. In one embodiment the NHR agonist has activity at a Type II NHR when bound to Q-L. In one embodiment the NHR ligand is
  • a steroid that exhibits an EC 50 of about 1 ⁇ or less when unconjugated to Q-L, and further has a molecular weight of up to about 1000 daltons;
  • a ligand that activates the thyroid hormone receptor or iii) a ligand that activates the peroxisome proliferator-activated receptors (PPAR).
  • PPAR peroxisome proliferator-activated receptors
  • the NHR ligand of the conjugate is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof. In one embodiment the NHR ligand is dexamethasone.
  • Ri5 is C1-C4 alkyl, -CH 2 (pyridazinone), -CH 2 (OH)(phenyl)F, -CH(OH)CH 3 , halo or H; R 2 o is halo, CH 3 or H;
  • R 2 i is halo, CH 3 or H
  • R 22 is H, OH, halo, -CH 2 (OH)(C 6 aryl)F, or d-C 4 alkyl;
  • R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH,
  • the thyroid hormone receptor agonist has the general structure of Formula I:
  • HO R20, R21, and R22 are independently selected from the group consisting of H, OH, halo and Ci-C 4 alkyl;
  • Ri5 is halo or H.
  • the thyroid hormone receptor agonist is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetra-iodothyronine), and 3,5,3'-triiodo L- thyronine.
  • the NHR ligand is an agonist of a PPAR.
  • the PPAR agonist is selected from the group consisting of Tesaglitazar, Aleglitazar and thiazolidinediones.
  • the PPAR agonist is Tesaglitazar or Aleglitazar.
  • the insulin peptide of the conjugate Q-L-Y is a native insulin peptide or any insulin receptor agonist known to those skilled in the art.
  • the insulin peptide (Q) comprises an A chain and a B chain wherein said A chain comprises a sequence
  • X 4 is glutamic acid or aspartic acid
  • X5 is glutamine or glutamic acid
  • X 8 is histidine, threonine or phenylalanine
  • X9 is serine, arginine, lysine, ornithine or alanine
  • X10 is isoleucine or serine
  • X12 is serine or aspartic acid
  • Xi 4 is tyrosine, arginine, lysine, ornithine or alanine;
  • Xi5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine;
  • Xi7 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine;
  • Xi 8 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine;
  • X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
  • X25 is histidine or threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine;
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 4 2 is selected from the group consisting of alanine, ornithine, lysine and arginine;
  • X 4 5 is tyrosine or phenylalanine;
  • R 62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
  • R5 3 is COOH or CONH 2 .
  • the insulin peptide is a two chain insulin analog.
  • the insulin peptide is a single chain insulin analog wherein the carboxy terminus of the B chain is linked to the amino terminus of the A chain via a peptide linker. Any of the previous disclosed single chain insulin analogs having activity at the insulin receptor and known to those skilled in the art are encompassed by the present disclosure.
  • the insulin peptide of the conjugate is a two chain insulin wherein the A and B chains are linked by interchain disulfide bonds, wherein the A chain comprises the sequence GIVEQCCX 8 X 9 ICSLYQLENYCX 2 i-R5 3 (SEQ ID NO: 73) and the B chain comprises a sequence R 62 -X 25 LCGAX 3 oLVDALYLVCGDX 42 GFY (SEQ ID NO: 75), wherein
  • X 8 is histidine or threonine
  • X9 is serine, lysine, or alanine
  • X21 is alanine, glycine or asparagine
  • X25 is histidine or threonine
  • X 3 o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 42 is selected from the group consisting of alanine, ornithine and arginine; and Rs 3 is COOH or CONH 2 ;
  • R 62 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine, and an N- terminal amine; and
  • R5 3 is COOH or CONH 2 .
  • the A chain comprises the sequence GIYEQCCX 8 X 9 ICSLYQLENYCX 2 i-R5 3 (SEQ ID NO: 73) and the B chain comprises the B chain sequence comprises the sequence FVKQX 25 LCGSHLVEALYLVCGERGFF-R 63 (SEQ ID NO: 147), or FVNQX 25 LCGSHLVEALYLVCGERGFF-R 63 (SEQ ID NO: 148), wherein
  • X 8 is histidine or threonine
  • X9 is serine, lysine, or alanine
  • X 21 is alanine, glycine or asparagine
  • X 25 is selected from the group consisting of histidine and threonine.
  • R 6 is selected from the group consisting of YTX 28 KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX 28 K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX 28 , YT, Y and a bond.
  • GIVEQCCX 8 SICSLYQLENYCX2i-R5 3 (SEQ ID NO: 153) or
  • GIVEQCCTSICSLYQLENYCN-R 53 (SEQ ID NO: 1) and the B chain comprises the sequence FVKQX 25 LCGSHLVEALYLVCGERGFFYTEKT (SEQ ID NO: 154),
  • X 8 is histidine or threonine
  • X 21 is alanine, glycine or asparagine; X 25 is selected from the group consisting of histidine and threonine and Rs 3 is COOH or CONH 2 .
  • the A chain comprises a sequence GIVEQCCTSICSLYQLENYCN-R 53 (SEQ ID NO: 1) and said B chain comprises a sequence F VNQHLC GS HLVE ALYLVCGERGFFYTPKT (SEQ ID NO: 2) wherein R 53 is COOH or CONH 2 .
  • the insulin peptide is a single chain insulin analog.
  • the peptide linker joining the B and A chains is selected from the group consisting of SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 158),
  • SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 159), GAGSSSX 57 X 58 (SEQ ID NO: 76), GYGSSSX 57 X 58 (SEQ ID NO: 21) and GYGSSSX 57 X 58 APQT; (SEQ ID NO: 77), wherein X 57 and X 58 are independently arginine, lysine or ornithine. In one embodiment both X 57 and X 58 are independently arginines.
  • the peptide linking moiety joining the insulin A and B chains to form a single chain insulin analog is a peptide sequence consisting of GYGSSSRR (SEQ ID NO: 18) GAGSSSRR (SEQ ID NO: 22) or
  • the linker (L in the formula Q-L-Y) is a linking group or a bond that covalently links the insulin peptide to the NHR ligand.
  • the NHR ligand is linked to the side chain of an amino acid at position B28 or B29 of the insulin peptide.
  • the amino acid at position B28 or B29 of the insulin peptide is lysine and the NHR ligand is linked to the side chain of the lysine.
  • the NHR ligand is linked to the insulin peptide via the N-terminal alpha amine of the insulin A or B chain.
  • the NHR ligand is linked to the insulin peptide via an amid bond form between and amino group of the insulin peptide and a carboxy group of the NHR ligand, optionally through a spacer moiety.
  • the linker (L in the formula Q-L-Y) is a linking group wherein L is stable in vivo, hydrolyzable in vivo, or metastable in vivo.
  • L comprises an ether moiety, or an amide moiety, an ester moiety, an acid-labile moiety, a reduction-labile moiety , an enzyme-labile moiety, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety.
  • the NHR ligand of the invention (Y) is partly or wholly non-peptidic and is hydrophobic or lipophilic.
  • the NHR ligand has a molecular weight that is about 5000 daltons or less, or about 4000 daltons or less, or about 3000 daltons or less, or about 2000 daltons or less, or about 1750 daltons or less, or about 1500 daltons or less, or about 1250 daltons or less, or about 1000 daltons or less, or about 750 daltons or less, or about 500 daltons or less, or about 250 daltons or less.
  • the structure of Y can be in accordance with any of the teachings disclosed herein.
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or
  • Y comprises a tetracyclic skeleton having three 6-membered rings joined to one 5-membered ring or a variation thereof (e.g. a Y that acts at the vitamin D receptor)
  • the carbon atoms of the skeleton are referred to by position number, as shown below:
  • a modification having a ketone at position-6 refers to the following structure:
  • the NHR ligand (Y) acts on a Type I nuclear hormone receptor.
  • Y can have any structure that permits or promotes agonist activity upon binding of the ligand to a Type I nuclear hormone receptor, while in other embodiments Y is an antagonist of the Type I nuclear hormone receptor.
  • Y comprises a structure as shown in Formula A:
  • R 1 and R 2 when present, are independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula A to the Type I nuclear hormone receptor; R 3 and R 4 are independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula A to the Type I nuclear hormone receptor; and each dashed line represents an optional double bond.
  • Formula A may further comprise one or more substituents at one or more of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 14, 15, 16, 17, 18, and 19.
  • Contemplated optional substituents include, but are not limited to, OH, N3 ⁇ 4, ketone, and Q-Cis alkyl groups.
  • Y comprises a structure of Formula A wherein R 1 is present and is hydrogen, d-C 7 alkyl; (C 0 _C 3 alkyl)C(0)Ci-C 7 alkyl, (C 0 _C 3 alkyl)C(0)aryl, or S0 3 H;
  • R is present and is hydrogen, halo, OH, or Ci-C 7 alkyl
  • R is hydrogen, halo, OH, or Ci-C 7 alkyl
  • R 4 is hydrogen, (Co-C 8 alkyl)halo, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -i 8 alkynyl, heteroalkyl, (C 0 _C 8 alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 _C 8 alkyl)OCi_C 8 alkyl, (C 0 _C 8 alkyl)OC 2 -C 8 alkenyl, (C 0 -C 8 alkyl)OC 2 -C 8 alkynyl, (C 0 -C 8 alkyl)OH, (C 0 -C 8 alkyl)SH, (C 0 - C 8 alkyl)NR 24 Ci_C 8 alkyl, (C 0 -C 8 alkyl)NR 24 C 2 -C 8 alkenyl, (C 0 -C 8 alkyl)NR 24 C 2 -C 8 alkynyl
  • R 24 is hydrogen or Ci_C 7 alkyl.
  • R 1 is hydrogen, propionate, acetate, benzoate, or sulfate
  • R 2 is hydrogen or methyl
  • R 3 is hydrogen or methyl
  • R 4 is acetate, cypionate, hemisucciniate, enanthate, or propionate.
  • Y comprises a structure of Formula A
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula A that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula A is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 of Formula A.
  • Formula A is conjugated to L or Q at position 1, 3, 6, 7, 12, 10, 13, 16, 17, or 19 of Formula A.
  • Y acts at an estrogen receptor (e.g. ERa, ERP). In some embodiments, Y permits or promotes agonist activity at the estrogen receptor, while in other embodiments Y is an antagonist of ER. In exemplary embodiments, Y can have a structure of Formula B:
  • R 1 , R 5 and R 6 are moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula B to the estrogen receptor.
  • Formula B further comprises one or more substitutents at one or more of positions 1, 2, 4, 6, 7, 8, 9, 11, 12, 14, 15, and 16 (e.g. a ketone at position-6).
  • Y comprises a structure of Formula B, wherein
  • R 1 is hydrogen, C1-C7 alkyl; (C 0 -C 3 alkyl)C(0)Ci-C 7 alkyl, (C 0 -C 3 alkyl)C(0)aryl, or S0 3 H;
  • R 5 is hydrogen, (Co-C 8 alkyl)halo, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -i 8 alkynyl, heteroalkyl, (C 0 -C 8 alkyl)aryl, (C 0 -C 8 alkyl)heteroaryl, (C 0 -C 8 alkyl)OCi_C 8 alkyl, (C 0 -C 8 alkyl)OC 2 -C 8 alkenyl, (C 0 -C 8 alkyl)OC 2 -C 8 alkynyl, (C 0 -C 8 alkyl)OH, (C 0 -C 8 alkyl)SH, (C 0 - C 8 alkyl)NR 24 Ci_C 8 alkyl, (C 0 _C 8 alkyl)NR 24 C 2 _C 8 alkenyl, (C 0 _C 8 alkyl)NR 24 C 2 _C 8 alkynyl
  • R 6 is hydrogen, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, heteroalkyl, (Co-C 8 alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 _C 8 alkyl)C(0)Ci_C 8 alkyl, (C 0 _C 8 alkyl)C(0)C 2 _C 8 alkenyl, (C 0 -C 8 alkyl)C(0)C 2 -C 8 alkynyl, (C 0 -C 8 alkyl)C(0)H, (C 0 -C 8 alkyl)C(0)aryl, (C 0 -C 8 alkyl)C(0)heteroaryl, (C 0 -C 8 alkyl)C(0)OCi_C 8 alkyl, (C 0 -C 8 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 - C 8
  • R 24 is hydrogen or Ci_C 7 alkyl.
  • R 1 is hydrogen, propionate, acetate, benzoate, or sulfate
  • R 5 is hydrogen, ethynyl, hydroxyl
  • R 6 is acetate, cypionate, hemisucciniate, enanthate, or propionate.
  • Nonlimiting examples of the compound of Formula B include 17P-estradiol, modified forms of estradiol such as ⁇ -estradiol 17-acetate, ⁇ -estradiol 17-cypionate, ⁇ - estradiol 17-enanthate, ⁇ -estradiol 17-valerate, ⁇ -estradiol 3,17-diacetate, ⁇ -estradiol 3,17- dipropionate, ⁇ -estradiol 3 -benzoate, ⁇ -estradiol 3 -benzoate 17-n-butyrate, ⁇ -estradiol 3- glycidyl ether, ⁇ -estradiol 3-methyl ether, ⁇ -estradiol 6-one, ⁇ -estradiol 3-glycidyl, ⁇ - estradiol 6-one 6-(0-carboxymethyloxime), 16-epiestriol, 17-epiestriol, 2-methoxy estradiol, 4-methoxy estradiol, estradiol 17
  • Y comprises a structure of Formula B
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula B that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula B is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 of Formula B.
  • Formula B is conjugated to L or Q at position 3 or 17 of Formula B.
  • Y acts at an estrogen receptor but is not encompassed by Formula B.
  • ligands that act at an estrogen receptor that are not encompassed by Formula B are shown below:
  • Y acts at a glucocorticoid receptor (GR).
  • GR glucocorticoid receptor
  • Y comprises any structure that permits or promotes agonist activity at the GR, while in other embodiments Y is an antagonist of GR.
  • Y comprises a structure of Formula C:
  • R , R', R°, R and R 1U are each independently moieties that permit or promote agonist or antagonist activity upon the binding of the compound of Formula C to the GR; and each dash represents an optional double bond.
  • Formula C further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 9, 11, 12, 14, and 15 (e.g. hydroxyl or ketone at position- 11).
  • Y comprises a structure of Formula C wherein
  • R is hydrogen, halo, OH, or C 1 -C 7 alkyl
  • R is hydrogen, halo, OH, or C 1 -C 7 alkyl
  • R 6 is hydrogen, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, heteroalkyl, (Co-C 8 alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 _C 8 alkyl)C(0)Ci_C 8 alkyl, (C 0 _C 8 alkyl)C(0)C 2 _C 8 alkenyl, (C 0 -C 8 alkyl)C(0)C 2 -C 8 alkynyl, (C 0 -C 8 alkyl)C(0)H, (C 0 -C 8 alkyl)C(0)aryl, (C 0 -C 8 alkyl)C(0)heteroaryl, (C 0 -C 8 alkyl)C(0)OCi_C 8 alkyl, (C 0 -C 8 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 - C 8
  • R is hydrogen, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, heteroalkyl, (Co-C 8 alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 alkyl)C(0)Ci_C 8 alkyl, (C 0 alkyl)C(0)C 2 _C 8 alkenyl, (Co alkyl)C(0)C 2 -C 8 alkynyl, (C 0 )C(O)aryl, (C 0 )C(O)heteroaryl, (C 0 )C(O)OCi_C 8 alkyl, (C 0 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 alkyl)C(0)OC 2 -C 8 alkynyl, or (C 0 alkyl)C(0)OH
  • R is hydrogen or Ci_C 7 alkyl
  • R 9 is hydrogen or Ci_C 7 alkyl
  • R 10 is hydrogen or OH
  • R 24 is hydrogen or Ci_C 7 alkyl.
  • R 2 is hydrogen or methyl
  • R 3 is hydrogen, fluoro, chloro, or methyl
  • R 6 is hydrogen or C(O) C1-C7 alkyl
  • R 7 is hydrogen, C(0)CH 3 , or C(0)CH 2 CH 3
  • R 8 is hydrogen or methyl
  • R 9 is hydrogen or methyl
  • R 10 is hydroxyl.
  • Nonlimiting examples of structures of Formula C include:
  • Y comprises a structure of Formula C
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula C that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula C is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23 of Formula C.
  • Formula C is conjugated to L or Q at position 3, 10, 16 or 17 of Formula C.
  • Y acts at a mineralcorticoid receptor (MR).
  • MR mineralcorticoid receptor
  • Y comprises any structure that permits or promotes agonist activity at the MR, while in other embodiments Y is an antagonist of MR.
  • Y comprises a structure of Formula D:
  • Formula D further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 11, 12, 14, 15, 16, and 17.
  • Y comprises a structure of Formula D wherein
  • R is hydrogen, halo, OH, or C 1 -C7 alkyl
  • R is hydrogen, halo, OH, or C 1 -C7 alkyl
  • R is hydrogen, Ci_C 8 alkyl, C 2 -C8 alkenyl, C 2 -C8 alkynyl, heteroalkyl, (Co-Cs alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 alkyl)C(0)Ci_C 8 alkyl, (C 0 alkyl)C(0)C 2 -C 8 alkenyl, (Co alkyl)C(0)C 2 _C 8 alkynyl, (C 0 )C(O)aryl, (C 0 )C(O)heteroaryl, (C 0 )C(O)OCi_C 8 alkyl, (C 0 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 alkyl)C(0)OC 2 -C 8 alkynyl, or (C 0 alkyl)C(0)OH
  • R 10 is hydrogen or OH
  • R 24 is hydrogen or Ci_C 7 alkyl.
  • R 2 is hydrogen or methyl
  • R 3 is hydrogen, fluoro, chloro, or methyl
  • R 7 is hydrogen, C(0)CH 3 , or C(0)CH 2 CH 3
  • R 10 is hydroxyl
  • Nonlimiting examples of compounds of Formula D include:
  • Aldosterone Deoxycorticosterone acetate Deoxycorticosterone acetate and derivatives thereof.
  • Y comprises a structure of Formula D
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula D that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula D is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 of Formula D.
  • Formula D is conjugated to L or Q at position 3, 10, 13, or 17 of Formula D.
  • Y acts at a progesterone receptor (PR).
  • PR progesterone receptor
  • Y comprises any structure that permits or promotes agonist activity at the PR, while in other embodiments Y is an antagonist of PR.
  • Y comprises a structure of Formula E:
  • Formula E further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 11, 12, 14, 15, 16, and 17 (e.g. a methyl group at position 6).
  • Y comprises a structure of Formula E wherein
  • R is hydrogen, halo, OH, or C 1 -C 7 alkyl
  • R is hydrogen, halo, OH, or C 1 -C 7 alkyl;
  • R 4 is hydrogen, (Co-C 8 alkyl)halo, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -i 8 alkynyl, heteroalkyl, (C 0 _C 8 alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 _C 8 alkyl)OCi_C 8 alkyl, (C 0 _C 8 alkyl)OC 2 -C 8 alkenyl, (C 0 -C 8 alkyl)OC 2 -C 8 alkynyl, (C 0 -C 8 alkyl)OH, (C 0 -C 8 alkyl)SH, (C 0 - C 8 alkyl)NR 24 Ci_C 8 alkyl, (C 0 -C 8 alkyl)NR 24 C 2 -C 8 alkenyl, (
  • R is hydrogen, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, heteroalkyl, (Co-C 8 alkyl)aryl, (C 0 _C 8 alkyl)heteroaryl, (C 0 alkyl)C(0)Ci_C 8 alkyl, (C 0 alkyl)C(0)C 2 _C 8 alkenyl, (Co alkyl)C(0)C 2 _C 8 alkynyl, (C 0 )C(O)aryl, (C 0 )C(O)heteroaryl, (C 0 )C(O)OCi_C 8 alkyl, (C 0 alkyl)C(0)OC 2 -C 8 alkenyl, (C 0 alkyl)C(0)OC 2 -C 8 alkynyl, or (C 0 alkyl)C(0)OH; and
  • R 24 is hydrogen or C 1-C7 alkyl.
  • R 2 is hydrogen or methyl
  • R 3 is hydrogen or methyl
  • R 4 is (Ci alkyl)C(0)Ci_C 4 alkyl, acetate, cypionate, hemisucciniate, enanthate, or propionate
  • R is hydrogen, C(0)CH 3 , or C(0)CH 2 CH 3 .
  • Nonlimiting examples of compounds of Formula E include:
  • Progesterone 19-nor-progesterone Medroxyprogesterone and derivatives thereof.
  • Y comprises a structure of Formula E
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula E that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula E is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 of Formula E.
  • Formula E is conjugated to L or Q through position 3 or 17 of Formula E.
  • Y acts at a progesterone receptor but is not encompassed by Formula E.
  • Y can comprise the below structure and analogs thereof:
  • Y acts at an androgen receptor (AR).
  • AR androgen receptor
  • Y comprises any structure that permits or promotes agonist activity at the AR, while in other embodiments Y is an antagonist of AR.
  • Y comprises a structure of Formula F:
  • R 1 when present, R 2 , R 3 and R 6 are each independently a moiety that permits or promotes agonist or antagonist activity upon binding of the compound of Formula F to the AR; and each dashed line represents an optional double bond, with the proviso that no more than one of the optional carbon-carbon double bond is present at position 5.
  • Formula F further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 11, 12, 14, 15, 16, and 17.
  • Y comprises a structure of Formula F wherein
  • R 1 is hydrogen, C 1 -C7 alkyl; (C 0 -C 3 alkyl)C(0)Ci-C 7 alkyl, (C 0 _C 3 alkyl)C(0)aryl, or
  • R is hydrogen, halo, OH, or Ci-C 7 alkyl
  • R is hydrogen, halo, OH, or Ci-C 7 alkyl
  • R 6 is hydrogen, Ci_C 8 alkyl, C 2 -C 8 alkenyl, C 2 -C 8 alkynyl, heteroalkyl, (Co-C 8 alkyl)aryl, (C 0 -C 8 alkyl)heteroaryl, (C 0 -C 8 alkyl)C(0)Ci_C 8 alkyl, (C 0 -C 8 alkyl)C(0)C 2 -C 8 alkenyl, (C 0 -C 8 alkyl)C(0)C 2 -C 8 alkynyl, (C 0 -C 8 alkyl)C(0)H, (C 0 -C 8 alkyl)C(0)aryl, (C 0 -C 8 alkyl)C(0)heteroaryl, (C 0 _C 8 alkyl)C(0)OCi_C 8 alkyl, (C 0 _C 8 alkyl)C(0)OC 2 _C 8 alkenyl, (C 0 _ C 8
  • R 24 is hydrogen or Ci_C 7 alkyl.
  • R 1 is hydrogen or absent;
  • R2 is hydrogen or methyl;
  • R 3 is hydrogen or methyl; and
  • R 6 is H or absent.
  • Nonlimiting examples of compounds of Formula F include:
  • Y comprises a structure of Formula F
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula F that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula F is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 22 of Formula F.
  • Formula F is conjugated to L or Q at position 3 or 17 of Formula F.
  • the binding of the NHR ligand to the Type I nuclear hormone receptor results in agonist activity (or antagonist activity) in some but not all cells or tissues expressing the Type I nuclear hormone receptor.
  • the NHR ligand (Y) acts on a Type II nuclear hormone receptor.
  • Y can have any structure that permits or promotes agonist activity upon binding of the ligand to a Type II nuclear hormone receptor, while in other embodiments Y is an antagonist of the Type II nuclear hormone receptor.
  • Y exhibits agonist (or antagonist) activity at a thyroid hormone receptor (TR), retinoic acid receptor (RAR), peroxisome proliferator activated receptor (PPAR), Liver X Receptor (LXR), farnesoid X receptor (FXR), vitamin D receptor (VDR), and/or pregnane X receptor (PXR).
  • TR thyroid hormone receptor
  • RAR retinoic acid receptor
  • PPAR peroxisome proliferator activated receptor
  • LXR Liver X Receptor
  • FXR farnesoid X receptor
  • VDR vitamin D receptor
  • PXR pregnane X receptor
  • Y acts at a thyroid hormone receptor (e.g. TRa, TRP).
  • TRa thyroid hormone receptor
  • TRP thyroid hormone receptor
  • Y comprises any structure that permits or promotes agonist activity at the TR, while in other embodiments Y is an antagonist of TR.
  • a thyroid hormone receptor agonist is provided having the general structure of
  • Ri5 is Ci-C 4 alkyl, -CH 2 (pyridazinone), -CH 2 (OH)(phenyl)F, -CH(OH)CH 3 , halo or
  • R 20 is halo, CH 3 or H
  • R 21 is halo, CH 3 or H
  • R22 is H, OH, halo, -CH 2 (OH)(C 6 aryl)F, or C1-C4 alkyl;
  • R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH, -NHC(0)CH 2 COOH, -CH 2 CH 2 COOH, or -OCH 2 P0 3 2" .
  • Ri 5 is C 1 -C4 alkyl, -CH(OH)CH 3 , 1 or H
  • R 20 is I, Br, CH 3 or H
  • R 21 is I, Br, CH 3 or H
  • R22 is H, OH, I, or C1-C4 alkyl
  • R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH, -NHC(0)CH 2 COOH, -CH 2 CH 2 COOH, or -OCH 2 P0 3 2" .
  • R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH, -NHC(0)CH 2 COOH, -CH 2 CH 2 COOH, or -OCH 2 P0 3 2" .
  • R 23 is
  • Ri 5 is isopropyl, -CH(OH)CH 3 , 1 or H
  • R 20 is I, Br, CI, or CH 3 ;
  • R 21 is I, Br, CI, or CH 3 ;
  • R 22 is H
  • R 23 is -OCH 2 COOH, -CH 2 COOH, -NHC(0)CH 2 COOH, or -CH 2 CH 2 COOH.
  • the thyroid hormone receptor agonist is a compound of the general structure of Formula I:
  • R20, R21, and R22 are independently selected from the group consisting of H, OH, halo and Ci-C 4 alkyl;
  • Ri 5 is halo or H.
  • R 2 o and R 2 i are each CH 3
  • R15 is H
  • R 22 are independently selected from the group consisting of H, OH, halo and Ci-C 4 alkyl.
  • R 2 o, R21 and R 22 are each halo and R15 is H or halo.
  • R20, R21 and R22 are each I
  • R15 is H or I.
  • Y is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetraiodothyronine) and 3,5,3'- triiodo L- thyronine.
  • the thyroid hormone receptor ligand (Y) of the Q-L-Y conjugates is an indole derivative of thyroxine, including for example, compounds disclosed in U.S. Pat. No. 6,794,406 and US published application no. US 2009/0233979, the disclosures of which are incorporated herein.
  • the indole derivative of thyroxine comprises a compound of the general structure of Formula II:
  • Ri 3 is H or Ci-C 4 alkyl
  • Ri 4 is Ci-Cg alkyl
  • Ri 5 is H or Ci-C 4 alkyl
  • Ri 6 and Ri 7 are independently halo or Ci-C 4 alkyl.
  • the thyroid receptor ligand (Y) of the Q-L-Y conjugates is an indole derivative of thyroxine as disclosed in W097/21993 (U. Cal SF), WO99/00353 (KaroBio), GB98/284425 (KaroBio), and U.S. Provisional Application 60/183,223, the disclosures of which are incorporated by reference herein.
  • the thyroid receptor ligand comprises the general structure of Formula III:
  • X is oxygen, sulfur, carbonyl, methylene, or NH;
  • Ri is halogen, trifluoromethyl, or Ci-C 6 alkyl or C3-C7 cycloalkyl;
  • R 2 and R 3 are the same or different and are hydrogen, halogen, Ci-C 6 alkyl or C 3 -C 7 cycloalkyl, with the proviso that at least one of R 2 and R 3 being other than hydrogen;
  • R 4 is hydrogen or Ci-C 4 alkyl
  • R5 is hydrogen or Ci-C 4 alkyl
  • R 6 is carboxylic acid, or ester thereof
  • R 7 is hydrogen, or an alkanoyl or aroyl group.
  • Nonlimiting examples of Y include the following compounds:
  • Thyroxine (T 4 ) Triiodothyroxine (T 3 ) , and derivatives thereof.
  • Y comprises a structure that permits or promotes agonist or antagonist activity at a TR
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Y is conjugated to L or Q through any position of Y.
  • Y is conjugated to L or Q through the carboxylic acid or amine moieties, as indicated below:
  • Y acts at a retinoic acid receptor (e.g. RARa, RARp, RARy).
  • Y comprises any structure that permits or promotes agonist activity at the RAR, while in other embodiments Y is an antagonist of RAR.
  • Y comprises a structure of Formula G:
  • R 11 is a moiety that permits or promotes agonist or antagonist activity upon the binding of the compound of Formula G to a RAR, and - ⁇ w represents either E or Z stereochemistry.
  • Y comprises a structure of Formula G wherein R 11 is
  • Y comprises a structure of Formula G wherein R 11 is a carboxylic acid derivative (e.g. acyl chloride, anhydride, and ester).
  • R 11 is a carboxylic acid derivative (e.g. acyl chloride, anhydride, and ester).
  • Nonlimiting examples of the compound of Formula G include:
  • Retinal 1 1 -cis-retinoic acid and derivatives thereof.
  • Y comprises a structure of Formula G
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula G that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Y is conjugated to L or Q through any position of Y.
  • Formula G is conjugated to L or Q at R 11 .
  • Y acts at a peroxisome proliferator activated receptor (e.g. PPARa, PPARp/ ⁇ , PPARy). In some embodiments, Y acts at PPARy. In some
  • Y comprises any structure that permits or promotes agonist activity at the PPAR, while in other embodiments Y is an antagonist of PPAR.
  • Y is a saturated or unsaturated, halogenated or nonhalogenated free fatty acid (FFA) as described by Formula H:
  • n is 0-26 and each R 12 , when present, is independently a moiety that permits or promotes agonist or antagonist activity upon binding of the compound of Formula H to a
  • Y comprises a structure of Formula H, wherein n is 0-26 and each R 12 , when present, is independently hydrogen, C1-C7 alkyl, or halogen.
  • Formula B is saturated such as, for example, formic acid, acetic acid, n-caproic acid, heptanoic acid, caprylic acid, nonanoic acid, capric acid, undecanoic acid, lauric acid, tridecanoic acid, myristic acid, pentadeconoic acid, palmitic acid, heptadecanoic acid, stearic acid, nonadecanoic acid, arachidic acid, heneicosanoic acid, behenic acid, tricosanoic acid, perfluorononanoic acid (see below), perfluorooctanoic acid (see below), and derivatives thereof.
  • Formula H is unsaturated with either cis or trans
  • stereochemistry such as, for example, mead acid, myristoleic acid, palmitoleic acid, sapienic acid, oleic acid, linoleic acid, a-linolenic acid, elaidic acid, petroselinic acid, arachidonic acid, dihydroxyeicosatetraenoic acid (DiHETE), octadecynoic acid, eicosatriynoic acid, eicosadienoic acid, eicosatrienoic acid, eicosapentaenoic acid, erucic acid, dihomolinolenic acid, docosatrienoic acid, docosapentaenoic acid, docosahexaenoic acid, adrenic acid, and derivatives thereof, including for example:
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula H that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula H is conjugated to L or Q at any position on Formula H.
  • Formula H is conjugated to L or Q through the terminal carboxylic acid moiety.
  • Y is an eiconsanoid.
  • Y is a prostaglandin or a leukotriene.
  • Y is a prostaglandin having a structure as described by Formulae J1-J6:
  • each R is independently a moiety that permits or promotes agonist or antagonist activity upon the binding of the compound of Formula J to a PPAR (e.g. PGJ2 as shown below):
  • each R 13 is independently C 7 -C 8 alkyl, C 7 -C 8 alkenyl, C 7 -C 8 alkynyl, or heteroalkyl.
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of the eicosanoid that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Y is conjugated to L or Q through any position of Y.
  • the eicosanoid is conjugated to L or Q through a terminal carboxylic acid moiety or through a pendant alcohol moiety.
  • Y is a leukotriene having a structure as described by Formula K or a derivatized form of Formula K:
  • each R is independently a moiety that permits or promotes agonist or antagonist activity upon the binding of the compound of Formula K to a PPAR (e.g. leukotriene B4 as shown below):
  • each R is independently C 3 -C 13 alkyl, C 3 -C 13 alkenyl, C 3 -C 13 alkynyl, or heteroalkyl.
  • Y comprises a structure of Formula K
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula K that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula K is conjugated to L or Q at any position on Formula K.
  • Formula K is conjugated to L or Q through the terminal carboxylic acid moiety or through a pendant alcohol moiety.
  • Y is a thiazolidinedione comprising a structure as described by Formula L:
  • Nonlimiting examples of the compound of Formula L include:
  • Troghtazone ⁇ and derivatives thereof are provided.
  • Y comprises a structure of Formula L
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula L that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula L is conjugated to L or Q at any position on Formula L, such as, for example, a pendant alcohol moiety, or through an aromatic substituent.
  • Y comprises Tesaglitzar or Aleglitazar
  • Y is conjugated L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position that is capable of reacting with Q or L.
  • Tesaglitzar or Aleglitazar is conjugated to L or Q through the carboxylic acid moiety of the compound.
  • Y acts at a RAR-related orphan receptor (e.g. RORa, RORp, RORy).
  • Y comprises any structure that permits or promotes agonist activity at the ROR, while in other embodiments Y is an antagonist of ROR.
  • Nonlimiting examples of Y include:
  • CGP 52608 All-trans-retinoic acid , and derivatives thereof.
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Y is conjugated to L or Q through any position of Y, such as, for example, any of the positions previously described herein.
  • Y acts at a liver X receptor (LXRa, LXRP).
  • LXRa liver X receptor
  • Y comprises any structure that permits or promotes agonist activity at the LXR, while in other embodiments Y is an antagonist of LXR.
  • Y is an oxysterol (i.e. oxygenated derivative of cholesterol).
  • Nonlimiting examples of Y in these embodiments include 22(R)-hydroxycholesterol (see below), 24(S)-hydroxycholesterol (see below), 27-hydroxycholesterol, cholestenoic acid, and derivatives thereof.
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Y is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 of Formula F.
  • Formula F is conjugated to L or Q at position 3 or 17 of Formula F.
  • Y acts at the farnesoid X receptor (FXR).
  • FXR farnesoid X receptor
  • Y comprises any structure that permits or promotes agonist activity at the FXR, while in other embodiments Y is an antagonist of FXR. In some of these
  • Y is a bile acid.
  • Y has a structure of Formula M:
  • R 15 , R 16 , and R 17 are independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula M to a FXR.
  • each of R 15 and R 16 are independently hydrogen, (Co-C 8 alkyl)halo, Ci_Ci 8 alkyl, C 2 -Ci 8 alkenyl, C 2 _Ci 8 alkynyl, heteroalkyl, or (C 0 -C 8 alkyl)OH; and R 17 is OH, (C 0 -C 8 alkyl)NH(Ci-C 4
  • each of R 15 and R 16 are independently hydrogen or OH; and R 17 is OH, NH(C C 2 alkyl)S0 3 H, or NH(C C 2 alkyl)COOH.
  • Nonlimiting examples of the compound of Formula M include:
  • Y comprises a structure of Formula M
  • Y is conjugated to L
  • Formula M is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 of Formula M.
  • Formula M is conjugated to L or Q at position 3, 7, 12 or 17 of Formula M.
  • Y comprises any structure that permits or promotes agonist activity at the VDR, while in other embodiments Y is an antagonist of VDR.
  • Y has a structure of Formula N:
  • each of R 18 , R 1 , R 20 , R 21 , R 22 , and R 23 are moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula N to the VDR such as, for example, any of the vitamin D compounds found in Bouillon et al., Endocrine Reviews,
  • Y comprises a structure of Formula N
  • R 18 and R 19 are each independently hydrogen, (Co-C 8 alkyl)halo, (Co-C 8
  • each of R 21 and R 22 are independently Ci-C 4 alkyl
  • R 23 is C 4 _Cis alkyl, C 4 _Ci 8 alkenyl, C 4 _Ci 8 alkynyl, heteroalkyl, (C 4 _Ci 8 alkyl)aryl, (C 4 _Ci8 alkyl)heteroaryl, (C 0 _C 8 alkyl)OCi_Ci 8 alkyl, (C 0 _C 8 alkenyl)OCi_Ci 8 alkyl, (C 0 _C 8 alkynyl)OCi_Ci 8 alkyl, (C 0 _C 8 alkyl)OC 2 _Ci 8 alkenyl, (C 0 _C 8 alkyl)OC 2 _Ci 8 alkynyl, (C 6 _Ci 8 alkyl)OH, (C 6 -Ci 8 alkyl)SH, (C 6 -Ci 8 alkenyl)OH, (C 6 -Ci 8 alkynyl)OH, (
  • R 24 is hydrogen or Ci_Ci 8 alkyl.
  • Nonlimiting examples of the compound of Formula N include:
  • Y comprises a structure of Formula N
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula N that is capable of reacting with Q or L.
  • L e.g. when L is a linking group
  • Q e.g. when L is a bond
  • Formula N is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 of Formula N.
  • Formula N is conjugated to L or Q at position 1, 3, 19, or 25 of Formula N.
  • Y acts at the pregnane X receptor (PXR).
  • PXR pregnane X receptor
  • Y comprises any structure that permits or promotes agonist activity at the PXR, while in other embodiments Y is an antagonist of PXR.
  • Y is a steroid, antibiotic, antimycotic, bile acid, hyperforin, or a herbal compound.
  • Y is compound that is able to induce CYP3A4, such as dexamethasone and rifampicin.
  • Y comprises a structure that acts at the PXR
  • Y is conjugated to L (e.g. when L is a linking group) or Q (e.g.
  • Y is conjugated to L or Q at any of positions on Y. Modification of the NHR Ligand (Y)
  • the NHR ligand is derivatized or otherwise chemically modified to comprise a reactive moiety that is capable of reacting with the insulin peptide (Q) or the linking group (L).
  • Y is derivatized at any position of Y that is capable of reacting with Q or L. The position of derivatization on Y is apparent to one skilled in the art and depends on the type of NHR ligand used and the activity that is desired.
  • Y has a structure comprising a tetracyclic skeleton having three 6-membered rings joined to one 5-membered ring or a variation thereof
  • Y can be derivatized at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25.
  • Other positions of derivatization can be as previously described herein.
  • estradiol can be derivatized with succinic acid, succinic anhydride, benzoic acid, ethyl 2-bromoacetate, or iodoacetic acid to form the below derivatives of estradiol that are capable of conjugating to Q or L.
  • any of the aforementioned NHR ligands can be derivatized by methods known in the art. Additionally, certain derivatized ligands are commercially available and can be purchased from chemical companies such as Sigma-Aldrich.
  • Y is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof. In one embodiment Y is dexamethasone.
  • Y is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetra- iodo thyronine), 3,5,3 '-triiodo L-thyronine, Tesaglitazar, Aleglitazar and thiazolidinediones.
  • Y is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetra- iodo thyronine), and 3,5,3 '-triiodo L-thyronine.
  • Y is selected from the group consisting of Tesaglitazar and Aleglitazar. Structure of the Insulin peptide
  • the insulin peptide of the presently disclosed conjugates is native insulin, comprising the A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2, or an analog of native insulin, including for example a single-chain insulin analog comprising SEQ ID NOS: 1 and 2.
  • analogs of insulin encompass polypeptides comprising an A chain and a B chain wherein the insulin analogs differ from native insulin by one or more amino acid substitutions at positions selected from A5, A8, A9, A10, A12, A14, A15, A17, A18, A21, B l, B2, B3, B4, B5, B9, B IO, B 13, B 14, B 17, B20, B21, B22, B23, B26, B27, B28, B29 and B30 or deletions of any or all of positions B l-4 and B26-30.
  • amino acid residue at position B28 is substituted with Asp, Lys, Leu, Val, or Ala, and the amino acyl residue at position B29 is Lys or Pro;
  • an insulin analog comprising an Asp substituted at position B28 or a Lys substituted at position 28 and a proline substituted at position B29. Additional insulin analogs are disclosed in Chance, et al., U.S. Pat. No. 5,514,646; Chance, et al., U.S. patent application Ser. No. 08/255,297; Brems, et al., Protein Engineering, 5:527-533 (1992); Brange, et al., EPO Publication No. 214,826 (published Mar. 18, 1987); and Brange, et al., Current Opinion in Structural Biology, 1:934-940 (1991). The disclosures of which are expressly incorporated herein by reference.
  • Insulin analogs may also have replacements of the amidated amino acids with acidic forms.
  • Asn may be replaced with Asp or Glu.
  • Gin may be replaced with Asp or Glu.
  • Asn(A18), Asn(A21), or Asp(B3), or any combination of those residues may be replaced by Asp or Glu.
  • Gln(A15) or Gln(B4), or both, may be replaced by either Asp or Glu.
  • single chain insulin agonists comprising a B chain and an A chain of human insulin, or analogs or derivative thereof, wherein the carboxy terminus of the B chain is linked to the amino terminus of the A chain via a linking moiety.
  • the A chain is an amino acid sequence selected from the group consisting of GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1),
  • GIVDECCFRSCDLRRLEMYCA (SEQ ID NO: 5) or GIVEECCFRSCDLALLETYCA (SEQ ID NO: 7) and the B chain comprises the sequence
  • a YRPS ETLC GGELVDTLYLVC GDRGF YFS RP A (SEQ ID NO: 8), or a carboxy shortened sequence thereof having one to five amino acids corresponding to B26, B27, B28, B29 and B30 deleted, and analogs of those sequences wherein each sequence is modified to comprise one to five amino acid substitutions at positions corresponding to native insulin positions selected from A5, A8, A9, A10, A14, A15, A17, A18, A21, B l, B2, B3, B4, B5, B9, B IO, B 13, B 14, B20, B22, B23, B26, B27, B28, B29 and B30.
  • the amino acid substitutions are conservative amino acid substitutions.
  • Suitable amino acid substitutions at these positions that do not adversely impact insulin's desired activities are known to those skilled in the art, as demonstrated, for example, in Mayer, et al., Insulin Structure and Function, Biopolymers. 2007;88(5):687-713, the disclosure of which is incorporated herein by reference.
  • Additional amino acid sequences can be added to the amino terminus of the B chain or to the carboxy terminus of the A chain of the single chain insulin agonists of the present invention.
  • a series of negatively charged amino acids can be added to the amino terminus of the B chain, including for example a peptide of 1 to 12, 1 to 10, 1 to 8 or 1 to 6 amino acids in length and comprising one or more negatively charged amino acids including for example glutamic acid and aspartic acid.
  • the B chain amino terminal extension comprises 1 to 6 charged amino acids.
  • the B chain amino terminal extension comprises the sequence GX61X62X63X64X65K (SEQ ID NO: 26) or X61X62X63X64X65RK (SEQ ID NO: 27), wherein X 61 , X 62 , X 6 3 ⁇ 4 and X 65 are independently glutamic acid or aspartic acid.
  • the B chain comprises the sequence GEEEEEKGPEHLCGAHLVDALYLVCGDX 42 GFY (SEQ ID NO: 28), wherein X 42 is selected from the group consisting of alanine lysine, ornithine and arginine.
  • High potency NHR ligand-insulin conjugates can also be prepared based on using a modified IGF I and IGF II sequence described in published International application no. WO 2010/080607, the disclosure of which is expressly incorporated herein by reference, as the insulin peptide component. More particularly, analogs of IGF I and IGF II that comprise a substitution of a tyrosine leucine dipeptide for the native IGF amino acids at positions corresponding to B 16 and B 17 of native insulin have a tenfold increase in potency at the insulin receptor.
  • the insulin peptide for use in the present disclosure comprises a B chain sequence of R 62 -
  • X 4 is glutamic acid or aspartic acid
  • X5 is glutamine or glutamic acid
  • X 8 is histidine, threonine or phenylalanine
  • X9 is serine, arginine, lysine, ornithine or alanine
  • X10 is isoleucine or serine
  • X12 is serine or aspartic acid
  • Xi 4 is tyrosine, arginine, lysine, ornithine or alanine;
  • Xi 5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine;
  • Xi 7 is glutamine, glutamic acid, arginine, aspartic acid, lysine or ornithine;
  • Xi 8 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine;
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
  • X25 is histidine or threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X 30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 33 is selected from the group consisting of aspartic acid, glutamine and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine;
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
  • X 4 5 is tyrosine, histidine, asparagine or phenylalanine;
  • R 62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and a bond; and Rs 3 is COOH or CONH 2 .
  • the A chain and the B chain are linked to one another by interchain disulfide bonds, including those that form between the A and B chains of native insulin.
  • the A and B chains are linked together as a linear single chain- insulin peptide.
  • the conjugates comprise an insulin peptide wherein the A chain comprises a sequence of GIVEQCCXiSICSLYQLENX 2 CX 3 (SEQ ID NO: 30) and said B chain sequence comprises a sequence of X 4 LCGX 5 X 6 LVEALYLVCGERGFF (SEQ ID NO: 31), wherein
  • Xi is selected from the group consisting of threonine and histidine;
  • X 2 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X 3 is selected from the group consisting of asparagine and glycine
  • X 4 is selected from the group consisting of histidine and threonine
  • X5 is selected from the group consisting of alanine, glycine and serine;
  • X 6 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid.
  • an insulin analog is provided wherein the A chain of the insulin peptide comprises the sequence GIVEQCCX 8 X9ICSLYQLENYCX 21 - R 5 (SEQ ID NO: 73) or GIVEQCCX 8 SICSLYQLXi 7 NYCX 2 i (SEQ ID NO: 32) and the B chain comprising the sequence R 62 -X 25 LCGX 2 9X 3 oLVX 33 X 34 LYLVCGX 4 iX 42 GFX 45 YT-Zi- Bi (SEQ ID NO: 142), wherein
  • X 8 is selected from the group consisting of threonine and histidine;
  • X9 is serine, lysine, or alanine
  • X 17 is glutamine or glutamic acid
  • X 21 is asparagine or glycine
  • X25 is histidine or threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 4 2 is selected from the group consisting of alanine, ornithine, lysine and arginine;
  • X 4 5 is tyrosine or phenylalanine;
  • R 6 2 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine and an N-terminal amine
  • Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine;
  • Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide.
  • an insulin analog wherein the A chain of the insulin peptide comprises the sequence GIVEQCCX8SICSLYQLX 1 7NX 19 CX 21 (SEQ ID NO: 32) and the B chain comprising the sequence
  • X 8 is selected from the group consisting of threonine and histidine;
  • Xi7 is glutamic acid or glutamine
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X 21 is asparagine or glycine
  • X25 is selected from the group consisting of histidine and threonine
  • X 29 is selected from the group consisting of alanine, glycine and serine;
  • X 3 o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid.
  • the B chain comprises the sequence X 22 VNQX 25 LCGX 29 X 3 oLVEALYLVCGERGFFYT-Zi-B 1 (SEQ ID NO: 34) wherein
  • X 22 is selected from the group consisting of phenylalanine and desamino- phenylalanine;
  • X25 is selected from the group consisting of histidine and threonine;
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine;
  • Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide.
  • X 8 is histidine or phenylalanine
  • X9 and X 14 are independently selected from arginine, lysine, ornithine or alanine;
  • Xi5 is arginine, lysine, ornithine or leucine
  • X 17 is glutamic acid or glutamine
  • Xi 8 is methionine, asparagine or threonine
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X21 is alanine, glycine or asparagine
  • X23 is asparagine or glycine
  • X25 is selected from the group consisting of histidine and threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X3 4 is selected from the group consisting of alanine and threonine
  • X 4 2 is selected from the group consisting of alanine, lysine, ornithine and arginine;
  • X 4 5 is tyrosine
  • R53 is COOH or CONH 2 .
  • X 8 is histidine
  • X9 and X 14 are independently selected from arginine, lysine, ornithine or alanine; Xi5 is arginine, lysine, ornithine or leucine;
  • X 17 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine;
  • Xi8 is methionine, asparagine or threonine
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X 2 i is alanine, glycine or asparagine
  • X 2 3 is asparagine or glycine
  • X 2 5 is selected from the group consisting of histidine and threonine;
  • X 2 9 is selected from the group consisting of alanine, glycine and serine;
  • X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X3 4 is selected from the group consisting of alanine and threonine
  • X 42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
  • X 4 5 is tyrosine or phenylalanine
  • R 53 is COOH or CONH 2 .
  • the A chain comprises the sequence GIVDECCHX 9 SCDLXi 4 Xi5LXi7MXi 9 CX 2 i-R53 (SEQ ID NO: 37), and the B chain comprises the sequence X 25 LCGAX 3 oLVDALYLVCGDX 42 GFX 4 5 (SEQ ID NO: 38) wherein
  • X9, X 14 and Xi 5 are independently ornithine, lysine or arginine;
  • Xi 7 is glutamic acid or glutamine
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X 2 i is alanine, glycine or asparagine
  • X 2 5 is selected from the group consisting of histidine and threonine;
  • X30 is selected from the group consisting of histidine, aspartic acid and glutamic acid;
  • X 42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
  • X 4 5 is tyrosine or phenylalanine and
  • R 53 is COOH or CONH 2 .
  • the B chain is selected from the group consisting of HLCGAELVDALYLVCGDX 42 GFY (SEQ ID NO: 39),
  • GPEHLCGAELVDALYLVCGDX 42 GFY (SEQ ID NO: 40), GPEHLCGAELVDALYLVCGDX 42 GFYFNPKT (SEQ ID NO: 41) and GPEHLCGAELVDALYLVCGDX 42 GFYFNKPT (SEQ ID NO: 42), wherein X 42 is selected from the group consisting of ornithine, lysine and arginine.
  • the A chain comprises the sequence GIVDECCHX 9 SCDLXi 4 Xi 5 LQMYCN-R 5 3 (SEQ ID NO: 43), wherein X 9 , X 14 and X15 are independently ornithine, lysine or arginine.
  • GIVDECCX8RSCDLYQLENXi 9 CN-R53 (SEQ ID NO: 44) and the B chain comprises the sequence R 62 -X 25 LCGSHLVDALYLVCGDX 42 GFX 4 5 (SEQ ID NO: 45)
  • X 8 is threonine, histidine or phenylalanine
  • X 19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X 2 5 is histidine or threonine
  • X 42 is alanine, ornithine or arginine
  • X 4 5 is tyrosine histidine, asparagine or phenylalanine
  • R 62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ
  • SEQ ID NO: 12 PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and a bond; and R53 is COOH or CONH 2 .
  • R53 is COOH or CONH 2 .
  • X 19 is Tyr.
  • GIVDECCHRSCDLRRLEMXi 9 CX 2 i-R 5 3 (SEQ ID NO: 47); and the B chain comprises the sequence FVNQHLCGS HLVE ALYLVCGERGFFYTPKT (SEQ ID NO: 2), or
  • X 19 is tyrosine, 4-methoxy phenylalanine or 4-amino-phenylalanine
  • X 2 i is alanine, glycine or asparagine, optionally wherein X 19 is tyrosine, and X 21 is alanine or asparagine.
  • the A chain comprises the sequence
  • GIVEQCCHSICSLYQLENYCX 2 i-R 5 3 (SEQ ID NO: 160) and the B chain comprises the sequence FVKQX 25 LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 147), or
  • FVNQX 25 LCGSHLVEALYLVCGERGFF-R 6 3 (SEQ ID NO: 148), wherein X21 is alanine, glycine or asparagine;
  • X25 is selected from the group consisting of histidine and threonine
  • X 28 is proline, aspartic acid or glutamic acid
  • R 63 is selected from the group consisting of YTX 28 KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX 28 K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX 2S , YT, Y and a bond.
  • the B chain comprises the sequence
  • X 25 is selected from the group consisting of histidine and threonine.
  • linking moieties can be used to link human insulin A and B chains, or analogs or derivatives thereof, wherein the carboxy terminus of the B25 amino acid of the B chain is directly linked to a first end of a linking moiety, wherein the second end of the linking moiety is directly linked to the amino terminus of the Al amino acid of the A chain via the intervening linking moiety.
  • the insulin peptide is a single chain insulin agonist that comprises the general structure B-LM-A wherein B represents an insulin B chain, A represents an insulin A chain, and LM represents a linking moiety linking the carboxy terminus of the B chain to the amino terminus of the A chain.
  • Suitable linking moieties for joining the B chain to the A chain are disclosed herein under the header Linking Moieties for Single Chain-Insulin Analogs and the respective subheaders "Peptide linkers".
  • the linking moiety comprises a linking peptide, and more particularly, in one embodiment the peptide represents an analog of the IGF-1 C peptide.
  • Additional exemplary peptide linkers include but are not limited to the sequence XsiX 52 GSSSX 57 X 58 (SEQ ID NO: 49) or X 51 X 52 GSSSX 57 X 58 APQT (SEQ ID NO: 50) wherein X 51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine and proline, X52 is alanine, valine, leucine, isoleucine or proline and X57 or Xs 8 are independently arginine, lysine, cysteine, homocysteine, acetyl-phenylalanine or ornithine, optionally with a hydrophilic moiety linked to the side chain of the amino acid at position 7 or 8 of the linking moiety (i.e., at the X57 or Xs 8 position).
  • amino acid positions of the linking moiety are designated based on the corresponding position in the native C chain of IGF 1 (SEQ ID NO: 17).
  • the peptide linking moiety comprises a 29 contiguous amino acid sequence having greater than 70%, 80%, 90% sequence identity to
  • linking moiety is a non-peptide linker comprising a relatively short bifunctional non-peptide polymer linker that approximates the length of an 8-16 amino acid sequence.
  • non-peptide linker has the structure: wherein m is an integer ranging from
  • the linking moiety is linked directly to the B25 amino acid of the B chain.
  • the non-peptide linking moiety is a polyethylene glycol linker of approximately 4 to 20, 8 to 18, 8 to 16, 8 to 14, 8 to 12, 10 to 14, 10 to 12 or 11 to 13 monomers.
  • an NHR ligand-insulin conjugate comprising an insulin peptide having the structure: IB-LM-IA, wherein IB comprises the sequence R 6 2- X 25 LCGX 2 9X 3 oLVX 33 X 34 LYLVCGX 4 iX 42 GFX 4 5 (SEQ ID NO: 20), LM is a linking moiety as disclosed herein that covalently links IB to IA, and IA comprises the sequence
  • X 4 is glutamic acid or aspartic acid
  • X5 is glutamine or glutamic acid
  • X 8 is histidine or phenylalanine
  • X9 and X 14 are independently selected from arginine, lysine, ornithine or alanine;
  • X10 is isoleucine or serine
  • X12 is serine or aspartic acid
  • X 14 is tyrosine, arginine, lysine, ornithine or alanine;
  • Xi 5 is arginine, lysine, ornithine or leucine
  • X 17 is glutamic acid or glutamine
  • X 18 is methionine, asparagine or threonine
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X21 is alanine, glycine or asparagine
  • X25 is selected from the group consisting of histidine and threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
  • R 62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
  • R5 3 is COOH or CONH 2 , further wherein the amino acid at the designation X 4 5 is directly bound to the linking moiety, LM (i.e., the designation IB-LM-IA as used herein is intended to represent that the B chain carboxyl terminus and the amino terminus of the A chain are directly linked to the linking moiety LM without any further intervening amino acids).
  • LM i.e., the designation IB-LM-IA as used herein is intended to represent that the B chain carboxyl terminus and the amino terminus of the A chain are directly linked to the linking moiety LM without any further intervening amino acids.
  • the linking moiety comprises an amino acid sequence of no more than 17 amino acids in length.
  • the linking moiety comprises the sequence X51X52GSSSX57X58 (SEQ ID NO: 49) or X51X52GSSSX57X58APQT (SEQ ID NO: 50) wherein X 51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine and proline, Xs 2 is alanine, valine, leucine, isoleucine or proline and X57 or X58 are independently arginine, lysine, cysteine, homocysteine, acetyl-phenylalanine or ornithine, optionally with a hydrophilic moiety linked to the side chain of the amino acid at position 7 or 8 of the linking moiety (i.e., at the X57 or X58 position).
  • LM is GAGS S S RR APQT (SEQ ID NO: 23) or GAGSSSRR (SEQ ID NO: 22).
  • the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 70%, 80%, 90% sequence identity to
  • linking peptide comprises a total of 29 to 158 or 29 to 58 amino acids and comprises the sequence of SEQ ID NO: 68.
  • linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 90% sequence identity to
  • linking moiety comprises the sequence SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51) or SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52) optionally with one or two amino acid substitutions.
  • a single chain insulin agonist polypeptide comprising a B chain and A chain of human insulin, or analogs or derivative thereof, wherein the last five carboxy amino acids of the native B chain are deleted (i.e., B26-B30), and amino acid B25 is linked to amino acid Al of the A chain via an intervening linking moiety.
  • the linking moiety comprises the structure: wherein m is an integer ranging from 10 to 14 and the linking moiety is linked directly to the B25 amino acid of the B chain.
  • an NHR ligand-insulin conjugate comprising an insulin peptide having the general formula IB-LM-IA wherein IB comprises the sequence GPEHLCGAX 3 oLVDALYLVCGDX 42 GFYFNX 48 X 4 9 (SEQ ID NO: 163);
  • LM comprises the sequence SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51), SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52), GYGSSSRR (SEQ ID NO: 18), GAGS S S RRAPQT (SEQ ID NO: 23) or GAGSSSRR (SEQ ID NO: 22); and
  • IA comprises the sequence GIVDECCXsXgSCDLX ⁇ XisLXnXisXigCXii-Rss (SEQ ID NO: 35) wherein
  • X 8 is histidine or phenylalanine
  • X9 is arginine, ornithine or alanine
  • X 14 and Xi 5 are both arginine
  • X 17 is glutamic acid
  • Xis is methionine, asparagine or threonine
  • X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
  • X21 is alanine or asparagine
  • histidine or threonine is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 42 is selected from the group consisting of alanine, ornithine and arginine;
  • X 48 is lysine or aspartic acid
  • X 4 9 is proline, ornithine or arginine
  • R53 is COOH
  • the linking moiety is a peptide or peptidomimetic of 6-18, 8-18, 8-17, 8-12, 8-10, 13-17 or 13-15 amino acids (or amino acid analogs or derivatives thereof).
  • the linking moiety is 8 to 17 amino acids in length and comprises the sequence X 51 X 52 GSSSRR (SEQ ID NO: 53) wherein X51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine, proline and methionine, and X52 is a non-aromatic amino acid, including for example, alanine.
  • the linking moiety is 8 to 17 amino acids in length and comprises a sequence that differs from X 51 X 52 GSSSRR (SEQ ID NO: 53) by a single amino acid substitution wherein the amino acid substitution is an amino acid that is pegylated at its side chain, further wherein X 51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine, proline and methionine, and X52 is a non-aromatic amino acid, including for example, alanine.
  • the linking moiety is a derivative of the IGF 1 C chain sequence (GYGSSSRRAPQT; SEQ ID NO: 17).
  • the derivative is a peptide that differs from SEQ ID NO: 17 by a single amino acid substitution of a lysine, cysteine ornithine, homocysteine, or acetyl-phenylalanine residue, and in a further embodiment the lysine, cysteine ornithine, homocysteine, or acetyl-phenylalanine amino acid is pegylated.
  • the linking moiety is a peptide that differs from SEQ ID NO: 17 by a single lysine substitution.
  • the substitution is made at position 8 of SEQ ID NO: 17.
  • Applicants have discovered that use of the IGF 1 C chain sequence and analogs thereof as a linking moiety will generate a single chain insulin polypeptide that has near wild type insulin activity.
  • use of a IGF 1 C chain sequence analog as the linking moiety wherein position 2 of the IGF 1 C chain sequence is modified, or the carboxy terminal four amino acids are deleted from the IGF 1 C chain sequence, produces a single chain insulin polypeptide that is selective for insulin (i.e., has a higher binding and/or activity at the insulin receptor compared to the IGF-1 receptor).
  • the single chain insulin polypeptide has 5x, lOx, 20x, 30x, 40x, or 50x higher affinity or activity at the insulin receptor relative to the IGF- 1 receptor.
  • the linking moiety is a derivative of the IGF 1 C chain sequence (GYGSSSRRAPQT; SEQ ID NO: 17) and comprises a non-native sequence that differs from GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT (SEQ ID NO: 23) by 1 to 3 amino acid substitutions, or 1 to 2 amino acid substitutions.
  • at least one of the amino acid substitutions is a lysine or cysteine substitution, and in one embodiment the amino acid substitutions are conservative amino acid substitutions.
  • the linking moiety is a peptide (or peptidomimetic) of 8 to 17 amino acids comprising a non-native amino acid sequence that differs from GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT (SEQ ID NO: 23) by 1 amino acid substitution, including for example substitution with a lysine or cysteine.
  • the linking moiety comprises the sequence GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT (SEQ ID NO: 23).
  • the linking moiety comprises the sequence GAGSSSRXsgAPQT (SEQ ID NO: 54), GYGSSSX 57 X 58 APQT (SEQ ID NO: 69), or an amino acid that differs from SEQ ID NO: 54 by a single amino acid substitution, wherein Xs 7 is arginine and Xs 8 is arginine, ornithine or lysine, and in a further embodiment a polyethylene glycol chain is linked to the side chain of the amino acid at position 8 of said linking moiety.
  • the linking moiety comprises the sequence GX 52 GSSSRX 58 APQT (SEQ ID NO: 55), wherein Xs 2 is any non-aromatic amino acid, including for example, alanine, valine, leucine, isoleucine or proline, and Xs 8 represents an amino acid that has a
  • Xs 8 is a pegylated lysine.
  • the linking moiety is an 8 to 17 amino acid sequence comprising the sequence GXs 2 GSSSRR (SEQ ID NO: 56), wherein Xs 2 is any amino acid, a peptidomimetic of SEQ ID NO: 31, or an analog thereof that differs from SEQ ID NO: 31 by a single amino acid substitution at any of positions 1, 3, 4, 5, 6, 7 or 8 of SEQ ID NO: 31, with the proviso that when the linking peptide is longer than 8 amino acids Xs 2 is other than tyrosine.
  • the linking moiety comprises an 8-17 amino acid sequence selected from the group consisting of GYGSSSRR (SEQ ID NO: 18), GAGSSSRR (SEQ ID NO: 22), GAGSSSRRA (SEQ ID NO: 57), GAGSSSRRAP (SEQ ID NO: 58), GAGSSSRRAPQ (SEQ ID NO: 59), GAGS S S RRAPQT (SEQ ID NO: 23), PYGSSSRR (SEQ ID NO: 61), PAGSSSRR (SEQ ID NO: 62), PAGSSSRRA (SEQ ID NO: 63), PAGSSSRRAP (SEQ ID NO: 64), PAGSSSRRAPQ (SEQ ID NO: 65),
  • the linking moiety comprises an amino acid sequence that differs from GYGSSSRR (SEQ ID NO: 18), GAGSSSRR (SEQ ID NO: 22), GAGSSSRRA (SEQ ID NO: 57), GAGSSSRRAP (SEQ ID NO: 58), GAGSSSRRAPQ (SEQ ID NO: 59), GAGS S S RRAPQT (SEQ ID NO: 23), PYGSSSRR (SEQ ID NO: 61), PAGSSSRR (SEQ ID NO: 62), PAGSSSRRA (SEQ ID NO: 63), PAGSSSRRAP (SEQ ID NO: 64), PAGSSSRRAPQ (SEQ ID NO: 65),
  • P AGS S S RRAPQT (SEQ ID NO: 66) by a single pegylated amino acid including for example a pegylated lysine or pegylated cysteine amino acid substitution.
  • the pegylated amino acid is at position 8 of the linking moiety.
  • CTP C-terminal peptide
  • SSSSKAPPPSLPSPSRLPGPSDTPILPQR SEQ ID NO: 52
  • SSSSKAPPPSLPSPSRLPGPSDTPILPQR SEQ ID NO: 52
  • the CTP peptide can be used to connect the B and A chains of insulin to form a single chain insulin analog while still maintaining high in vitro potency in a manner that the native proinsulin C- peptide can not.
  • a NHR ligand-insulin conjugate is prepared comprising an insulin peptide having the carboxy terminus of the B chain linked to the amino terminus of the A chain via a CTP peptide.
  • an insulin analog is provided as a two-chain construct with the CTP covalently linked to the C-terminus of the B-chain and/or the amino terminus of the B chain.
  • CTP modified insulin analogs In vitro and in vivo characterization reveals the CTP modified insulin analogs to have high potency in the absence of glycosylation, thus providing a mechanism to extend insulin action that is based on glycosylation, a natural approach to longer duration proteins.
  • the linking moiety comprises a peptide having a length of at least 18 amino acids that shares a similar amino acid content.
  • the linking moiety comprises an analog of (SEQ ID NO: 68), wherein said analog differs from (SEQ ID NO: 68) by 1, 2, 3, 4, 5 or 6 amino acid substitutions.
  • the linking peptide comprises a CTP peptide wherein amino acid substitutions are made at one or more positions selected from positions 1, 2, 3, 4, 10, 13, 15, and 21 of (SEQ ID NO: 68).
  • the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 60, 80 or 90% sequence identity to SSSSX 50 APPPSLPSPSRLPGPSDTPILPQX 5 i (SEQ ID NO: 68), with the proviso that the sequence does not comprise a 15 amino acid sequence identical to a 15 amino acid sequence contained within SEQ ID NO 53.
  • the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein at least 58% of the amino acids comprising the 29 contiguous amino acid sequence are selected from the group consisting of serine and proline.
  • the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 70%, 80%, 90% sequence identity to S S S S X50 APPPS LPS PS RLPGPS DTPILPQX 5 1 (SEQ ID NO: 68), wherein X 50 and X 51 are independently selected from arginine and lysine, with the proviso that the sequence does not comprise a 15 amino acid sequence identical to a 15 amino acid sequence contained within SEQ ID NO 53.
  • the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence is an analog of (SEQ ID NO: 52), wherein said analog differs from (SEQ ID NO: 52) only by 1, 2, 3, 4, 5 or 6 amino acid modification, and in a further embodiment the amino acid modifications are conservative amino acid substitutions.
  • the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence is an analog of (SEQ ID NO: 52), wherein said analog differs from (SEQ ID NO: 52) only by 1, 2 or 3 amino acid substitutions.
  • an insulin analog comprising a CTP peptide having the sequence
  • n is an integer selected from the group consisting of 1, 2, 3 and 4 and X50 and X51 are independently selected from arginine and lysine.
  • CTP peptide comprises a sequence selected from the group consisting of
  • SSSSRAPPPSLPSPSRLPGPSDTPILPQ (SEQ ID NO: 67)
  • the CTP peptide comprises the sequence SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51).
  • L is a bond.
  • Q and Y are conjugated together by reacting a nucleophilic reactive moiety on Q with and electrophilic reactive moiety on Y.
  • Q and Y are conjugated together by reacting an electrophilic reactive moiety on Q with a nucleophilic moiety on Y.
  • L is an amide bond that forms upon reaction of an amine on Q (e.g. an ⁇ - amine of a lysine residue) with a carboxyl group on Y.
  • Q and or Y are derivatized with a derivatizing agent before conjugation.
  • L is a linking group. In some embodiments, L is a bifunctional linker and comprises only two reactive groups before conjugation to Q and Y. In embodiments where both Q and Y have electrophilic reactive groups, L comprises two of the same or two different nucleophilic groups (e.g. amine, hydroxyl, thiol) before conjugation to Q and Y. In embodiments where both Q and Y have nucleophilic reactive groups, L comprises two of the same or two different electrophilic groups (e.g. carboxyl group, activated form of a carboxyl group, compound with a leaving group) before conjugation to Q and Y. In embodiments where one of Q or Y has a nucleophilic reactive group and the other of Q or Y has an electrophilic reactive group, L comprises one nucleophilic reactive group and one electrophilic group before conjugation to Q and Y.
  • nucleophilic groups e.g. amine, hydroxyl, thiol
  • electrophilic reactive groups e.g. carboxy
  • L can be any molecule with at least two reactive groups (before conjugation to Q and Y) capable of reacting with each of Q and Y. In some embodiments L has only two reactive groups and is bifunctional. L (before conjugation to the peptides) can be represented by Formula VI:
  • W and J are independently nucleophilic or electrophilic reactive groups.
  • W and J are either both nucleophilic groups or both electrophilic groups.
  • one of W or J is a nucleophilic group and the other of W or J is an electrophilic group.
  • L comprises a chain of atoms from 1 to about 60, or 1 to 30 atoms or longer, 2 to 5 atoms, 2 to 10 atoms, 5 to 10 atoms, or 10 to 20 atoms long.
  • the chain atoms are all carbon atoms.
  • the chain atoms in the backbone of the linker are selected from the group consisting of C, O, N, and S. Chain atoms and linkers may be selected according to their expected solubility (hydrophilicity) so as to provide a more soluble conjugate.
  • L provides a functional group that is subject to cleavage by an enzyme or other catalyst or hydrolytic conditions found in the target tissue or organ or cell. In some embodiments, the length of L is long enough to reduce the potential for steric hindrance.
  • the linking group is hydrophilic such as, for example, polyalkylene glycol.
  • the hydrophilic linking group comprises at least two reactive groups (W and J), as described herein and as shown below:
  • the linking group is polyethylene glycol (PEG).
  • PEG polyethylene glycol
  • the PEG in certain embodiments has a molecular weight of about 100 Daltons to about 10,000 Daltons, e.g. about 500 Daltons to about 5000 Daltons.
  • the PEG in some embodiments has a molecular weight of about 10,000 Daltons to about 40,000 Daltons.
  • the hydrophilic linking group comprises either a maleimido or an iodoacetyl group and either a carboxylic acid or an activated carboxylic acid (e.g. NHS ester) as the reactive groups.
  • the maleimido or iodoacetyl group can be coupled to a thiol moiety on Q or Y and the carboxylic acid or activated carboxylic acid can be coupled to an amine on Q or Y with or without the use of a coupling reagent. Any appropriate coupling agent known to one skilled in the art can be used to couple the carboxylic acid with the amine.
  • the linking group is maleimido- PEG(20 kDa)-COOH, iodoacetyl-PEG(20 kDa)-COOH, maleimido-PEG(20 kDa)-NHS, or iodoacetyl-PEG(20 kDa)-NHS.
  • the linking group is comprised of an amino acid, a dipeptide, a tripeptide, or a polypeptide, wherein the amino acid, dipeptide, tripeptide, or polypeptide comprises at least two activating groups, as described herein.
  • the linking group (L) comprises a moiety selected from the group consisting of: amino, ether, thioether, maleimido, disulfide, amide, ester, thioester, alkene, cycloalkene, alkyne, trizoyl, carbamate, carbonate, cathepsin B-cleavable, and hydrazone.
  • the linking group is an amino acid selected from the group Asp, Glu, homoglutamic acid, homocysteic acid, cysteic acid, gamma-glutamic acid.
  • the linking group is a dipeptide selected from the group consisting of: Ala- Ala, ⁇ -Ala- ⁇ -Ala, Leu-Leu, Pro-Pro, ⁇ -aminobutyric acid- ⁇ -aminobutyric acid, and ⁇ -Glu- ⁇ -Glu.
  • L comprises gamma-glutamic acid.
  • an activating agent can be used to form an activated ester of the carboxylic acid.
  • the activated ester of the carboxylic acid can be, for example, N-hydroxysuccinimide (NHS), tosylate (Tos), mesylate, triflate, a carbodiimide, or a hexafluorophosphate.
  • the carbodiimide is 1,3-dicyclohexylcarbodiimide (DCC), 1,1'- carbonyldiimidazole (CDI), l-ethyl-3-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDC), or 1,3-diisopropylcarbodiimide (DICD).
  • DCC 1,3-dicyclohexylcarbodiimide
  • CDI 1,1'- carbonyldiimidazole
  • EDC l-ethyl-3-(3-dimethylaminopropyl)carbodiimide hydrochloride
  • DICD 1,3-diisopropylcarbodiimide
  • hexafluorophosphate is selected from a group consisting of hexafluorophosphate benzotriazol- l-yl-oxy-tris(dimethylamino)phosphonium hexafluorophosphate (BOP), benzotriazol-l-yl- oxytripyrrolidinophosphonium hexafluorophosphate (PyBOP), 2-(lH-7- azabenzotriazol-l-yl)-l,l,3,3-tetramethyl uranium hexafluorophosphate (HATU), and o- benzotriazole-N,N,N',N'-tetramethyl-uronium-hexafluoro-phosphate (HBTU).
  • BOP benzotriazol- l-yl-oxy-tris(dimethylamino)phosphonium hexafluorophosphate
  • PyBOP benzotriazol-l-yl- oxytri
  • Q comprises a nucleophilic reactive group (e.g. the amino group, thiol group, or hydroxyl group of the side chain of lysine, cysteine or serine) that is capable of conjugating to an electrophilic reactive group on Y or L.
  • Q comprises an electrophilic reactive group (e.g. the carboxylate group of the side chain of Asp or Glu) that is capable of conjugating to a nucleophilic reactive group on Y or L.
  • Q is chemically modified to comprise a reactive group that is capable of conjugating directly to Y or to L.
  • Q is modified at the C-terminal to comprise a natural or nonnatural amino acid with a nucleophilic side chain, such as an amino acid represented by Formula I, Formula II, or Formula III, as previously described herein (see Acylation and alkylatiori).
  • the C-terminal amino acid of Q is selected from the group consisting of lysine, ornithine, serine, cysteine, and homocysteine.
  • the C-terminal amino acid of Q can be modified to comprise a lysine residue.
  • Q is modified at the C-terminal amino acid to comprise a natural or nonnatural amino acid with an electrophilic side chain such as, for example, Asp and Glu.
  • an internal amino acid of Q is substituted with a natural or nonnatural amino acid having a nucleophilic side chain, such as an amino acid represented by Formula I, Formula II, or Formula III, as previously described herein (see Acylation and alkylation).
  • the internal amino acid of Q that is substituted is selected from the group consisting of lysine, ornithine, serine, cysteine, and homocysteine.
  • an internal amino acid of Q can be substituted with a lysine residue.
  • an internal amino acid of Q is substituted with a natural or nonnatural amino acid with an electrophilic side chain, such as, for example, Asp and Glu.
  • Y comprises a reactive group that is capable of conjugating directly to Q or to L.
  • Y comprises a nucleophilic reactive group (e.g. amine, thiol, hydroxyl) that is capable of conjugating to an electrophilic reactive group on Q or L.
  • Y comprises electrophilic reactive group (e.g. carboxyl group, activated form of a carboxyl group, compound with a leaving group) that is capable of conjugating to a nucleophilic reactive group on Q or L.
  • L is stable in vivo. In some embodiments, L is stable in blood serum for at least 5 minutes, e.g. less than 25%, 20%, 15%, 10% or 5% of the conjugate is cleaved when incubated in serum for a period of 5 minutes. In other embodiments, L is stable in blood serum for at least 10, or 20, or 25, or 30, or 60, or 90, or 120 minutes, or 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 18 or 24 hours. In these embodiments, L does not comprise a functional group that is capable of undergoing hydrolysis in vivo. In some exemplary embodiments, L is stable in blood serum for at least about 72 hours. Nonlimiting examples of functional groups that are not capable of undergoing significant hydrolysis in vivo include amides, ethers, and thioethers. For example, the following compound is not capable of undergoing significant hydrolysis in vivo:
  • L is hydrolyzable in vivo.
  • L comprises a functional group that is capable of undergoing hydrolysis in vivo.
  • functional groups that are capable of undergoing hydrolysis in vivo include esters, anhydrides, and thioesters.
  • the following compound is capable of undergoing hydrolysis in vivo because it comprises an ester group:
  • L is labile and undergoes substantial hydrolysis within 3 hours in blood plasma at 37 °C, with complete hydrolysis within 6 hours. In some exemplary embodiments, L is not labile.
  • L is metastable in vivo.
  • L comprises a functional group that is capable of being chemically or enzymatically cleaved in vivo (e.g., an acid-labile, reduction-labile, or enzyme-labile functional group), optionally over a period of time.
  • L can comprise, for example, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety.
  • the Q-L-Y conjugate is stable in an extracellular environment, e.g., stable in blood serum for the time periods described above, but labile in the intracellular environment or conditions that mimic the intracellular environment, so that it cleaves upon entry into a cell.
  • L is stable in blood serum for at least about 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 42, or 48 hours, for example, at least about 48, 54, 60, 66, or 72 hours, or about 24-48, 48-72, 24-60, 36-48, 36-72, or 48-72 hours.
  • the insulin peptides disclosed herein are further modified to comprise a hydrophilic moiety covalently linked to the side chain of an amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain or at the N-terminal alpha amine of the B chain (e.g.
  • this hydrophilic moiety is covalently linked to a Lys, Cys, Orn, homocysteine, or acetyl- phenylalanine residue at any of these positions.
  • the hydrophilic moiety is covalently linked to the side chain of an amino acid of the linking moiety.
  • hydrophilic moieties include polyethylene glycol (PEG), for example, of a molecular weight of about 1,000 Daltons to about 40,000 Daltons, or about 20,000 Daltons to about 40,000 Daltons.
  • Additional suitable hydrophilic moieties include, polypropylene glycol, polyoxyethylated polyols (e.g., POG), polyoxyethylated sorbitol, polyoxyethylated glucose, polyoxyethylated glycerol (POG), polyoxyalkylenes, polyethylene glycol propionaldehyde, copolymers of ethylene glycol/propylene glycol, monomethoxy- polyethylene glycol, mono-(Cl-ClO) alkoxy- or aryloxy-polyethylene glycol,
  • polystyrene resin polyvinyl alcohol
  • PVA polyvinyl pyrrolidone
  • poly- 1, 3-dioxolane poly-l,3,6-trioxane
  • ethylene/maleic anhydride copolymer poly (beta-amino acids) (either homopolymers or random copolymers)
  • poly(n- vinyl pyrrolidone)polyethylene glycol polypropylene glycol homopolymers (PPG) and other polyakylene oxides, polypropylene oxide/ethylene oxide copolymers, colonic acids or other polysaccharide polymers, Ficoll or dextran and mixtures thereof.
  • the hydrophilic moiety e.g., polyethylene glycol chain in accordance with some embodiments has a molecular weight selected from the range of about 500 to about 40,000 Daltons.
  • the hydrophilic moiety, e.g. PEG has a molecular weight selected from the range of about 500 to about 5,000 Daltons, or about 1,000 to about 5,000 Daltons.
  • the hydrophilic moiety, e.g., PEG has a molecular weight of about 10,000 to about 20,000 Daltons.
  • the hydrophilic moiety, e.g., PEG has a molecular weight of about 20,000 to about 40,000 Daltons.
  • PEG has a molecular weight of about 20,000 Daltons.
  • an insulin peptide is provided wherein one or more amino acids of the analog are pegylated, and the combined molecular weight of the covalently linked PEG chains is about 20,000 Daltons.
  • dextrans are used as the hydrophilic moiety.
  • Dextrans are polysaccharide polymers of glucose subunits, predominantly linked by ocl-6 linkages. Dextran is available in many molecular weight ranges, e.g., about 1 kD to about 100 kD, or from about 5, 10, 15 or 20 kD to about 20, 30, 40, 50, 60, 70, 80 or 90 kD.
  • Linear or branched polymers are contemplated.
  • Resulting preparations of conjugates may be essentially monodisperse or polydisperse, and may have about 0.5, 0.7, 1, 1.2, 1.5 or 2 polymer moieties per peptide.
  • the hydrophilic moiety is a polyethylene glycol (PEG) chain, optionally linked to the side chain of an amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain, positions B l, B2, B 10, B22, B28 or B29 of the B chain, at the N-terminal alpha amine of the B chain, or at any position of the linking moiety of a single chain insulin analog that links the A chain and B chain, including for example at position C8.
  • the single chain insulin analog comprises a peptide linking moiety of 8 to 12 amino acids, wherein one of the amino acids of the linking moiety has a polyethylene chain covalently bound to its side chain.
  • the single chain insulin analog comprises a peptide linking moiety of 8 to 12 amino acids, wherein an amino acid of the linking moiety is pegylated and one or more amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain, positions B l, B2, B 10, B22, B28 or B29 of the B chain is also pegylated.
  • the total molecular weight of the covalently linked PEG chain(s) is about 20,000 Daltons.
  • a single chain insulin analog comprises a linking moiety of 8 to 12 amino acids, wherein one of the amino acids of the linking moiety has a 20,000 Dalton polyethylene chain covalently bound to its side chain.
  • an insulin analog comprises a peptide linking moiety of 8 to 12 amino acids, wherein one of the amino acids of the linking moiety has a polyethylene chain covalently bound to its side chain and a second PEG chain is linked to the N-terminal alpha amine of the B chain (e.g. at position B 1 for insulin based B chain or position B2 for IGF-1 based B chain) or at the side chain of an amino acid at position B 1, B2 and B29 of the B chain.
  • each PEG chain has a molecular weight of about 10,000 Daltons. In one embodiment when the PEG chain is linked to an 8 to 12 amino acid linking moiety, the PEG chain is linked at position C7 or C8 of the linking moiety and in one embodiment the PEG chain is linked at position C8 of the linking moiety. In one embodiment when two PEG chains are linked to the single chain insulin analog, with one PEG chain linked at position C8 and the second PEG is linked at A9, A14, A15, B l, B2, B 10, B22, B28 or B29.
  • Hydrophilic moieties such as polyethylene glycol can be attached to the NHR ligand- insulin conjugate under any suitable conditions used to react a protein with an activated polymer molecule. Any means known in the art can be used, including via acylation, reductive alkylation, Michael addition, thiol alkylation or other chemoselective
  • conjugation/ligation methods through a reactive group on the PEG moiety (e.g., an aldehyde, amino, ester, thiol, a-haloacetyl, maleimido or hydrazino group) to a reactive group on the target compound (e.g., an aldehyde, amino, ester, thiol, a-haloacetyl, maleimido or hydrazino group).
  • Activating groups which can be used to link the water soluble polymer to one or more proteins include without limitation sulfone, maleimide, sulfhydryl, thiol, triflate, tresylate, azidirine, oxirane and 5-pyridyl. If attached to the peptide by reductive alkylation, the polymer selected should have a single reactive aldehyde so that the degree of
  • the insulin analog is modified to comprise an acyl group.
  • the acyl group can be covalently linked directly to an amino acid of the NHR ligand-insulin conjugate, or indirectly to an amino acid of the NHR ligand-insulin conjugate via a spacer, wherein the spacer is positioned between the amino acid of the NHR ligand-insulin conjugate and the acyl group.
  • the NHR ligand-insulin conjugate may be acylated at the same amino acid position where a hydrophilic moiety is linked, or at a different amino acid position.
  • acylation may occur at any position including any of amino acid of the A or B chains as well as a position within the linking moiety, provided that the activity exhibited by the non-acylated NHR ligand-insulin conjugate is retained upon acylation.
  • Nonlimiting examples include acylation at positions A14 and A15 of the A chain, positions position B l for insulin based B chain or position B2 for IGF-1 based B chain or positions B 10, B22, B28 or B29 of the B chain or at any position of the linking moiety.
  • the insulin analog is modified to comprise an acyl group by direct acylation of an amine, hydroxyl, or thiol of a side chain of an amino acid of the NHR ligand-insulin conjugate.
  • the insulin analog is directly acylated through the side chain amine, hydroxyl, or thiol of an amino acid.
  • acylation is at position B28 or B29 (according to the amino acid numbering of the native insulin A and B chain sequences).
  • an insulin analog can be provided that has been modified by one or more amino acid substitutions in the A or B chain sequence, including for example at positions A14, A15, B l, B2, B IO, B22, B28 or B29 (according to the amino acid numbering of the native insulin A and B chain sequences) or at any position of the linking moiety with an amino acid comprising a side chain amine, hydroxyl, or thiol.
  • the direct acylation of the insulin peptide occurs through the side chain amine, hydroxyl, or thiol of the amino acid at position B28 or B29 (according to the amino acid numbering of the native insulin A and B chain sequences).
  • the acylated insulin analogs comprise a spacer between the peptide and the acyl group.
  • the NHR ligand-insulin conjugate is covalently bound to the spacer, which is covalently bound to the acyl group.
  • the insulin peptide is modified to comprise an acyl group by acylation of an amine, hydroxyl, or thiol of a spacer, which spacer is attached to a side chain of an amino acid at position B28 or B29 (according to the amino acid numbering of the A or B chain of native insulin), or at any position of the spacer moiety.
  • the amino acid of the NHR ligand-insulin conjugate to which the spacer is attached can be any amino acid comprising a moiety which permits linkage to the spacer.
  • an amino acid comprising a side chain -NH 2 , -OH, or -COOH e.g., Lys, Orn, Ser, Asp, or Glu is suitable.
  • the spacer between the NHR ligand-insulin conjugate and the acyl group is an amino acid comprising a side chain amine, hydroxyl, or thiol (or a dipeptide or tripeptide comprising an amino acid comprising a side chain amine, hydroxyl, or thiol).
  • the spacer comprises a hydrophilic bifunctional spacer.
  • the spacer comprises an amino poly(alkyloxy)carboxylate.
  • the spacer can comprise, for example, NH 2 (CH 2 CH 2 0) n (CH 2 ) m COOH, wherein m is any integer from 1 to 6 and n is any integer from 2 to 12, such as, e.g., 8-amino-3,6-dioxaoctanoic acid, which is commercially available from Peptides International, Inc. (Louisville, KY).
  • the hydrophilic bifunctional spacer comprises two or more reactive groups, e.g., an amine, a hydroxyl, a thiol, and a carboxyl group or any combinations thereof.
  • the hydrophilic bifunctional spacer comprises a hydroxyl group and a carboxylate.
  • the hydrophilic bifunctional spacer comprises an amine group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises a thiol group and a carboxylate. In some embodiments, the spacer between peptide the NHR ligand-insulin conjugate and the acyl group is a hydrophobic bifunctional spacer. Hydrophobic bifunctional spacers are known in the art. See, e.g., Bioconjugate Techniques, G. T. Hermanson (Academic Press, San Diego, CA, 1996), which is incorporated by reference in its entirety.
  • the bifunctional spacer can be a synthetic or naturally occurring amino acid comprising an amino acid backbone that is 3 to 10 atoms in length (e.g., 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, and 8- aminooctanoic acid).
  • the spacer can be a dipeptide or tripeptide spacer having a peptide backbone that is 3 to 10 atoms (e.g., 6 to 10 atoms) in length.
  • Each amino acid of the dipeptide or tripeptide spacer attached to the NHR ligand-insulin conjugate can be independently selected from the group consisting of: naturally-occurring and/or non- naturally occurring amino acids, including, for example, any of the D or L isomers of the naturally-occurring amino acids (Ala, Cys, Asp, Glu, Phe, Gly, His, He, Lys, Leu, Met, Asn, Pro, Arg, Ser, Thr, Val, Trp, Tyr), or any D or L isomers of the non-naturally occurring amino acids selected from the group consisting of: ⁇ -alanine ( ⁇ -Ala), N-a-methyl- alanine (Me-Ala), aminobutyric acid (Abu), a-aminobutyric acid ( ⁇ -Abu), aminohexanoic acid ( ⁇ - Ahx), aminoisobutyric acid (Aib), aminomethylpyrrole carboxylic acid,
  • aminopiperidinecarboxylic acid aminoserine (Ams), aminotetrahydropyran-4-carboxylic acid, arginine N-methoxy-N-methyl amide, ⁇ -aspartic acid ( ⁇ -Asp), azetidine carboxylic acid, 3-(2-benzothiazolyl)alanine, a-tert-butylglycine, 2-amino-5-ureido-n-valeric acid (citrulline, Cit), ⁇ -Cyclohexylalanine (Cha), acetamidomethyl-cysteine, diaminobutanoic acid (Dab), diaminopropionic acid (Dpr), dihydroxyphenylalanine (DOPA),
  • DMT A dimethylthiazolidine
  • ⁇ -Glutamic acid ⁇ -Glu
  • homoserine Hse
  • hydroxyproline Hyp
  • isoleucine N-methoxy-N-methyl amide methyl-isoleucine (MeHe)
  • isonipecotic acid Isn
  • MeLeu methyl-lysine
  • dimethyl-lysine trimethyl-lysine
  • methanoproline methionine- sulfoxide (Met(O)), methionine-sulfone (Met(02)), norleucine (Nle), methyl-norleucine (Me-Nle), norvaline (Nva), ornithine (Orn), para-aminobenzoic acid (PABA), penicillamine (Pen), methylphenylalanine (MePhe), 4-Chlorophenylalanine (Phe(4-Cl)), 4-fluorophenylalanine (Phe(4-F)), 4-nitrophenylalanine (Phe(4-N02)), 4- cyanophenylalanine ((Phe(4-CN)), phenylglycine (Phg), piperidinylalanine,
  • piperidinylglycine 3,4-dehydroproline, pyrrolidinylalanine, sarcosine (Sar), selenocysteine (Sec), U-Benzyl-phosphoserine, 4-amino-3-hydroxy-6-methylheptanoic acid (Sta), 4-amino- 5-cyclohexyl-3-hydroxypentanoic acid (ACHPA), 4-amino-3-hydroxy-5-phenylpentanoic acid (AHPPA), l,2,3,4,-tetrahydro-isoquinoline-3-carboxylic acid (Tic), tetrahydropyranglycine, thienylalanine (Thi) , U-Benzyl-phosphotyrosine, O- Phosphotyrosine, methoxytyrosine, ethoxytyrosine, O-(bis-dimethylamino-phosphono)- tyrosine, tyrosine sulfate
  • the dipeptide spacer is selected from the group consisting of: Ala-Ala, ⁇ -Ala- ⁇ -Ala, Leu-Leu, Pro-Pro, ⁇ -aminobutyric acid- ⁇ -aminobutyric acid, and ⁇ -Glu- ⁇ -Glu.
  • the peptide the NHR ligand-insulin conjugate can be modified to comprise an acyl group by acylation of a long chain alkane of any size and can comprise any length of carbon chain.
  • the long chain alkane can be linear or branched.
  • the long chain alkane is a C 4 to C30 alkane.
  • the long chain alkane can be any of a C 4 alkane, C 6 alkane, C 8 alkane, C10 alkane, C 12 alkane, C 14 alkane, C 16 alkane, C 18 alkane, C 2 o alkane, C 22 alkane, C 24 alkane, C 3 ⁇ 4 alkane, C 28 alkane, or a C30 alkane.
  • the long chain alkane comprises a C 8 to C 2 o alkane, e.g., a C 14 alkane, C 16 alkane, or a C 18 alkane.
  • an amine, hydroxyl, or thiol group of the NHR ligand-insulin conjugate is acylated with a cholesterol acid.
  • the peptide is linked to the cholesterol acid through an alkylated des-amino Cys spacer, i.e., an alkylated 3- mercaptopropionic acid spacer. Suitable methods of peptide acylation via amines, hydroxyls, and thiols are known in the art.
  • the acyl group of the acylated peptide the NHR ligand-insulin conjugate can be of any size, e.g., any length carbon chain, and can be linear or branched.
  • the acyl group is a C 4 to C30 fatty acid.
  • the acyl group can be any of a C 4 fatty acid, C 6 fatty acid, C 8 fatty acid, C 10 fatty acid, C 12 fatty acid, C 14 fatty acid, C 16 fatty acid, C 18 fatty acid, C 2 o fatty acid, C 22 fatty acid, C 24 fatty acid, C 3 ⁇ 4 fatty acid, C 28 fatty acid, or a C30 fatty acid.
  • the acyl group is a C 8 to C 2 o fatty acid, e.g., a C 14 fatty acid or a C 16 fatty acid.
  • the NHR ligand-insulin conjugate comprises an amino acid with a side chain covalently attached, optionally through a spacer, to an acyl group or an alkyl group, wherein the acyl group or alkyl group is non-native to a naturally-occurring amino acid.
  • the covalently linked acyl or alkyl group has a carboxylate at its free end.
  • the acyl group in some embodiments is a C4 to C30 fatty acyl group, optionally with carboxylate groups at each end.
  • the NHR ligand- insulin conjugate comprises a covalently linked C4 to C30 acyl group optionally with a carboxylate at its free end.
  • the acyl group or alkyl group is covalently attached to the side chain of an amino acid of the insulin peptide at position B28 or B29.
  • the acyl group is a bile acid.
  • the bile acid can be any suitable bile acid, including, but not limited to, cholic acid, chenodeoxycholic acid, deoxycholic acid, lithocholic acid, taurocholic acid, glycocholic acid, and cholesterol acid.
  • the NHR ligand-insulin conjugate is modified to comprise an alkyl group.
  • the alkyl group can be covalently linked directly to an amino acid of the insulin analog, or indirectly to an amino acid of the NHR ligand-insulin conjugate via a spacer, wherein the spacer is positioned between the amino acid of the NHR ligand-insulin conjugate and the alkyl group.
  • the alkyl group can be attached to the NHR ligand-insulin conjugate via an ether, thioether, or amino linkage.
  • the NHR ligand-insulin conjugate may be alkylated at the same amino acid position where a hydrophilic moiety is linked, or at a different amino acid position.
  • Alkylation can be carried out at any position within the NHR ligand-insulin conjugate, including for example in the C-terminal region of the B chain or at a position in the linking moiety, provided that insulin activity is retained.
  • the NHR ligand-insulin conjugate is modified to comprise an alkyl group by direct alkylation of an amine, hydroxyl, or thiol of a side chain of an amino acid of the NHR ligand-insulin conjugate.
  • the NHR ligand-insulin conjugate is directly alkylated through the side chain amine, hydroxyl, or thiol of an amino acid.
  • the direct alkylation of the NHR ligand-insulin conjugate occurs through the side chain amine, hydroxyl, or thiol of the amino acid at position A14, A15, B l (for insulin based B chains), B2 (for IGF-1 based B chains), B IO, B22, B28 or B29 (according to the amino acid numbering of the A and B chain of native insulin).
  • the NHR ligand-insulin conjugate comprises a spacer between the peptide and the alkyl group. In some embodiments, the NHR ligand- insulin conjugate is covalently bound to the spacer, which is covalently bound to the alkyl group.
  • the NHR ligand-insulin conjugate is modified to comprise an alkyl group by alkylation of an amine, hydroxyl, or thiol of a spacer, wherein the spacer is attached to a side chain of an amino acid at position A14, A15, B l (for insulin based B chains), B2 (for IGF-1 based B chains), B IO, B22, B28 or B29 (according to the amino acid numbering of the A and B chains of native insulin).
  • the amino acid of the NHR ligand-insulin conjugate to which the spacer is attached can be any amino acid (e.g., a singly a-substituted amino acid or an ⁇ , ⁇ -disubstituted amino acid) comprising a moiety which permits linkage to the spacer.
  • An amino acid of the NHR ligand-insulin conjugate comprising a side chain -N3 ⁇ 4, -OH, or -COOH e.g., Lys, Orn, Ser, Asp, or Glu
  • the spacer between the peptide the NHR ligand-insulin conjugate and the alkyl group is an amino acid comprising a side chain amine, hydroxyl, or thiol or a dipeptide or tripeptide comprising an amino acid comprising a side chain amine, hydroxyl, or thiol.
  • the spacer amino acid can be any amino acid.
  • the spacer amino acid can be a hydrophobic amino acid, e.g., Gly, Ala, Val, Leu, He, Trp, Met, Phe, Tyr.
  • the spacer amino acid can be an acidic residue, e.g., Asp and Glu.
  • the spacer amino acid can be a hydrophobic amino acid, e.g., Gly, Ala, Val, Leu, He, Trp, Met, Phe, Tyr, 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, 8-aminooctanoic acid.
  • a hydrophobic amino acid e.g., Gly, Ala, Val, Leu, He, Trp, Met, Phe, Tyr, 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, 8-aminooctanoic acid.
  • the spacer amino acid can be an acidic residue, e.g., Asp and Glu, provided that the alkylation occurs on the alpha amine of the acidic residue.
  • the spacer amino acid is an amino acid comprising a side chain amine, e.g., an amino acid of Formula I (e.g., Lys or Orn).
  • the alpha amine and the side chain amine of the spacer amino acid it is possible for both the alpha amine and the side chain amine of the spacer amino acid to be alkylated, such that the peptide is dialkylated.
  • Embodiments of the invention include such dialkylated molecules.
  • the spacer comprises a hydrophilic bifunctional spacer.
  • the spacer comprises an amino poly(alkyloxy)carboxylate.
  • the spacer can comprise, for example, NH 2 (CH 2 CH 2 0) n (CH 2 ) m COOH, wherein m is any integer from 1 to 6 and n is any integer from 2 to 12, such as, e.g., 8-amino-3,6- dioxaoctanoic acid, which is commercially available from Peptides International, Inc.
  • the spacer between peptide the NHR ligand-insulin conjugate and the alkyl group is a hydrophilic bifunctional spacer.
  • the hydrophilic bifunctional spacer comprises two or more reactive groups, e.g., an amine, a hydroxyl, a thiol, and a carboxyl group or any combinations thereof.
  • the hydrophilic bifunctional spacer comprises a hydroxyl group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises an amine group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises a thiol group and a carboxylate.
  • the spacer e.g., amino acid, dipeptide, tripeptide, hydrophilic bifunctional spacer, or hydrophobic bifunctional spacer
  • the spacer is 3 to 10 atoms (e.g., 6 to 10 atoms, (e.g., 6, 7, 8, 9, or 10 atoms)) in length.
  • the spacer is about 3 to 10 atoms (e.g., 6 to 10 atoms) in length and the alkyl is a C 12 to C 18 alkyl group, e.g., C 14 alkyl group, C 16 alkyl group, such that the total length of the spacer and alkyl group is 14 to 28 atoms, e.g., about 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, or 28 atoms.
  • the alkyl is a C 12 to C 18 alkyl group, e.g., C 14 alkyl group, C 16 alkyl group, such that the total length of the spacer and alkyl group is 14 to 28 atoms, e.g., about 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, or 28 atoms.
  • the length of the spacer and alkyl is 17 to 28 (e.g., 19 to 26, 19 to 21) atoms.
  • the bifunctional spacer is a synthetic or non- naturally occurring amino acid comprising an amino acid backbone that is 3 to 10 atoms in length (e.g., 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, and 8- aminooctanoic acid).
  • the spacer can be a dipeptide or tripeptide spacer having a peptide backbone that is 3 to 10 atoms (e.g., 6 to 10 atoms) in length.
  • the dipeptide or tripeptide spacer attached to the NHR ligand-insulin conjugate can be composed of naturally-occurring and/or non-naturally occurring amino acids, including, for example, any of the amino acids taught herein.
  • the spacer comprises an overall negative charge, e.g., comprises one or two negatively charged amino acids.
  • the dipeptide spacer is selected from the group consisting of: Ala- Ala, ⁇ -Ala- ⁇ -Ala, Leu-Leu, Pro-Pro, ⁇ -aminobutyric acid- ⁇ -aminobutyric acid, and ⁇ -Glu- ⁇ -Glu.
  • the dipeptide spacer is ⁇ -Glu- ⁇ -Glu.
  • Suitable methods of peptide alkylation via amines, hydroxyls, and thiols are known in the art.
  • a Williamson ether synthesis can be used to form an ether linkage between the insulin peptide and the alkyl group.
  • a nucleophilic substitution reaction of the peptide with an alkyl halide can result in any of an ether, thioether, or amino linkage.
  • the alkyl group of the alkylated peptide the NHR ligand-insulin conjugate can be of any size, e.g., any length carbon chain, and can be linear or branched.
  • the alkyl group is a C 4 to C 3 o alkyl.
  • the alkyl group can be any of a C 4 alkyl, C 6 alkyl, C 8 alkyl, Cio alkyl, C i2 alkyl, C u alkyl, C l6 alkyl, C i8 alkyl, C 20 alkyl,
  • the alkyl group is a C 8 to C 2 o alkyl, e.g., a C 14 alkyl or a C 16 alkyl.
  • the alkyl group comprises a steroid moiety of a bile acid, e.g., cholic acid, chenodeoxycholic acid, deoxycholic acid, lithocholic acid, taurocholic acid, glycocholic acid, and cholesterol acid.
  • a bile acid e.g., cholic acid, chenodeoxycholic acid, deoxycholic acid, lithocholic acid, taurocholic acid, glycocholic acid, and cholesterol acid.
  • the long chain alkane When a long chain alkane is alkylated by the NHR ligand-insulin conjugate or the spacer, the long chain alkane may be of any size and can comprise any length of carbon chain.
  • the long chain alkane can be linear or branched.
  • the long chain alkane is a C 4 to C 3 o alkane.
  • the long chain alkane can be any of a C 4 alkane, C 6 alkane, C 8 alkane, Cio alkane, C 12 alkane, C 14 alkane, C 16 alkane, C 18 alkane, C 2 o alkane,
  • the long chain alkane comprises a C 8 to C 2 o alkane, e.g., a C 14 alkane, C 16 alkane, or a C 18 alkane.
  • alkylation can occur between the insulin analog and a cholesterol moiety.
  • the hydroxyl group of cholesterol can displace a leaving group on the long chain alkane to form a cholesterol-insulin peptide product.
  • the insulin peptide of the conjugates disclosed herein are further modified to comprise a self cleaving dipeptide element.
  • the dipeptide element comprises the structure U-J, wherein U is an amino acid or a hydroxyl acid and J is an N-alkylated amino acid.
  • one or more dipeptide elements are linked to the NHR ligand-insulin conjugate through an amide bond formed through one or more amino groups selected from the N-terminal amino group of the A or B chain of the insulin component, or the side chain amino group of an amino acid present in the conjugate.
  • one or more dipeptide elements are linked to the NHR ligand-insulin conjugate at an amino group selected from the N-terminal amino group of the conjugate, or the side chain amino group of an aromatic amine of a 4-amino-phenylalanine residue present at a position corresponding to position A19, B 16 or B25 of native insulin, or a side chain of an amino acid of the linking moiety of a single chain insulin analog.
  • Ri , R 2, R 4 and R 8 are independently selected from the group consisting of H, Ci-Ci 8 alkyl, C 2 -Ci 8 alkenyl, (Ci-Cis alkyl)OH, (Ci-Cis alkyl)SH, (C 2 -C 3 alkyl)SCH 3 , (d-C 4 alkyl)CONH 2 , (d-C 4 alkyl)COOH, (d-C 4 alkyl)NH 2 , (Ci-C 4 alkyl)NHC(NH 2 + )NH 2 , (C 0 -C 4 alkyl)(C 3 -C 6 cycloalkyl), (C 0 -C 4 alkyl)(C 2 -C 5 heterocyclic), (C 0 -C 4 alkyl)(C 6 -Ci 0 aryl)R 7 , (Ci-C 4 alkyl)(C 3 -C 9 heteroaryl), and C1-C12 alkyl(W)C
  • R 3 is selected from the group consisting of Ci-Ci 8 alkyl, (Ci-Ci 8 alkyl)OH, (Ci-Ci 8 alkyl)NH 2 , (Ci-Cis alkyl)SH, (C 0 -C 4 alkyl)(C 3 -C 6 )cycloalkyl, (C 0 -C 4 alkyl)(C 2 -C 5 heterocyclic), (C 0 -C 4 alkyl)(C 6 -Cio aryl)R 7 , and (C 1 -C 4 alkyl)(C 3 -C9 heteroaryl) or R 4 and R 3 together with the atoms to which they are attached form a pyrrolidine ring;
  • R 5 is NHR 6 or OH
  • R 6 is H, Ci-C 8 alkyl
  • R 7 is selected from the group consisting of H and OH.
  • the prodrug element is linked to the N-terminal amine of the NHR ligand-insulin conjugate and R 4 and R 3 together with the atoms to which they are attached form a pyrrolidine ring, then at least one of Ri and R 2 are other than H.
  • a complex is provided comprising the general structure U-J-(Q- L-Y), wherein Q-L-Y comprises any of the structures as described elsewhere in this disclosure and U-J is a dipeptide that is linked via an amide bond to an amine of the Q-L-Y conjugate.
  • U-J is linked to amine present on the insulin peptide.
  • U-J is linked to the N-terminal alpha amine of the A or B chain of the insulin peptide of the conjugate.
  • a complex of the structure U-J-(Q-L-Y) is provided, wherein Q- L-Y comprises any of the structures as described elsewhere in this disclosure and wherein U is an amino acid or a hydroxy acid;
  • U-J is an N-alkylated amino acid linked to Q through an amide bond between a carboxyl moiety of B and an amine of Q.
  • U-J comprises the structure:
  • R 1 , R2 , R 4 and R 8 are independently selected from the group consisting of H, CI -CI 8 alkyl, C2-C18 alkenyl, (CI -CI 8 alkyl)OH, (CI -CI 8 alkyl)SH, (C2-C3 alkyl)SCH 3 , (C1-C4 alkyl)CONH 2 , (C1-C4 alkyl)COOH, (C1-C4 alkyl)NH 2 , (C1-C4
  • R 1 and R 2 together with the atoms to which they are attached form a C3-C12 cycloalkyl or aryl;
  • R 3 is selected from the group consisting of C 1 -C 18 alkyl, (C 1 -C 18 alkyl)OH,
  • R 5 is NHR 6 or OH;
  • R is H, Ci-Cg alkyl;
  • R is selected from the group consisting of H and OH
  • U-J comprises the structure:
  • Ri and R 8 are independently H or Ci-C 8 alkyl
  • R 2 and R 4 are independently selected from the group consisting of H, Ci-C 8 alkyl, (C 1 -C4 alkyl)OH, (C 1 -C4 alkyl)SH, (C 2 -C 3 alkyl)SCH 3 , (C 1 -C4 alkyl)CONH 2 , (C 1 -C4 alkyl)COOH, (C 1 -C4 alkyl)NH 2 , and (d-C 4 alkyl)(C 6 aryl)R 7 ;
  • R 3 is Ci-C 6 alkyl
  • R 5 is NH 2 ;
  • R 7 is selected from the group consisting of hydrogen, and OH.
  • U-J comprises the structure:
  • R 2 is H, C1-C4 alkyl, (CH 2 alkyl)OH, (C1-C4 alkyl)NH 2 , or (CH 2 )(C 6 aryl)R 7 ;
  • R 3 is Ci-C 6 alkyl;
  • R 4 is H, C 1 -C4 alkyl, or (CH 2 )(C 6 aryl)R 7 ;
  • R 5 is NH 2 ;
  • R 8 is hydrogen
  • R 7 is H or OH.
  • U-J comprises the structure:
  • Ri is H or Ci-C 4 alkyl
  • R 2 is H, Ci-C 4 alkyl, or (C1-C4 alkyl)NH 2 ;
  • R 3 is Ci-C 6 alkyl
  • R 4 is H, or C 1 -C 4 alkyl
  • R 5 is NH 2 ;
  • R 8 is hydrogen.
  • Ri is Ci-C 4 alkyl
  • R 2 is Ci-C 4 alkyl
  • R 3 is C C 6 alkyl
  • R 4 and R 8 are each H and R5 is NH 2 .
  • R 2 is Ci-C 4 alkyl, or (C 1 -C4 alkyl)NH 2 alkyl
  • R 3 is Ci-C 6 alkyl
  • R ls R 4 and R 8 are each H and R 5 is NH 2 .
  • compositions comprising the NHR ligand-insulin conjugates disclosed herein can be formulated and administered to patients using standard
  • the present disclosure also encompasses pharmaceutical compositions comprising one or more of the NHR ligand-insulin conjugates disclosed herein or a pharmaceutically acceptable salt thereof, in combination with a pharmaceutically acceptable carrier.
  • the pharmaceutical composition comprises a lmg/ml concentration of the NHR ligand-insulin conjugate at a pH of about 4.0 to about 7.0 in a phosphate buffer system.
  • the pharmaceutical compositions may comprise the NHR ligand- insulin conjugate as the sole pharmaceutically active component, or the NHR ligand-insulin conjugate peptide can be combined with one or more additional active agents.
  • NHR ligand-insulin conjugate peptides include all pharmaceutically acceptable salts thereof.
  • the kit is provided with a device for administering the NHR ligand-insulin conjugate to a patient.
  • the kit may further include a variety of containers, e.g., vials, tubes, bottles, and the like.
  • the kits will also include instructions for use.
  • the device of the kit is an aerosol dispensing device, wherein the composition is prepackaged within the aerosol device.
  • the kit comprises a syringe and a needle, and in one embodiment the NHR ligand-insulin conjugate composition is prepackaged within the syringe.
  • the compounds of this invention may be prepared by standard synthetic methods, recombinant DNA techniques, or any other methods of preparing peptides and fusion proteins. Although certain non-natural amino acids cannot be expressed by standard recombinant DNA techniques, techniques for their preparation are known in the art.
  • an insulin peptide/NHR ligand conjugate wherein the conjugate comprises the structure Q-L-Y;
  • Q is an insulin peptide
  • Y is a NHR ligand
  • L is a linking group or a bond
  • conjugate has activity at both a nuclear hormone receptor and an insulin receptor, wherein said NHR ligand is
  • Q-L and further has a molecular weight of up to about 1000 daltons;
  • a ligand that activates the peroxisome proliferator-activated receptors wherein the nuclear hormone receptor ligand is covalently linked to the insulin peptide through the side chain of a lysine residue located at a position selected from the group consisting of B28 and B29, or the N-terminal alpha amine of the A or B chain, optionally the NHR ligand is linked to the carboxy terminus of the insulin peptide B chain.
  • PPAR peroxisome proliferator-activated receptors
  • the NHR ligand is a ligand that activates the thyroid hormone receptor or activates the peroxisome proliferator-activated receptors, wherein the NHR ligand is linked via the side chain of a lysine residue located at the C-terminus of the insulin B chain.
  • Y is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof.
  • Ri 5 is Ci-C 4 alkyl, -CH 2 (pyridazinone), -CH 2 (OH)(phenyl)F, -CH(OH)CH 3 , halo or H; R 2 o is halo, CH 3 or H;
  • R 2 i is halo, CH 3 or H
  • R 22 is H, OH, halo, -CH 2 (OH)(C 6 aryl)F, or C1-C4 alkyl;
  • R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH,
  • Ri 5 is C 1 -C4 alkyl, -CH(OH)CH 3 , 1 or H
  • R 20 is I, Br, CH 3 or H
  • R 2 i is I, Br, CH 3 or H
  • R 22 is H, OH, I, or C 1 -C4 alkyl
  • R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH,
  • Ri 5 is C 1 -C4 alkyl, l or H;
  • R 20 is I, Br, CH 3 or H
  • R 2 i is I, Br, CH 3 or H
  • R 22 is H, OH, I, or Ci-C 4 alkyl; and R 23 is -CH 2 CH(NH 2 )COOH, -OCH 2 COOH, -NHC(0)COOH, -CH 2 COOH,
  • R20, R21, and R 22 are independently selected from the group consisting of H, OH, halo and Ci-C 4 alkyl;
  • Ri5 is halo or H.
  • Y is selected from the group consisting of 3,5,3',5'-tetra-iodothyronine, and 3,5,3'-triiodo L-thyronine.
  • Y is selected from the group consisting of Tesaglitazar, Aleglitazar and thiazolidinediones, optionally wherein Y is Tesaglitazar or Aleglitazar, optionally Y is Tesaglitazar linked to the N-terminal amine of the insulin B chain.
  • the conjugate of any one of embodiments 1-9 is provided wherein said insulin peptide (Q) comprises an A chain and a B chain wherein said A chain comprises a sequence
  • GIVX 4 X5CCX8X 9 XioCXi2LXi 4 Xi5LXi7Xi8YCX2i-R5 3 (SEQ ID NO: 19)
  • said B chain comprises a sequence R 6 2-X 25 LCGX 2 9X 3 oLVX 33 X 34 LYLVCGX 4 iX 42 GFX 4 5 (SEQ ID NO: 20), wherein
  • X 4 is glutamic acid or aspartic acid
  • X5 is glutamine or glutamic acid
  • X 8 is histidine, threonine or phenylalanine
  • X9 is serine, arginine, lysine, ornithine or alanine
  • X10 is isoleucine or serine
  • X12 is serine or aspartic acid
  • X 14 is tyrosine, arginine, lysine, ornithine or alanine;
  • Xi5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine
  • X 17 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine
  • X 18 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine;
  • X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
  • X25 is histidine or threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X 30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 4 2 is selected from the group consisting of alanine, ornithine, lysine and arginine;
  • X 4 5 is tyrosine or phenylalanine;
  • R 62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
  • R 53 is COOH or CONH 2 .
  • the conjugate of any one of embodiments 1-10 is provided wherein said insulin peptide comprises an A chain and a B chain and said A chain comprises the sequence GIVEQCCX 8 X 9 ICSLYQLENYCX 2 i-R53 (SEQ ID NO: 73) said B chain comprises the sequence R 6 2-X 25 LCGX 2 9X 3 oLVX 33 X 34 LYLVCGX 4 iX 42 GFX 4 5 (SEQ ID NO: 20), wherein
  • X 8 is histidine or threonine
  • X9 is serine, lysine, or alanine
  • X21 is alanine, glycine or asparagine
  • X25 is histidine or threonine
  • X29 is selected from the group consisting of alanine, glycine and serine;
  • X 3 o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine;
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 42 is selected from the group consisting of alanine, ornithine, lysine and arginine;
  • X 4 5 is tyrosine or phenylalanine;
  • P 62 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine and an N-terminal amine; and
  • R 53 is COOH or CONH 2 .
  • the conjugate of any one of embodiments 1-11 is provided wherein said insulin peptide comprises an A chain and a B chain wherein said A chain comprises a sequence GIVDECCX8X9SCDLRRLEMX19CX21-R53 (SEQ ID NO: 74) and said B chain comprises a sequence R 62 -X 25 LCGAX 3 oLVDALYLVCGDX 42 GFY (SEQ ID NO: 75), wherein
  • X 8 is phenylalanine or histidine
  • X9 is arginine, ornithine or alanine
  • X 1 9 is tyrosine, 4-methoxy-phenylalanine or 4-amino-phenylalanine;
  • X 21 is alanine or asparagine
  • X 25 is histidine or threonine
  • X 3 o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 42 is selected from the group consisting of alanine ornithine and arginine; and Rs 3 is COOH or CONH 2 ;
  • R 62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
  • R 5 is COOH or CONH 2 .
  • the conjugate of embodiments 12 is provided wherein the B chain sequence comprises the sequence
  • X 25 is selected from the group consisting of histidine and threonine; and R 63 is selected from the group consisting of YTX 28 KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX 28 K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX 28 , YT, Y and a bond, wherein X 28 is proline, aspartic acid or glutamic acid.
  • the conjugate of any one of embodiments 1-13 wherein the insulin peptide comprises an A chain and a B chain wherein the A chain comprises the sequence GIVEQCCX 8 X 9 ICSLYQLENYCX 2 i-R 53 (SEQ ID NO: 73), and the B chain sequence comprises the sequence
  • X 8 is histidine or threonine
  • X9 is serine, lysine, or alanine
  • X 21 is alanine, glycine or asparagine
  • X 25 is selected from the group consisting of histidine and threonine.
  • X 2 9 is selected from the group consisting of alanine, glycine and serine;
  • X 3 o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
  • X 33 is selected from the group consisting of aspartic acid and glutamic acid;
  • X 34 is selected from the group consisting of alanine and threonine
  • X 4 i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
  • X 42 is selected from the group consisting of alanine, ornithine, lysine and arginine;
  • X 4 5 is tyrosine or phenylalanine;
  • Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine;
  • Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide.
  • said A chain comprises a sequence GIVEQCCTSICSLYQLENYCN-R53 (SEQ ID NO: 1) and said B chain comprises a sequence
  • FVNQHLC GS HLVE AL YLVC GERGFFYTPKT (SEQ ID NO: 2), wherein R53 is COOH or CONH 2 .
  • Y is selected from the group consisting of 3,5,3',5'-tetra-iodothyronine, and 3,5,3'-triiodo L-thyronine and Y is linked via the side chain of the lysine residue present at B28 or B29 of said B chain.
  • the conjugate of embodiment 17 is provided wherein Y is 3,5,3'-triiodo L-thyronine and Y is linked via the side chain of the lysine residue present at B29 of said B chain.
  • the conjugate of any one of embodiments 1-18 is provided wherein the insulin peptide is a single chain insulin and the peptide linking moiety joining the B and A chains is selected from the group consisting of
  • SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51), GAGSSSX 57 X 58 (SEQ ID NO: 76), GYGSSSX 57 X 58 (SEQ ID NO: 21) and G YGS S S X57X58 APQT ; (SEQ ID NO: 77), wherein
  • X 57 and X58 are independently arginine, lysine or ornithine.
  • the conjugate of embodiment 19 is provided wherein the peptide linking moiety is selected from the group consisting of GYGSSSRR (SEQ ID NO: 18) and GAGSSSRR (SEQ ID NO: 22).
  • the conjugate of embodiment 22 is provided wherein L comprises an ether moiety, or an amide moiety, an ester moiety, an acid-labile moiety, a reduction-labile moiety , an enzyme-labile moiety, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety.
  • the conjugate of any one of embodiments 1-23 wherein Y-L is covalently linked to the insulin peptide (Q) through a position independently selected from the side chain of an amino acid at B28 or B29 of the B chain, the N-terminal alpha amine of the B chain, the N-terminal alpha amine of the A chain and at the side chain of an amino acid at any position of a linking moiety that links the A chain and B chain of a single chain insulin peptide.
  • the conjugate of embodiment 24 is provided wherein Y-L is covalently linked to the insulin peptide (Q) through the side chain of an amino acid at B28 or B29 of the B chain.
  • the conjugate of any one of embodiments 1-25 is provided wherein the conjugate is derivatized by linking the structure U-J to the conjugate, wherein
  • U is an amino acid or a hydroxy acid
  • J is an N-alkylated amino acid, sad structure being linked to said conjugate through an amide bond between a carboxyl moiety of J and an amine of the conjugate, wherein the chemical cleavage half- life (ti /2 ) of U-J from the conjugate is at least about 1 hour to about 1 week in PBS under physiological conditions.
  • the conjugate of any one of embodiments 1-26 is provided wherein a hydrophilic moiety is covalently linked at one or more positions corresponding to A14, A15, BO, B l, B IO, B22, B28, B29.
  • the conjugate of embodiment 27 is provided wherein the hydrophilic moiety is a polyethylene glycol.
  • the conjugate of any one of embodiments 1-28 is provided further comprising an acyl group or alkyl group covalently linked to an amino acid side chain of said conjugate.
  • the conjugate of embodiment 29 is provided wherein said acyl group or alkyl group is covalently linked to one or more positions selected from A14, A15, B0, B l, B 10, B22, B28, B29 of the insulin peptide.
  • the conjugate of any one of embodiments 1-30 is provided as a pharmaceutical composition comprising a conjugate of any one of the preceding embodiments, or pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable carrier.
  • the conjugate of any one of embodiments 1-31 is provided for treating diabetes or reducing weight gain or enhncing weight loss.
  • the use of the conjugate of the composition of embodiment 31, or pharmaceutically acceptable salt thereof, is provided for the treatment of diabetes.
  • any combination of i), ii) and iii) is provided wherein the method comprises the step of administering a conjugate of any one of embodiments 1-30 to a patient in need thereof.
  • Insulin A & B chains were synthesized on 4-methylbenzhyryl amine (MB HA) resin or 4-Hydroxymethyl-phenylacetamidomethyl (PAM) resin using Boc chemistry.
  • the peptides were cleaved from the resin using HF/p-cresol 95:5 for 1 hour at 0°C. Following HF removal and ether precipitation, peptides were dissolved into 50% aqueous acetic acid and lyophilized. Alternatively, peptides were synthesized using Fmoc chemistry.
  • the peptides were cleaved from the resin using Trifluoroacetic acid (TFA)/ Triisopropylsilane (TIS)/ H 2 0 (95:2.5:2.5), for 2 hour at room temperature.
  • TSA Trifluoroacetic acid
  • TIS Triisopropylsilane
  • H 2 0 95:2.5:2.5
  • the peptide was precipitated through the addition of an excessive amount of diethyl ether and the pellet solubilized in aqueous acidic buffer.
  • the quality of peptides were monitored by RP-HPLC and confirmed by Mass Spectrometry (ESI or MALDI).
  • Insulin A chains were synthesized with a single free cysteine at amino acid 7 and all other cysteines protected as acetamidomethyl A-(SH) 7 (Acm) 6 ' U ' 20 .
  • Insulin B chains were synthesized with a single free cysteine at position 7 and the other cysteine protected as acetamidomethyl B-(SH) 7 (Acm) 19.
  • the crude peptides were purified by conventional RP- HPLC.
  • the synthesized A and B chains were linked to one another through their native disulfide bond linkage in accordance with the general procedure outlined in Fig. 1.
  • the respective B chain was activated to the Cys -Npys analog through dissolution in DMF or DMSO and reacted with 2,2' -Dithiobis (5-nitropyridine) (Npys) at a 1: 1 molar ratio, at room temperature.
  • the activation was monitored by RP-HPLC and the product was confirmed by ESI-MS.
  • the first B7-A7 disulfide bond was formed by dissolution of the respective A-
  • Insulin peptides comprising a modified amino acid can also be synthesized in vivo using a system that allows for incorporation of non-coded amino acids into proteins, including for example, the system taught in US Patent Nos. 7,045,337 and 7,083,970.
  • Insulin or an insulin analog
  • mPEG20k-Aldyhyde or an insulin analog
  • NaB3 ⁇ 4CN in a molar ratio of 1:2:30, were dissolved in acetic acid buffer at a pH of 4.1-4.4.
  • the reaction solution was composed of 0.1 N NaCl, 0.2 N acetic acid and 0.1 N Na 2 C0 3 .
  • the insulin peptide concentration was approximately 0.5 mg/ml.
  • the reaction occurs over six hours at room temperature. The degree of reaction was monitored by RP-HPLC and the yield of the reaction was approximately 50%.
  • reaction mixture was diluted 2-5 fold with 0.1% TFA and applied to a
  • Insulin or an insulin analog
  • mPEG20k-NHS were dissolved in 0.1 N Bicine buffer (pH 8.0) at a molar ratio of 1: 1.
  • the insulin peptide concentration was approximately 0.5 mg/ml. Reaction progress was monitored by HPLC. The yield of the reaction is approximately 90% after 2 hours at room temperature.
  • reaction mixture was diluted 2-5 fold and loaded to RP-HPLC.
  • HPLC condition C4 column; flow rate 10 ml/min; A buffer 10% ACN and 0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B% from 0-40% (0-80 min); PEG- insulin or analogues was collected at approximately 35% B. .
  • the desired compounds were verified by MALDI-TOF, following chemical modification through sulftolysis or trypsin degradation. Reductive Aminated Pegylation of Acetyl Group on the Aromatic Ring Of The
  • Insulin (or an insulin analogue), mPEG20k-Hydrazide, and NaBH 3 CN in a molar ratio of 1:2:20 were dissolved in acetic acid buffer (pH of 4.1 to 4.4).
  • the reaction solution was composed of 0.1 N NaCl, 0.2 N acetic acid and 0.1 N Na 2 C0 3 .
  • Insulin or insulin analogue concentration was approximately 0.5 mg/ml. at room temperature for 24h.
  • the reaction process was monitored by HPLC. The conversion of the reaction was
  • reaction mixture was diluted 2-5 fold and loaded to RP-HPLC.
  • HPLC condition C4 column; flow rate 10 ml/min; A buffer 10% ACN and 0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B% from 0-40% (0-80 min); PEG- insulin, or the PEG-insulin analogue was collected at approximately 35%B. .
  • the desired compounds were verified by MALDI-TOF, following chemical modification through sulftolysis or trypsin degradation.
  • each peptide for the insulin or IGF-1 receptor was measured in a competition binding assay utilizing scintillation proximity technology.
  • Serial 3-fold dilutions of the peptides were made in Tris-Cl buffer (0.05 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.1% w/v bovine serum albumin) and mixed in 96 well plates (Corning Inc., Acton, MA) with 0.05 nM (3-[125I]-iodotyrosyl) A TyrA14 insulin or (3-[125I]-iodotyrosyl) IGF-1 (Amersham Biosciences, Piscataway, NJ).
  • Binding (Bound-NSB / Total bound-NSB) x 100. IC50 values were determined by using Origin software (OriginLab, Northampton, MA).
  • receptor transfected HEK293 cells were plated in 96 well tissue culture plates (Costar #3596, Cambridge, MA) and cultured in Dulbecco's modified Eagle medium (DMEM) supplemented with 100 IU/ml penicillin, 100 g/ml streptomycin, 10 mM HEPES and 0.25% bovine growth serum (HyClone SH30541, Logan, UT) for 16-20 hrs at 37 °C, 5% CO 2 and 90% humidity.
  • DMEM Dulbecco's modified Eagle medium
  • the plate was then washed three times and filled with horseradish peroxidase-conjugated antibody against phospho tyro sine (Upstate biotechnology #16-105, Temecula, CA) reconstituted in PBS with 2% bovine serum albumin per manufacturer's recommendation. After 3 hrs incubation at room temperature the plate was washed 4 times and 0.1 ml of TMB single solution substrate (Invitrogen, #00-2023, Carlbad, CA) was added to each well. Color development was stopped 5 min later by adding 0.05 ml 1 N HC1. Absorbance at 450 nm was measured on Titertek Multiscan MCC340 (ThermoFisher, Pittsburgh, PA). Absorbance vs. peptide concentration dose response curves were plotted and EC 50 values were determined by using Origin software (OriginLab, Northampton, MA).
  • insulin-IGF-I minigene comprising a native insulin B and A chain linked via the IGF-I C chain ( ⁇ °- ⁇ - ⁇ °) was cloned into expression vector pGAPZa A (purchased from Invitrogen) under GAP promoter (promoter of the glyceraldehyde-3-phosphate
  • GPDH dehydrogenase
  • the minigenes including B°C 1 A°, eleven alanine mutants, and other select derivatives were transformed into yeast Pichia pastoris by electroporation. Positive transformants were selected on minimal methanol plates and a genomic preparation of each Pichia isolate was performed and integration of the constructs into the yeast genome was confirmed by PCR. An 833 base pair PCR product was visualized on an agarose DNA gel. The insulin analogs were produced by fermentation of a corresponding yeast line. The yeast cells were pelleted by centrifugation at 5 K for 20 minutes in 500 ml Beckman centrifuge tubes and the media was kept for subsequent protein purification.
  • Lyophilized samples were re-suspended in 0.025M NH 3 HCO 3 pH 8 and purified on a Luna C18 column (10 ⁇ particle size, 300A° pore size). Protein was eluted from the column using a linear gradient of 20-60% aqueous ACN. MALDI-MS positive fractions were pooled and transferred to a disposable scintillation vial for subsequent lyophilization. Lyophilized samples were then resuspended in 20% aqueous ACN containing 0.1% TFA, and purified on a Luna C18 column (10 ⁇ particle size, 300A° pore size).
  • the protein was eluted from the column using a linear gradient of 18-54% aqueous ACN with 0.1% TFA. Protein elution was monitored at an absorbance 280 nm. MALDI-TOF MS positive fractions were analyzed via a C8 analytical column to insure purity.
  • the B°-C 1_ A 0 analog demonstrated potency that was equally effective at both insulin receptor isoforms and the IGF-1 receptor. Mutation of the tyrosine at position 2 to alanine or the shortening of the C-peptide to eight amino acids through deletion of C9-12 provided a selective enhancement in the specificity of insulin action by significant reduction in the IGF- 1 receptor activity. See also the data provided in Tables 5A and 5B: Table 5A
  • Position 2 and 3 in the C-peptide are most sensitive to modification at the IGF-1 receptor with the insulin receptor proving to be relatively immune to modification. All of the analogs maintained single unit nanomolar activity with certain specific analogs proving to be slightly enhanced in potency (low single unit nanomolar). The most insulin selective analogs were those that we missing the last four residues of the C-peptide, had an alanine mutation at position two of the C-peptide, or a combination of the two changes.
  • mice Single administration of six- to eight-week-old male C57BL/6 mice were maintained at 23'C, constant humidity, and a 12-h light— dark cycle.
  • the acute in vivo effects of select peptides were evaluated by subcutaneously injecting peptides solubilized in physiologically buffered saline or a vehicle control to normal or diabetic mice. Blood glucose was measured at various time points through the course of a 24hr period following administration of the peptides. Each group of mice contained 8 animals per group. The average body weight was 25 g. Mice were made diabetic by administration of strep tozotocin.
  • mice Repeat daily subcutaneous administration of the peptides or vehicle control was administered to the mice for periods of five days to two weeks.
  • the obese mice were given a diabetogenic diet Mice had free access to water and were fed ad libitum with a high fat diet (HFD) comprising 58% of calories from fat (D 12331; Research Diets, New Brunswick, NJ) and each group of mice contained 8 animals per group.
  • HFD high fat diet
  • the average body weight was -50 g 20 and the mice were— 6 months old males.
  • Body weight and food intake were measured on the days that peptide or vehicle control was administered to the mice. Fasting blood glucose levels were measured repeatedly.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medicinal Chemistry (AREA)
  • Veterinary Medicine (AREA)
  • Epidemiology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Public Health (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Endocrinology (AREA)
  • Toxicology (AREA)
  • Diabetes (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Botany (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Peptides Or Proteins (AREA)

Abstract

Disclosed herein are insulin peptides conjugated to a nuclear hormone receptor ligand (NHR ligand) wherein the insulin/NHR ligand conjugate has agonist activity at both the insulin receptor and the corresponding nuclear hormone receptor.

Description

INSULIN-NUCLEAR HORMONE CONJUGATES
CROSS REFERENCE TO RELATED APPLICATIONS
This application claims priority to U.S. Provisional Patent Application Nos.
62/344,674, filed on June 2, 2016, the disclosure of which is hereby expressly incorporated by reference in its entirety.
INCORPORATION BY REFERENCE OF MATERIAL SUBMITTED ELECTRONICALLY
Incorporated by reference in its entirety is a computer-readable nucleotide/amino acid sequence listing submitted concurrently herewith and identified as follows: 90 kilobytes acii (text) file named "265655seqlist_ST25.txt," created on May 30, 2017.
BACKGROUND
Insulin is a proven therapy for the treatment of juvenile-onset diabetes and later stage adult-onset diabetes. The peptide is biosynthesized as a larger linear precursor of low potency (approximately 2% to 9% of native insulin), named proinsulin. Proinsulin is proteolytically converted to insulin by the selective removal of a 35-residue connecting peptide (C peptide). The resultant heteroduplex formed by disulfide links between the insulin "A chain" (SEQ ID NO: 1) and "B chain" (SEQ ID NO: 2) chain, representing a total of 51 amino acids, has high potency for the insulin receptor (nM range). Native insulin has approximately one hundredfold selective affinity for the insulin receptor relative to the related insulin-like growth factor 1 receptor, but demonstrates little selectively for the two different insulin receptor isoforms, named A & B.
The insulin-like growth factors 1 and 2 are single chain liner peptide hormones that are highly homologous in their A and B chain sequences, sharing approximately fifty percent homology with native insulin. The IGF A and B chains are linked by a "C-peptide", wherein the C-peptides of the two IGFs differ in size and amino acid sequence, the first being twelve and the second being eight amino acids in length. Human IGF-1 is a 70 aa basic peptide having the protein sequence shown in SEQ ID NO: 3, and has a 43% homology with proinsulin (Rinderknecht et al. (1978) J. Biol. Chem. 253:2769-2776). Human IGF-2 is a 67 amino acid basic peptide having the protein sequence shown in SEQ ID NO: 4. The IGFs demonstrate considerably less activity at the insulin B receptor isoform than the A-receptor isoform.
Applicants have previously identified IGF-1 based insulin peptides analogs, (wherein the native Gln-Phe dipeptide of the B -chain is replaced by Tyr-Leu) that display high activity at the insulin receptor (see PCT/US2009/068713, the disclosure of which is incorporated herein). Such analogs (referred to herein as IGF YL analog peptides) are more readily synthesized than insulin and enable the development of co-agonist analogs for insulin and IGF- 1 receptors, and selective insulin receptor specific analogs. Furthermore, these insulin analogs can also be formulated as single chain insulin agonists in accordance with the present disclosure.
Single chain insulin analogs comprising the insulin A and B chains have been previously prepared (see EP 1,193,272 and US 2007/0129284). However, single chain high potency insulin agonists can also be prepared by insertion of the IGF-1 C-peptide, or analogs thereof, as a connecting peptide linking the insulin B and A peptides. The selective mutation of individual amino acids in the C-peptide sequence yields peptides that are highly selective for insulin relative to IGF- 1 receptor.
Nuclear hormone receptor proteins form a class of ligand activated proteins that, when bound to specific sequences of DNA, serve as on-off switches for transcription within the cell nucleus. These switches control the development and differentiation of skin, bone and behavioral centers in the brain, as well as the continual regulation of reproductive tissues.
Nuclear hormone receptor ligands such as steroids, sterols, retinoids, thyroid hormones, and vitamin D function to activate nuclear hormone receptors. The interaction of the hormone and receptor triggers a conformational change in the receptor, which results in the up-regulation of gene expression. The level of cellular signal transduction activated by the interaction of a ligand and a nuclear hormone receptor is determined by the number of ligands and receptors available for binding, and by the binding affinity between the ligand and the receptor.
Thyroid hormones powerfully influence systemic metabolism through multiple pathways, with profound effects on energy expenditure, fat oxidation, and cholesterol metabolism. Clinical reports revealed sixty years ago that administration of thyroid extracts reduced circulating cholesterol and reversed obesity. However, adverse side effects of thyroid hormone treatment include increased heart rate, cardiac hypertrophy, muscle wasting, and reduced bone density, terminating its clinical use. Discovery of thyromimetics capable of separating lipid metabolism benefits from adverse cardiovascular effects has remained a desire for patients, physicians and the pharmaceutical industry.
Peroxisome proliferator-activated receptors (PPAR) are nuclear hormone receptors. PPAR receptors activate transcription by binding to elements of DNA sequences, known as peroxisome proliferator response elements (PPRE), in the form of a heterodimer with retinoid X receptors (known as RXRs). Three sub-types of human PPAR have been identified and described: PPAR- alpha, PPAR-gamma and PPAR-delta (or NUCI). PPAR- alpha is mainly expressed in the liver, while PPAR-delta is ubiquitous. PPAR-gamma is involved in regulating the differentiation of adipocytes, where it is highly expressed. It also has a key role in systemic lipid homeostasis. A number of compounds that modulate the activity of PPARs have been identified including thiazolidinediones, which have been employed in the treatment of diabetes and metabolic disease. Activation of PPAR gamma is known to reduce blood glucose levels without increasing insulin secretion. Discovery of PPAR agonists capable of separating lipid metabolism benefits from adverse effects has remained a desire for patients, physicians and the pharmaceutical industry
As disclosed herein conjugates are formed between an insulin peptide and a nuclear hormone receptor ligand, wherein the conjugate has agonist activity at both the insulin receptor and the corresponding nuclear hormone receptor. More particularly, the
conjugation of a nuclear hormone receptor ligand is anticipated to produce a beneficial modification of the insulin peptide activity while moderating adverse effects associated with nuclear hormone agonists.
SUMMARY
An insulin agonist/ nuclear hormone receptor ligand conjugate is provided wherein the conjugate has agonist activity at both the insulin receptor and the corresponding nuclear hormone receptor (NHR). These conjugates with plural activities are useful for the treatment of a variety of diseases, including the treatment of diabetes. In accordance with one embodiment the conjugates of the present disclosure can be represented by the following formula:
Q-L-Y
wherein Q is an insulin peptide, Y is a NHR ligand, and L is a linking group or a bond. The insulin peptide component of the conjugate can be native insulin or any known insulin analog that has activity at the insulin receptor including for example any insulin peptide disclosed in published international applications W096/34882, WO 2010/080607, WO 2010/080609, WO 2011/159882, WO/2011/159895 and US Patent No. 6,630,348, the disclosures of which are incorporated herein by reference.
The NHR ligand (Y) is wholly or partly non-peptidic and acts at a nuclear hormone receptor with an activity in accordance with any of the teachings set forth herein. In some embodiments the NHR ligand is an agonist that, in its unbound state, has an EC50 or IC50 of about 1 mM or less, or 100 μΜ or less, or 10 μΜ or less, or 1 μΜ or less. In some embodiments, the NHR ligand has a molecular weight of up to about 5000 daltons, or up to about 2000 daltons, or up to about 1000 daltons, or up to about 500 daltons. The NHR ligand may act at any of the nuclear hormone receptors described herein or have any of the structures described herein. In accordance with one embodiment the NHR ligand component of the conjugate can be a ligand that activates the thyroid hormone receptor or activates the peroxisome proliferator- activated receptors (PPAR).
The conjugates of the present disclosure are anticipated to lessen the amount of insulin that is needed to control hyperglycemia while also simultaneously lowering body weight and triglycerides and/or cholesterol levels. More particularly, the chemical addition of nuclear hormone activity to insulin agonist peptides provides supplemental pharmacology that is envisioned to broaden insulin action such that it could lower triglycerides, cholesterol and body weight in a means that is currently not possible with insulin therapy alone.
Thyroid hormone is recognized for its ability to lower body weight and
independently lower cholesterol through action at the liver. In accordance with one embodiment, applicants anticipate that the insulin-thyroid hormone conjugates will provide new dimensions to insulin action and lessen the amount of insulin needed to control blood glucose leading to less hypoglycemia and less body weight gain. In one embodiment applicants anticipate that insulin can be conjugated to a PPAR agonist to improve insulin sensitivity which should lower exogenous insulin requirements leading to less hypoglycemia and less body weight gain.
In accordance with one embodiment the nuclear hormone receptor ligand is linked to the C-terminus of the insulin B chain or at the N-terminus of the insulin A or B chain, either directly via an ester or amide bond, or indirectly through a spacer. In one embodiment the nuclear hormone receptor ligand is covalently linked to the insulin peptide through the side chain of an amino acid at a position selected from the group consisting of Al, B l, B28 and B29, or the N-terminal alpha amine of the A or B chain, or at the side chain of an amino acid at any position of a linking moiety that links the A chain and B chain of a single chain insulin analog.
In one embodiment the insulin peptide of the conjugate is a two chain insulin analog comprising an A chain and B chain linked to one another via intermolecular disulfide bonds. In a further embodiment the conjugate comprises a two chain insulin analog wherein a first and second nuclear hormone receptor ligand are covalently linked to the insulin peptide at a position selected from the side chain of the B28 or B29 amino acid and the N-terminal alpha amine of the A or B chain.
In one embodiment the NHR ligand-insulin conjugate comprises a hydrophilic moiety linked to the N-terminal alpha amine of the B chain or to the side chain of an amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain or positions B l, B2, B IO, B22, B28 or B29 of the B chain or to a side chain of an amino acid of the linking moiety in a single chain insulin analog. In one embodiment the NHR ligand is linked to the amino acid side chain of a lysine present at position B28 or B29 and optionally a hydrophilic moiety is linked to the amino terminus of the B chain, optionally through the N-terminal alpha amine of the B chain. In one embodiment the hydrophilic moiety is a polyethylene glycol chain.
In one embodiment a polyethylene glycol chain is covalently bound to the side chain of an amino acid of the linking moiety of the insulin peptide component, when the insulin peptide is a single chain insulin analog. In one embodiment the insulin peptide is a single chain insulin wherein linking moiety joining the B and A chains comprises an amino acid sequence of no more than 17 amino acids in length and comprising the sequence
GYGSSSX57X58 (SEQ ID NO: 21), GAGSSSRR (SEQ ID NO: 22) or
G YGS S S X57X58 APQT ; (SEQ ID NO: 69) wherein X57 and X58 are independently arginine, lysine or ornithine and the amino acid designated by Xs7 or Xs8 optionally further comprises a hydrophilic moiety linked to the side chain of the amino acid at that position. In one embodiment the hydrophilic moiety is a polyethylene glycol chain.
Acylation or alkylation can increase the half-life of the NHR ligand-insulin conjugate in circulation. Acylation or alkylation can advantageously delay the onset of action and/or extend the duration of action at the insulin receptors. The insulin peptide of the conjugates may be acylated or alkylated at the same amino acid position where a hydrophilic moiety is linked (including, for example at position 8 of the linking moiety), or at a different amino acid position.
Also encompassed by the present disclosure are pharmaceutical compositions comprising the NHR ligand-insulin conjugates and a pharmaceutically acceptable carrier. In accordance with one embodiment a pharmaceutical composition is provided comprising any of the NHR ligand-insulin conjugates disclosed herein preferably at a purity level of at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%, and a pharmaceutically acceptable diluent, carrier or excipient. Such compositions may contain a conjugate as disclosed herein at a concentration of at least 0.5 mg/ml, 1 mg/ml, 2 mg/ml, 3 mg/ml, 4 mg/ml, 5 mg/ml, 6 mg/ml, 7 mg/ml, 8 mg/ml, 9 mg/ml, 10 mg/ml, 11 mg/ml, 12 mg/ml, 13 mg/ml, 14 mg/ml, 15 mg/ml, 16 mg/ml, 17 mg/ml, 18 mg/ml, 19 mg/ml, 20 mg/ml, 21 mg/ml, 22 mg/ml, 23 mg/ml, 24 mg/ml, 25 mg/ml or higher. In one embodiment the pharmaceutical compositions comprise aqueous solutions that are sterilized and optionally stored within various package containers. In other embodiments the pharmaceutical compositions comprise a lyophilized powder. The pharmaceutical compositions can be further packaged as part of a kit that includes a disposable device for administering the composition to a patient. The containers or kits may be labeled for storage at ambient room temperature or at refrigerated temperature.
In accordance with one embodiment an improved method of regulating blood glucose levels in insulin dependent patients is provided. The method comprises the steps of administering to a patient a NHR ligand-insulin conjugate as disclosed herein in an amount therapeutically effective for the control of diabetes. In accordance with one embodiment a method of reducing weight or preventing weight gain is provided wherein the method comprises administering a NHR ligand-insulin conjugate as disclosed herein to a patient in need of such therapy.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1. is a schematic overview of the two step synthetic strategy for preparing human insulin. Details of the procedure are provided in Example 1.
Fig. 2 is a graph comparing insulin receptor specific binding of synthetic human insulin relative to purified native insulin. The synthetic insulin was produced by the approach detailed in Fig. 1 where the A 7 -B 7 bond is the first disulfide formed. As indicated by the data presented in the graph, the two molecules have similar binding activities. Fig. 3 shows the structure of an insulin-dexamethasone and two insulin-thyroxine (T3) conjugates, wherein native insulin is modified by linkage of the dexamethasone/T3 to the side chain of the native Lys at position B29. Also provided is the activity of the three conjugates and native insulin at the insulin IR-B receptor using the in vitro assay of Example 3.
Fig. 4 is a graph presenting data for an in vivo study on blood glucose levels in mice administered the insulin-nuclear hormone ligand conjugates CIU-15, CIU-16 and CIU-17 relative to control and native insulin.
Fig. 5 shows the structure of an insulin-Tesaglitazar conjugate, wherein the native insulin heteroduplex (comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2) is linked to Tesaglitazar via a Benzyl-PEG2-GGG-K linker.
Fig. 6 shows the structure of an insulin- Aleglitazar conjugate, wherein the native insulin heteroduplex (comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2) is linked to Tesaglitazar via an NHCO linker to the B29 Lys side chain (CIU-164), to the A chain N-terminal amine and the B29 Lys side chain (CIU-165) or the B29 Lys side chain of a B l pegylated insulin analog (CIU-166).
Figs. 7A-7D shows the activity of an insulin- Aleglitazar conjugate (CIU-166) and an insulin-T3 conjugate (CIU-167), wherein the insulin component is native insulin (comprising the insulin A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2) pegylated with a 20K peg at the B l position. Fig. 7A shows the activity of CIU-166 and CIU-167 the insulin IR-B receptor using an in vitro assay. Fig. 7B is a graph presenting data for an in vivo study on blood glucose levels in mice administered CIU-166 and CIU-167. Fig. 7C is a graph presenting data for an in vivo study on body weight in mice administered CIU-166 and CIU- 167; Fig. 7D is a graph presenting data for glucose tolerance test in mice administered CIU- 166 and CIU-167.
DETAILED DESCRIPTION DEFINITIONS
In describing and claiming the invention, the following terminology will be used in accordance with the definitions set forth below.
The term "about" as used herein means greater or lesser than the value or range of values stated by 10 percent, but is not intended to designate any value or range of values to only this broader definition. Each value or range of values preceded by the term "about" is also intended to encompass the embodiment of the stated absolute value or range of values.
As used herein the term "amino acid" encompasses any molecule containing both amino and carboxyl functional groups, wherein the amino and carboxylate groups are attached to the same carbon (the alpha carbon). The alpha carbon optionally may have one or two further organic substituents. For the purposes of the present disclosure designation of an amino acid without specifying its stereochemistry is intended to encompass either the L or D form of the amino acid, or a racemic mixture.
As used herein the term "hydroxyl acid" refers to amino acids that have been modified to replace the alpha carbon amino group with a hydroxyl group.
As used herein the term "non-coded amino acid" encompasses any amino acid that is not an L-isomer of any of the following 20 amino acids: Ala, Cys, Asp, Glu, Phe, Gly, His, He, Lys, Leu, Met, Asn, Pro, Gin, Arg, Ser, Thr, Val, Trp, Tyr.
As used herein a general reference to a peptide is intended to encompass peptides that have modified amino and carboxy termini. For example, an amino acid sequence designating the standard amino acids is intended to encompass standard amino acids at the N- and C- terminus as well as a corresponding hydroxyl acid at the N-terminus and/or a corresponding C-terminal amino acid modified to comprise an amide group in place of the terminal carboxylic acid.
As used herein an "acylated" amino acid is an amino acid comprising an acyl group which is non-native to a naturally-occurring amino acid, regardless by the means by which it is produced. Exemplary methods of producing acylated amino acids and acylated peptides are known in the art and include acylating an amino acid before inclusion in the peptide or peptide synthesis followed by chemical acylation of the peptide. In some embodiments, the acyl group causes the peptide to have one or more of (i) a prolonged half-life in circulation, (ii) a delayed onset of action, (iii) an extended duration of action, (iv) an improved resistance to proteases, such as DPP-IV, and (v) increased potency at the IGF and/or insulin peptide receptors.
As used herein, an "alkylated" amino acid is an amino acid comprising an alkyl group which is non-native to a naturally-occurring amino acid, regardless of the means by which it is produced. Exemplary methods of producing alkylated amino acids and alkylated peptides are known in the art and including alkylating an amino acid before inclusion in the peptide or peptide synthesis followed by chemical alkylation of the peptide. Without being held to any particular theory, it is believed that alkylation of peptides will achieve similar, if not the same, effects as acylation of the peptides, e.g., a prolonged half-life in circulation, a delayed onset of action, an extended duration of action, an improved resistance to proteases, such as DPP-IV, and increased potency at the IGF and/or insulin receptors.
As used herein, the term "pharmaceutically acceptable carrier" includes any of the standard pharmaceutical carriers, such as a phosphate buffered saline solution, water, emulsions such as an oil/water or water/oil emulsion, and various types of wetting agents. The term also encompasses any of the agents approved by a regulatory agency of the US Federal government or listed in the US Pharmacopeia for use in animals, including humans.
As used herein the term "pharmaceutically acceptable salt" refers to salts of compounds that retain the biological activity of the parent compound, and which are not biologically or otherwise undesirable. Many of the compounds disclosed herein are capable of forming acid and/or base salts by virtue of the presence of amino and/or carboxyl groups or groups similar thereto.
As used herein, the term "hydrophilic moiety" refers to any compound that is readily water-soluble or readily absorbs water, and which are tolerated in vivo by mammalian species without toxic effects (i.e. are biocompatible). Examples of hydrophilic moieties include polyethylene glycol (PEG), polylactic acid, polyglycolic acid, a polylactic- polyglycolic acid copolymer, polyvinyl alcohol, polyvinylpyrrolidone, polymethoxazoline, polyethyloxazoline, polyhydroxyethyl methacrylate, polyhydroxypropyl methacrylamide, polymethacrylamide, polydimethylacrylamide, and derivatised celluloses such as hydroxymethylcellulose or hydroxyethylcellulose and co-polymers thereof, as well as natural polymers including, for example, albumin, heparin and dextran.
As used herein, the term "treating" includes prophylaxis of the specific disorder or condition, or alleviation of the symptoms associated with a specific disorder or condition and/or preventing or eliminating said symptoms. For example, as used herein the term "treating diabetes" will refer in general to maintaining glucose blood levels near normal levels and may include increasing or decreasing blood glucose levels depending on a given situation.
As used herein an "effective" amount or a "therapeutically effective amount" of an insulin analog refers to a nontoxic but sufficient amount of an insulin analog to provide the desired effect. For example one desired effect would be the prevention or treatment of hyperglycemia. The amount that is "effective" will vary from subject to subject, depending on the age and general condition of the individual, mode of administration, and the like. Thus, it is not always possible to specify an exact "effective amount." However, an appropriate "effective" amount in any individual case may be determined by one of ordinary skill in the art using routine experimentation.
The term, "parenteral" means not through the alimentary canal but by some other route such as intranasal, inhalation, subcutaneous, intramuscular, intraspinal, or intravenous.
Throughout the application, all references to a particular amino acid position in an insulin analog by letter and number (e.g. position A5) refer to the amino acid at that position of either the A chain (e.g. position A5) or the B chain (e.g. position B5) in the respective native human insulin A chain (SEQ ID NO: 1) or B chain (SEQ ID NO: 2), or the corresponding amino acid position in any analogs thereof. For example, a reference herein to "position B28" absent any further elaboration would mean the corresponding position B27 of the B chain of an insulin analog in which the first amino acid of SEQ ID NO: 2 has been deleted. Similarly, amino acids added to the N-terminus of the native B chain are numbered starting with BO, followed by numbers of increasing negative value (e.g., B-l, B-2...) as amino acids are added to the N-terminus. Alternatively, any reference to an amino acid position in the linking moiety of a single chain analog, is made in reference to the native C chain of IGF 1 (SEQ ID NO: 17). For example, position 9 of the native C chain (or the "position C9") has an alanine residue.
As used herein the term "native insulin peptide" is intended to designate the 51 amino acid heteroduplex comprising the A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2, as well as single-chain insulin analogs that comprise SEQ ID NOS: 1 and 2. The term "insulin peptide" as used herein, absent further descriptive language is intended to encompass the 51 amino acid heteroduplex comprising the A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2, as well as single-chain insulin analogs thereof (including for example those disclosed in published international application W096/34882 and US Patent No. 6,630,348, the disclosures of which are incorporated herein by reference), including heteroduplexes and single-chain analogs that comprise modified analogs of the native A chain and/or B chain and derivatives thereof. Such modified analogs include modification of the amino acid at position A 19, B 16 or B25 to a 4-amino phenylalanine or one or more amino acid substitutions at positions selected from A5, A8, A9, A10, A12, A14, A15, A17, A18, A21, B l, B2, B3, B4, B5, B9, B 10, B 13, B 14, B 17, B20, B21, B22, B23, B26, B27, B28, B29 and B30 or deletions of any or all of positions B l-4 and B26-30. Insulin peptides as defined herein can also be analogs derived from a naturally occurring insulin by insertion or substitution of a non-peptide moiety, e.g. a retroinverso fragment, or incorporation of non- peptide bonds such as an azapeptide bond (CO substituted by NH) or pseudo-peptide bond (e.g. NH substituted with C¾) or an ester bond (e.g., a depsipeptide, wherein one or more of the amide (-CONHR-) bonds are replaced by ester (COOR) bonds).
As used herein, the term "single-chain insulin analog" encompasses a group of structurally-related proteins wherein insulin or IGF A and B chains, or analogs or derivatives thereof, are covalently linked to one another to form a linear polypeptide chain. As disclosed herein the single-chain insulin analog comprises the covalent linkage of the carboxy terminus of the B chain to the amino terminus of the A chain via a linking moiety.
As used herein the term "insulin A chain", absent further descriptive language is intended to encompass the 21 amino acid sequence of SEQ ID NO: 1 as well as functional analogs and derivatives thereof, including insulin analogs known to those skilled in the art, including modification of the sequence of SEQ ID NO: 1 by one or more amino acid insertions, deletions or substitutions at positions selected from A4, A5, A8, A9, A10, A12, A14, A15, A17, A18, A21.
As used herein the term "insulin B chain", absent further descriptive language is intended to encompass the 30 amino acid sequence of SEQ ID NO: 2, as well as modified functional analogs of the native B chain, including one or more amino acid insertions, deletions or substitutions at positions selected from B l, B2, B3, B4, B5, B9, B 10, B 13, B 14, B 17, B20, B21, B22, B23, B25, B26, B27, B28, B29 and B30 or deletions of any or all of positions B l-4 and B26-30.
The term "identity" as used herein relates to the similarity between two or more sequences. Identity is measured by dividing the number of identical residues by the total number of residues and multiplying the product by 100 to achieve a percentage. Thus, two copies of exactly the same sequence have 100% identity, whereas two sequences that have amino acid deletions, additions, or substitutions relative to one another have a lower degree of identity. Those skilled in the art will recognize that several computer programs, such as those that employ algorithms such as BLAST (Basic Local Alignment Search Tool, Altschul et al. (1993) J. Mol. Biol. 215:403-410) are available for determining sequence identity.
As used herein, the term "selectivity" of a molecule for a first receptor relative to a second receptor refers to the following ratio: EC50 of the molecule at the second receptor divided by the EC50 of the molecule at the first receptor. For example, a molecule that has an EC50 of 1 nM at a first receptor and an EC50 of 100 nM at a second receptor has 100-fold selectivity for the first receptor relative to the second receptor.
As used herein an amino acid "modification" refers to a substitution of an amino acid, or the derivation of an amino acid by the addition and/or removal of chemical groups to/from the amino acid, and includes substitution with any of the 20 amino acids commonly found in human proteins, as well as atypical or non-naturally occurring amino acids.
Commercial sources of atypical amino acids include Sigma-Aldrich (Milwaukee, WI), ChemPep Inc. (Miami, FL), and Genzyme Pharmaceuticals (Cambridge, MA). Atypical amino acids may be purchased from commercial suppliers, synthesized de novo, or chemically modified or derivatized from naturally occurring amino acids.
As used herein an amino acid "substitution" refers to the replacement of one amino acid residue by a different amino acid residue.
As used herein, the term "conservative amino acid substitution" is defined herein as exchanges within one of the following five groups: I. Small aliphatic, nonpolar or slightly polar residues:
Ala, Ser, Thr, Pro, Gly;
II. Polar, negatively charged residues and their amides:
Asp, Asn, Glu, Gin, cysteic acid and homocysteic acid;
III. Polar, positively charged residues:
His, Arg, Lys; Ornithine (Orn)
IV. Large, aliphatic, nonpolar residues:
Met, Leu, He, Val, Cys, Norleucine (Nle), homocysteine
V. Large, aromatic residues:
Phe, Tyr, Trp, acetyl phenylalanine
As used herein the general term "polyethylene glycol chain" or "PEG chain", refers to mixtures of condensation polymers of ethylene oxide and water, in a branched or straight chain, represented by the general formula H(OCH2CH2)nOH, wherein n is at least 2.
"Polyethylene glycol chain" or "PEG chain" is used in combination with a numeric suffix to indicate the approximate average molecular weight thereof. For example, PEG-5,000 refers to polyethylene glycol chain having a total molecular weight average of about 5,000 Daltons As used herein the term "pegylated" and like terms refers to a compound that has been modified from its native state by linking a polyethylene glycol chain to the compound. A "pegylated polypeptide" is a polypeptide that has a PEG chain covalently bound to the polypeptide.
As used herein a "linker" is a bond, molecule or group of molecules that binds two separate entities to one another. Linkers may provide for optimal spacing of the two entities or may further supply a labile linkage that allows the two entities to be separated from each other. Labile linkages include photocleavable groups, acid-labile moieties, base-labile moieties and enzyme-cleavable groups.
As used herein a "dimer" is a complex comprising two subunits covalently bound to one another via a linker. The term dimer, when used absent any qualifying language, encompasses both homodimers and heterodimers. A homodimer comprises two identical subunits, whereas a heterodimer comprises two subunits that differ, although the two subunits are substantially similar to one another.
The term "Ci-Cn alkyl" wherein n can be from 1 through 6, as used herein, represents a branched or linear alkyl group having from one to the specified number of carbon atoms. Typical Ci-C6 alkyl groups include, but are not limited to, methyl, ethyl, n-propyl, iso- propyl, butyl, iso-Butyl, sec -butyl, tert-butyl, pentyl, hexyl and the like.
The terms "C2-Cn alkenyl" wherein n can be from 2 through 6, as used herein, represents an olefinically unsaturated branched or linear group having from 2 to the specified number of carbon atoms and at least one double bond. Examples of such groups include, but are not limited to, 1-propenyl, 2-propenyl (-CH2-CH=CH2), 1,3-butadienyl, (- CH=CHCH=CH2), 1-butenyl (-CH=CHCH2CH3), hexenyl, pentenyl, and the like.
The term "C2-Cn alkynyl" wherein n can be from 2 to 6, refers to an unsaturated branched or linear group having from 2 to n carbon atoms and at least one triple bond.
Examples of such groups include, but are not limited to, 1-propynyl, 2-propynyl, 1-butynyl, 2-butynyl, 1-pentynyl, and the like.
As used herein the term "aryl" refers to a mono- or bicyclic carbocyclic ring system having one or two aromatic rings including, but not limited to, phenyl, naphthyl,
tetrahydronaphthyl, indanyl, indenyl, and the like. The size of the aryl ring and the presence of substituents or linking groups are indicated by designating the number of carbons present. For example, the term "(Ci-C3 alkyl)(C6-Cio aryl)" refers to a 5 to 10 membered aryl that is attached to a parent moiety via a one to three membered alkyl chain. The term "heteroaryl" as used herein refers to a mono- or bi- cyclic ring system containing one or two aromatic rings and containing at least one nitrogen, oxygen, or sulfur atom in an aromatic ring. The size of the heteroaryl ring and the presence of substituents or linking groups are indicated by designating the number of carbons present. For example, the term "(Ci-Cn alkyl)(Cs-C6 heteroaryl)" refers to a 5 or 6 membered heteroaryl that is attached to a parent moiety via a one to "n" membered alkyl chain.
As used herein, the term "halo" refers to one or more members of the group consisting of fluorine, chlorine, bromine, and iodine.
As used herein the term "patient" without further designation is intended to encompass any warm blooded vertebrate domesticated animal (including for example, but not limited to livestock, horses, cats, dogs and other pets) and humans.
The term "isolated" as used herein means having been removed from its natural environment. In some embodiments, the analog is made through recombinant methods and the analog is isolated from the host cell.
The term "purified," as used herein relates to the isolation of a molecule or compound in a form that is substantially free of contaminants normally associated with the molecule or compound in a native or natural environment and means having been increased in purity as a result of being separated from other components of the original composition. The term "purified polypeptide" is used herein to describe a polypeptide which has been separated from other compounds including, but not limited to nucleic acid molecules, lipids and carbohydrates.
A "peptidomimetic" refers to a chemical compound having a structure that is different from the general structure of an existing peptide, but that functions in a manner similar to the existing peptide, e.g., by mimicking the biological activity of that peptide. Peptidomimetics typically comprise naturally-occurring amino acids and/or unnatural amino acids, but can also comprise modifications to the peptide backbone. For example a peptidomimetic may include a sequence of naturally-occurring amino acids with the insertion or substitution of a non-peptide moiety, e.g. a retroinverso fragment, or
incorporation of non-peptide bonds such as an azapeptide bond (CO substituted by NH) or pseudo-peptide bond (e.g. NH substituted with CH2), or an ester bond (e.g., depsipeptides, wherein one or more of the amide (-CONHR-) bonds are replaced by ester (COOR) bonds). Alternatively the peptidomimetic may be devoid of any naturally-occurring amino acids. ABBREVIATIONS:
Insulin analogs will be abbreviated as follows:
The insulin A and B chains will be designated generically by a capital A for the A chain and a capital B for the B chain. When present, a superscript 0 (e.g., A0 or B°) will designate the base sequence is an insulin sequence (A chain: SEQ ID NO: 1, B chain SEQ ID NO: 2) and a superscript 1 (e.g., A1 or B1) will designate the base sequence is an IGF-1 sequence (A chain: SEQ ID NO: 5, B chain SEQ ID NO: 6). Modifications that deviate from the native insulin and IGF sequence are indicated in parenthesis following the designation of the A or B chain (e.g., [Β^Ή,ΗΙΟ,ΥΙό,ίΠ) : A1(H8,N18,N21)]) with the single letter amino acid abbreviation indicating the substitution and the number indicating the position of the substitution in the respective A or B chain, using native insulin numbering. A colon between the A and B chain indicates a two chain insulin whereas a dash will indicate a covalent bond and thus a single chain analog. In single chain analogs a linking moiety will be included between the A and B chains and the designation C1 refers to the native IGF 1 C peptide, SEQ ID NO: 17. The designation "position C8" in reference to the linking moiety designates an amino acid located at the position corresponding to the eighth amino acid of SEQ ID NO: 17.
EMBODIMENTS
Disclosed herein are conjugates of an insulin peptide and a NHR ligand. In one embodiment the NHR ligand is an NHR agonist. In one embodiment the insulin
agonist/NHR ligand conjugate comprises the structure Q-L-Y, wherein Q is an insulin peptide, Y is a NHR ligand and L is a linking group or a bond, wherein said conjugate has activity at both the corresponding nuclear hormone receptor of Y and an insulin receptor, including for example the insulin subtype B receptor. In one embodiment the NHR ligand is selected from the group consisting of a steroid that exhibits an EC50 of about 1 μΜ or less when unconjugated to Q-L, and has a molecular weight of up to about 1000 daltons. In one embodiment the NHR ligand is a ligand that activates the thyroid hormone receptor or a ligand that activates the peroxisome proliferator- activated receptors (PPAR).
In some embodiments the NHR ligand is an NHR agonist. In one embodiment the
NHR agonist has activity at a Type I NHR when bound to Q-L. In one embodiment the NHR agonist has activity at a Type II NHR when bound to Q-L. In one embodiment the NHR ligand is
i) a steroid that exhibits an EC50 of about 1 μΜ or less when unconjugated to Q-L, and further has a molecular weight of up to about 1000 daltons; or
ii) a ligand that activates the thyroid hormone receptor; or iii) a ligand that activates the peroxisome proliferator-activated receptors (PPAR).
In one embodiment the NHR ligand of the conjugate is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof. In one embodiment the NHR ligand is dexamethasone.
In accordance with one embodiment the NHR ligand is a thyroid hormone receptor agonist having the general structure
Figure imgf000018_0001
wherein
Ri5 is C1-C4 alkyl, -CH2(pyridazinone), -CH2(OH)(phenyl)F, -CH(OH)CH3, halo or H; R2o is halo, CH3 or H;
R2i is halo, CH3 or H;
R22 is H, OH, halo, -CH2(OH)(C6 aryl)F, or d-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2". In one embodiment the thyroid hormone receptor agonist has the general structure of Formula I:
HO
Figure imgf000018_0002
R20, R21, and R22 are independently selected from the group consisting of H, OH, halo and Ci-C4 alkyl; and
Ri5 is halo or H. In one embodiment the thyroid hormone receptor agonist is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetra-iodothyronine), and 3,5,3'-triiodo L- thyronine.
In one embodiment the NHR ligand is an agonist of a PPAR. In one embodiment the PPAR agonist is selected from the group consisting of Tesaglitazar, Aleglitazar and thiazolidinediones. In one embodiment the PPAR agonist is Tesaglitazar or Aleglitazar.
In one embodiment the insulin peptide of the conjugate Q-L-Y is a native insulin peptide or any insulin receptor agonist known to those skilled in the art. In one embodiment the insulin peptide (Q) comprises an A chain and a B chain wherein said A chain comprises a sequence
GIVX4X5CCX8X9XioCXi2LXi4Xi5LXi7Xi8YCX2i-R53 (SEQ ID NO: 19), and said B chain comprises a sequence R62-X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20), wherein
X4 is glutamic acid or aspartic acid;
X5 is glutamine or glutamic acid
X8 is histidine, threonine or phenylalanine;
X9 is serine, arginine, lysine, ornithine or alanine;
X10 is isoleucine or serine;
X12 is serine or aspartic acid;
Xi4 is tyrosine, arginine, lysine, ornithine or alanine;
Xi5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine;
Xi7 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine;
Xi8 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine;
X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid; X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
R53 is COOH or CONH2. In one embodiment the insulin peptide is a two chain insulin analog. In another embodiment the insulin peptide is a single chain insulin analog wherein the carboxy terminus of the B chain is linked to the amino terminus of the A chain via a peptide linker. Any of the previous disclosed single chain insulin analogs having activity at the insulin receptor and known to those skilled in the art are encompassed by the present disclosure.
In one embodiment the insulin peptide of the conjugate is a two chain insulin wherein the A and B chains are linked by interchain disulfide bonds, wherein the A chain comprises the sequence GIVEQCCX8X9ICSLYQLENYCX2i-R53 (SEQ ID NO: 73) and the B chain comprises a sequence R62-X25LCGAX3oLVDALYLVCGDX42GFY (SEQ ID NO: 75), wherein
X8 is histidine or threonine;
X9 is serine, lysine, or alanine;
X21 is alanine, glycine or asparagine;
X25 is histidine or threonine;
X3o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X42 is selected from the group consisting of alanine, ornithine and arginine; and Rs3 is COOH or CONH2;
R62 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine, and an N- terminal amine; and
R53 is COOH or CONH2. In one embodiment the A chain comprises the sequence GIYEQCCX8X9ICSLYQLENYCX2i-R53 (SEQ ID NO: 73) and the B chain comprises the B chain sequence comprises the sequence FVKQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 147), or FVNQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 148), wherein
X8 is histidine or threonine;
X9 is serine, lysine, or alanine;
X21 is alanine, glycine or asparagine;
X25 is selected from the group consisting of histidine and threonine; and
R6 is selected from the group consisting of YTX28KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX28K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX28, YT, Y and a bond.
In one embodiment the A chain comprises the sequence
GIVEQCCX8SICSLYQLENYCX2i-R53 (SEQ ID NO: 153) or
GIVEQCCTSICSLYQLENYCN-R53 (SEQ ID NO: 1) and the B chain comprises the sequence FVKQX25LCGSHLVEALYLVCGERGFFYTEKT (SEQ ID NO: 154),
FVNQX25LCGSHLVEALYLVCGERGFFYTDKT (SEQ ID NO: 155),
FVNQX25LCGS HLVE ALYLVC GERGFF YTKPT (SEQ ID NO: 156) or
FVNQX25LCGS HLVE ALYLVC GERGFF YTPKT (SEQ ID NO: 157) wherein
X8 is histidine or threonine;
X21 is alanine, glycine or asparagine; X25 is selected from the group consisting of histidine and threonine and Rs3 is COOH or CONH2. In one embodiment the A chain comprises a sequence GIVEQCCTSICSLYQLENYCN-R53 (SEQ ID NO: 1) and said B chain comprises a sequence F VNQHLC GS HLVE ALYLVCGERGFFYTPKT (SEQ ID NO: 2) wherein R53 is COOH or CONH2.
In one embodiment the insulin peptide is a single chain insulin analog. In one embodiment the peptide linker joining the B and A chains is selected from the group consisting of SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 158),
SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 159), GAGSSSX57X58 (SEQ ID NO: 76), GYGSSSX57X58 (SEQ ID NO: 21) and GYGSSSX57X58APQT; (SEQ ID NO: 77), wherein X57 and X58 are independently arginine, lysine or ornithine. In one embodiment both X57 and X58 are independently arginines. In one embodiment the peptide linking moiety joining the insulin A and B chains to form a single chain insulin analog is a peptide sequence consisting of GYGSSSRR (SEQ ID NO: 18) GAGSSSRR (SEQ ID NO: 22) or
GAGS S S RR APQT (SEQ ID NO: 23). In accordance with one embodiment, the linker (L in the formula Q-L-Y) is a linking group or a bond that covalently links the insulin peptide to the NHR ligand. In one embodiment the NHR ligand is linked to the side chain of an amino acid at position B28 or B29 of the insulin peptide. In one embodiment the amino acid at position B28 or B29 of the insulin peptide is lysine and the NHR ligand is linked to the side chain of the lysine. In one embodiment the NHR ligand is linked to the insulin peptide via the N-terminal alpha amine of the insulin A or B chain. In one embodiment the NHR ligand is linked to the insulin peptide via an amid bond form between and amino group of the insulin peptide and a carboxy group of the NHR ligand, optionally through a spacer moiety.
In one embodiment the linker (L in the formula Q-L-Y) is a linking group wherein L is stable in vivo, hydrolyzable in vivo, or metastable in vivo. In one embodiment L comprises an ether moiety, or an amide moiety, an ester moiety, an acid-labile moiety, a reduction-labile moiety , an enzyme-labile moiety, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety.
Structure of the NHR Ligand (Y)
The NHR ligand of the invention (Y) is partly or wholly non-peptidic and is hydrophobic or lipophilic. In some embodiments, the NHR ligand has a molecular weight that is about 5000 daltons or less, or about 4000 daltons or less, or about 3000 daltons or less, or about 2000 daltons or less, or about 1750 daltons or less, or about 1500 daltons or less, or about 1250 daltons or less, or about 1000 daltons or less, or about 750 daltons or less, or about 500 daltons or less, or about 250 daltons or less. The structure of Y can be in accordance with any of the teachings disclosed herein.
In the embodiments described herein, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or
L. One skilled in the art could readily determine the position and means of conjugation in view of general knowledge and the disclosure provided herein.
In any of the embodiments described herein wherein Y comprises a tetracyclic skeleton having three 6-membered rings joined to one 5-membered ring or a variation thereof (e.g. a Y that acts at the vitamin D receptor), the carbon atoms of the skeleton are referred to by position number, as shown below:
Figure imgf000023_0001
For example, a modification having a ketone at position-6 refers to the following structure:
Figure imgf000023_0002
NHR Ligand that Acts on a Type I Nuclear Hormone Receptor
In some embodiments of the invention, the NHR ligand (Y) acts on a Type I nuclear hormone receptor. In some embodiments, Y can have any structure that permits or promotes agonist activity upon binding of the ligand to a Type I nuclear hormone receptor, while in other embodiments Y is an antagonist of the Type I nuclear hormone receptor.
In exemplary embodiments, Y comprises a structure as shown in Formula A:
Figure imgf000023_0003
Formula A
wherein R 1 and R 2 , when present, are independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula A to the Type I nuclear hormone receptor; R3 and R4 are independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula A to the Type I nuclear hormone receptor; and each dashed line represents an optional double bond. Formula A may further comprise one or more substituents at one or more of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 14, 15, 16, 17, 18, and 19. Contemplated optional substituents include, but are not limited to, OH, N¾, ketone, and Q-Cis alkyl groups.
In some embodiments, Y comprises a structure of Formula A wherein R1 is present and is hydrogen, d-C7 alkyl; (C0_C3 alkyl)C(0)Ci-C7 alkyl, (C0_C3 alkyl)C(0)aryl, or S03H;
R is present and is hydrogen, halo, OH, or Ci-C7 alkyl;
R is hydrogen, halo, OH, or Ci-C7 alkyl;
R4 is hydrogen, (Co-C8 alkyl)halo, Ci_C8 alkyl, C2-C8 alkenyl, C2-i8 alkynyl, heteroalkyl, (C0_C8 alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0_C8 alkyl)OCi_C8 alkyl, (C0_C8 alkyl)OC2-C8 alkenyl, (C0-C8 alkyl)OC2-C8 alkynyl, (C0-C8 alkyl)OH, (C0-C8 alkyl)SH, (C0- C8 alkyl)NR24Ci_C8 alkyl, (C0-C8 alkyl)NR24C2-C8 alkenyl, (C0-C8 alkyl)NR24C2-C8 alkynyl, (Co-C8 alkyl)NR24H2, (C0_C8 alkyl)C(0)Ci_C8 alkyl, (C0_C8 alkyl)C(0)C2_C8 alkenyl, (C0_C8 alkyl)C(0)C2_C8 alkynyl, (C0_C8 alkyl)C(0)H, (C0_C8 alkyl)C(0)aryl, (C0_C8
alkyl)C(0)heteroaryl, (C0-C8 alkyl)C(0)OCi_C8 alkyl, (C0-C8 alkyl)C(0)OC2-C8 alkenyl, (C0- C8 alkyl)C(0)OC2-C8 alkynyl, (C0-C8 alkyl)C(0)OH, (C0-C8 alkyl)C(0)0 aryl, (C0-C8 alkyl)C(0)0 heteroaryl, (C0_C8 alkyl)OC(0)Ci_C8 alkyl, (C0_C8 alkyl)OC(0)C2_C8 alkenyl, (Co-C8 alkyl)OC(0)C2_Ci8 alkynyl, (C0_C8 alkyl)C(0)NR24Ci_C8 alkyl, (C0_C8
alkyl)C(0)NR24C2-C8 alkenyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkynyl, (C0-C8
alkyl)C(0)NR24H2, (C0-C8 alkyl)C(0)NR24aryl, (C0-C8 alkyl)C(0)NR24heteroaryl, (C0-C8 alkyl)NR24C(0)Ci_C8 alkyl, (C0_C8 alkyl)NR24C(0)C2_C8 alkenyl, or (C0_C8
alkyl)NR24C(0)C2_C8 alkynyl, (C0_C8 alkyl)NR24C(0)OH, (C0_C8 alkyl)OC(0)OCi_C8 alkyl, (Co-C8 alkyl)OC(0)OC2-C8 alkenyl, (C0-C8 alkyl)OC(0)OC2-C8 alkynyl, (C0-C8
alkyl)OC(0)OH, (C0-C8 alkyl)OC(0)NR24Ci_C8 alkyl, (C0-C8 alkyl)OC(0)NR24C2-C8 alkenyl, (C0_C8 alkyl)OC(0)NR24C2_C8 alkynyl, (C0_C8 alkyl)OC(0)NR24H2, (C0_C8 alkyl)NR24(0)OCi_C8 alkyl, (C0_C8 alkyl)NR24(0)OC2_C8 alkenyl, (C0_C8 alkyl)NR24(0)OC2_ C8 alkynyl, or (C0-C8 alkyl)NR24(0)OH; and,
R 24 is hydrogen or Ci_C7 alkyl.
In some embodiments, R 1 is hydrogen, propionate, acetate, benzoate, or sulfate; R 2 is hydrogen or methyl; R3 is hydrogen or methyl; and R4 is acetate, cypionate, hemisucciniate, enanthate, or propionate.
In embodiments wherein Y comprises a structure of Formula A, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula A that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula A and means of conjugation of Formula A to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula A is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 of Formula A. In some embodiments, Formula A is conjugated to L or Q at position 1, 3, 6, 7, 12, 10, 13, 16, 17, or 19 of Formula A.
In some embodiments, Y acts at an estrogen receptor (e.g. ERa, ERP). In some embodiments, Y permits or promotes agonist activity at the estrogen receptor, while in other embodiments Y is an antagonist of ER. In exemplary embodiments, Y can have a structure of Formula B:
Figure imgf000025_0001
Formula B
wherein R1, R5 and R6 are moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula B to the estrogen receptor. In some
embodiments, Formula B further comprises one or more substitutents at one or more of positions 1, 2, 4, 6, 7, 8, 9, 11, 12, 14, 15, and 16 (e.g. a ketone at position-6).
In some embodiments when Y comprises a structure of Formula B, wherein
R1 is hydrogen, C1-C7 alkyl; (C0-C3 alkyl)C(0)Ci-C7 alkyl, (C0-C3 alkyl)C(0)aryl, or S03H;
R5 is hydrogen, (Co-C8 alkyl)halo, Ci_C8 alkyl, C2-C8 alkenyl, C2-i8 alkynyl, heteroalkyl, (C0-C8 alkyl)aryl, (C0-C8 alkyl)heteroaryl, (C0-C8 alkyl)OCi_C8 alkyl, (C0-C8 alkyl)OC2-C8 alkenyl, (C0-C8 alkyl)OC2-C8 alkynyl, (C0-C8 alkyl)OH, (C0-C8 alkyl)SH, (C0- C8 alkyl)NR24Ci_C8 alkyl, (C0_C8 alkyl)NR24C2_C8 alkenyl, (C0_C8 alkyl)NR24C2_C8 alkynyl, (Co-C8 alkyl)NR24H2, (C0_C8 alkyl)C(0)Ci_C8 alkyl, (C0_C8 alkyl)C(0)C2_C8 alkenyl, (C0_C8 alkyl)C(0)C2-C8 alkynyl, (C0-C8 alkyl)C(0)H, (C0-C8 alkyl)C(0)aryl, (C0-C8
alkyl)C(0)heteroaryl, (C0-C8 alkyl)C(0)OCi_C8 alkyl, (C0-C8 alkyl)C(0)OC2-C8 alkenyl, (C0- C8 alkyl)C(0)OC2_C8 alkynyl, (C0_C8 alkyl)C(0)OH, (C0_C8 alkyl)C(0)0 aryl, (C0_C8 alkyl)C(0)0 heteroaryl, (C0_C8 alkyl)OC(0)Ci_C8 alkyl, (C0_C8 alkyl)OC(0)C2_C8 alkenyl, (Co-C8 alkyl)OC(0)C2-Ci8 alkynyl, (C0-C8 alkyl)C(0)NR24Ci_C8 alkyl, (C0-C8
alkyl)C(0)NR24C2-C8 alkenyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkynyl, (C0-C8
alkyl)C(0)NR24H2, (C0_C8 alkyl)C(0)NR24aryl, (C0_C8 alkyl)C(0)NR24heteroaryl, (C0_C8 alkyl)NR24C(0)Ci_C8 alkyl, (C0_C8 alkyl)NR24C(0)C2_C8 alkenyl, or (C0_C8
alkyl)NR24C(0)C2-C8 alkynyl, (C0-C8 alkyl)NR24C(0)OH, (C0-C8 alkyl)OC(0)OCi_C8 alkyl, (Co-C8 alkyl)OC(0)OC2-C8 alkenyl, (C0-C8 alkyl)OC(0)OC2-C8 alkynyl, (C0-C8 alkyl)OC(0)OH, (C0_C8 alkyl)OC(0)NR 4Ci_C8 alkyl, (C0_C8 alkyl)OC(0)NR 4C2_C8 alkenyl, (C0_C8 alkyl)OC(0)NR24C2_C8 alkynyl, (C0_C8 alkyl)OC(0)NR24H2, (C0_C8 alkyl)NR24(0)OCi_C8 alkyl, (C0-C8 alkyl)NR24(0)OC2-C8 alkenyl, (C0-C8 alkyl)NR24(0)OC2- C8 alkynyl, or (C0-C8 alkyl)NR24(0)OH;
R6 is hydrogen, Ci_C8 alkyl, C2-C8 alkenyl, C2-C8 alkynyl, heteroalkyl, (Co-C8 alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0_C8 alkyl)C(0)Ci_C8 alkyl, (C0_C8 alkyl)C(0)C2_C8 alkenyl, (C0-C8 alkyl)C(0)C2-C8 alkynyl, (C0-C8 alkyl)C(0)H, (C0-C8 alkyl)C(0)aryl, (C0-C8 alkyl)C(0)heteroaryl, (C0-C8 alkyl)C(0)OCi_C8 alkyl, (C0-C8 alkyl)C(0)OC2-C8 alkenyl, (C0- C8 alkyl)C(0)OC2_C8 alkynyl, (C0_C8 alkyl)C(0)OH, C0_C8 alkyl)C(0)0 aryl, (C0_C8 alkyl)C(0)0 heteroaryl, (C0_C8 alkyl)C(0)NR24Ci_C8 alkyl, (C0_C8 alkyl)C(0)NR24C2_C8 alkenyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkynyl, (C0-C8 alkyl)C(0)NR24H2, (C0-C8
alkyl)C(0)NR24aryl, or (C0-C8 alkyl)C(0)NR24heteroaryl; and
R 24 is hydrogen or Ci_C7 alkyl.
For example, R1 is hydrogen, propionate, acetate, benzoate, or sulfate; R5 is hydrogen, ethynyl, hydroxyl; and R6 is acetate, cypionate, hemisucciniate, enanthate, or propionate.
Nonlimiting examples of the compound of Formula B include 17P-estradiol, modified forms of estradiol such as β-estradiol 17-acetate, β-estradiol 17-cypionate, β- estradiol 17-enanthate, β-estradiol 17-valerate, β-estradiol 3,17-diacetate, β-estradiol 3,17- dipropionate, β-estradiol 3 -benzoate, β-estradiol 3 -benzoate 17-n-butyrate, β-estradiol 3- glycidyl ether, β-estradiol 3-methyl ether, β-estradiol 6-one, β-estradiol 3-glycidyl, β- estradiol 6-one 6-(0-carboxymethyloxime), 16-epiestriol, 17-epiestriol, 2-methoxy estradiol, 4-methoxy estradiol, estradiol 17-phenylpropionate, and 17β-estradiol 2-methyl ether, 17a- ethynylestradiol, megestrol acetate, estriol, and derivatives thereof. In some embodiments, carbon 17 has a ketone substituent and R5 and R6 are absent (e.g. estrone). Some of the aforementioned compounds of Formula B are shown below:
Figure imgf000027_0001
Figure imgf000027_0002
Ethylyl estradiol
In embodiments wherein Y comprises a structure of Formula B, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula B that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula B and means of conjugation of Formula B to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula B is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 of Formula B. In some embodiments, Formula B is conjugated to L or Q at position 3 or 17 of Formula B.
In other embodiments, Y acts at an estrogen receptor but is not encompassed by Formula B. Nonlimiting examples of ligands that act at an estrogen receptor that are not encompassed by Formula B are shown below:
Figure imgf000027_0003
In some embodiments, Y acts at a glucocorticoid receptor (GR). In some
embodiments, Y comprises any structure that permits or promotes agonist activity at the GR, while in other embodiments Y is an antagonist of GR. In exemplary embodiments, Y comprises a structure of Formula C:
Figure imgf000028_0001
Formula C
wherein R 2", R 3J, 6 7 8 9 10
R , R', R°, R and R1U are each independently moieties that permit or promote agonist or antagonist activity upon the binding of the compound of Formula C to the GR; and each dash represents an optional double bond. In some embodiments, Formula C further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 9, 11, 12, 14, and 15 (e.g. hydroxyl or ketone at position- 11).
In some embodiments, Y comprises a structure of Formula C wherein
R is hydrogen, halo, OH, or C1-C7 alkyl;
R is hydrogen, halo, OH, or C1-C7 alkyl;
R6 is hydrogen, Ci_C8 alkyl, C2-C8 alkenyl, C2-C8 alkynyl, heteroalkyl, (Co-C8 alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0_C8 alkyl)C(0)Ci_C8 alkyl, (C0_C8 alkyl)C(0)C2_C8 alkenyl, (C0-C8 alkyl)C(0)C2-C8 alkynyl, (C0-C8 alkyl)C(0)H, (C0-C8 alkyl)C(0)aryl, (C0-C8 alkyl)C(0)heteroaryl, (C0-C8 alkyl)C(0)OCi_C8 alkyl, (C0-C8 alkyl)C(0)OC2-C8 alkenyl, (C0- C8 alkyl)C(0)OC2_C8 alkynyl, (C0_C8 alkyl)C(0)OH, C0_C8 alkyl)C(0)0 aryl, (C0_C8 alkyl)C(0)0 heteroaryl, (C0_C8 alkyl)C(0)NR24Ci_C8 alkyl, (C0_C8 alkyl)C(0)NR24C2_C8 alkenyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkynyl, (C0-C8 alkyl)C(0)NR24H2, (C0-C8
alkyl)C(0)NR24aryl, or (C0-C8 alkyl)C(0)NR24heteroaryl;
R is hydrogen, Ci_C8 alkyl, C2-C8 alkenyl, C2-C8 alkynyl, heteroalkyl, (Co-C8 alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0 alkyl)C(0)Ci_C8 alkyl, (C0 alkyl)C(0)C2_C8 alkenyl, (Co alkyl)C(0)C2-C8 alkynyl, (C0)C(O)aryl, (C0)C(O)heteroaryl, (C0)C(O)OCi_C8 alkyl, (C0 alkyl)C(0)OC2-C8 alkenyl, (C0 alkyl)C(0)OC2-C8 alkynyl, or (C0 alkyl)C(0)OH;
R is hydrogen or Ci_C7 alkyl;
R9 is hydrogen or Ci_C7 alkyl;
R10 is hydrogen or OH; and
R 24 is hydrogen or Ci_C7 alkyl. For example, R2 is hydrogen or methyl; R3 is hydrogen, fluoro, chloro, or methyl; R6 is hydrogen or C(O) C1-C7 alkyl; R7 is hydrogen, C(0)CH3, or C(0)CH2CH3; R8 is hydrogen or methyl; R9 is hydrogen or methyl; and R10 is hydroxyl.
Nonlimiting examples of structures of Formula C include:
Figure imgf000029_0001
Beclometasone
Cortisol Cortisone acetate
Figure imgf000029_0002
Prednisone Prednisolone Methylprednisolone
Betamethasone Triamcinolone Dexamethasone and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula C, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula C that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula C and means of conjugation of Formula C to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula C is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, or 23 of Formula C. In some embodiments, Formula C is conjugated to L or Q at position 3, 10, 16 or 17 of Formula C.
In some embodiments, Y acts at a mineralcorticoid receptor (MR). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the MR, while in other embodiments Y is an antagonist of MR. In exemplary embodiments, Y comprises a structure of Formula D:
Figure imgf000030_0001
Formula D
wherein R 2 , R 3 ,R 7 and R 10 are each independently a moiety that permits or promotes agonist or antagonist activity upon binding of the compound of Formula D to the MR; and the dashed line indicates an optional double bond. In some embodiments, Formula D further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 11, 12, 14, 15, 16, and 17.
In some embodiments, Y comprises a structure of Formula D wherein
R is hydrogen, halo, OH, or C1-C7 alkyl;
R is hydrogen, halo, OH, or C1-C7 alkyl;
R is hydrogen, Ci_C8 alkyl, C2-C8 alkenyl, C2-C8 alkynyl, heteroalkyl, (Co-Cs alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0 alkyl)C(0)Ci_C8 alkyl, (C0 alkyl)C(0)C2-C8 alkenyl, (Co alkyl)C(0)C2_C8 alkynyl, (C0)C(O)aryl, (C0)C(O)heteroaryl, (C0)C(O)OCi_C8 alkyl, (C0 alkyl)C(0)OC2-C8 alkenyl, (C0 alkyl)C(0)OC2-C8 alkynyl, or (C0 alkyl)C(0)OH;
R10 is hydrogen or OH; and
R 24 is hydrogen or Ci_C7 alkyl.
For example, R 2 is hydrogen or methyl; R 3 is hydrogen, fluoro, chloro, or methyl; R 7 is hydrogen, C(0)CH3, or C(0)CH2CH3; and R10 is hydroxyl.
Nonlimiting examples of compounds of Formula D include:
Figure imgf000031_0001
Aldosterone Deoxycorticosterone acetate Deoxycorticosterone acetate and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula D, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula D that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula D and means of conjugation of Formula D to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula D is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 of Formula D. In some embodiments, Formula D is conjugated to L or Q at position 3, 10, 13, or 17 of Formula D.
In some embodiments, Y acts at a progesterone receptor (PR). In some
embodiments, Y comprises any structure that permits or promotes agonist activity at the PR, while in other embodiments Y is an antagonist of PR. In exemplary embodiments, Y comprises a structure of Formula E:
Figure imgf000031_0002
Formula E
wherein R 2 , R 3 , R 4 , and R 7 are each independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula E to the PR; and the dashed line indicates an optional double bond. In some embodiments, Formula E further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 11, 12, 14, 15, 16, and 17 (e.g. a methyl group at position 6).
In some embodiments, Y comprises a structure of Formula E wherein
R is hydrogen, halo, OH, or C1-C7 alkyl;
R is hydrogen, halo, OH, or C1-C7 alkyl; R4 is hydrogen, (Co-C8 alkyl)halo, Ci_C8 alkyl, C2-C8 alkenyl, C2-i8 alkynyl, heteroalkyl, (C0_C8 alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0_C8 alkyl)OCi_C8 alkyl, (C0_C8 alkyl)OC2-C8 alkenyl, (C0-C8 alkyl)OC2-C8 alkynyl, (C0-C8 alkyl)OH, (C0-C8 alkyl)SH, (C0- C8 alkyl)NR24Ci_C8 alkyl, (C0-C8 alkyl)NR24C2-C8 alkenyl, (C0-C8 alkyl)NR24C2-C8 alkynyl, (Co-C8 alkyl)NR24H2, (C0_C8 alkyl)C(0)Ci_C8 alkyl, (C0_C8 alkyl)C(0)C2_C8 alkenyl, (C0_C8 alkyl)C(0)C2_C8 alkynyl, (C0_C8 alkyl)C(0)H, (C0_C8 alkyl)C(0)aryl, (C0_C8
alkyl)C(0)heteroaryl, (C0-C8 alkyl)C(0)OCi_C8 alkyl, (C0-C8 alkyl)C(0)OC2-C8 alkenyl, (C0- C8 alkyl)C(0)OC2-C8 alkynyl, (C0-C8 alkyl)C(0)OH, (C0-C8 alkyl)C(0)0 aryl, (C0-C8 alkyl)C(0)0 heteroaryl, (C0_C8 alkyl)OC(0)Ci_C8 alkyl, (C0_C8 alkyl)OC(0)C2_C8 alkenyl, (Co-C8 alkyl)OC(0)C2_Ci8 alkynyl, (C0_C8 alkyl)C(0)NR24Ci_C8 alkyl, (C0_C8
alkyl)C(0)NR24C2-C8 alkenyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkynyl, (C0-C8
alkyl)C(0)NR24H2, (C0-C8 alkyl)C(0)NR24aryl, (C0-C8 alkyl)C(0)NR24heteroaryl, (C0-C8 alkyl)NR24C(0)Ci_C8 alkyl, (C0_C8 alkyl)NR24C(0)C2_C8 alkenyl, or (C0_C8
alkyl)NR24C(0)C2_C8 alkynyl, (C0_C8 alkyl)NR24C(0)OH, (C0_C8 alkyl)OC(0)OCi_C8 alkyl, (Co-C8 alkyl)OC(0)OC2-C8 alkenyl, (C0-C8 alkyl)OC(0)OC2-C8 alkynyl, (C0-C8
alkyl)OC(0)OH, (C0-C8 alkyl)OC(0)NR24Ci_C8 alkyl, (C0-C8 alkyl)OC(0)NR24C2-C8 alkenyl, (C0_C8 alkyl)OC(0)NR24C2_C8 alkynyl, (C0_C8 alkyl)OC(0)NR24H2, (C0_C8 alkyl)NR24(0)OCi_C8 alkyl, (C0_C8 alkyl)NR24(0)OC2_C8 alkenyl, (C0_C8 alkyl)NR24(0)OC2_ C8 alkynyl, or (C0-C8 alkyl)NR24(0)OH;
R is hydrogen, Ci_C8 alkyl, C2-C8 alkenyl, C2-C8 alkynyl, heteroalkyl, (Co-C8 alkyl)aryl, (C0_C8 alkyl)heteroaryl, (C0 alkyl)C(0)Ci_C8 alkyl, (C0 alkyl)C(0)C2_C8 alkenyl, (Co alkyl)C(0)C2_C8 alkynyl, (C0)C(O)aryl, (C0)C(O)heteroaryl, (C0)C(O)OCi_C8 alkyl, (C0 alkyl)C(0)OC2-C8 alkenyl, (C0 alkyl)C(0)OC2-C8 alkynyl, or (C0 alkyl)C(0)OH; and
R 24 is hydrogen or C 1-C7 alkyl.
For example, R 2 is hydrogen or methyl; R 3 is hydrogen or methyl; R 4 is (Ci alkyl)C(0)Ci_C4 alkyl, acetate, cypionate, hemisucciniate, enanthate, or propionate; and R is hydrogen, C(0)CH3, or C(0)CH2CH3.
Nonlimiting examples of compounds of Formula E include:
Figure imgf000033_0001
Progesterone 19-nor-progesterone Medroxyprogesterone , and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula E, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula E that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula E and means of conjugation of Formula E to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula E is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, or 24 of Formula E. In some embodiments, Formula E is conjugated to L or Q through position 3 or 17 of Formula E.
In other embodiments, Y acts at a progesterone receptor but is not encompassed by Formula E. For example, Y can comprise the below structure and analogs thereof:
Figure imgf000033_0002
Norethindrone
In some embodiments, Y acts at an androgen receptor (AR). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the AR, while in other embodiments Y is an antagonist of AR. In exemplary embodiments, Y comprises a structure of Formula F:
Figure imgf000033_0003
Formula F
wherein R1, when present, R2, R3 and R6 are each independently a moiety that permits or promotes agonist or antagonist activity upon binding of the compound of Formula F to the AR; and each dashed line represents an optional double bond, with the proviso that no more than one of the optional carbon-carbon double bond is present at position 5. In some embodiments, Formula F further comprises one or more substituents at one or more of positions 1, 2, 4, 5, 6, 7, 8, 11, 12, 14, 15, 16, and 17.
In some embodiments, Y comprises a structure of Formula F wherein
R1 is hydrogen, C1-C7 alkyl; (C0-C3 alkyl)C(0)Ci-C7 alkyl, (C0_C3 alkyl)C(0)aryl, or
S03H;
R is hydrogen, halo, OH, or Ci-C7 alkyl;
R is hydrogen, halo, OH, or Ci-C7 alkyl;
R6 is hydrogen, Ci_C8 alkyl, C2-C8 alkenyl, C2-C8 alkynyl, heteroalkyl, (Co-C8 alkyl)aryl, (C0-C8 alkyl)heteroaryl, (C0-C8 alkyl)C(0)Ci_C8 alkyl, (C0-C8 alkyl)C(0)C2-C8 alkenyl, (C0-C8 alkyl)C(0)C2-C8 alkynyl, (C0-C8 alkyl)C(0)H, (C0-C8 alkyl)C(0)aryl, (C0-C8 alkyl)C(0)heteroaryl, (C0_C8 alkyl)C(0)OCi_C8 alkyl, (C0_C8 alkyl)C(0)OC2_C8 alkenyl, (C0_ C8 alkyl)C(0)OC2_C8 alkynyl, (C0_C8 alkyl)C(0)OH, C0_C8 alkyl)C(0)0 aryl, (C0_C8 alkyl)C(0)0 heteroaryl, (C0-C8 alkyl)C(0)NR24Ci_C8 alkyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkenyl, (C0-C8 alkyl)C(0)NR24C2-C8 alkynyl, (C0-C8 alkyl)C(0)NR24H2, (C0-C8
alkyl)C(0)NR24aryl, or (C0-C8 alkyl)C(0)NR24heteroaryl; and
R 24 is hydrogen or Ci_C7 alkyl.
For example, R 1 is hydrogen or absent; R2 is hydrogen or methyl; R 3 is hydrogen or methyl; and R6 is H or absent.
Nonlimiting examples of compounds of Formula F include:
Figure imgf000034_0001
Testosterone Dehydroepiandrosterone Androstenedione
Figure imgf000034_0002
5-Androstenediol Androsterone Dihydrotestosterone and derivatives thereof. In embodiments wherein Y comprises a structure of Formula F, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula F that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula F and means of conjugation of Formula F to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula F is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, or 22 of Formula F. In some embodiments, Formula F is conjugated to L or Q at position 3 or 17 of Formula F.
In some embodiments, the binding of the NHR ligand to the Type I nuclear hormone receptor results in agonist activity (or antagonist activity) in some but not all cells or tissues expressing the Type I nuclear hormone receptor.
NHR Ligand that Acts on a Type II Nuclear Hormone Receptor
In some embodiments of the invention, the NHR ligand (Y) acts on a Type II nuclear hormone receptor. In some embodiments, Y can have any structure that permits or promotes agonist activity upon binding of the ligand to a Type II nuclear hormone receptor, while in other embodiments Y is an antagonist of the Type II nuclear hormone receptor. In exemplary embodiments, Y exhibits agonist (or antagonist) activity at a thyroid hormone receptor (TR), retinoic acid receptor (RAR), peroxisome proliferator activated receptor (PPAR), Liver X Receptor (LXR), farnesoid X receptor (FXR), vitamin D receptor (VDR), and/or pregnane X receptor (PXR).
In some embodiments, Y acts at a thyroid hormone receptor (e.g. TRa, TRP). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the TR, while in other embodiments Y is an antagonist of TR. In one embodiment a thyroid hormone receptor agonist is provided having the general structure of
Figure imgf000035_0001
wherein
Ri5 is Ci-C4 alkyl, -CH2(pyridazinone), -CH2(OH)(phenyl)F, -CH(OH)CH3, halo or
H; R20 is halo, CH3 or H;
R21 is halo, CH3 or H;
R22 is H, OH, halo, -CH2(OH)(C6 aryl)F, or C1-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH, -NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2".
In accordance with one embodiment the thyroid hormone receptor agonist is a compound of the general structure
Figure imgf000036_0001
wherein
Ri5 is C1-C4 alkyl, -CH(OH)CH3, 1 or H
R20 is I, Br, CH3 or H;
R21 is I, Br, CH3 or H;
R22 is H, OH, I, or C1-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH, -NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2". In one embodiment R23 is
CH2CH(NH2)COOH.
In accordance with one embodiment the thyroid hormone receptor agonist is a compound of the general structure
Figure imgf000036_0002
wherein
Ri5 is isopropyl, -CH(OH)CH3, 1 or H
R20 is I, Br, CI, or CH3;
R21 is I, Br, CI, or CH3;
R22 is H; and
R23 is -OCH2COOH, -CH2COOH, -NHC(0)CH2COOH, or -CH2CH2COOH. In accordance with one embodiment the thyroid hormone receptor agonist is a compound of the general structure of Formula I:
Figure imgf000037_0001
R20, R21, and R22 are independently selected from the group consisting of H, OH, halo and Ci-C4 alkyl; and
Ri5 is halo or H. In one embodiment R2o and R2i are each CH3, R15 is H and R22 are independently selected from the group consisting of H, OH, halo and Ci-C4 alkyl. In one embodiment R2o, R21 and R22 are each halo and R15 is H or halo. In a further embodiment R20, R21 and R22 are each I, and R15 is H or I. In accordance with one embodiment Y is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetraiodothyronine) and 3,5,3'- triiodo L- thyronine.
In one embodiment, the thyroid hormone receptor ligand (Y) of the Q-L-Y conjugates, is an indole derivative of thyroxine, including for example, compounds disclosed in U.S. Pat. No. 6,794,406 and US published application no. US 2009/0233979, the disclosures of which are incorporated herein. In one embodiment the indole derivative of thyroxine comprises a compound of the general structure of Formula II:
Figure imgf000037_0002
wherein
Ri3 is H or Ci-C4 alkyl;
Ri4 is Ci-Cg alkyl;
Ri5 is H or Ci-C4 alkyl; and
Ri6 and Ri7 are independently halo or Ci-C4 alkyl.
In one embodiment, the thyroid receptor ligand (Y) of the Q-L-Y conjugates, is an indole derivative of thyroxine as disclosed in W097/21993 (U. Cal SF), WO99/00353 (KaroBio), GB98/284425 (KaroBio), and U.S. Provisional Application 60/183,223, the disclosures of which are incorporated by reference herein. In one embodiment the thyroid receptor ligand comprises the general structure of Formula III:
Figure imgf000038_0001
wherein X is oxygen, sulfur, carbonyl, methylene, or NH;
Y is (CH2)n, where n is an integer from 1 to 5, or C=C;
Ri is halogen, trifluoromethyl, or Ci-C6 alkyl or C3-C7 cycloalkyl;
R2 and R3 are the same or different and are hydrogen, halogen, Ci-C6 alkyl or C3-C7 cycloalkyl, with the proviso that at least one of R2 and R3 being other than hydrogen;
R4 is hydrogen or Ci-C4 alkyl;
R5 is hydrogen or Ci-C4 alkyl;
R6 is carboxylic acid, or ester thereof;
R7 is hydrogen, or an alkanoyl or aroyl group.
Nonlimiting examples of Y include the following compounds:
Figure imgf000038_0002
Thyroxine (T4) ^ Triiodothyroxine (T3) , and derivatives thereof.
In embodiments wherein Y comprises a structure that permits or promotes agonist or antagonist activity at a TR, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Y and means of conjugation of Y to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Y is conjugated to L or Q through any position of Y. In some embodiments, Y is conjugated to L or Q through the carboxylic acid or amine moieties, as indicated below:
Figure imgf000039_0001
In some embodiments, Y acts at a retinoic acid receptor (e.g. RARa, RARp, RARy). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the RAR, while in other embodiments Y is an antagonist of RAR. In exemplary
embodiments, Y comprises a structure of Formula G:
Figure imgf000039_0002
Formula G
wherein R11 is a moiety that permits or promotes agonist or antagonist activity upon the binding of the compound of Formula G to a RAR, and -~w represents either E or Z stereochemistry.
In some embodiments, Y comprises a structure of Formula G wherein R11 is
C(0)OH, CH2OH, or C(0)H. In some embodiments, Y comprises a structure of Formula G wherein R11 is a carboxylic acid derivative (e.g. acyl chloride, anhydride, and ester).
Nonlimiting examples of the compound of Formula G include:
Figure imgf000039_0003
All-trans-retinoic acid Retinol
Figure imgf000039_0004
Retinal 1 1 -cis-retinoic acid , and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula G, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula G that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Y and means of conjugation of Y to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Y is conjugated to L or Q through any position of Y. In some embodiments, Formula G is conjugated to L or Q at R11.
In some embodiments, Y acts at a peroxisome proliferator activated receptor (e.g. PPARa, PPARp/δ, PPARy). In some embodiments, Y acts at PPARy. In some
embodiments, Y comprises any structure that permits or promotes agonist activity at the PPAR, while in other embodiments Y is an antagonist of PPAR. In some embodiments, Y is a saturated or unsaturated, halogenated or nonhalogenated free fatty acid (FFA) as described by Formula H:
Figure imgf000040_0001
Formula H
wherein n is 0-26 and each R 12 , when present, is independently a moiety that permits or promotes agonist or antagonist activity upon binding of the compound of Formula H to a
PPAR.
In some embodiments, Y comprises a structure of Formula H, wherein n is 0-26 and each R 12 , when present, is independently hydrogen, C1-C7 alkyl, or halogen. In some embodiments Formula B is saturated such as, for example, formic acid, acetic acid, n-caproic acid, heptanoic acid, caprylic acid, nonanoic acid, capric acid, undecanoic acid, lauric acid, tridecanoic acid, myristic acid, pentadeconoic acid, palmitic acid, heptadecanoic acid, stearic acid, nonadecanoic acid, arachidic acid, heneicosanoic acid, behenic acid, tricosanoic acid, perfluorononanoic acid (see below), perfluorooctanoic acid (see below), and derivatives thereof. In some embodiments Formula H is unsaturated with either cis or trans
stereochemistry such as, for example, mead acid, myristoleic acid, palmitoleic acid, sapienic acid, oleic acid, linoleic acid, a-linolenic acid, elaidic acid, petroselinic acid, arachidonic acid, dihydroxyeicosatetraenoic acid (DiHETE), octadecynoic acid, eicosatriynoic acid, eicosadienoic acid, eicosatrienoic acid, eicosapentaenoic acid, erucic acid, dihomolinolenic acid, docosatrienoic acid, docosapentaenoic acid, docosahexaenoic acid, adrenic acid, and derivatives thereof, including for example:
Figure imgf000040_0002
Perfluorononanoic acid Perfluorooctanoic acid In embodiments wherein Y comprises a structure of Formula H, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula H that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula H and means of conjugation of Formula H to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula H is conjugated to L or Q at any position on Formula H. In some embodiments, Formula H is conjugated to L or Q through the terminal carboxylic acid moiety.
In some of these embodiments, Y is an eiconsanoid. In specific embodiments, Y is a prostaglandin or a leukotriene. In some exemplary embodiments, Y is a prostaglandin having a structure as described by Formulae J1-J6:
Figure imgf000041_0002
wherein each R is independently a moiety that permits or promotes agonist or antagonist activity upon the binding of the compound of Formula J to a PPAR (e.g. PGJ2 as shown below):
H
Figure imgf000041_0001
In some embodiments when Y comprises a structure of any one of Formulae J1-J6, each R 13 is independently C7-C8 alkyl, C7-C8 alkenyl, C7-C8 alkynyl, or heteroalkyl.
In embodiments wherein Y is an eicosanoid, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of the eicosanoid that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Y and means of conjugation of Y to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Y is conjugated to L or Q through any position of Y. In some embodiments, the eicosanoid is conjugated to L or Q through a terminal carboxylic acid moiety or through a pendant alcohol moiety.
In some exemplary embodiments, Y is a leukotriene having a structure as described by Formula K or a derivatized form of Formula K:
Figure imgf000042_0001
Formula K
wherein each R is independently a moiety that permits or promotes agonist or antagonist activity upon the binding of the compound of Formula K to a PPAR (e.g. leukotriene B4 as shown below):
Figure imgf000042_0002
In some embodiments when Y comprises a structure of Formula K, each R is independently C3-C13 alkyl, C3-C13 alkenyl, C3-C13 alkynyl, or heteroalkyl.
In embodiments wherein Y comprises a structure of Formula K, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula K that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula K and means of conjugation of Formula K to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula K is conjugated to L or Q at any position on Formula K. In some embodiments, Formula K is conjugated to L or Q through the terminal carboxylic acid moiety or through a pendant alcohol moiety.
In some exemplary embodiments, Y is a thiazolidinedione comprising a structure as described by Formula L:
Figure imgf000042_0003
Formula L.
Nonlimiting examples of the compound of Formula L include:
Figure imgf000043_0001
Rosiglitazone Pioglitazone
Figure imgf000043_0002
Troghtazone ^ and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula L, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula L that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula L and means of conjugation of Formula L to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula L is conjugated to L or Q at any position on Formula L, such as, for example, a pendant alcohol moiety, or through an aromatic substituent.
In one embodiment wherein Y is Tesaglitzar or Aleglitazar:
Figure imgf000043_0003
Figure imgf000043_0004
Aleglitazar.
In embodiments wherein Y comprises Tesaglitzar or Aleglitazar, Y is conjugated L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position that is capable of reacting with Q or L. In one embodiment, Tesaglitzar or Aleglitazar is conjugated to L or Q through the carboxylic acid moiety of the compound. In some embodiments, Y acts at a RAR-related orphan receptor (e.g. RORa, RORp, RORy). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the ROR, while in other embodiments Y is an antagonist of ROR.
Nonlimiting examples of Y include:
Figure imgf000044_0001
Cholesterol Melatonin
Figure imgf000044_0002
CGP 52608 s All-trans-retinoic acid , and derivatives thereof.
In embodiments wherein Y acts at a ROR, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Y and means of conjugation of Y to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Y is conjugated to L or Q through any position of Y, such as, for example, any of the positions previously described herein.
In some embodiments, Y acts at a liver X receptor (LXRa, LXRP). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the LXR, while in other embodiments Y is an antagonist of LXR. In exemplary embodiments, Y is an oxysterol (i.e. oxygenated derivative of cholesterol). Nonlimiting examples of Y in these embodiments include 22(R)-hydroxycholesterol (see below), 24(S)-hydroxycholesterol (see below), 27-hydroxycholesterol, cholestenoic acid, and derivatives thereof.
Figure imgf000044_0003
22(R)-Hydroxycholesterol 24(S)-Hydroxycholesterol In embodiments wherein Y acts at a LXR, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Y and means of conjugation of Y to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Y is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 of Formula F. In some embodiments, Formula F is conjugated to L or Q at position 3 or 17 of Formula F.
In some embodiments, Y acts at the farnesoid X receptor (FXR). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the FXR, while in other embodiments Y is an antagonist of FXR. In some of these
embodiments, Y is a bile acid. In exemplary embodiments, Y has a structure of Formula M:
Figure imgf000045_0001
Formula M
wherein each of R15, R16, and R17 are independently moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula M to a FXR.
In some embodiments when Y comprises a structure of Formula M, each of R15 and R16 are independently hydrogen, (Co-C8 alkyl)halo, Ci_Ci8 alkyl, C2-Ci8 alkenyl, C2_Ci8 alkynyl, heteroalkyl, or (C0-C8 alkyl)OH; and R17 is OH, (C0-C8 alkyl)NH(Ci-C4
alkyl)S03H, or (C0-C8 alkyl)NH(Ci-C4 alkyl)COOH.
In some embodiments when Y comprises a structure of Formula M, each of R15 and R16 are independently hydrogen or OH; and R17 is OH, NH(C C2 alkyl)S03H, or NH(C C2 alkyl)COOH.
Nonlimiting examples of the compound of Formula M include:
Figure imgf000046_0001
Tauroc olic acid ^ Glycocholic acid and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula M, Y is conjugated to L
(e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula M that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula M and means of conjugation of Formula M to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula M is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 of Formula M. In some embodiments, Formula M is conjugated to L or Q at position 3, 7, 12 or 17 of Formula M.
In some embodiments, Y acts at the vitamin D receptor (VDR). In some
embodiments, Y comprises any structure that permits or promotes agonist activity at the VDR, while in other embodiments Y is an antagonist of VDR. In exemplary embodiments, Y has a structure of Formula N:
Figure imgf000047_0001
Formula N
wherein each of R 18 , R 1 , R 20 , R 21 , R 22 , and R 23 are moieties that permit or promote agonist or antagonist activity upon binding of the compound of Formula N to the VDR such as, for example, any of the vitamin D compounds found in Bouillon et al., Endocrine Reviews,
16(2):200-257 (1995).
In some embodiments wherein Y comprises a structure of Formula N,
R18 and R19 are each independently hydrogen, (Co-C8 alkyl)halo, (Co-C8
alkyl)heteroaryl, or (C0-C8 alkyl)OH;
both of R 20 are hydrogen or both of R 20 are taken together to form =CH2;
each of R 21 and R 22 are independently Ci-C4 alkyl; and
R 23 is C4_Cis alkyl, C4_Ci8 alkenyl, C4_Ci8 alkynyl, heteroalkyl, (C4_Ci8 alkyl)aryl, (C4_Ci8 alkyl)heteroaryl, (C0_C8 alkyl)OCi_Ci8 alkyl, (C0_C8 alkenyl)OCi_Ci8 alkyl, (C0_C8 alkynyl)OCi_Ci8 alkyl, (C0_C8 alkyl)OC2_Ci8 alkenyl, (C0_C8 alkyl)OC2_Ci8 alkynyl, (C6_Ci8 alkyl)OH, (C6-Ci8 alkyl)SH, (C6-Ci8 alkenyl)OH, (C6-Ci8 alkynyl)OH, (C0-C8 alkyl)NR24Ci_ Ci8 alkyl, (C0-C8 alkenyl)NR24Ci_Ci8 alkyl, (C0-C8 alkynyl)NR24Ci_Ci8 alkyl, (C0-C8 alkyl)NR24C2_Ci8 alkenyl, (C0_C8 alkyl)NR24C2_Ci8 alkynyl, (C0_C8 alkyl)C(0)Ci_Ci8 alkyl, (Co-C8 alkyl)C(0)C2_Ci8 alkenyl, (C0_C8 alkyl)C(0)C2_Ci8 alkynyl, (C0_C8 alkyl)C(0)H, (C0_ C8 alkyl)C(0)aryl, (C0-C8 alkyl)C(0)heteroaryl, (C0-C8 alkyl)C(0)OCi_Ci8 alkyl, (C0-C8 alkyl)C(0)OC2_Ci8 alkenyl, (C0-C8 alkyl)C(0)OC2_Ci8 alkynyl, (C0-C8 alkyl)C(0)OH, (C0-C8 alkyl)C(0)0 aryl, (C0_C8 alkyl)C(0)0 heteroaryl, (C0_C8 alkyl)OC(0)Ci_Ci8 alkyl, (C0_C8 alkyl)OC(0)C2_Ci8 alkenyl, (C0_C8 alkyl)OC(0)C2_Ci8 alkynyl, (C0_C8 alkyl)C(0)NR24Ci_ Ci8 alkyl, (C0-C8 alkyl)C(0)NR24C2_Ci8 alkenyl, (C0-C8 alkyl)C(0)NR24C2_Ci8 alkynyl, (C0- C8 alkyl)C(0)NR24H2, (C0-C8 alkyl)C(0)NR24aryl, (C0-C8 alkyl)C(0)NR24heteroaryl, (C0-C8 alkyl)NR24C(0)Ci_Ci8 alkyl, (C0_C8 alkyl)NR24C(0)C2_C 8 alkenyl, or (C0_C8
alkyl)NR24C(0)C2_Ci8 alkynyl, (C0_C8 alkyl)NR24C(0)OH, (C0_C8 alkyl)OC(0)OCi_C 18 alkyl, (Co-C8 alkyl)OC(0)OC2_C 18 alkenyl, (C0-C8 alkyl)OC(0)OC2_C 18 alkynyl, (C0-C8 alkyl)OC(0)OH, (C0_C8 alkyl)0C(0)NR 4Ci_C 18 alkyl, (C0_C8 alkyl)OC(0)NR 4C2_C 18 alkenyl, (C0_C8 alkyl)OC(0)NR24C2_C 18 alkynyl, (C0_C8 alkyl)OC(0)NR24H2, (C0_C8 alkyl)NR24(0)OCi_C 18 alkyl, (C0-C8 alkyl)NR24(0)OC2_C 18 alkenyl, (C0-C8
alkyl)NR24(0)OC2_C 18 alkynyl, or (C0-C8 alkyl)NR24(0)OH; and
R 24 is hydrogen or Ci_Ci8 alkyl.
Nonlimiting examples of the compound of Formula N include:
Figure imgf000048_0001
Calcitriol , 25-Hydroxyvitamin D3 , and derivatives thereof.
In embodiments wherein Y comprises a structure of Formula N, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Formula N that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Formula N and means of conjugation of Formula N to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Formula N is conjugated to L or Q at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, or 26 of Formula N. In some embodiments, Formula N is conjugated to L or Q at position 1, 3, 19, or 25 of Formula N.
In some embodiments, Y acts at the pregnane X receptor (PXR). In some embodiments, Y comprises any structure that permits or promotes agonist activity at the PXR, while in other embodiments Y is an antagonist of PXR. In some embodiments, Y is a steroid, antibiotic, antimycotic, bile acid, hyperforin, or a herbal compound. In exemplary embodiments, Y is compound that is able to induce CYP3A4, such as dexamethasone and rifampicin. In embodiments wherein Y comprises a structure that acts at the PXR, Y is conjugated to L (e.g. when L is a linking group) or Q (e.g. when L is a bond) at any position of Y that is capable of reacting with Q or L. One skilled in the art could readily determine the position of conjugation on Y and means of conjugation of Y to Q or L in view of general knowledge and the disclosure provided herein. In some embodiments, Y is conjugated to L or Q at any of positions on Y. Modification of the NHR Ligand (Y)
In some embodiments, the NHR ligand is derivatized or otherwise chemically modified to comprise a reactive moiety that is capable of reacting with the insulin peptide (Q) or the linking group (L). In the embodiments described herein, Y is derivatized at any position of Y that is capable of reacting with Q or L. The position of derivatization on Y is apparent to one skilled in the art and depends on the type of NHR ligand used and the activity that is desired. For example, in embodiments wherein Y has a structure comprising a tetracyclic skeleton having three 6-membered rings joined to one 5-membered ring or a variation thereof, Y can be derivatized at any of positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25. Other positions of derivatization can be as previously described herein.
The NHR ligand can be derivatized using any agent known to one skilled in the art or described herein. For example, estradiol can be derivatized with succinic acid, succinic anhydride, benzoic acid, ethyl 2-bromoacetate, or iodoacetic acid to form the below derivatives of estradiol that are capable of conjugating to Q or L.
Figure imgf000049_0001
Similarly, any of the aforementioned NHR ligands can be derivatized by methods known in the art. Additionally, certain derivatized ligands are commercially available and can be purchased from chemical companies such as Sigma-Aldrich. In accordance with one embodiment Y is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof. In one embodiment Y is dexamethasone. In one embodiment Y is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetra- iodo thyronine), 3,5,3 '-triiodo L-thyronine, Tesaglitazar, Aleglitazar and thiazolidinediones. In one embodiment Y is selected from the group consisting of thyroxine T4 (3,5,3',5'-tetra- iodo thyronine), and 3,5,3 '-triiodo L-thyronine. In one embodiment Y is selected from the group consisting of Tesaglitazar and Aleglitazar. Structure of the Insulin peptide
In some embodiments, the insulin peptide of the presently disclosed conjugates is native insulin, comprising the A chain of SEQ ID NO: 1 and the B chain of SEQ ID NO: 2, or an analog of native insulin, including for example a single-chain insulin analog comprising SEQ ID NOS: 1 and 2. In accordance with the present disclosure analogs of insulin encompass polypeptides comprising an A chain and a B chain wherein the insulin analogs differ from native insulin by one or more amino acid substitutions at positions selected from A5, A8, A9, A10, A12, A14, A15, A17, A18, A21, B l, B2, B3, B4, B5, B9, B IO, B 13, B 14, B 17, B20, B21, B22, B23, B26, B27, B28, B29 and B30 or deletions of any or all of positions B l-4 and B26-30.
In one embodiment the insulin peptide is an insulin analog wherein:
(a) the amino acid residue at position B28 is substituted with Asp, Lys, Leu, Val, or Ala, and the amino acyl residue at position B29 is Lys or Pro;
(b) the amino acid residues at any of positions B27, B28, B29, and B30 are deleted or substituted with a nonnative amino acid. In one embodiment an insulin analog is provided comprising an Asp substituted at position B28 or a Lys substituted at position 28 and a proline substituted at position B29. Additional insulin analogs are disclosed in Chance, et al., U.S. Pat. No. 5,514,646; Chance, et al., U.S. patent application Ser. No. 08/255,297; Brems, et al., Protein Engineering, 5:527-533 (1992); Brange, et al., EPO Publication No. 214,826 (published Mar. 18, 1987); and Brange, et al., Current Opinion in Structural Biology, 1:934-940 (1991). The disclosures of which are expressly incorporated herein by reference.
Insulin analogs may also have replacements of the amidated amino acids with acidic forms. For example, Asn may be replaced with Asp or Glu. Likewise, Gin may be replaced with Asp or Glu. In particular, Asn(A18), Asn(A21), or Asp(B3), or any combination of those residues, may be replaced by Asp or Glu. Also, Gln(A15) or Gln(B4), or both, may be replaced by either Asp or Glu.
As disclosed herein single chain insulin agonists are provided comprising a B chain and an A chain of human insulin, or analogs or derivative thereof, wherein the carboxy terminus of the B chain is linked to the amino terminus of the A chain via a linking moiety. In one embodiment the A chain is an amino acid sequence selected from the group consisting of GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 1),
GIVDECCFRSCDLRRLEMYCA (SEQ ID NO: 5) or GIVEECCFRSCDLALLETYCA (SEQ ID NO: 7) and the B chain comprises the sequence
FVNQHLC GS HLVE AL YLVC GERGFFYTPKT (SEQ ID NO: 2),
GPETLCGAELVDALYLVCGDRGFYFNKPT (SEQ ID NO: 6) or
A YRPS ETLC GGELVDTLYLVC GDRGF YFS RP A (SEQ ID NO: 8), or a carboxy shortened sequence thereof having one to five amino acids corresponding to B26, B27, B28, B29 and B30 deleted, and analogs of those sequences wherein each sequence is modified to comprise one to five amino acid substitutions at positions corresponding to native insulin positions selected from A5, A8, A9, A10, A14, A15, A17, A18, A21, B l, B2, B3, B4, B5, B9, B IO, B 13, B 14, B20, B22, B23, B26, B27, B28, B29 and B30. In one embodiment the amino acid substitutions are conservative amino acid substitutions. Suitable amino acid substitutions at these positions that do not adversely impact insulin's desired activities are known to those skilled in the art, as demonstrated, for example, in Mayer, et al., Insulin Structure and Function, Biopolymers. 2007;88(5):687-713, the disclosure of which is incorporated herein by reference.
Additional amino acid sequences can be added to the amino terminus of the B chain or to the carboxy terminus of the A chain of the single chain insulin agonists of the present invention. For example, a series of negatively charged amino acids can be added to the amino terminus of the B chain, including for example a peptide of 1 to 12, 1 to 10, 1 to 8 or 1 to 6 amino acids in length and comprising one or more negatively charged amino acids including for example glutamic acid and aspartic acid. In one embodiment the B chain amino terminal extension comprises 1 to 6 charged amino acids. In one embodiment the B chain amino terminal extension comprises the sequence GX61X62X63X64X65K (SEQ ID NO: 26) or X61X62X63X64X65RK (SEQ ID NO: 27), wherein X61, X62, X63 Χό4 and X65 are independently glutamic acid or aspartic acid. In one embodiment the B chain comprises the sequence GEEEEEKGPEHLCGAHLVDALYLVCGDX42GFY (SEQ ID NO: 28), wherein X42 is selected from the group consisting of alanine lysine, ornithine and arginine.
High potency NHR ligand-insulin conjugates can also be prepared based on using a modified IGF I and IGF II sequence described in published International application no. WO 2010/080607, the disclosure of which is expressly incorporated herein by reference, as the insulin peptide component. More particularly, analogs of IGF I and IGF II that comprise a substitution of a tyrosine leucine dipeptide for the native IGF amino acids at positions corresponding to B 16 and B 17 of native insulin have a tenfold increase in potency at the insulin receptor.
In accordance with one embodiment the insulin peptide for use in the present disclosure comprises a B chain sequence of R62-
X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20) and an A chain sequence of GIVX4X5CCX8X9XioCXi2LXi4Xi5LXi7Xi8Xi9CX2i-R53 (SEQ ID NO: 29) wherein
X4 is glutamic acid or aspartic acid;
X5 is glutamine or glutamic acid
X8 is histidine, threonine or phenylalanine;
X9 is serine, arginine, lysine, ornithine or alanine;
X10 is isoleucine or serine;
X12 is serine or aspartic acid
Xi4 is tyrosine, arginine, lysine, ornithine or alanine;
Xi5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine;
Xi7 is glutamine, glutamic acid, arginine, aspartic acid, lysine or ornithine;
Xi8 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid, glutamine and glutamic acid; X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, lysine, ornithine and arginine; X45 is tyrosine, histidine, asparagine or phenylalanine;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and a bond; and Rs3 is COOH or CONH2. In one embodiment the A chain and the B chain are linked to one another by interchain disulfide bonds, including those that form between the A and B chains of native insulin. In an alternative embodiment the A and B chains are linked together as a linear single chain- insulin peptide.
In one embodiment the conjugates comprise an insulin peptide wherein the A chain comprises a sequence of GIVEQCCXiSICSLYQLENX2CX3 (SEQ ID NO: 30) and said B chain sequence comprises a sequence of X4LCGX5X6LVEALYLVCGERGFF (SEQ ID NO: 31), wherein
Xi is selected from the group consisting of threonine and histidine;
X2 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X3 is selected from the group consisting of asparagine and glycine;
X4 is selected from the group consisting of histidine and threonine;
X5 is selected from the group consisting of alanine, glycine and serine;
X6 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid.
In accordance with one embodiment an insulin analog is provided wherein the A chain of the insulin peptide comprises the sequence GIVEQCCX8X9ICSLYQLENYCX21- R5 (SEQ ID NO: 73) or GIVEQCCX8SICSLYQLXi7NYCX2i (SEQ ID NO: 32) and the B chain comprising the sequence R62-X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45YT-Zi- Bi (SEQ ID NO: 142), wherein
X8 is selected from the group consisting of threonine and histidine;
X9 is serine, lysine, or alanine;
X17 is glutamine or glutamic acid;
X21 is asparagine or glycine; X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
R62 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine and an N-terminal amine
Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine; and
Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide.
In accordance with one embodiment an insulin analog is provided wherein the A chain of the insulin peptide comprises the sequence GIVEQCCX8SICSLYQLX17NX19CX21 (SEQ ID NO: 32) and the B chain comprising the sequence
X25LCGX29X3oLVEALYLVCGERGFF (SEQ ID NO: 33) wherein
X8 is selected from the group consisting of threonine and histidine;
Xi7 is glutamic acid or glutamine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X21 is asparagine or glycine;
X25 is selected from the group consisting of histidine and threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X3o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid. In a further embodiment the B chain comprises the sequence X22VNQX25LCGX29X3oLVEALYLVCGERGFFYT-Zi-B 1 (SEQ ID NO: 34) wherein
X22 is selected from the group consisting of phenylalanine and desamino- phenylalanine; X25 is selected from the group consisting of histidine and threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine; and
Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide.
In accordance with some embodiments the A chain comprises the sequence
GrVEQCCX8SICSLYQLX17NXi9CX23 (SEQ ID NO: 32) or
GIVDECCX8X9SCDLXi4Xi5LXi7Xi8 X19CX21-R53 (SEQ ID NO: 35), and the B chain comprises the sequence X25LCGX29X3oLVX33X34LYLVCGDX42GFX45 (SEQ ID NO: 36) wherein
X8 is histidine or phenylalanine;
X9 and X14 are independently selected from arginine, lysine, ornithine or alanine;
Xi5 is arginine, lysine, ornithine or leucine;
X17 is glutamic acid or glutamine;
Xi8 is methionine, asparagine or threonine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X21 is alanine, glycine or asparagine;
X23 is asparagine or glycine;
X25 is selected from the group consisting of histidine and threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
X45 is tyrosine; and
R53 is COOH or CONH2.
In a further embodiment the A chain comprises the sequence
GIVDECCX8X9SCDLXi4Xi5LXi7Xi8 X19CX21-R53 (SEQ ID NO: 35), and the B chain comprises the sequence X25LCGX29X3oLVX33X34LYLVCGDX42GFX45 (SEQ ID NO: 36) wherein
X8 is histidine;
X9 and X14 are independently selected from arginine, lysine, ornithine or alanine; Xi5 is arginine, lysine, ornithine or leucine;
X17 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine;
Xi8 is methionine, asparagine or threonine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X2i is alanine, glycine or asparagine;
X23 is asparagine or glycine;
X25 is selected from the group consisting of histidine and threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
X45 is tyrosine or phenylalanine and
R53 is COOH or CONH2. In a further embodiment the A chain comprises the sequence GIVDECCHX9SCDLXi4Xi5LXi7MXi9CX2i-R53 (SEQ ID NO: 37), and the B chain comprises the sequence X25LCGAX3oLVDALYLVCGDX42GFX45 (SEQ ID NO: 38) wherein
X9, X14 and Xi5 are independently ornithine, lysine or arginine;
Xi7 is glutamic acid or glutamine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X2i is alanine, glycine or asparagine;
X25 is selected from the group consisting of histidine and threonine;
X30 is selected from the group consisting of histidine, aspartic acid and glutamic acid; X42 is selected from the group consisting of alanine, lysine, ornithine and arginine; X45 is tyrosine or phenylalanine and
R53 is COOH or CONH2. In one embodiment the B chain is selected from the group consisting of HLCGAELVDALYLVCGDX42GFY (SEQ ID NO: 39),
GPEHLCGAELVDALYLVCGDX42GFY (SEQ ID NO: 40), GPEHLCGAELVDALYLVCGDX42GFYFNPKT (SEQ ID NO: 41) and GPEHLCGAELVDALYLVCGDX42GFYFNKPT (SEQ ID NO: 42), wherein X42 is selected from the group consisting of ornithine, lysine and arginine. In a further embodiment the A chain comprises the sequence GIVDECCHX9SCDLXi4Xi5LQMYCN-R53 (SEQ ID NO: 43), wherein X9, X14 and X15 are independently ornithine, lysine or arginine.
In another embodiment the A chain comprises the sequence
GIVDECCX8RSCDLYQLENXi9CN-R53 (SEQ ID NO: 44) and the B chain comprises the sequence R62-X25LCGSHLVDALYLVCGDX42GFX45 (SEQ ID NO: 45)
wherein
X8 is threonine, histidine or phenylalanine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X25 is histidine or threonine;
X42 is alanine, ornithine or arginine;
X45 is tyrosine histidine, asparagine or phenylalanine;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ
(SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and a bond; and R53 is COOH or CONH2. and
R53 is COOH or CONH2. In a further embodiment X19 is Tyr.
In another embodiment the A chain comprises the sequence
GIVEQCCHSICSLYQLENXi9CX2i-R53 (SEQ ID NO: 46) or
GIVDECCHRSCDLRRLEMXi9CX2i-R53 (SEQ ID NO: 47); and the B chain comprises the sequence FVNQHLCGS HLVE ALYLVCGERGFFYTPKT (SEQ ID NO: 2), or
GPETLCGAELVDALYLVCGDRGFYFNPKT (SEQ ID NO: 48)
wherein
X19 is tyrosine, 4-methoxy phenylalanine or 4-amino-phenylalanine; and
X2i is alanine, glycine or asparagine, optionally wherein X19 is tyrosine, and X21 is alanine or asparagine.
In another embodiment, the A chain comprises the sequence
GIVEQCCHSICSLYQLENYCX2i-R53 (SEQ ID NO: 160) and the B chain comprises the sequence FVKQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 147), or
FVNQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 148), wherein X21 is alanine, glycine or asparagine; and
X25 is selected from the group consisting of histidine and threonine;
X28 is proline, aspartic acid or glutamic acid; and
R63 is selected from the group consisting of YTX28KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX28K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX2S, YT, Y and a bond. In one embodiment the B chain comprises the sequence
F VKQX25LCGS HLVE ALYLVC GERGFF YTEKT (SEQ ID NO: 162),
FVNQX25LCGSHLVEALYLVCGERGFFYTDKT (SEQ ID NO: 164),
FVNQX25LCGS HLVE ALYLVC GERGFF YTKPT (SEQ ID NO: 165) or
FVNQX25LCGS HLVE ALYLVC GERGFF YTPKT (SEQ ID NO: 161) wherein
X25 is selected from the group consisting of histidine and threonine.
Single Chain Insulin Peptide Agonists
As disclosed herein linking moieties can be used to link human insulin A and B chains, or analogs or derivatives thereof, wherein the carboxy terminus of the B25 amino acid of the B chain is directly linked to a first end of a linking moiety, wherein the second end of the linking moiety is directly linked to the amino terminus of the Al amino acid of the A chain via the intervening linking moiety.
In accordance with one embodiment the insulin peptide is a single chain insulin agonist that comprises the general structure B-LM-A wherein B represents an insulin B chain, A represents an insulin A chain, and LM represents a linking moiety linking the carboxy terminus of the B chain to the amino terminus of the A chain. Suitable linking moieties for joining the B chain to the A chain are disclosed herein under the header Linking Moieties for Single Chain-Insulin Analogs and the respective subheaders "Peptide linkers". In one embodiment the linking moiety comprises a linking peptide, and more particularly, in one embodiment the peptide represents an analog of the IGF-1 C peptide. Additional exemplary peptide linkers include but are not limited to the sequence XsiX52GSSSX57X58 (SEQ ID NO: 49) or X51X52GSSSX57X58APQT (SEQ ID NO: 50) wherein X51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine and proline, X52 is alanine, valine, leucine, isoleucine or proline and X57 or Xs8 are independently arginine, lysine, cysteine, homocysteine, acetyl-phenylalanine or ornithine, optionally with a hydrophilic moiety linked to the side chain of the amino acid at position 7 or 8 of the linking moiety (i.e., at the X57 or Xs8 position). Amino acid positions of the linking moiety are designated based on the corresponding position in the native C chain of IGF 1 (SEQ ID NO: 17). In another embodiment the peptide linking moiety comprises a 29 contiguous amino acid sequence having greater than 70%, 80%, 90% sequence identity to
S S S S X50 APPPS LPS PS RLPGPS DTPILPQX51 (SEQ ID NO: 68), wherein X50 and X51 are independently selected from arginine and lysine. In one embodiment the linking moiety is a non-peptide linker comprising a relatively short bifunctional non-peptide polymer linker that approximates the length of an 8-16 amino acid sequence. In one embodiment the non- peptide linker has the structure:
Figure imgf000059_0001
wherein m is an integer ranging from
10 to 14 and the linking moiety is linked directly to the B25 amino acid of the B chain. In accordance with one embodiment the non-peptide linking moiety is a polyethylene glycol linker of approximately 4 to 20, 8 to 18, 8 to 16, 8 to 14, 8 to 12, 10 to 14, 10 to 12 or 11 to 13 monomers.
In one embodiment an NHR ligand-insulin conjugate is provided that comprises an insulin peptide having the structure: IB-LM-IA, wherein IB comprises the sequence R62- X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20), LM is a linking moiety as disclosed herein that covalently links IB to IA, and IA comprises the sequence
GIVX4X5CCX8X9XioCXi2LXi4Xi5LXi7Xi8Xi9CX2i-R53 (SEQ ID NO: 29), wherein
X4 is glutamic acid or aspartic acid;
X5 is glutamine or glutamic acid;
X8 is histidine or phenylalanine;
X9 and X14 are independently selected from arginine, lysine, ornithine or alanine;
X10 is isoleucine or serine;
X12 is serine or aspartic acid;
X14 is tyrosine, arginine, lysine, ornithine or alanine;
Xi5 is arginine, lysine, ornithine or leucine;
X17 is glutamic acid or glutamine;
X18 is methionine, asparagine or threonine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X21 is alanine, glycine or asparagine;
X25 is selected from the group consisting of histidine and threonine;
X29 is selected from the group consisting of alanine, glycine and serine; X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, lysine, ornithine and arginine;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
R53 is COOH or CONH2, further wherein the amino acid at the designation X45 is directly bound to the linking moiety, LM (i.e., the designation IB-LM-IA as used herein is intended to represent that the B chain carboxyl terminus and the amino terminus of the A chain are directly linked to the linking moiety LM without any further intervening amino acids).
In one embodiment the linking moiety (LM) comprises an amino acid sequence of no more than 17 amino acids in length. In one embodiment the linking moiety comprises the sequence X51X52GSSSX57X58 (SEQ ID NO: 49) or X51X52GSSSX57X58APQT (SEQ ID NO: 50) wherein X51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine and proline, Xs2 is alanine, valine, leucine, isoleucine or proline and X57 or X58 are independently arginine, lysine, cysteine, homocysteine, acetyl-phenylalanine or ornithine, optionally with a hydrophilic moiety linked to the side chain of the amino acid at position 7 or 8 of the linking moiety (i.e., at the X57 or X58 position). Amino acid positions of the linking moiety are designated based on the corresponding position in the native C chain of IGF 1 (SEQ ID NO: 17). In one embodiment LM is GAGS S S RR APQT (SEQ ID NO: 23) or GAGSSSRR (SEQ ID NO: 22).
In another embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 70%, 80%, 90% sequence identity to
S S S S X50 APPPS LPS PS RLPGPS DTPILPQX51 (SEQ ID NO: 68), wherein X50 and X51 are independently selected from arginine and lysine. In one embodiment the linking peptide comprises a total of 29 to 158 or 29 to 58 amino acids and comprises the sequence of SEQ ID NO: 68. In another embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 90% sequence identity to
S S S S X50 APPPS LPS PS RLPGPS DTPILPQX51 (SEQ ID NO: 68), wherein X50 and X51 are independently selected from arginine and lysine. In one embodiment the linking moiety comprises the sequence SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51) or SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52) optionally with one or two amino acid substitutions.
In accordance with one embodiment a single chain insulin agonist polypeptide is provided comprising a B chain and A chain of human insulin, or analogs or derivative thereof, wherein the last five carboxy amino acids of the native B chain are deleted (i.e., B26-B30), and amino acid B25 is linked to amino acid Al of the A chain via an intervening linking moiety. In one embodiment the linking moiety comprises the structure:
Figure imgf000061_0001
wherein m is an integer ranging from 10 to 14 and the linking moiety is linked directly to the B25 amino acid of the B chain.
In one embodiment an NHR ligand-insulin conjugate is provided comprising an insulin peptide having the general formula IB-LM-IA wherein IB comprises the sequence GPEHLCGAX3oLVDALYLVCGDX42GFYFNX48X49 (SEQ ID NO: 163);
LM comprises the sequence SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51), SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52), GYGSSSRR (SEQ ID NO: 18), GAGS S S RRAPQT (SEQ ID NO: 23) or GAGSSSRR (SEQ ID NO: 22); and
IA comprises the sequence GIVDECCXsXgSCDLX^XisLXnXisXigCXii-Rss (SEQ ID NO: 35) wherein
X8 is histidine or phenylalanine;
X9 is arginine, ornithine or alanine;
X14 and Xi5 are both arginine;
X17 is glutamic acid;
Xis is methionine, asparagine or threonine
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino phenylalanine;
X21 is alanine or asparagine;
histidine or threonine; X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X42 is selected from the group consisting of alanine, ornithine and arginine;
X48 is lysine or aspartic acid;
X49 is proline, ornithine or arginine; and
R53 is COOH.
Linking Moieties for Single Chain Insulin Analogs
Peptide linkers
In accordance with one embodiment the linking moiety is a peptide or peptidomimetic of 6-18, 8-18, 8-17, 8-12, 8-10, 13-17 or 13-15 amino acids (or amino acid analogs or derivatives thereof). In one embodiment the linking moiety is 8 to 17 amino acids in length and comprises the sequence X51X52GSSSRR (SEQ ID NO: 53) wherein X51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine, proline and methionine, and X52 is a non-aromatic amino acid, including for example, alanine. In one embodiment the linking moiety is 8 to 17 amino acids in length and comprises a sequence that differs from X51X52GSSSRR (SEQ ID NO: 53) by a single amino acid substitution wherein the amino acid substitution is an amino acid that is pegylated at its side chain, further wherein X51 is selected from the group consisting of glycine, alanine, valine, leucine, isoleucine, proline and methionine, and X52 is a non-aromatic amino acid, including for example, alanine.
In accordance with one embodiment the linking moiety is a derivative of the IGF 1 C chain sequence (GYGSSSRRAPQT; SEQ ID NO: 17). In one embodiment the derivative is a peptide that differs from SEQ ID NO: 17 by a single amino acid substitution of a lysine, cysteine ornithine, homocysteine, or acetyl-phenylalanine residue, and in a further embodiment the lysine, cysteine ornithine, homocysteine, or acetyl-phenylalanine amino acid is pegylated. In one further embodiment the linking moiety is a peptide that differs from SEQ ID NO: 17 by a single lysine substitution. In one specific embodiment the substitution is made at position 8 of SEQ ID NO: 17. Applicants have discovered that use of the IGF 1 C chain sequence and analogs thereof as a linking moiety will generate a single chain insulin polypeptide that has near wild type insulin activity. Furthermore, use of a IGF 1 C chain sequence analog as the linking moiety, wherein position 2 of the IGF 1 C chain sequence is modified, or the carboxy terminal four amino acids are deleted from the IGF 1 C chain sequence, produces a single chain insulin polypeptide that is selective for insulin (i.e., has a higher binding and/or activity at the insulin receptor compared to the IGF-1 receptor). In one embodiment the single chain insulin polypeptide has 5x, lOx, 20x, 30x, 40x, or 50x higher affinity or activity at the insulin receptor relative to the IGF- 1 receptor.
In accordance with one embodiment the linking moiety is a derivative of the IGF 1 C chain sequence (GYGSSSRRAPQT; SEQ ID NO: 17) and comprises a non-native sequence that differs from GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT (SEQ ID NO: 23) by 1 to 3 amino acid substitutions, or 1 to 2 amino acid substitutions. In one embodiment at least one of the amino acid substitutions is a lysine or cysteine substitution, and in one embodiment the amino acid substitutions are conservative amino acid substitutions. In one embodiment the linking moiety is a peptide (or peptidomimetic) of 8 to 17 amino acids comprising a non-native amino acid sequence that differs from GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT (SEQ ID NO: 23) by 1 amino acid substitution, including for example substitution with a lysine or cysteine. In one embodiment the linking moiety comprises the sequence GYGSSSRR (SEQ ID NO: 18) or GAGSSSRRAPQT (SEQ ID NO: 23). In one embodiment the linking moiety comprises the sequence GAGSSSRXsgAPQT (SEQ ID NO: 54), GYGSSSX57X58APQT (SEQ ID NO: 69), or an amino acid that differs from SEQ ID NO: 54 by a single amino acid substitution, wherein Xs7 is arginine and Xs8 is arginine, ornithine or lysine, and in a further embodiment a polyethylene glycol chain is linked to the side chain of the amino acid at position 8 of said linking moiety. In another embodiment the linking moiety comprises the sequence GX52GSSSRX58APQT (SEQ ID NO: 55), wherein Xs2 is any non-aromatic amino acid, including for example, alanine, valine, leucine, isoleucine or proline, and Xs8 represents an amino acid that has a
polyethylene chain covalently linked to its side chain. In one embodiment Xs8 is a pegylated lysine.
In another embodiment, the linking moiety is an 8 to 17 amino acid sequence comprising the sequence GXs2GSSSRR (SEQ ID NO: 56), wherein Xs2 is any amino acid, a peptidomimetic of SEQ ID NO: 31, or an analog thereof that differs from SEQ ID NO: 31 by a single amino acid substitution at any of positions 1, 3, 4, 5, 6, 7 or 8 of SEQ ID NO: 31, with the proviso that when the linking peptide is longer than 8 amino acids Xs2 is other than tyrosine. In accordance with one embodiment the linking moiety comprises an 8-17 amino acid sequence selected from the group consisting of GYGSSSRR (SEQ ID NO: 18), GAGSSSRR (SEQ ID NO: 22), GAGSSSRRA (SEQ ID NO: 57), GAGSSSRRAP (SEQ ID NO: 58), GAGSSSRRAPQ (SEQ ID NO: 59), GAGS S S RRAPQT (SEQ ID NO: 23), PYGSSSRR (SEQ ID NO: 61), PAGSSSRR (SEQ ID NO: 62), PAGSSSRRA (SEQ ID NO: 63), PAGSSSRRAP (SEQ ID NO: 64), PAGSSSRRAPQ (SEQ ID NO: 65),
P AGS S S RRAPQT (SEQ ID NO: 66). In accordance with one embodiment the linking moiety comprises an amino acid sequence that differs from GYGSSSRR (SEQ ID NO: 18), GAGSSSRR (SEQ ID NO: 22), GAGSSSRRA (SEQ ID NO: 57), GAGSSSRRAP (SEQ ID NO: 58), GAGSSSRRAPQ (SEQ ID NO: 59), GAGS S S RRAPQT (SEQ ID NO: 23), PYGSSSRR (SEQ ID NO: 61), PAGSSSRR (SEQ ID NO: 62), PAGSSSRRA (SEQ ID NO: 63), PAGSSSRRAP (SEQ ID NO: 64), PAGSSSRRAPQ (SEQ ID NO: 65),
P AGS S S RRAPQT (SEQ ID NO: 66) by a single pegylated amino acid including for example a pegylated lysine or pegylated cysteine amino acid substitution. In one
embodiment the pegylated amino acid is at position 8 of the linking moiety.
In one embodiment a peptide sequence named C-terminal peptide (CTP:
SSSSKAPPPSLPSPSRLPGPSDTPILPQR; SEQ ID NO: 52), which is prone to O-linked hyperglycosylation when the protein is expressed in a eukaryotic cellular expression system, can be used as a linker peptide. Surprisingly, applicants have discovered that the CTP peptide can be used to connect the B and A chains of insulin to form a single chain insulin analog while still maintaining high in vitro potency in a manner that the native proinsulin C- peptide can not. In one embodiment a NHR ligand-insulin conjugate is prepared comprising an insulin peptide having the carboxy terminus of the B chain linked to the amino terminus of the A chain via a CTP peptide. In another embodiment an insulin analog is provided as a two-chain construct with the CTP covalently linked to the C-terminus of the B-chain and/or the amino terminus of the B chain. In vitro and in vivo characterization reveals the CTP modified insulin analogs to have high potency in the absence of glycosylation, thus providing a mechanism to extend insulin action that is based on glycosylation, a natural approach to longer duration proteins.
Applicants have discovered that the primary sequence of the CTP peptide does not appear to be critical. Accordingly, in one embodiment the linking moiety comprises a peptide having a length of at least 18 amino acids that shares a similar amino acid content. In one embodiment the linking moiety comprises an analog of (SEQ ID NO: 68), wherein said analog differs from (SEQ ID NO: 68) by 1, 2, 3, 4, 5 or 6 amino acid substitutions. In one embodiment the linking peptide comprises a CTP peptide wherein amino acid substitutions are made at one or more positions selected from positions 1, 2, 3, 4, 10, 13, 15, and 21 of (SEQ ID NO: 68). In one embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 60, 80 or 90% sequence identity to SSSSX50APPPSLPSPSRLPGPSDTPILPQX5i (SEQ ID NO: 68), with the proviso that the sequence does not comprise a 15 amino acid sequence identical to a 15 amino acid sequence contained within SEQ ID NO 53. In another embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein at least 58% of the amino acids comprising the 29 contiguous amino acid sequence are selected from the group consisting of serine and proline.
In another embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence has greater than 70%, 80%, 90% sequence identity to S S S S X50 APPPS LPS PS RLPGPS DTPILPQX51 (SEQ ID NO: 68), wherein X50 and X51 are independently selected from arginine and lysine, with the proviso that the sequence does not comprise a 15 amino acid sequence identical to a 15 amino acid sequence contained within SEQ ID NO 53. In another embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence is an analog of (SEQ ID NO: 52), wherein said analog differs from (SEQ ID NO: 52) only by 1, 2, 3, 4, 5 or 6 amino acid modification, and in a further embodiment the amino acid modifications are conservative amino acid substitutions. In another embodiment the linking moiety comprises a 29 contiguous amino acid sequence, directly linked to the carboxy terminal amino acid of the B chain, wherein said 29 contiguous amino acid sequence is an analog of (SEQ ID NO: 52), wherein said analog differs from (SEQ ID NO: 52) only by 1, 2 or 3 amino acid substitutions.
Applicants have also found that multiple copies of the CTP peptide can be used as the linking peptide in single chain analogs and/or linked to the amino terminus of the B chain in single chain or two chain insulin analogs. The multiple copies of the CTP peptide can be identical or can differ in sequence and can be arranged in a head to tail or head to head orientation. In accordance with one embodiment an insulin analog is provided comprising a CTP peptide having the sequence
(SSSSX5oAPPPSLPSPSRLPGPSDTPILPQX5i)n (SEQ ID NO: 68), wherein n is an integer selected from the group consisting of 1, 2, 3 and 4 and X50 and X51 are independently selected from arginine and lysine.
In one embodiment the CTP peptide comprises the sequence
S S S S X50 APPPS LPS PS RLPGPS DTPILPQX51 (SEQ ID NO: 68), wherein X50 and X51 are independently selected from arginine and lysine. In another embodiment the CTP peptide comprises a sequence selected from the group consisting of
SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51),
SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52) or
SSSSRAPPPSLPSPSRLPGPSDTPILPQ (SEQ ID NO: 67), and in a further embodiment the CTP peptide comprises the sequence SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51).
Structure of L
In some embodiments, L is a bond. In these embodiments, Q and Y are conjugated together by reacting a nucleophilic reactive moiety on Q with and electrophilic reactive moiety on Y. In alternative embodiments, Q and Y are conjugated together by reacting an electrophilic reactive moiety on Q with a nucleophilic moiety on Y. In exemplary embodiments, L is an amide bond that forms upon reaction of an amine on Q (e.g. an ε- amine of a lysine residue) with a carboxyl group on Y. In alternative embodiments, Q and or Y are derivatized with a derivatizing agent before conjugation.
In some embodiments, L is a linking group. In some embodiments, L is a bifunctional linker and comprises only two reactive groups before conjugation to Q and Y. In embodiments where both Q and Y have electrophilic reactive groups, L comprises two of the same or two different nucleophilic groups (e.g. amine, hydroxyl, thiol) before conjugation to Q and Y. In embodiments where both Q and Y have nucleophilic reactive groups, L comprises two of the same or two different electrophilic groups (e.g. carboxyl group, activated form of a carboxyl group, compound with a leaving group) before conjugation to Q and Y. In embodiments where one of Q or Y has a nucleophilic reactive group and the other of Q or Y has an electrophilic reactive group, L comprises one nucleophilic reactive group and one electrophilic group before conjugation to Q and Y.
L can be any molecule with at least two reactive groups (before conjugation to Q and Y) capable of reacting with each of Q and Y. In some embodiments L has only two reactive groups and is bifunctional. L (before conjugation to the peptides) can be represented by Formula VI:
W- Linking Group (L) wherein W and J are independently nucleophilic or electrophilic reactive groups. In some embodiments W and J are either both nucleophilic groups or both electrophilic groups. In some embodiments one of W or J is a nucleophilic group and the other of W or J is an electrophilic group.
In some embodiments, L comprises a chain of atoms from 1 to about 60, or 1 to 30 atoms or longer, 2 to 5 atoms, 2 to 10 atoms, 5 to 10 atoms, or 10 to 20 atoms long. In some embodiments, the chain atoms are all carbon atoms. In some embodiments, the chain atoms in the backbone of the linker are selected from the group consisting of C, O, N, and S. Chain atoms and linkers may be selected according to their expected solubility (hydrophilicity) so as to provide a more soluble conjugate. In some embodiments, L provides a functional group that is subject to cleavage by an enzyme or other catalyst or hydrolytic conditions found in the target tissue or organ or cell. In some embodiments, the length of L is long enough to reduce the potential for steric hindrance.
In some embodiments, the linking group is hydrophilic such as, for example, polyalkylene glycol. Before conjugation to the peptides of the composition, the hydrophilic linking group comprises at least two reactive groups (W and J), as described herein and as shown below:
W- Hydrophilic Linking Group
In specific embodiments, the linking group is polyethylene glycol (PEG). The PEG in certain embodiments has a molecular weight of about 100 Daltons to about 10,000 Daltons, e.g. about 500 Daltons to about 5000 Daltons. The PEG in some embodiments has a molecular weight of about 10,000 Daltons to about 40,000 Daltons.
In some embodiments, the hydrophilic linking group comprises either a maleimido or an iodoacetyl group and either a carboxylic acid or an activated carboxylic acid (e.g. NHS ester) as the reactive groups. In these embodiments, the maleimido or iodoacetyl group can be coupled to a thiol moiety on Q or Y and the carboxylic acid or activated carboxylic acid can be coupled to an amine on Q or Y with or without the use of a coupling reagent. Any appropriate coupling agent known to one skilled in the art can be used to couple the carboxylic acid with the amine. In some embodiments, the linking group is maleimido- PEG(20 kDa)-COOH, iodoacetyl-PEG(20 kDa)-COOH, maleimido-PEG(20 kDa)-NHS, or iodoacetyl-PEG(20 kDa)-NHS.
In some embodiments, the linking group is comprised of an amino acid, a dipeptide, a tripeptide, or a polypeptide, wherein the amino acid, dipeptide, tripeptide, or polypeptide comprises at least two activating groups, as described herein. In some embodiments, the linking group (L) comprises a moiety selected from the group consisting of: amino, ether, thioether, maleimido, disulfide, amide, ester, thioester, alkene, cycloalkene, alkyne, trizoyl, carbamate, carbonate, cathepsin B-cleavable, and hydrazone. In some embodiments, the linking group is an amino acid selected from the group Asp, Glu, homoglutamic acid, homocysteic acid, cysteic acid, gamma-glutamic acid. In some embodiments, the linking group is a dipeptide selected from the group consisting of: Ala- Ala, β-Ala- β-Ala, Leu-Leu, Pro-Pro, γ-aminobutyric acid- γ-aminobutyric acid, and γ-Glu- γ-Glu. In one embodiment L comprises gamma-glutamic acid.
In embodiments where Q and Y are conjugated together by reacting a carboxylic acid with an amine, an activating agent can be used to form an activated ester of the carboxylic acid. The activated ester of the carboxylic acid can be, for example, N-hydroxysuccinimide (NHS), tosylate (Tos), mesylate, triflate, a carbodiimide, or a hexafluorophosphate. In some embodiments, the carbodiimide is 1,3-dicyclohexylcarbodiimide (DCC), 1,1'- carbonyldiimidazole (CDI), l-ethyl-3-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDC), or 1,3-diisopropylcarbodiimide (DICD). In some embodiments, the
hexafluorophosphate is selected from a group consisting of hexafluorophosphate benzotriazol- l-yl-oxy-tris(dimethylamino)phosphonium hexafluorophosphate (BOP), benzotriazol-l-yl- oxytripyrrolidinophosphonium hexafluorophosphate (PyBOP), 2-(lH-7- azabenzotriazol-l-yl)-l,l,3,3-tetramethyl uranium hexafluorophosphate (HATU), and o- benzotriazole-N,N,N',N'-tetramethyl-uronium-hexafluoro-phosphate (HBTU).
In some embodiments, Q comprises a nucleophilic reactive group (e.g. the amino group, thiol group, or hydroxyl group of the side chain of lysine, cysteine or serine) that is capable of conjugating to an electrophilic reactive group on Y or L. In some embodiments, Q comprises an electrophilic reactive group (e.g. the carboxylate group of the side chain of Asp or Glu) that is capable of conjugating to a nucleophilic reactive group on Y or L. In some embodiments, Q is chemically modified to comprise a reactive group that is capable of conjugating directly to Y or to L. In some embodiments, Q is modified at the C-terminal to comprise a natural or nonnatural amino acid with a nucleophilic side chain, such as an amino acid represented by Formula I, Formula II, or Formula III, as previously described herein (see Acylation and alkylatiori). In exemplary embodiments, the C-terminal amino acid of Q is selected from the group consisting of lysine, ornithine, serine, cysteine, and homocysteine. For example, the C-terminal amino acid of Q can be modified to comprise a lysine residue. In some embodiments, Q is modified at the C-terminal amino acid to comprise a natural or nonnatural amino acid with an electrophilic side chain such as, for example, Asp and Glu. In some embodiments, an internal amino acid of Q is substituted with a natural or nonnatural amino acid having a nucleophilic side chain, such as an amino acid represented by Formula I, Formula II, or Formula III, as previously described herein (see Acylation and alkylation). In exemplary embodiments, the internal amino acid of Q that is substituted is selected from the group consisting of lysine, ornithine, serine, cysteine, and homocysteine. For example, an internal amino acid of Q can be substituted with a lysine residue. In some embodiments, an internal amino acid of Q is substituted with a natural or nonnatural amino acid with an electrophilic side chain, such as, for example, Asp and Glu.
In some embodiments, Y comprises a reactive group that is capable of conjugating directly to Q or to L. In some embodiments, Y comprises a nucleophilic reactive group (e.g. amine, thiol, hydroxyl) that is capable of conjugating to an electrophilic reactive group on Q or L. In some embodiments, Y comprises electrophilic reactive group (e.g. carboxyl group, activated form of a carboxyl group, compound with a leaving group) that is capable of conjugating to a nucleophilic reactive group on Q or L.
Stability of L in vivo
In some embodiments, L is stable in vivo. In some embodiments, L is stable in blood serum for at least 5 minutes, e.g. less than 25%, 20%, 15%, 10% or 5% of the conjugate is cleaved when incubated in serum for a period of 5 minutes. In other embodiments, L is stable in blood serum for at least 10, or 20, or 25, or 30, or 60, or 90, or 120 minutes, or 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 18 or 24 hours. In these embodiments, L does not comprise a functional group that is capable of undergoing hydrolysis in vivo. In some exemplary embodiments, L is stable in blood serum for at least about 72 hours. Nonlimiting examples of functional groups that are not capable of undergoing significant hydrolysis in vivo include amides, ethers, and thioethers. For example, the following compound is not capable of undergoing significant hydrolysis in vivo:
Figure imgf000070_0001
In some embodiments, L is hydrolyzable in vivo. In these embodiments, L comprises a functional group that is capable of undergoing hydrolysis in vivo. Nonlimiting examples of functional groups that are capable of undergoing hydrolysis in vivo include esters, anhydrides, and thioesters. For example the following compound is capable of undergoing hydrolysis in vivo because it comprises an ester group:
Figure imgf000070_0002
In some exemplary embodiments L is labile and undergoes substantial hydrolysis within 3 hours in blood plasma at 37 °C, with complete hydrolysis within 6 hours. In some exemplary embodiments, L is not labile.
In some embodiments, L is metastable in vivo. In these embodiments, L comprises a functional group that is capable of being chemically or enzymatically cleaved in vivo (e.g., an acid-labile, reduction-labile, or enzyme-labile functional group), optionally over a period of time. In these embodiments, L can comprise, for example, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety. When L is metastable, and without intending to be bound by any particular theory, the Q-L-Y conjugate is stable in an extracellular environment, e.g., stable in blood serum for the time periods described above, but labile in the intracellular environment or conditions that mimic the intracellular environment, so that it cleaves upon entry into a cell. In some embodiments when L is metastable, L is stable in blood serum for at least about 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 42, or 48 hours, for example, at least about 48, 54, 60, 66, or 72 hours, or about 24-48, 48-72, 24-60, 36-48, 36-72, or 48-72 hours.
Pegylation of insulin peptides
Applicants have discovered that covalent linkage of a hydrophilic moiety to the insulin single chain analogs disclosed herein provide analogs having slower onset, extended duration and exhibit a basal profile of activity. In one embodiment, the insulin peptides disclosed herein are further modified to comprise a hydrophilic moiety covalently linked to the side chain of an amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain or at the N-terminal alpha amine of the B chain (e.g. at position B 1 for insulin based B chain or position B2 for IGF-1 based B chain) or at the side chain of an amino acid at position B 1, B2, B 10, B22, B28 or B29 of the B chain or at any position of the linking moiety that links the A chain and B chain. In exemplary embodiments, this hydrophilic moiety is covalently linked to a Lys, Cys, Orn, homocysteine, or acetyl- phenylalanine residue at any of these positions. In one embodiment the hydrophilic moiety is covalently linked to the side chain of an amino acid of the linking moiety.
Exemplary hydrophilic moieties include polyethylene glycol (PEG), for example, of a molecular weight of about 1,000 Daltons to about 40,000 Daltons, or about 20,000 Daltons to about 40,000 Daltons. Additional suitable hydrophilic moieties include, polypropylene glycol, polyoxyethylated polyols (e.g., POG), polyoxyethylated sorbitol, polyoxyethylated glucose, polyoxyethylated glycerol (POG), polyoxyalkylenes, polyethylene glycol propionaldehyde, copolymers of ethylene glycol/propylene glycol, monomethoxy- polyethylene glycol, mono-(Cl-ClO) alkoxy- or aryloxy-polyethylene glycol,
carboxymethylcellulose, polyacetals, polyvinyl alcohol (PVA), polyvinyl pyrrolidone, poly- 1, 3-dioxolane, poly-l,3,6-trioxane, ethylene/maleic anhydride copolymer, poly (beta-amino acids) (either homopolymers or random copolymers), poly(n- vinyl pyrrolidone)polyethylene glycol, propropylene glycol homopolymers (PPG) and other polyakylene oxides, polypropylene oxide/ethylene oxide copolymers, colonic acids or other polysaccharide polymers, Ficoll or dextran and mixtures thereof.
The hydrophilic moiety, e.g., polyethylene glycol chain in accordance with some embodiments has a molecular weight selected from the range of about 500 to about 40,000 Daltons. In one embodiment the hydrophilic moiety, e.g. PEG, has a molecular weight selected from the range of about 500 to about 5,000 Daltons, or about 1,000 to about 5,000 Daltons. In another embodiment the hydrophilic moiety, e.g., PEG, has a molecular weight of about 10,000 to about 20,000 Daltons. In yet other exemplary embodiment the hydrophilic moiety, e.g., PEG, has a molecular weight of about 20,000 to about 40,000 Daltons. In one embodiment the hydrophilic moiety, e.g. PEG, has a molecular weight of about 20,000 Daltons. In one embodiment an insulin peptide is provided wherein one or more amino acids of the analog are pegylated, and the combined molecular weight of the covalently linked PEG chains is about 20,000 Daltons.
In one embodiment dextrans are used as the hydrophilic moiety. Dextrans are polysaccharide polymers of glucose subunits, predominantly linked by ocl-6 linkages. Dextran is available in many molecular weight ranges, e.g., about 1 kD to about 100 kD, or from about 5, 10, 15 or 20 kD to about 20, 30, 40, 50, 60, 70, 80 or 90 kD.
Linear or branched polymers are contemplated. Resulting preparations of conjugates may be essentially monodisperse or polydisperse, and may have about 0.5, 0.7, 1, 1.2, 1.5 or 2 polymer moieties per peptide.
In one embodiment the hydrophilic moiety is a polyethylene glycol (PEG) chain, optionally linked to the side chain of an amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain, positions B l, B2, B 10, B22, B28 or B29 of the B chain, at the N-terminal alpha amine of the B chain, or at any position of the linking moiety of a single chain insulin analog that links the A chain and B chain, including for example at position C8. In one embodiment the single chain insulin analog comprises a peptide linking moiety of 8 to 12 amino acids, wherein one of the amino acids of the linking moiety has a polyethylene chain covalently bound to its side chain. In one embodiment the single chain insulin analog comprises a peptide linking moiety of 8 to 12 amino acids, wherein an amino acid of the linking moiety is pegylated and one or more amino acid at a position selected from the group consisting of A9, A14 and A15 of the A chain, positions B l, B2, B 10, B22, B28 or B29 of the B chain is also pegylated. In one embodiment the total molecular weight of the covalently linked PEG chain(s) is about 20,000 Daltons.
In one embodiment a single chain insulin analog comprises a linking moiety of 8 to 12 amino acids, wherein one of the amino acids of the linking moiety has a 20,000 Dalton polyethylene chain covalently bound to its side chain. In another embodiment an insulin analog comprises a peptide linking moiety of 8 to 12 amino acids, wherein one of the amino acids of the linking moiety has a polyethylene chain covalently bound to its side chain and a second PEG chain is linked to the N-terminal alpha amine of the B chain (e.g. at position B 1 for insulin based B chain or position B2 for IGF-1 based B chain) or at the side chain of an amino acid at position B 1, B2 and B29 of the B chain. In one embodiment when two PEG chains are linked to the insulin peptide, each PEG chain has a molecular weight of about 10,000 Daltons. In one embodiment when the PEG chain is linked to an 8 to 12 amino acid linking moiety, the PEG chain is linked at position C7 or C8 of the linking moiety and in one embodiment the PEG chain is linked at position C8 of the linking moiety. In one embodiment when two PEG chains are linked to the single chain insulin analog, with one PEG chain linked at position C8 and the second PEG is linked at A9, A14, A15, B l, B2, B 10, B22, B28 or B29. Hydrophilic moieties such as polyethylene glycol can be attached to the NHR ligand- insulin conjugate under any suitable conditions used to react a protein with an activated polymer molecule. Any means known in the art can be used, including via acylation, reductive alkylation, Michael addition, thiol alkylation or other chemoselective
conjugation/ligation methods through a reactive group on the PEG moiety (e.g., an aldehyde, amino, ester, thiol, a-haloacetyl, maleimido or hydrazino group) to a reactive group on the target compound (e.g., an aldehyde, amino, ester, thiol, a-haloacetyl, maleimido or hydrazino group). Activating groups which can be used to link the water soluble polymer to one or more proteins include without limitation sulfone, maleimide, sulfhydryl, thiol, triflate, tresylate, azidirine, oxirane and 5-pyridyl. If attached to the peptide by reductive alkylation, the polymer selected should have a single reactive aldehyde so that the degree of
polymerization is controlled. See, for example, Kinstler et al., Adv. Drug. Delivery Rev. 54: 477-485 (2002); Roberts et al., Adv. Drug Delivery Rev. 54: 459-476 (2002); and Zalipsky et al., Adv. Drug Delivery Rev. 16: 157-182 (1995).
Acylation
In some embodiments, the insulin analog is modified to comprise an acyl group. The acyl group can be covalently linked directly to an amino acid of the NHR ligand-insulin conjugate, or indirectly to an amino acid of the NHR ligand-insulin conjugate via a spacer, wherein the spacer is positioned between the amino acid of the NHR ligand-insulin conjugate and the acyl group. The NHR ligand-insulin conjugate may be acylated at the same amino acid position where a hydrophilic moiety is linked, or at a different amino acid position. For example, acylation may occur at any position including any of amino acid of the A or B chains as well as a position within the linking moiety, provided that the activity exhibited by the non-acylated NHR ligand-insulin conjugate is retained upon acylation.
Nonlimiting examples include acylation at positions A14 and A15 of the A chain, positions position B l for insulin based B chain or position B2 for IGF-1 based B chain or positions B 10, B22, B28 or B29 of the B chain or at any position of the linking moiety.
In one specific aspect of the invention, the insulin analog is modified to comprise an acyl group by direct acylation of an amine, hydroxyl, or thiol of a side chain of an amino acid of the NHR ligand-insulin conjugate. In some embodiments, the insulin analog is directly acylated through the side chain amine, hydroxyl, or thiol of an amino acid. In some embodiments, acylation is at position B28 or B29 (according to the amino acid numbering of the native insulin A and B chain sequences). In this regard, an insulin analog can be provided that has been modified by one or more amino acid substitutions in the A or B chain sequence, including for example at positions A14, A15, B l, B2, B IO, B22, B28 or B29 (according to the amino acid numbering of the native insulin A and B chain sequences) or at any position of the linking moiety with an amino acid comprising a side chain amine, hydroxyl, or thiol. In some specific embodiments of the invention, the direct acylation of the insulin peptide occurs through the side chain amine, hydroxyl, or thiol of the amino acid at position B28 or B29 (according to the amino acid numbering of the native insulin A and B chain sequences).
In accordance with one embodiment, the acylated insulin analogs comprise a spacer between the peptide and the acyl group. In some embodiments, the NHR ligand-insulin conjugate is covalently bound to the spacer, which is covalently bound to the acyl group. In some exemplary embodiments, the insulin peptide is modified to comprise an acyl group by acylation of an amine, hydroxyl, or thiol of a spacer, which spacer is attached to a side chain of an amino acid at position B28 or B29 (according to the amino acid numbering of the A or B chain of native insulin), or at any position of the spacer moiety. The amino acid of the NHR ligand-insulin conjugate to which the spacer is attached can be any amino acid comprising a moiety which permits linkage to the spacer. For example, an amino acid comprising a side chain -NH2, -OH, or -COOH (e.g., Lys, Orn, Ser, Asp, or Glu) is suitable.
In some embodiments, the spacer between the NHR ligand-insulin conjugate and the acyl group is an amino acid comprising a side chain amine, hydroxyl, or thiol (or a dipeptide or tripeptide comprising an amino acid comprising a side chain amine, hydroxyl, or thiol). In some embodiments, the spacer comprises a hydrophilic bifunctional spacer. In a specific embodiment, the spacer comprises an amino poly(alkyloxy)carboxylate. In this regard, the spacer can comprise, for example, NH2(CH2CH20)n(CH2)mCOOH, wherein m is any integer from 1 to 6 and n is any integer from 2 to 12, such as, e.g., 8-amino-3,6-dioxaoctanoic acid, which is commercially available from Peptides International, Inc. (Louisville, KY). In one embodiment, the hydrophilic bifunctional spacer comprises two or more reactive groups, e.g., an amine, a hydroxyl, a thiol, and a carboxyl group or any combinations thereof. In certain embodiments, the hydrophilic bifunctional spacer comprises a hydroxyl group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises an amine group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises a thiol group and a carboxylate. In some embodiments, the spacer between peptide the NHR ligand-insulin conjugate and the acyl group is a hydrophobic bifunctional spacer. Hydrophobic bifunctional spacers are known in the art. See, e.g., Bioconjugate Techniques, G. T. Hermanson (Academic Press, San Diego, CA, 1996), which is incorporated by reference in its entirety. In accordance with certain embodiments the bifunctional spacer can be a synthetic or naturally occurring amino acid comprising an amino acid backbone that is 3 to 10 atoms in length (e.g., 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, and 8- aminooctanoic acid). Alternatively, the spacer can be a dipeptide or tripeptide spacer having a peptide backbone that is 3 to 10 atoms (e.g., 6 to 10 atoms) in length. Each amino acid of the dipeptide or tripeptide spacer attached to the NHR ligand-insulin conjugate can be independently selected from the group consisting of: naturally-occurring and/or non- naturally occurring amino acids, including, for example, any of the D or L isomers of the naturally-occurring amino acids (Ala, Cys, Asp, Glu, Phe, Gly, His, He, Lys, Leu, Met, Asn, Pro, Arg, Ser, Thr, Val, Trp, Tyr), or any D or L isomers of the non-naturally occurring amino acids selected from the group consisting of: β-alanine (β -Ala), N-a-methyl- alanine (Me-Ala), aminobutyric acid (Abu), a-aminobutyric acid (γ-Abu), aminohexanoic acid (ε- Ahx), aminoisobutyric acid (Aib), aminomethylpyrrole carboxylic acid,
aminopiperidinecarboxylic acid, aminoserine (Ams), aminotetrahydropyran-4-carboxylic acid, arginine N-methoxy-N-methyl amide, β-aspartic acid (β-Asp), azetidine carboxylic acid, 3-(2-benzothiazolyl)alanine, a-tert-butylglycine, 2-amino-5-ureido-n-valeric acid (citrulline, Cit), β-Cyclohexylalanine (Cha), acetamidomethyl-cysteine, diaminobutanoic acid (Dab), diaminopropionic acid (Dpr), dihydroxyphenylalanine (DOPA),
dimethylthiazolidine (DMT A), γ-Glutamic acid (γ-Glu), homoserine (Hse), hydroxyproline (Hyp), isoleucine N-methoxy-N-methyl amide, methyl-isoleucine (MeHe), isonipecotic acid (Isn), methyl-leucine (MeLeu), methyl-lysine, dimethyl-lysine, trimethyl-lysine,
methanoproline, methionine- sulfoxide (Met(O)), methionine-sulfone (Met(02)), norleucine (Nle), methyl-norleucine (Me-Nle), norvaline (Nva), ornithine (Orn), para-aminobenzoic acid (PABA), penicillamine (Pen), methylphenylalanine (MePhe), 4-Chlorophenylalanine (Phe(4-Cl)), 4-fluorophenylalanine (Phe(4-F)), 4-nitrophenylalanine (Phe(4-N02)), 4- cyanophenylalanine ((Phe(4-CN)), phenylglycine (Phg), piperidinylalanine,
piperidinylglycine, 3,4-dehydroproline, pyrrolidinylalanine, sarcosine (Sar), selenocysteine (Sec), U-Benzyl-phosphoserine, 4-amino-3-hydroxy-6-methylheptanoic acid (Sta), 4-amino- 5-cyclohexyl-3-hydroxypentanoic acid (ACHPA), 4-amino-3-hydroxy-5-phenylpentanoic acid (AHPPA), l,2,3,4,-tetrahydro-isoquinoline-3-carboxylic acid (Tic), tetrahydropyranglycine, thienylalanine (Thi) , U-Benzyl-phosphotyrosine, O- Phosphotyrosine, methoxytyrosine, ethoxytyrosine, O-(bis-dimethylamino-phosphono)- tyrosine, tyrosine sulfate tetrabutylamine, methyl-valine (MeVal), 1 -amino- 1-cyclohexane carboxylic acid (Acx), aminovaleric acid, beta-cyclopropyl-alanine (Cpa), propargylglycine (Prg), allylglycine (Alg), 2-amino-2-cyclohexyl-propanoic acid (2-Cha), tertbutylglycine (Tbg), vinylglycine (Vg), 1 -amino- 1 -cyclopropane carboxylic acid (Acp), 1-amino-l- cyclopentane carboxylic acid (Acpe), alkylated 3-mercaptopropionic acid, 1-amino-l- cyclobutane carboxylic acid (Acb). In some embodiments the dipeptide spacer is selected from the group consisting of: Ala-Ala, β-Ala- β-Ala, Leu-Leu, Pro-Pro, γ-aminobutyric acid- γ-aminobutyric acid, and γ-Glu- γ-Glu.
The peptide the NHR ligand-insulin conjugate can be modified to comprise an acyl group by acylation of a long chain alkane of any size and can comprise any length of carbon chain. The long chain alkane can be linear or branched. In certain aspects, the long chain alkane is a C4 to C30 alkane. For example, the long chain alkane can be any of a C4 alkane, C6 alkane, C8 alkane, C10 alkane, C12 alkane, C14 alkane, C16 alkane, C18 alkane, C2o alkane, C22 alkane, C24 alkane, C¾ alkane, C28 alkane, or a C30 alkane. In some embodiments, the long chain alkane comprises a C8 to C2o alkane, e.g., a C14 alkane, C16 alkane, or a C18 alkane.
In some embodiments, an amine, hydroxyl, or thiol group of the NHR ligand-insulin conjugate is acylated with a cholesterol acid. In a specific embodiment, the peptide is linked to the cholesterol acid through an alkylated des-amino Cys spacer, i.e., an alkylated 3- mercaptopropionic acid spacer. Suitable methods of peptide acylation via amines, hydroxyls, and thiols are known in the art. See, for example, Miller, Biochem Biophys Res Commun 218: 377-382 (1996); Shimohigashi and Stammer, Int J Pept Protein Res 19: 54-62 (1982); and Previero et al., Biochim Biophys Acta 263: 7-13 (1972) (for methods of acylating through a hydroxyl); and San and Silvius, J Pept Res 66: 169-180 (2005) (for methods of acylating through a thiol); Bioconjugate Chem. "Chemical Modifications of Proteins: History and Applications" pages 1, 2-12 (1990); Hashimoto et al., Pharmacuetical Res. "Synthesis of Palmitoyl Derivatives of Insulin and their Biological Activity" Vol. 6, No: 2 pp.l71-176 (1989).
The acyl group of the acylated peptide the NHR ligand-insulin conjugate can be of any size, e.g., any length carbon chain, and can be linear or branched. In some specific embodiments of the invention, the acyl group is a C4 to C30 fatty acid. For example, the acyl group can be any of a C4 fatty acid, C6 fatty acid, C8 fatty acid, C10 fatty acid, C12 fatty acid, C14 fatty acid, C16 fatty acid, C18 fatty acid, C2o fatty acid, C22 fatty acid, C24 fatty acid, C¾ fatty acid, C28 fatty acid, or a C30 fatty acid. In some embodiments, the acyl group is a C8 to C2o fatty acid, e.g., a C14 fatty acid or a C16 fatty acid.
In certain embodiments, the the NHR ligand-insulin conjugate comprises an amino acid with a side chain covalently attached, optionally through a spacer, to an acyl group or an alkyl group, wherein the acyl group or alkyl group is non-native to a naturally-occurring amino acid. In one embodiment the covalently linked acyl or alkyl group has a carboxylate at its free end. The acyl group in some embodiments is a C4 to C30 fatty acyl group, optionally with carboxylate groups at each end. In one embodiment the the NHR ligand- insulin conjugate comprises a covalently linked C4 to C30 acyl group optionally with a carboxylate at its free end. In specific aspects, the acyl group or alkyl group is covalently attached to the side chain of an amino acid of the insulin peptide at position B28 or B29.
In an alternative embodiment, the acyl group is a bile acid. The bile acid can be any suitable bile acid, including, but not limited to, cholic acid, chenodeoxycholic acid, deoxycholic acid, lithocholic acid, taurocholic acid, glycocholic acid, and cholesterol acid.
Alkylation
In some embodiments, the NHR ligand-insulin conjugate is modified to comprise an alkyl group. The alkyl group can be covalently linked directly to an amino acid of the insulin analog, or indirectly to an amino acid of the NHR ligand-insulin conjugate via a spacer, wherein the spacer is positioned between the amino acid of the NHR ligand-insulin conjugate and the alkyl group. The alkyl group can be attached to the NHR ligand-insulin conjugate via an ether, thioether, or amino linkage. For example, the NHR ligand-insulin conjugate may be alkylated at the same amino acid position where a hydrophilic moiety is linked, or at a different amino acid position.
Alkylation can be carried out at any position within the NHR ligand-insulin conjugate, including for example in the C-terminal region of the B chain or at a position in the linking moiety, provided that insulin activity is retained. In a specific aspect of the invention, the NHR ligand-insulin conjugate is modified to comprise an alkyl group by direct alkylation of an amine, hydroxyl, or thiol of a side chain of an amino acid of the NHR ligand-insulin conjugate. In some embodiments, the NHR ligand-insulin conjugate is directly alkylated through the side chain amine, hydroxyl, or thiol of an amino acid. In some specific embodiments of the invention, the direct alkylation of the NHR ligand-insulin conjugate occurs through the side chain amine, hydroxyl, or thiol of the amino acid at position A14, A15, B l (for insulin based B chains), B2 (for IGF-1 based B chains), B IO, B22, B28 or B29 (according to the amino acid numbering of the A and B chain of native insulin).
In some embodiments of the invention, the NHR ligand-insulin conjugate comprises a spacer between the peptide and the alkyl group. In some embodiments, the NHR ligand- insulin conjugate is covalently bound to the spacer, which is covalently bound to the alkyl group. In some exemplary embodiments, the NHR ligand-insulin conjugate is modified to comprise an alkyl group by alkylation of an amine, hydroxyl, or thiol of a spacer, wherein the spacer is attached to a side chain of an amino acid at position A14, A15, B l (for insulin based B chains), B2 (for IGF-1 based B chains), B IO, B22, B28 or B29 (according to the amino acid numbering of the A and B chains of native insulin). The amino acid of the NHR ligand-insulin conjugate to which the spacer is attached can be any amino acid (e.g., a singly a-substituted amino acid or an α,α-disubstituted amino acid) comprising a moiety which permits linkage to the spacer. An amino acid of the NHR ligand-insulin conjugate comprising a side chain -N¾, -OH, or -COOH (e.g., Lys, Orn, Ser, Asp, or Glu) is suitable. In some embodiments, the spacer between the peptide the NHR ligand-insulin conjugate and the alkyl group is an amino acid comprising a side chain amine, hydroxyl, or thiol or a dipeptide or tripeptide comprising an amino acid comprising a side chain amine, hydroxyl, or thiol.
In the instance in which the alpha amine is alkylated, the spacer amino acid can be any amino acid. For example, the spacer amino acid can be a hydrophobic amino acid, e.g., Gly, Ala, Val, Leu, He, Trp, Met, Phe, Tyr. Alternatively, the spacer amino acid can be an acidic residue, e.g., Asp and Glu. In exemplary embodiments, the spacer amino acid can be a hydrophobic amino acid, e.g., Gly, Ala, Val, Leu, He, Trp, Met, Phe, Tyr, 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, 8-aminooctanoic acid.
Alternatively, the spacer amino acid can be an acidic residue, e.g., Asp and Glu, provided that the alkylation occurs on the alpha amine of the acidic residue. In the instance in which the side chain amine of the spacer amino acid is alkylated, the spacer amino acid is an amino acid comprising a side chain amine, e.g., an amino acid of Formula I (e.g., Lys or Orn). In this instance, it is possible for both the alpha amine and the side chain amine of the spacer amino acid to be alkylated, such that the peptide is dialkylated. Embodiments of the invention include such dialkylated molecules.
In some embodiments, the spacer comprises a hydrophilic bifunctional spacer. In a specific embodiment, the spacer comprises an amino poly(alkyloxy)carboxylate. In this regard, the spacer can comprise, for example, NH2(CH2CH20)n(CH2)mCOOH, wherein m is any integer from 1 to 6 and n is any integer from 2 to 12, such as, e.g., 8-amino-3,6- dioxaoctanoic acid, which is commercially available from Peptides International, Inc.
(Louisville, KY). In some embodiments, the spacer between peptide the NHR ligand-insulin conjugate and the alkyl group is a hydrophilic bifunctional spacer. In certain embodiments, the hydrophilic bifunctional spacer comprises two or more reactive groups, e.g., an amine, a hydroxyl, a thiol, and a carboxyl group or any combinations thereof. In certain
embodiments, the hydrophilic bifunctional spacer comprises a hydroxyl group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises an amine group and a carboxylate. In other embodiments, the hydrophilic bifunctional spacer comprises a thiol group and a carboxylate.
The spacer (e.g., amino acid, dipeptide, tripeptide, hydrophilic bifunctional spacer, or hydrophobic bifunctional spacer) is 3 to 10 atoms (e.g., 6 to 10 atoms, (e.g., 6, 7, 8, 9, or 10 atoms)) in length. In more specific embodiments, the spacer is about 3 to 10 atoms (e.g., 6 to 10 atoms) in length and the alkyl is a C12 to C18 alkyl group, e.g., C14 alkyl group, C16 alkyl group, such that the total length of the spacer and alkyl group is 14 to 28 atoms, e.g., about 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, or 28 atoms. In some
embodiments the length of the spacer and alkyl is 17 to 28 (e.g., 19 to 26, 19 to 21) atoms.
In accordance with one embodiment the bifunctional spacer is a synthetic or non- naturally occurring amino acid comprising an amino acid backbone that is 3 to 10 atoms in length (e.g., 6-amino hexanoic acid, 5-aminovaleric acid, 7-aminoheptanoic acid, and 8- aminooctanoic acid). Alternatively, the spacer can be a dipeptide or tripeptide spacer having a peptide backbone that is 3 to 10 atoms (e.g., 6 to 10 atoms) in length. The dipeptide or tripeptide spacer attached to the NHR ligand-insulin conjugate can be composed of naturally-occurring and/or non-naturally occurring amino acids, including, for example, any of the amino acids taught herein. In some embodiments the spacer comprises an overall negative charge, e.g., comprises one or two negatively charged amino acids. In some embodiments the dipeptide spacer is selected from the group consisting of: Ala- Ala, β-Ala- β-Ala, Leu-Leu, Pro-Pro, γ-aminobutyric acid- γ-aminobutyric acid, and γ-Glu- γ-Glu. In one embodiment the dipeptide spacer is γ-Glu- γ-Glu.
Suitable methods of peptide alkylation via amines, hydroxyls, and thiols are known in the art. For example, a Williamson ether synthesis can be used to form an ether linkage between the insulin peptide and the alkyl group. Also, a nucleophilic substitution reaction of the peptide with an alkyl halide can result in any of an ether, thioether, or amino linkage.
The alkyl group of the alkylated peptide the NHR ligand-insulin conjugate can be of any size, e.g., any length carbon chain, and can be linear or branched. In some embodiments of the invention, the alkyl group is a C4 to C3o alkyl. For example, the alkyl group can be any of a C4 alkyl, C6 alkyl, C8 alkyl, Cio alkyl, Ci2 alkyl, Cu alkyl, Cl6 alkyl, Ci8 alkyl, C20 alkyl,
C22 alkyl, C24 alkyl, C¾ alkyl, C28 alkyl, or a C3o alkyl. In some embodiments, the alkyl group is a C8 to C2o alkyl, e.g., a C14 alkyl or a C16 alkyl.
In some specific embodiments, the alkyl group comprises a steroid moiety of a bile acid, e.g., cholic acid, chenodeoxycholic acid, deoxycholic acid, lithocholic acid, taurocholic acid, glycocholic acid, and cholesterol acid.
When a long chain alkane is alkylated by the NHR ligand-insulin conjugate or the spacer, the long chain alkane may be of any size and can comprise any length of carbon chain. The long chain alkane can be linear or branched. In certain aspects, the long chain alkane is a C4 to C3o alkane. For example, the long chain alkane can be any of a C4 alkane, C6 alkane, C8 alkane, Cio alkane, C12 alkane, C14 alkane, C16 alkane, C18 alkane, C2o alkane,
C22 alkane, C24 alkane, C¾ alkane, C28 alkane, or a C3o alkane. In some embodiments the long chain alkane comprises a C8 to C2o alkane, e.g., a C14 alkane, C16 alkane, or a C18 alkane.
Also, in some embodiments alkylation can occur between the insulin analog and a cholesterol moiety. For example, the hydroxyl group of cholesterol can displace a leaving group on the long chain alkane to form a cholesterol-insulin peptide product.
Self Cleaving Dipeptide Element
In accordance with one embodiment the insulin peptide of the conjugates disclosed herein are further modified to comprise a self cleaving dipeptide element. In one
embodiment the dipeptide element comprises the structure U-J, wherein U is an amino acid or a hydroxyl acid and J is an N-alkylated amino acid. In one embodiment one or more dipeptide elements are linked to the NHR ligand-insulin conjugate through an amide bond formed through one or more amino groups selected from the N-terminal amino group of the A or B chain of the insulin component, or the side chain amino group of an amino acid present in the conjugate. In accordance with one embodiment one or more dipeptide elements are linked to the NHR ligand-insulin conjugate at an amino group selected from the N-terminal amino group of the conjugate, or the side chain amino group of an aromatic amine of a 4-amino-phenylalanine residue present at a position corresponding to position A19, B 16 or B25 of native insulin, or a side chain of an amino acid of the linking moiety of a single chain insulin analog.
In one embodiment the dipeptide prodrug element comprises the general structure of Formula X:
Figure imgf000081_0001
wherein
Ri, R2, R4 and R8 are independently selected from the group consisting of H, Ci-Ci8 alkyl, C2-Ci8 alkenyl, (Ci-Cis alkyl)OH, (Ci-Cis alkyl)SH, (C2-C3 alkyl)SCH3, (d-C4 alkyl)CONH2, (d-C4 alkyl)COOH, (d-C4 alkyl)NH2, (Ci-C4 alkyl)NHC(NH2 +)NH2, (C0-C4 alkyl)(C3-C6 cycloalkyl), (C0-C4 alkyl)(C2-C5 heterocyclic), (C0-C4 alkyl)(C6-Ci0 aryl)R7, (Ci-C4 alkyl)(C3-C9 heteroaryl), and C1-C12 alkyl(W)Ci-Ci2 alkyl, wherein W is a heteroatom selected from the group consisting of N, S and O, or Ri and R2 together with the atoms to which they are attached form a C3-Ci2 cycloalkyl or aryl; or R4 and R8 together with the atoms to which they are attached form a C3-C6 cycloalkyl;
R3 is selected from the group consisting of Ci-Ci8 alkyl, (Ci-Ci8 alkyl)OH, (Ci-Ci8 alkyl)NH2, (Ci-Cis alkyl)SH, (C0-C4 alkyl)(C3-C6)cycloalkyl, (C0-C4 alkyl)(C2-C5 heterocyclic), (C0-C4 alkyl)(C6-Cio aryl)R7, and (C1-C4 alkyl)(C3-C9 heteroaryl) or R4 and R3 together with the atoms to which they are attached form a pyrrolidine ring;
R5 is NHR6 or OH;
R6 is H, Ci-C8 alkyl; and
R7 is selected from the group consisting of H and OH. In one embodiment when the prodrug element is linked to the N-terminal amine of the NHR ligand-insulin conjugate and R4 and R3 together with the atoms to which they are attached form a pyrrolidine ring, then at least one of Ri and R2 are other than H. In one embodiment a complex is provided comprising the general structure U-J-(Q- L-Y), wherein Q-L-Y comprises any of the structures as described elsewhere in this disclosure and U-J is a dipeptide that is linked via an amide bond to an amine of the Q-L-Y conjugate. In one embodiment U-J is linked to amine present on the insulin peptide. In one embodiment U-J is linked to the N-terminal alpha amine of the A or B chain of the insulin peptide of the conjugate.
In one embodiment, a complex of the structure U-J-(Q-L-Y) is provided, wherein Q- L-Y comprises any of the structures as described elsewhere in this disclosure and wherein U is an amino acid or a hydroxy acid;
J is an N-alkylated amino acid linked to Q through an amide bond between a carboxyl moiety of B and an amine of Q. In one embodiment U-J comprises the structure:
Figure imgf000082_0001
wherein
(a) R 1 , R2 , R 4 and R 8 are independently selected from the group consisting of H, CI -CI 8 alkyl, C2-C18 alkenyl, (CI -CI 8 alkyl)OH, (CI -CI 8 alkyl)SH, (C2-C3 alkyl)SCH3, (C1-C4 alkyl)CONH2, (C1-C4 alkyl)COOH, (C1-C4 alkyl)NH2, (C1-C4
alkyl)NHC(NH2 +)NH2, (C0-C4 alkyl)(C3-C6 cycloalkyl), (C0-C4 alkyl)(C2-C5
heterocyclic), (C0-C4 alkyl)(C6-C10 aryl)R7, (C1-C4 alkyl)(C3-C9 heteroaryl), and C1-C12 alkyl(Wl)Cl-C12 alkyl, wherein Wl is a heteroatom selected from the group consisting of N, S and O, or
(ii) R 1 and R 2 together with the atoms to which they are attached form a C3-C12 cycloalkyl or aryl; or
(iii) R 4 and R 8 together with the atoms to which they are attached form a C3-C6 cycloalkyl;
(b) R3 is selected from the group consisting of C 1 -C 18 alkyl, (C 1 -C 18 alkyl)OH,
(C1-C18 alkyl)NH2, (C1-C18 alkyl)SH, (C0-C4 alkyl)(C3-C6)cycloalkyl, (C0-C4 alkyl)(C2- C5 heterocyclic), (C0-C4 alkyl)(C6-C10 aryl)R7, and (C1-C4 alkyl)(C3-C9 heteroaryl) or R4 and R together with the atoms to which they are attached form a 4, 5 or 6 member heterocyclic ring;
(c) R5 is NHR6 or OH; (d) R is H, Ci-Cg alkyl; and
(e) R is selected from the group consisting of H and OH
wherein the chemical cleavage half-life (ti/2) of U- J from Q or Y is at least about 1 hour to about 1 week in PBS under physiological conditions.
In a further embodiment, U-J comprises the structure:
Figure imgf000083_0001
wherein
Ri and R8 are independently H or Ci-C8 alkyl;
R2 and R4 are independently selected from the group consisting of H, Ci-C8 alkyl, (C1-C4 alkyl)OH, (C1-C4 alkyl)SH, (C2-C3 alkyl)SCH3, (C1-C4 alkyl)CONH2, (C1-C4 alkyl)COOH, (C1-C4 alkyl)NH2, and (d-C4 alkyl)(C6 aryl)R7;
R3 is Ci-C6 alkyl;
R5 is NH2; and
R7 is selected from the group consisting of hydrogen, and OH.
In a further embodiment, U-J comprises the structure:
Figure imgf000083_0002
wherein
Ri is H;
R2 is H, C1-C4 alkyl, (CH2 alkyl)OH, (C1-C4 alkyl)NH2, or (CH2)(C6 aryl)R7; R3 is Ci-C6 alkyl;
R4 is H, C1-C4 alkyl, or (CH2)(C6 aryl)R7;
R5 is NH2;
R8 is hydrogen; and
R7 is H or OH.
In a further embodiment, U-J comprises the structure:
Figure imgf000084_0001
wherein
Ri is H or Ci-C4 alkyl;
R2 is H, Ci-C4 alkyl, or (C1-C4 alkyl)NH2;
R3 is Ci-C6 alkyl;
R4 is H, or C1-C4 alkyl;
R5 is NH2; and
R8 is hydrogen. In a further embodiment, Ri is Ci-C4 alkyl, R2 is Ci-C4 alkyl, R3 is C C6 alkyl; R4 and R8 are each H and R5 is NH2. In a further embodiment, R2 is Ci-C4 alkyl, or (C1-C4 alkyl)NH2 alkyl, R3 is Ci-C6 alkyl; Rls R4 and R8 are each H and R5 is NH2.
Pharmaceutical compositions comprising the NHR ligand-insulin conjugates disclosed herein can be formulated and administered to patients using standard
pharmaceutically acceptable carriers and routes of administration known to those skilled in the art. Accordingly, the present disclosure also encompasses pharmaceutical compositions comprising one or more of the NHR ligand-insulin conjugates disclosed herein or a pharmaceutically acceptable salt thereof, in combination with a pharmaceutically acceptable carrier. In one embodiment the pharmaceutical composition comprises a lmg/ml concentration of the NHR ligand-insulin conjugate at a pH of about 4.0 to about 7.0 in a phosphate buffer system. The pharmaceutical compositions may comprise the NHR ligand- insulin conjugate as the sole pharmaceutically active component, or the NHR ligand-insulin conjugate peptide can be combined with one or more additional active agents.
All therapeutic methods, pharmaceutical compositions, kits and other similar embodiments described herein contemplate that NHR ligand-insulin conjugate peptides include all pharmaceutically acceptable salts thereof.
In one embodiment the kit is provided with a device for administering the NHR ligand-insulin conjugate to a patient. The kit may further include a variety of containers, e.g., vials, tubes, bottles, and the like. Preferably, the kits will also include instructions for use. In accordance with one embodiment the device of the kit is an aerosol dispensing device, wherein the composition is prepackaged within the aerosol device. In another embodiment the kit comprises a syringe and a needle, and in one embodiment the NHR ligand-insulin conjugate composition is prepackaged within the syringe.
The compounds of this invention may be prepared by standard synthetic methods, recombinant DNA techniques, or any other methods of preparing peptides and fusion proteins. Although certain non-natural amino acids cannot be expressed by standard recombinant DNA techniques, techniques for their preparation are known in the art.
Compounds of this invention that encompass non-peptide portions may be synthesized by standard organic chemistry reactions, in addition to standard peptide chemistry reactions when applicable.
Exemplary Embodiments
In accordance with embodiment 1, an insulin peptide/NHR ligand conjugate is provided wherein the conjugate comprises the structure Q-L-Y;
wherein
Q is an insulin peptide;
Y is a NHR ligand; and
L is a linking group or a bond
wherein said conjugate has activity at both a nuclear hormone receptor and an insulin receptor, wherein said NHR ligand is
i) a steroid that exhibits an EC50 of about 1 μΜ or less when unconjugated to
Q-L, and further has a molecular weight of up to about 1000 daltons; or
ii) a ligand that activates the thyroid hormone receptor; or
iii) a ligand that activates the peroxisome proliferator-activated receptors (PPAR), wherein the nuclear hormone receptor ligand is covalently linked to the insulin peptide through the side chain of a lysine residue located at a position selected from the group consisting of B28 and B29, or the N-terminal alpha amine of the A or B chain, optionally the NHR ligand is linked to the carboxy terminus of the insulin peptide B chain. More particularly, in one embodiment the NHR ligand is a ligand that activates the thyroid hormone receptor or activates the peroxisome proliferator-activated receptors, wherein the NHR ligand is linked via the side chain of a lysine residue located at the C-terminus of the insulin B chain.
In accordance with embodiment 2 the conjugate of embodiment 1 is provided wherein Y is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof.
In accordance with embodiment 3 the conjugate of embodiment 2 is provided wherein Y is dexamethasone.
In accordance with embodiment 4 the conjugate of any one of embodiments 1-3 is provided wherein Y is a compound having the general structure
Figure imgf000086_0001
wherein
Ri5 is Ci-C4 alkyl, -CH2(pyridazinone), -CH2(OH)(phenyl)F, -CH(OH)CH3, halo or H; R2o is halo, CH3 or H;
R2i is halo, CH3 or H;
R22 is H, OH, halo, -CH2(OH)(C6 aryl)F, or C1-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2".
In accordance with embodiment 5 the conjugate of embodiments 4 is provided wherein
Ri5 is C1-C4 alkyl, -CH(OH)CH3, 1 or H
R20 is I, Br, CH3 or H;
R2i is I, Br, CH3 or H;
R22 is H, OH, I, or C1-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2".
In accordance with embodiment 6 the conjugate of of any one of embodiment 4 is provided wherein
Ri5 is C1-C4 alkyl, l or H;
R20 is I, Br, CH3 or H;
R2i is I, Br, CH3 or H;
R22 is H, OH, I, or Ci-C4 alkyl; and R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2".
In accordance with embodiment 7 the conjugate of any one of embodiments 1-6 is provided wherein Y is a compound of the general structure of Formula I:
Figure imgf000087_0001
wherein
R20, R21, and R22 are independently selected from the group consisting of H, OH, halo and Ci-C4 alkyl; and
Ri5 is halo or H.
In accordance with embodiment 8 the conjugate of any one of embodiments 1-7 is provided wherein Y is selected from the group consisting of 3,5,3',5'-tetra-iodothyronine, and 3,5,3'-triiodo L-thyronine.
In accordance with embodiment 9 the conjugate of embodiment 1 is provided wherein Y is selected from the group consisting of Tesaglitazar, Aleglitazar and thiazolidinediones, optionally wherein Y is Tesaglitazar or Aleglitazar, optionally Y is Tesaglitazar linked to the N-terminal amine of the insulin B chain.
In accordance with embodiment 10 the conjugate of any one of embodiments 1-9 is provided wherein said insulin peptide (Q) comprises an A chain and a B chain wherein said A chain comprises a sequence
GIVX4X5CCX8X9XioCXi2LXi4Xi5LXi7Xi8YCX2i-R53 (SEQ ID NO: 19), and said B chain comprises a sequence R62-X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20), wherein
X4 is glutamic acid or aspartic acid;
X5 is glutamine or glutamic acid
X8 is histidine, threonine or phenylalanine;
X9 is serine, arginine, lysine, ornithine or alanine;
X10 is isoleucine or serine;
X12 is serine or aspartic acid;
X14 is tyrosine, arginine, lysine, ornithine or alanine;
Xi5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine; X17 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine;
X18 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine; X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
R53 is COOH or CONH2.
In accordance with embodiment 11 the conjugate of any one of embodiments 1-10 is provided wherein said insulin peptide comprises an A chain and a B chain and said A chain comprises the sequence GIVEQCCX8X9ICSLYQLENYCX2i-R53 (SEQ ID NO: 73) said B chain comprises the sequence R62-X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20), wherein
X8 is histidine or threonine;
X9 is serine, lysine, or alanine;
X21 is alanine, glycine or asparagine;
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X3o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid; X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
P 62 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine and an N-terminal amine; and
R53 is COOH or CONH2.
In accordance with embodiment 12 the conjugate of any one of embodiments 1-11 is provided wherein said insulin peptide comprises an A chain and a B chain wherein said A chain comprises a sequence GIVDECCX8X9SCDLRRLEMX19CX21-R53 (SEQ ID NO: 74) and said B chain comprises a sequence R62-X25LCGAX3oLVDALYLVCGDX42GFY (SEQ ID NO: 75), wherein
X8 is phenylalanine or histidine;
X9 is arginine, ornithine or alanine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino-phenylalanine;
X21 is alanine or asparagine;
X25 is histidine or threonine;
X3o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X42 is selected from the group consisting of alanine ornithine and arginine; and Rs3 is COOH or CONH2;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
R5 is COOH or CONH2.
In accordance with embodiment 13 the conjugate of embodiments 12 is provided wherein the B chain sequence comprises the sequence
FVKQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 147), or
FVNQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 148), wherein
X25 is selected from the group consisting of histidine and threonine; and R63 is selected from the group consisting of YTX28KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX28K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX28, YT, Y and a bond, wherein X28 is proline, aspartic acid or glutamic acid.
In accordance with embodiment 14 the conjugate of any one of embodiments 1-13 is provided wherein the insulin peptide comprises an A chain and a B chain wherein the A chain comprises the sequence GIVEQCCX8X9ICSLYQLENYCX2i-R53 (SEQ ID NO: 73), and the B chain sequence comprises the sequence
FVKQX25LCGS HLVE ALYLVC GERGFF YTEKT (SEQ ID NO: 162),
FVNQX25LCGSHLVEALYLVCGERGFFYTDKT (SEQ ID NO: 164),
FVNQX25LCGS HLVE ALYLVC GERGFF YTKPT (SEQ ID NO: 165) or
FVNQX25LCGS HLVE ALYLVC GERGFF YTPKT (SEQ ID NO: 161) wherein
X8 is histidine or threonine;
X9 is serine, lysine, or alanine;
X21 is alanine, glycine or asparagine;
X25 is selected from the group consisting of histidine and threonine.
In accordance with embodiment 15 the conjugate of embodiment 11 is provided wherein said B chain comprises a sequence R62-
X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45YT-Zi-Bi (SEQ ID NO: 142), wherein X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X3o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine; and
Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide. In accordance with embodiment 16 the conjugate of any one of embodiments 1-15 is provided wherein said A chain comprises a sequence GIVEQCCTSICSLYQLENYCN-R53 (SEQ ID NO: 1) and said B chain comprises a sequence
FVNQHLC GS HLVE AL YLVC GERGFFYTPKT (SEQ ID NO: 2), wherein R53 is COOH or CONH2.
In accordance with embodiment 17 the conjugate of any one of embodiments 1-16 is provided wherein Y is selected from the group consisting of 3,5,3',5'-tetra-iodothyronine, and 3,5,3'-triiodo L-thyronine and Y is linked via the side chain of the lysine residue present at B28 or B29 of said B chain.
In accordance with embodiment 18 the conjugate of embodiment 17 is provided wherein Y is 3,5,3'-triiodo L-thyronine and Y is linked via the side chain of the lysine residue present at B29 of said B chain.
In accordance with embodiment 19 the conjugate of any one of embodiments 1-18 is provided wherein the insulin peptide is a single chain insulin and the peptide linking moiety joining the B and A chains is selected from the group consisting of
SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52),
SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51), GAGSSSX57X58 (SEQ ID NO: 76), GYGSSSX57X58 (SEQ ID NO: 21) and G YGS S S X57X58 APQT ; (SEQ ID NO: 77), wherein
X57 and X58 are independently arginine, lysine or ornithine.
In accordance with embodiment 20 the conjugate of embodiment 19 is provided wherein the peptide linking moiety is selected from the group consisting of GYGSSSRR (SEQ ID NO: 18) and GAGSSSRR (SEQ ID NO: 22).
In accordance with embodiment 21 the conjugate of any one of embodiments 1-20 is provided wherein the N-terminus of said B chain is pegylated.
In accordance with embodiment 22 the conjugate of any one of embodiments 1-21 is provided wherein L is stable in vivo, hydrolyzable in vivo, or metastable in vivo.
In accordance with embodiment 23 the conjugate of embodiment 22 is provided wherein L comprises an ether moiety, or an amide moiety, an ester moiety, an acid-labile moiety, a reduction-labile moiety , an enzyme-labile moiety, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety.
In accordance with embodiment 24 the conjugate of any one of embodiments 1-23 is provided wherein Y-L is covalently linked to the insulin peptide (Q) through a position independently selected from the side chain of an amino acid at B28 or B29 of the B chain, the N-terminal alpha amine of the B chain, the N-terminal alpha amine of the A chain and at the side chain of an amino acid at any position of a linking moiety that links the A chain and B chain of a single chain insulin peptide.
In accordance with embodiment 25 the conjugate of embodiment 24 is provided wherein Y-L is covalently linked to the insulin peptide (Q) through the side chain of an amino acid at B28 or B29 of the B chain.
In accordance with embodiment 26 the conjugate of any one of embodiments 1-25 is provided wherein the conjugate is derivatized by linking the structure U-J to the conjugate, wherein
U is an amino acid or a hydroxy acid;
J is an N-alkylated amino acid, sad structure being linked to said conjugate through an amide bond between a carboxyl moiety of J and an amine of the conjugate, wherein the chemical cleavage half- life (ti/2) of U-J from the conjugate is at least about 1 hour to about 1 week in PBS under physiological conditions.
In accordance with embodiment 27 the conjugate of any one of embodiments 1-26 is provided wherein a hydrophilic moiety is covalently linked at one or more positions corresponding to A14, A15, BO, B l, B IO, B22, B28, B29.
In accordance with embodiment 28 the conjugate of embodiment 27 is provided wherein the hydrophilic moiety is a polyethylene glycol.
In accordance with embodiment 29 the conjugate of any one of embodiments 1-28 is provided further comprising an acyl group or alkyl group covalently linked to an amino acid side chain of said conjugate.
In accordance with embodiment 30 the conjugate of embodiment 29 is provided wherein said acyl group or alkyl group is covalently linked to one or more positions selected from A14, A15, B0, B l, B 10, B22, B28, B29 of the insulin peptide.
In accordance with embodiment 31 the conjugate of any one of embodiments 1-30 is provided as a pharmaceutical composition comprising a conjugate of any one of the preceding embodiments, or pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable carrier.
In accordance with embodiment 32 the conjugate of any one of embodiments 1-31 is provided for treating diabetes or reducing weight gain or enhncing weight loss. In accordance with embodiment 33 the use of the conjugate of the composition of embodiment 31, or pharmaceutically acceptable salt thereof, is provided for the treatment of diabetes.
In accordance with embodiment 34 a method of treating diabetes and optionally i) lowering body weight;
ii) lowering triglycerides;
ii) lowering cholesterol levels; or
iv) any combination of i), ii) and iii) is provided wherein the method comprises the step of administering a conjugate of any one of embodiments 1-30 to a patient in need thereof.
EXAMPLE 1
Synthesis of Insulin A & B Chains
Insulin A & B chains were synthesized on 4-methylbenzhyryl amine (MB HA) resin or 4-Hydroxymethyl-phenylacetamidomethyl (PAM) resin using Boc chemistry. The peptides were cleaved from the resin using HF/p-cresol 95:5 for 1 hour at 0°C. Following HF removal and ether precipitation, peptides were dissolved into 50% aqueous acetic acid and lyophilized. Alternatively, peptides were synthesized using Fmoc chemistry. The peptides were cleaved from the resin using Trifluoroacetic acid (TFA)/ Triisopropylsilane (TIS)/ H20 (95:2.5:2.5), for 2 hour at room temperature. The peptide was precipitated through the addition of an excessive amount of diethyl ether and the pellet solubilized in aqueous acidic buffer. The quality of peptides were monitored by RP-HPLC and confirmed by Mass Spectrometry (ESI or MALDI).
Insulin A chains were synthesized with a single free cysteine at amino acid 7 and all other cysteines protected as acetamidomethyl A-(SH)7(Acm)6'U'20. Insulin B chains were synthesized with a single free cysteine at position 7 and the other cysteine protected as acetamidomethyl B-(SH) 7 (Acm) 19. The crude peptides were purified by conventional RP- HPLC.
The synthesized A and B chains were linked to one another through their native disulfide bond linkage in accordance with the general procedure outlined in Fig. 1. The respective B chain was activated to the Cys -Npys analog through dissolution in DMF or DMSO and reacted with 2,2' -Dithiobis (5-nitropyridine) (Npys) at a 1: 1 molar ratio, at room temperature. The activation was monitored by RP-HPLC and the product was confirmed by ESI-MS.
The first B7-A7 disulfide bond was formed by dissolution of the respective A-
(SH) 7 (Acm) 6 ' 11 ' 20 and B-(Npys) 7 (Acm) 19 at 1: 1 molar ratio to a total peptide concentration of 10 mg/ml. When the chain combination reaction was complete the mixture was diluted to a concentration of 50% aqueous acetic acid. The last two disulfide bonds were formed simultaneously through the addition of iodine. A 40 fold molar excess of iodine was added to the solution and the mixture was stirred at room temperature for an additional hour. The reaction was terminated by the addition of an aqueous ascorbic acid solution. The mixture was purified by RP-HPLC and the final compound was confirmed by MALDI-MS. As shown in Fig. 2 and the data in Table 1, the synthetic insulin prepared in accordance with this procedure compares well with purified insulin for insulin receptor binding.
Insulin peptides comprising a modified amino acid (such as 4-amino phenylalanine at position A 19) can also be synthesized in vivo using a system that allows for incorporation of non-coded amino acids into proteins, including for example, the system taught in US Patent Nos. 7,045,337 and 7,083,970.
Table 1: Activity of synthesized insulin relative to native insulin
Figure imgf000094_0001
EXAMPLE 2
Pegylation of Amine Groups (N-Terminus and Lysine) by Reductive Alkylation
a. Synthesis
Insulin (or an insulin analog), mPEG20k-Aldyhyde, and NaB¾CN, in a molar ratio of 1:2:30, were dissolved in acetic acid buffer at a pH of 4.1-4.4. The reaction solution was composed of 0.1 N NaCl, 0.2 N acetic acid and 0.1 N Na2C03. The insulin peptide concentration was approximately 0.5 mg/ml. The reaction occurs over six hours at room temperature. The degree of reaction was monitored by RP-HPLC and the yield of the reaction was approximately 50%.
b. Purification
The reaction mixture was diluted 2-5 fold with 0.1% TFA and applied to a
preparative RP-HPLC column. HPLC condition: C4 column; flow rate 10 ml/min; A buffer 10% ACN and 0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B% from 0-40% (0-80 min); PEG-insulin or analogues was eluted at approximately 35% buffer B. The desired compounds were verified by MALDI-TOF, following chemical modification through sulftolysis or trypsin degradation.
Pegylation of Amine Groups (N-Terminus and Lysine) by N-Hydroxysuccinimide Acylation. a. Synthesis
Insulin (or an insulin analog) along with mPEG20k-NHS were dissolved in 0.1 N Bicine buffer (pH 8.0) at a molar ratio of 1: 1. The insulin peptide concentration was approximately 0.5 mg/ml. Reaction progress was monitored by HPLC. The yield of the reaction is approximately 90% after 2 hours at room temperature.
b. Purification
The reaction mixture was diluted 2-5 fold and loaded to RP-HPLC.
HPLC condition: C4 column; flow rate 10 ml/min; A buffer 10% ACN and 0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B% from 0-40% (0-80 min); PEG- insulin or analogues was collected at approximately 35% B. . The desired compounds were verified by MALDI-TOF, following chemical modification through sulftolysis or trypsin degradation. Reductive Aminated Pegylation of Acetyl Group on the Aromatic Ring Of The
Phenylalanine
a. Synthesis
Insulin (or an insulin analogue), mPEG20k-Hydrazide, and NaBH3CN in a molar ratio of 1:2:20 were dissolved in acetic acid buffer (pH of 4.1 to 4.4). The reaction solution was composed of 0.1 N NaCl, 0.2 N acetic acid and 0.1 N Na2C03. Insulin or insulin analogue concentration was approximately 0.5 mg/ml. at room temperature for 24h. The reaction process was monitored by HPLC. The conversion of the reaction was
approximately 50%. (calculated by HPLC) b. Purification
The reaction mixture was diluted 2-5 fold and loaded to RP-HPLC.
HPLC condition: C4 column; flow rate 10 ml/min; A buffer 10% ACN and 0.1% TFA in water; B buffer 0.1% TFA in ACN; A linear gradient B% from 0-40% (0-80 min); PEG- insulin, or the PEG-insulin analogue was collected at approximately 35%B. . The desired compounds were verified by MALDI-TOF, following chemical modification through sulftolysis or trypsin degradation.
EXAMPLE 3
Insulin Receptor Binding Assay:
The affinity of each peptide for the insulin or IGF-1 receptor was measured in a competition binding assay utilizing scintillation proximity technology. Serial 3-fold dilutions of the peptides were made in Tris-Cl buffer (0.05 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.1% w/v bovine serum albumin) and mixed in 96 well plates (Corning Inc., Acton, MA) with 0.05 nM (3-[125I]-iodotyrosyl) A TyrA14 insulin or (3-[125I]-iodotyrosyl) IGF-1 (Amersham Biosciences, Piscataway, NJ). An aliquot of 1-6 micrograms of plasma membrane fragments prepared from cells over-expressing the human insulin or IGF-1 receptors were present in each well and 0.25 mg/well polyethylene imine-treated wheat germ agglutinin type A scintillation proximity assay beads (Amersham Biosciences, Piscataway, NJ) were added. After five minutes of shaking at 800 rpm the plate was incubated for 12h at room temperature and radioactivity was measured with MicroB eta 1450 liquid scintillation counter (Perkin-Elmer, Wellesley, MA). Non-specifically bound (NSB) radioactivity was measured in the wells with a four- fold concentration excess of "cold" native ligand than the highest concentration in test samples. Total bound radioactivity was detected in the wells with no competitor. Percent specific binding was calculated as following: % Specific
Binding = (Bound-NSB / Total bound-NSB) x 100. IC50 values were determined by using Origin software (OriginLab, Northampton, MA).
EXAMPLE 4
Insulin Receptor Phosphorylation Assay:
To measure receptor phosphorylation of insulin or incretin-insulin conjugate, receptor transfected HEK293 cells were plated in 96 well tissue culture plates (Costar #3596, Cambridge, MA) and cultured in Dulbecco's modified Eagle medium (DMEM) supplemented with 100 IU/ml penicillin, 100 g/ml streptomycin, 10 mM HEPES and 0.25% bovine growth serum (HyClone SH30541, Logan, UT) for 16-20 hrs at 37 °C, 5% CO2 and 90% humidity. Serial dilutions of insulin or insulin analogs were prepared in DMEM supplemented with 0.5% bovine serum albumin (Roche Applied Science #100350, Indianapolis, IN) and added to the wells with adhered cells. After 15 min incubation at 37 °C in humidified atmosphere with 5% C02 the cells were fixed with 5% paraformaldehyde for 20 min at room temperature, washed twice with phosphate buffered saline pH 7.4 and blocked with 2% bovine serum albumin in PBS for 1 hr. The plate was then washed three times and filled with horseradish peroxidase-conjugated antibody against phospho tyro sine (Upstate biotechnology #16-105, Temecula, CA) reconstituted in PBS with 2% bovine serum albumin per manufacturer's recommendation. After 3 hrs incubation at room temperature the plate was washed 4 times and 0.1 ml of TMB single solution substrate (Invitrogen, #00-2023, Carlbad, CA) was added to each well. Color development was stopped 5 min later by adding 0.05 ml 1 N HC1. Absorbance at 450 nm was measured on Titertek Multiscan MCC340 (ThermoFisher, Pittsburgh, PA). Absorbance vs. peptide concentration dose response curves were plotted and EC 50 values were determined by using Origin software (OriginLab, Northampton, MA).
EXAMPLE 5
Biosynthesis and Purification of Single Chain Insulin Analogs
An insulin-IGF-I minigene comprising a native insulin B and A chain linked via the IGF-I C chain (Β°-^-Α°) was cloned into expression vector pGAPZa A (purchased from Invitrogen) under GAP promoter (promoter of the glyceraldehyde-3-phosphate
dehydrogenase (GAPDH)) for constitutive expression and purification of recombinant protein in yeast Pichia pastoris. The minigene was fused to an N-terminal peptide encoding Saccharomyces cerevisiae a-mating factor leader signal for secretion of the recombinant protein into the medium. A Kex2 cleavage site between the minigene and the leading a- mating factor sequence was used to cleave the leader sequence for secretion of the minigene with native amino termini. Single-site alanine mutations were introduced into C peptide at positions 1 (G1A), 2 (Y2A), 3 (G3A), 4 (S4A), 5 (S5A), 6 (S6A), 7 (R7A), 8 (R8A), 10 (P10A), 11 (QUA), and 12 (T12A) of the BVA0 minigene.
The minigenes including B°C1A°, eleven alanine mutants, and other select derivatives were transformed into yeast Pichia pastoris by electroporation. Positive transformants were selected on minimal methanol plates and a genomic preparation of each Pichia isolate was performed and integration of the constructs into the yeast genome was confirmed by PCR. An 833 base pair PCR product was visualized on an agarose DNA gel. The insulin analogs were produced by fermentation of a corresponding yeast line. The yeast cells were pelleted by centrifugation at 5 K for 20 minutes in 500 ml Beckman centrifuge tubes and the media was kept for subsequent protein purification.
Growth media supernatants were filtered through 0.2 μιη Millipore filter. Acetonitrile (ACN) was added to the supernatant to a final volume of 20%. The supernatant was purified over a Amberlite XAD7HP resin from Sigma, pre-equilibrated with 20% aqueous ACN. The resin was then rinsed twice with 30 ml of 20% aqueous ACN and contaminants were removed with 30% aqueous ACN containing 0.1% TFA. Partially purified insulin analogs were eluted from the column with 54% aqueous ACN containing 0.1% TFA and
lyophilizied. Lyophilized samples were re-suspended in 0.025M NH3HCO3 pH 8 and purified on a Luna C18 column (10 μιη particle size, 300A° pore size). Protein was eluted from the column using a linear gradient of 20-60% aqueous ACN. MALDI-MS positive fractions were pooled and transferred to a disposable scintillation vial for subsequent lyophilization. Lyophilized samples were then resuspended in 20% aqueous ACN containing 0.1% TFA, and purified on a Luna C18 column (10 μιη particle size, 300A° pore size). The protein was eluted from the column using a linear gradient of 18-54% aqueous ACN with 0.1% TFA. Protein elution was monitored at an absorbance 280 nm. MALDI-TOF MS positive fractions were analyzed via a C8 analytical column to insure purity.
The B°-C1_A0 analog demonstrated potency that was equally effective at both insulin receptor isoforms and the IGF-1 receptor. Mutation of the tyrosine at position 2 to alanine or the shortening of the C-peptide to eight amino acids through deletion of C9-12 provided a selective enhancement in the specificity of insulin action by significant reduction in the IGF- 1 receptor activity. See also the data provided in Tables 5A and 5B: Table 5A
Insulin Binding & Phosphorylation Analysis
(B°C1A°)
Peptide Insulin Binding Insulin Phosphorylation
ICso, nM n EC50, nM n
Insulin 0.54±0.02 4 1.67±0.13 1
IGF-1 18.81 ±1.77 3 29.20±8.41 1
010 (B°C1A°) 2.83±0.52 2 1.93±0.43 1
G1A 1.21 +0.15 1 2.4±0.24 1
Y2A 1.95±0.28 3 1.86±0.42 1
G3A 1.41 ±0.05 2 2.13±0.02 1
S4A 0.84±0.47 2 0.76±0.35 1
S5A 0.93±0.44 ! 2.23±1.27 1
S6A 1.15±0.24 ! 2.33±1.65 2
R7A 6.04±0.82 5.21 ±4.14 1
R8A 0.63±0.09 ! 2.03±0.06 2
P10A 2.86±0.93 ! 2.59±1.2 1
Q11A 1.79±0.47 ! 2.58±0.83 1
T12A 1.2+0.18 1 2.83±1.31 1
Table 5B
IGF-1 Binding & Phosphorylation Analysis
(B C 1A C )
Figure imgf000100_0001
Position 2 and 3 in the C-peptide are most sensitive to modification at the IGF-1 receptor with the insulin receptor proving to be relatively immune to modification. All of the analogs maintained single unit nanomolar activity with certain specific analogs proving to be slightly enhanced in potency (low single unit nanomolar). The most insulin selective analogs were those that we missing the last four residues of the C-peptide, had an alanine mutation at position two of the C-peptide, or a combination of the two changes.
EXAMPLE 6
Insulin Tolerance Tests
Single administration of six- to eight-week-old male C57BL/6 mice were maintained at 23'C, constant humidity, and a 12-h light— dark cycle. The acute in vivo effects of select peptides were evaluated by subcutaneously injecting peptides solubilized in physiologically buffered saline or a vehicle control to normal or diabetic mice. Blood glucose was measured at various time points through the course of a 24hr period following administration of the peptides. Each group of mice contained 8 animals per group. The average body weight was 25 g. Mice were made diabetic by administration of strep tozotocin.
Serial administration
Repeat daily subcutaneous administration of the peptides or vehicle control was administered to the mice for periods of five days to two weeks. The obese mice were given a diabetogenic diet Mice had free access to water and were fed ad libitum with a high fat diet (HFD) comprising 58% of calories from fat (D 12331; Research Diets, New Brunswick, NJ) and each group of mice contained 8 animals per group. The average body weight was -50 g 20 and the mice were— 6 months old males. Body weight and food intake were measured on the days that peptide or vehicle control was administered to the mice. Fasting blood glucose levels were measured repeatedly.

Claims

Claims:
1. An insulin peptide/NHR ligand conjugate comprising the structure
Q L-Y;
wherein
Q is an insulin peptide;
Y is a NHR ligand; and
L is a linking group or a bond
wherein said conjugate has activity at both a nuclear hormone receptor and an insulin receptor, wherein said NHR ligand is
i) a steroid that exhibits an EC50 of about 1 μΜ or less when unconjugated to Q-L, and further has a molecular weight of up to about 1000 daltons; or
ii) a ligand that activates the thyroid hormone receptor; or
iii) a ligand that activates the peroxisome proliferator-activated receptors (PPAR)
wherein the nuclear hormone receptor ligand is covalently linked to the insulin peptide through the side chain of a lysine residue located at a position selected from the group consisting of B28 and B29, or through the N-terminal alpha amine of the A or B chain.
2. The conjugate of claim 1, wherein Y is selected from the group consisting of estradiol and derivatives thereof, estrone and derivatives thereof, testosterone and derivatives thereof, and Cortisol and derivatives thereof.
3. The conjugate of claim 2, wherein Y is dexamethasone.
4. The conjugate of claim 1 wherein Y is a compound having the general structure
Figure imgf000102_0001
wherein
Ri5 is C1-C4 alkyl, -CH2(pyridazinone), -CH2(OH)(phenyl)F, -CH(OH)CH3,
H;
R20 is halo, CH3 or H;
R21 is halo, CH3 or H;
R22 is H, OH, halo, -CH2(OH)(C6 aryl)F, or C1-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2 .
5. The conjugate of claim 4 wherein
wherein
Ri5 is C1-C4 alkyl, -CH(OH)CH3, 1 or H
R20 is I, Br, CH3 or H;
R21 is I, Br, CH3 or H;
R22 is H, OH, I, or C1-C4 alkyl; and
R23 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2".
6. The conjugate of claim 4 wherein
Ri5 is Ci-C4 alkyl, l or H;
R20 is I, Br, CH3 or H;
R21 is I, Br, CH3 or H;
R22 is H, OH, I, or C1-C4 alkyl; and
R2 is -CH2CH(NH2)COOH, -OCH2COOH, -NHC(0)COOH, -CH2COOH,
-NHC(0)CH2COOH, -CH2CH2COOH, or -OCH2P03 2".
7. The conjugate of claim 1 wherein Y is a compound of the general structure of Formula I:
Figure imgf000103_0001
wherein
R20, R21, and R22 are independently selected from the group consisting of H, OH, halo and Ci-C4 alkyl; and
Ri5 is halo or H.
8. The conjugate of claim 1 wherein Y is selected from the group consisting of 3,5,3',5'-tetra-iodothyronine, and 3,5,3'-triiodo L-thyronine.
9. The conjugate of claim 1 wherein Y is selected from the group consisting of Tesaglitazar, Aleglitazar and thiazolidinediones, optionally wherein Y is Tesaglitazar.
10. The conjugate of any one of claims 1-9 wherein said insulin peptide (Q) comprises an A chain and a B chain wherein said A chain comprises a sequence
GIVX4X5CCX8X9XioCXi2LXi4Xi5LXi7Xi8YCX2i-R53 (SEQ ID NO: 19), and said B chain comprises a sequence R62-X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20), wherein
X4 is glutamic acid or aspartic acid;
X5 is glutamine or glutamic acid
X8 is histidine, threonine or phenylalanine;
X9 is serine, arginine, lysine, ornithine or alanine;
X10 is isoleucine or serine;
X12 is serine or aspartic acid;
Xi4 is tyrosine, arginine, lysine, ornithine or alanine;
Xi5 is glutamine, glutamic acid, arginine, alanine, lysine, ornithine or leucine;
Xi7 is glutamic acid, aspartic acid, asparagine, lysine, ornithine or glutamine;
Xi8 is methionine, asparagine, glutamine, aspartic acid, glutamic acid or threonine;
X21 is selected from the group consisting of alanine, glycine, serine, valine, threonine, isoleucine, leucine, glutamine, glutamic acid, asparagine, aspartic acid, histidine, tryptophan, tyrosine, and methionine;
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid; X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
R53 is COOH or CONH2.
11. The conjugate of claim 10 wherein said A chain comprises the sequence GIVEQCCX8X9lCSLYQLENYCX2i-R53 (SEQ ID NO: 73) said B chain comprises the sequence R62-X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45 (SEQ ID NO: 20), wherein
X8 is histidine or threonine;
X9 is serine, lysine, or alanine;
X21 is alanine, glycine or asparagine;
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X3o is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
R62 is selected from the group consisting of FVNQ (SEQ ID NO: 12), a tripeptide valine-asparagine-glutamine, a dipeptide asparagine-glutamine, glutamine and an N-terminal amine; and
R5 is COOH or CONH2.
12. The conjugate of any one of claims 1-9 wherein said insulin peptide comprises an A chain and a B chain wherein said A chain comprises a sequence
GIVDECCX8X9SCDLRRLEMX19CX21-R53 (SEQ ID NO: 74) and said B chain comprises a sequence Rei-XisLCGAXsoLVDALYLVCGDX^GFY (SEQ ID NO: 75), wherein
X8 is phenylalanine or histidine;
X9 is arginine, ornithine or alanine;
X19 is tyrosine, 4-methoxy-phenylalanine or 4-amino-phenylalanine;
X21 is alanine or asparagine;
X25 is histidine or threonine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X42 is selected from the group consisting of alanine ornithine and arginine; and R53 is COOH or CONH2;
R62 is selected from the group consisting of AYRPSE (SEQ ID NO: 14), FVNQ (SEQ ID NO: 12), PGPE (SEQ ID NO: 11), a tripeptide glycine-proline-glutamic acid, a tripeptide valine-asparagine-glutamine, a dipeptide proline-glutamic acid, a dipeptide asparagine-glutamine, glutamine, glutamic acid and an N-terminal amine; and
R53 is COOH or CONH2.
13. The insulin peptide of claim 12 wherein the B chain sequence comprises the sequence FVKQX25LCGSHLVEALYLVCGERGFF-R63 (SEQ ID NO: 147), or
FVNQX25LCGS HLVE ALYLVC GERGFF-R63 (SEQ ID NO: 148), wherein
X25 is selected from the group consisting of histidine and threonine; and
R63 is selected from the group consisting of YTX28KT (SEQ ID NO: 149), YTKPT (SEQ ID NO: 150), YTX28K (SEQ ID NO: 152), YTKP (SEQ ID NO: 151), YTPK (SEQ ID NO: 70), YTX2S, YT, Y and a bond, wherein X28 is proline, aspartic acid or glutamic acid.
14. The insulin peptide of claim 13 wherein the B chain sequence comprises the sequence FVKQX25LCGSHLVEALYLVCGERGFFYTEKT (SEQ ID NO: 162),
FVNQX25LCGSHLVEALYLVCGERGFFYTDKT (SEQ ID NO: 164),
FVNQX25LCGS HLVE ALYLVC GERGFF YTKPT (SEQ ID NO: 165) or
FVNQX25LCGS HLVE ALYLVC GERGFF YTPKT (SEQ ID NO: 161) wherein
X25 is selected from the group consisting of histidine and threonine.
15. The conjugate of claim 11 wherein said B chain comprises a sequence R62- X25LCGX29X3oLVX33X34LYLVCGX4iX42GFX45YT-Zi-Bi (SEQ ID NO: 142), wherein
X25 is histidine or threonine;
X29 is selected from the group consisting of alanine, glycine and serine;
X30 is selected from the group consisting of histidine, aspartic acid, glutamic acid, homocysteic acid and cysteic acid;
X33 is selected from the group consisting of aspartic acid and glutamic acid;
X34 is selected from the group consisting of alanine and threonine;
X4i is selected from the group consisting of glutamic acid, aspartic acid or asparagine;
X42 is selected from the group consisting of alanine, ornithine, lysine and arginine; X45 is tyrosine or phenylalanine;
Zi is a dipeptide selected from the group consisting of aspartate-lysine, lysine- proline, and proline-lysine; and
Bi is selected from the group consisting of threonine, alanine or a threonine-arginine- arginine tripeptide.
16. The conjugate of claim 10 wherein said A chain comprises a sequence GIVEQCCTSICSLYQLENYCN-R53 (SEQ ID NO: 1) and said B chain comprises a sequence FVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 2), wherein R53 is COOH or CONH2.
17. The conjugate of claim 14 or 16 wherein Y is selected from the group consisting of 3,5,3',5'-tetra-iodothyronine, and 3, 5, 3 '-triiodo L-thyronine and Y is linked via the side chain of the lysine residue present at B28 or B29 of said B chain.
18. The conjugate of claim 17 wherein Y is 3,5,3'-triiodo L-thyronine and Y is linked via the side chain of the lysine residue present at B29 of said B chain.
19. The conjugate of any one of claims 10-18 wherein the insulin peptide is a single chain insulin and the peptide linking moiety joining the B and A chains is selected from the group consisting of SSSSKAPPPSLPSPSRLPGPSDTPILPQR (SEQ ID NO: 52), SSSSRAPPPSLPSPSRLPGPSDTPILPQK (SEQ ID NO: 51), GAGSSSX57X58 (SEQ ID NO: 76), GYGSSSX57X58 (SEQ ID NO: 21) and GYGSSSX57X58APQT; (SEQ ID NO: 77), wherein
X57 and X5s are independently arginine, lysine or ornithine.
20. The conjugate of claim 19 wherein the peptide linking moiety is selected from the group consisting of GYGSSSRR (SEQ ID NO: 18) and GAGSSSRR (SEQ ID NO: 22).
21. The conjugate of any one of claims 17-20 wherein the N-terminus of said B chain is pegylated.
22. The conjugate of any one of claims 1-21, wherein L is stable in vivo, hydrolyzable in vivo, or metastable in vivo.
23. The conjugate of claim 21, wherein L comprises an ether moiety, or an amide moiety, an ester moiety, an acid-labile moiety, a reduction-labile moiety , an enzyme- labile moiety, a hydrazone moiety, a disulfide moiety, or a cathepsin-cleavable moiety.
24. The conjugate of any one of claims 1-16 wherein Y-L is covalently linked to the insulin peptide (Q) through a position independently selected from the side chain of an amino acid at B28 or B29 of the B chain, the N-terminal alpha amine of the B chain, the N- terminal alpha amine of the A chain and at the side chain of an amino acid at any position of a linking moiety that links the A chain and B chain of a single chain insulin peptide.
25. The conjugate of claim 24 wherein Y-L is covalently linked to the insulin peptide (Q) through the side chain of an amino acid at B28 or B29 of the B chain.
26. A derivative of the conjugate of any one of the preceding claims further comprising the structure U-J, wherein
U is an amino acid or a hydroxy acid;
J is an N-alkylated amino acid linked to said conjugate through an amide bond between a carboxyl moiety of J and an amine of the conjugate, wherein the chemical cleavage half-life (ti/2) of U-J from the conjugate is at least about 1 hour to about 1 week in PBS under physiological conditions.
27. The conjugate of any one of claims 1-26 wherein a hydrophilic moiety is covalently linked at one or more positions corresponding to A14, A15, BO, B l, B IO, B22, B28, B29.
28. The conjugate of claim 27, wherein the hydrophilic moiety is a polyethylene glycol.
29. The conjugate of any one of the preceding claims, further comprising an acyl group or alkyl group covalently linked to an amino acid side chain of said conjugate.
30. The conjugate of claim 29 wherein said acyl group or alkyl group is covalently linked to one or more positions selected from A14, A15, BO, B l, B IO, B22, B28, B29 of the insulin peptide.
31. A pharmaceutical composition comprising a conjugate of any one of the preceding claims, or pharmaceutically acceptable salt thereof, and a pharmaceutically acceptable carrier.
32. A kit for administering an insulin peptide/incretin conjugate to a patient in need thereof, said kit comprising the pharmaceutical composition of claim 31; and a device for administering said composition to a patient.
33. Use of the conjugate of the composition of claim 31, or pharmaceutically acceptable salt thereof, for the treatment of diabetes.
34. A method of treating diabetes and optionally
i) lowering body weight;
ii) lowering triglycerides;
ii) lowering cholesterol levels; or
iv) any combination of i), ii) and iii);
said method comprising administering a conjugate of any one of embodiments 1-30 to a patient in need thereof.
PCT/US2017/035102 2016-06-02 2017-05-31 Insulin-nuclear hormone conjugates WO2017210242A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201662344674P 2016-06-02 2016-06-02
US62/344,674 2016-06-02

Publications (1)

Publication Number Publication Date
WO2017210242A1 true WO2017210242A1 (en) 2017-12-07

Family

ID=60477878

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2017/035102 WO2017210242A1 (en) 2016-06-02 2017-05-31 Insulin-nuclear hormone conjugates

Country Status (1)

Country Link
WO (1) WO2017210242A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030113305A1 (en) * 1998-06-02 2003-06-19 The University Of Washington Compositions and methods for treating diabetes
US20090304665A1 (en) * 2008-04-28 2009-12-10 Frost Gregory I Super fast-acting insulin compositions
US20150320871A1 (en) * 2010-05-13 2015-11-12 Indiana University Research And Technology Corporation Glucagon Superfamily Peptides Exhibiting Nuclear Hormone Receptor Activity
US20150353621A1 (en) * 2013-01-22 2015-12-10 Case Western Reserve University N-terminal truncated insulin analogues

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030113305A1 (en) * 1998-06-02 2003-06-19 The University Of Washington Compositions and methods for treating diabetes
US20090304665A1 (en) * 2008-04-28 2009-12-10 Frost Gregory I Super fast-acting insulin compositions
US20150320871A1 (en) * 2010-05-13 2015-11-12 Indiana University Research And Technology Corporation Glucagon Superfamily Peptides Exhibiting Nuclear Hormone Receptor Activity
US20150353621A1 (en) * 2013-01-22 2015-12-10 Case Western Reserve University N-terminal truncated insulin analogues

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
FINAN ET AL.: "Targeted estrogen delivery reverses the metabolic syndrome", NAT MED, vol. 18, no. 12, 11 November 2012 (2012-11-11), pages 1847 - 1856, XP002717086 *

Similar Documents

Publication Publication Date Title
US10696726B2 (en) Insulin-incretin conjugates
JP6121323B2 (en) Peptides of the glucagon superfamily exhibiting nuclear hormone receptor activity
CA2802485C (en) Single chain insulin agonists exhibiting high activity at the insulin receptor
JP6324315B2 (en) Glucagon superfamily of peptides exhibiting glucocorticoid receptor activity
TWI489992B (en) Amide based glucagon superfamily peptide prodrugs
CA2713348C (en) Ester-based peptide prodrugs
JP6050746B2 (en) Glucagon superfamily of peptides exhibiting G protein-coupled receptor activity
RU2580317C2 (en) Peptide pro-drugs of amide glucagon superfamily
CA2747195A1 (en) Dipeptide linked medicinal agents
US20130137849A1 (en) Dipeptide linked medicinal agents
US11351268B2 (en) FGF21 C-terminal peptide optimization
US20190290772A1 (en) Glucagon-like peptide-1-t3 conjugates
WO2017210242A1 (en) Insulin-nuclear hormone conjugates

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17807366

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17807366

Country of ref document: EP

Kind code of ref document: A1