WO2023201372A2 - Papb as a bimoiety-dependent thioether installation tool - Google Patents
Papb as a bimoiety-dependent thioether installation tool Download PDFInfo
- Publication number
- WO2023201372A2 WO2023201372A2 PCT/US2023/065825 US2023065825W WO2023201372A2 WO 2023201372 A2 WO2023201372 A2 WO 2023201372A2 US 2023065825 W US2023065825 W US 2023065825W WO 2023201372 A2 WO2023201372 A2 WO 2023201372A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- compound
- thioether
- formula
- structure represented
- amino acid
- Prior art date
Links
- 150000003568 thioethers Chemical class 0.000 title claims abstract description 306
- 238000009434 installation Methods 0.000 title description 6
- 230000001419 dependent effect Effects 0.000 title description 3
- 238000000034 method Methods 0.000 claims abstract description 300
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 250
- 150000001875 compounds Chemical class 0.000 claims abstract description 227
- 230000001225 therapeutic effect Effects 0.000 claims abstract description 14
- -1 C(O)NHOH Inorganic materials 0.000 claims description 124
- 229940024606 amino acid Drugs 0.000 claims description 116
- 235000001014 amino acid Nutrition 0.000 claims description 112
- 150000001413 amino acids Chemical class 0.000 claims description 100
- 229910052739 hydrogen Inorganic materials 0.000 claims description 86
- 241001442654 Percnon planissimum Species 0.000 claims description 85
- 239000001257 hydrogen Substances 0.000 claims description 85
- 125000000539 amino acid group Chemical group 0.000 claims description 83
- 125000000217 alkyl group Chemical group 0.000 claims description 73
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 claims description 66
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 49
- 108010076818 TEV protease Proteins 0.000 claims description 48
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 46
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 claims description 46
- 239000003638 chemical reducing agent Substances 0.000 claims description 44
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 42
- 108091005804 Peptidases Proteins 0.000 claims description 41
- 239000004365 Protease Substances 0.000 claims description 41
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 40
- 125000004435 hydrogen atom Chemical group [H]* 0.000 claims description 40
- 239000000758 substrate Substances 0.000 claims description 39
- 229910052717 sulfur Inorganic materials 0.000 claims description 37
- 229960004441 tyrosine Drugs 0.000 claims description 36
- 150000008574 D-amino acids Chemical group 0.000 claims description 34
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 32
- 125000004429 atom Chemical group 0.000 claims description 31
- 150000001576 beta-amino acids Chemical group 0.000 claims description 31
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims description 30
- 229960003767 alanine Drugs 0.000 claims description 29
- 125000000623 heterocyclic group Chemical group 0.000 claims description 28
- 235000004279 alanine Nutrition 0.000 claims description 27
- 229960001153 serine Drugs 0.000 claims description 27
- 229960002433 cysteine Drugs 0.000 claims description 26
- 235000018417 cysteine Nutrition 0.000 claims description 26
- 239000004471 Glycine Substances 0.000 claims description 24
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 24
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 23
- 150000003839 salts Chemical class 0.000 claims description 23
- 239000004475 Arginine Substances 0.000 claims description 22
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 22
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 22
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 22
- 229960003121 arginine Drugs 0.000 claims description 22
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 22
- 229930182817 methionine Natural products 0.000 claims description 22
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 21
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 21
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 21
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 21
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 21
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 21
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 21
- 239000004472 Lysine Substances 0.000 claims description 21
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 claims description 21
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 21
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 21
- 229960001230 asparagine Drugs 0.000 claims description 21
- 235000009582 asparagine Nutrition 0.000 claims description 21
- 229960003136 leucine Drugs 0.000 claims description 21
- 108010057366 Flavodoxin Proteins 0.000 claims description 20
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 20
- GRWZHXKQBITJKP-UHFFFAOYSA-L dithionite(2-) Chemical compound [O-]S(=O)S([O-])=O GRWZHXKQBITJKP-UHFFFAOYSA-L 0.000 claims description 20
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 19
- 108010087911 flavodoxin NADPH oxidoreductase Proteins 0.000 claims description 19
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 19
- 229960005190 phenylalanine Drugs 0.000 claims description 19
- 235000008729 phenylalanine Nutrition 0.000 claims description 19
- MSYNCHLYGJCFFY-UHFFFAOYSA-B 2-hydroxypropane-1,2,3-tricarboxylate;titanium(4+) Chemical compound [Ti+4].[Ti+4].[Ti+4].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O MSYNCHLYGJCFFY-UHFFFAOYSA-B 0.000 claims description 18
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 18
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 18
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 18
- 239000004474 valine Substances 0.000 claims description 18
- 229960004295 valine Drugs 0.000 claims description 18
- 125000004178 (C1-C4) alkyl group Chemical group 0.000 claims description 17
- DEQANNDTNATYII-OULOTJBUSA-N (4r,7s,10s,13r,16s,19r)-10-(4-aminobutyl)-19-[[(2r)-2-amino-3-phenylpropanoyl]amino]-16-benzyl-n-[(2r,3r)-1,3-dihydroxybutan-2-yl]-7-[(1r)-1-hydroxyethyl]-13-(1h-indol-3-ylmethyl)-6,9,12,15,18-pentaoxo-1,2-dithia-5,8,11,14,17-pentazacycloicosane-4-carboxa Chemical compound C([C@@H](N)C(=O)N[C@H]1CSSC[C@H](NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](CC=2C3=CC=CC=C3NC=2)NC(=O)[C@H](CC=2C=CC=CC=2)NC1=O)C(=O)N[C@H](CO)[C@H](O)C)C1=CC=CC=C1 DEQANNDTNATYII-OULOTJBUSA-N 0.000 claims description 16
- 229910006074 SO2NH2 Inorganic materials 0.000 claims description 16
- 229910006069 SO3H Inorganic materials 0.000 claims description 16
- 229910052711 selenium Inorganic materials 0.000 claims description 15
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 14
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 14
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 14
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 14
- 239000004473 Threonine Substances 0.000 claims description 14
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 14
- 229960000310 isoleucine Drugs 0.000 claims description 14
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 claims description 10
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 claims description 10
- 108010016076 Octreotide Proteins 0.000 claims description 9
- 229960002700 octreotide Drugs 0.000 claims description 9
- 125000006730 (C2-C5) alkynyl group Chemical group 0.000 claims description 8
- 125000002947 alkylene group Chemical group 0.000 claims description 6
- LJRDOKAZOAKLDU-UDXJMMFXSA-N (2s,3s,4r,5r,6r)-5-amino-2-(aminomethyl)-6-[(2r,3s,4r,5s)-5-[(1r,2r,3s,5r,6s)-3,5-diamino-2-[(2s,3r,4r,5s,6r)-3-amino-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-hydroxycyclohexyl]oxy-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl]oxyoxane-3,4-diol;sulfuric ac Chemical group OS(O)(=O)=O.N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)N)O[C@@H]1CO LJRDOKAZOAKLDU-UDXJMMFXSA-N 0.000 claims description 5
- UHDMAEPGMOIEHH-UHFFFAOYSA-N 2-amino-3-(2h-tetrazol-5-yl)propanoic acid Chemical group OC(=O)C(N)CC=1N=NNN=1 UHDMAEPGMOIEHH-UHFFFAOYSA-N 0.000 claims description 5
- NNIVFXCKHNRSRL-VKHMYHEASA-N Hydroxyglutamic acid Chemical group ON[C@H](C(O)=O)CCC(O)=O NNIVFXCKHNRSRL-VKHMYHEASA-N 0.000 claims description 5
- FFFHZYDWPBMWHY-VKHMYHEASA-N L-homocysteine Chemical group OC(=O)[C@@H](N)CCS FFFHZYDWPBMWHY-VKHMYHEASA-N 0.000 claims description 5
- 108010072543 bremelanotide Proteins 0.000 claims description 5
- FFHBJDQSGDNCIV-MFVUMRCOSA-N bremelanotide Chemical compound C([C@@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCNC(=O)C[C@@H](C(N[C@@H](CC=2NC=NC=2)C(=O)N1)=O)NC(=O)[C@@H](NC(C)=O)CCCC)C(O)=O)C1=CC=CC=C1 FFHBJDQSGDNCIV-MFVUMRCOSA-N 0.000 claims description 5
- 229950000740 bremelanotide Drugs 0.000 claims description 5
- 239000003937 drug carrier Substances 0.000 claims description 4
- 239000008194 pharmaceutical composition Substances 0.000 claims description 3
- 125000001554 selenocysteine group Chemical group [H][Se]C([H])([H])C(N([H])[H])C(=O)O* 0.000 claims description 3
- HDHDTKMUACZDAA-PHNIDTBTSA-N (4r,7s,10s,13r,16s,19r,22r)-22-[[(2s)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-13-benzyl-10-[3-(diaminomethylideneamino)propyl]-16-(1h-imidazol-5-ylmethyl)-7-(1h-indol-3-ylmethyl)-19-methyl-6,9,12,15,18,21-hexaoxo-1,2-dithia-5,8,11,14,17,20 Chemical compound C([C@@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](C(N[C@@H](CC=2N=CNC=2)C(=O)N1)=O)C)NC(=O)[C@H](CCCNC(N)=N)NC(C)=O)C(N)=O)C1=CC=CC=C1 HDHDTKMUACZDAA-PHNIDTBTSA-N 0.000 claims description 2
- PMATZTZNYRCHOR-CGLBZJNRSA-N Cyclosporin A Chemical compound CC[C@@H]1NC(=O)[C@H]([C@H](O)[C@H](C)C\C=C\C)N(C)C(=O)[C@H](C(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)N(C)C(=O)CN(C)C1=O PMATZTZNYRCHOR-CGLBZJNRSA-N 0.000 claims description 2
- 229930105110 Cyclosporin A Natural products 0.000 claims description 2
- 108010036949 Cyclosporine Proteins 0.000 claims description 2
- 101800000989 Oxytocin Proteins 0.000 claims description 2
- XNOPRXBHLZRZKH-UHFFFAOYSA-N Oxytocin Natural products N1C(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CC(C)C)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C(C(C)CC)NC(=O)C1CC1=CC=C(O)C=C1 XNOPRXBHLZRZKH-UHFFFAOYSA-N 0.000 claims description 2
- 229960001265 ciclosporin Drugs 0.000 claims description 2
- 229930182912 cyclosporin Natural products 0.000 claims description 2
- XNOPRXBHLZRZKH-DSZYJQQASA-N oxytocin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@H](N)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(N)=O)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 XNOPRXBHLZRZKH-DSZYJQQASA-N 0.000 claims description 2
- 229960001723 oxytocin Drugs 0.000 claims description 2
- 108010029667 pramlintide Proteins 0.000 claims description 2
- 229960003611 pramlintide Drugs 0.000 claims description 2
- NRKVKVQDUCJPIZ-MKAGXXMWSA-N pramlintide acetate Chemical compound C([C@@H](C(=O)NCC(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN)[C@@H](C)O)[C@@H](C)O)[C@@H](C)O)C(C)C)C1=CC=CC=C1 NRKVKVQDUCJPIZ-MKAGXXMWSA-N 0.000 claims description 2
- OHRURASPPZQGQM-GCCNXGTGSA-N romidepsin Chemical compound O1C(=O)[C@H](C(C)C)NC(=O)C(=C/C)/NC(=O)[C@H]2CSSCC\C=C\[C@@H]1CC(=O)N[C@H](C(C)C)C(=O)N2 OHRURASPPZQGQM-GCCNXGTGSA-N 0.000 claims description 2
- 229960003452 romidepsin Drugs 0.000 claims description 2
- OHRURASPPZQGQM-UHFFFAOYSA-N romidepsin Natural products O1C(=O)C(C(C)C)NC(=O)C(=CC)NC(=O)C2CSSCCC=CC1CC(=O)NC(C(C)C)C(=O)N2 OHRURASPPZQGQM-UHFFFAOYSA-N 0.000 claims description 2
- 108010091666 romidepsin Proteins 0.000 claims description 2
- 229950001912 setmelanotide Drugs 0.000 claims description 2
- 108700030852 setmelanotide Proteins 0.000 claims description 2
- 150000002431 hydrogen Chemical class 0.000 claims 2
- 102100031951 Oxytocin-neurophysin 1 Human genes 0.000 claims 1
- 125000004397 aminosulfonyl group Chemical group NS(=O)(=O)* 0.000 claims 1
- 125000001476 phosphono group Chemical group [H]OP(*)(=O)O[H] 0.000 claims 1
- 125000000020 sulfo group Chemical group O=S(=O)([*])O[H] 0.000 claims 1
- 230000000670 limiting effect Effects 0.000 abstract description 7
- 238000001819 mass spectrum Methods 0.000 description 83
- 230000015572 biosynthetic process Effects 0.000 description 65
- 238000004132 cross linking Methods 0.000 description 64
- 238000006243 chemical reaction Methods 0.000 description 59
- 239000000047 product Substances 0.000 description 59
- 239000000203 mixture Substances 0.000 description 49
- 125000003118 aryl group Chemical group 0.000 description 47
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 43
- 229910052799 carbon Inorganic materials 0.000 description 43
- 239000012634 fragment Substances 0.000 description 43
- 102000004196 processed proteins & peptides Human genes 0.000 description 43
- 125000004432 carbon atom Chemical group C* 0.000 description 40
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 39
- 102000004190 Enzymes Human genes 0.000 description 35
- 108090000790 Enzymes Proteins 0.000 description 35
- 229940088598 enzyme Drugs 0.000 description 35
- 125000001424 substituent group Chemical group 0.000 description 34
- 125000000753 cycloalkyl group Chemical group 0.000 description 33
- 108090000623 proteins and genes Proteins 0.000 description 33
- 108091005601 modified peptides Proteins 0.000 description 32
- 239000003814 drug Substances 0.000 description 30
- 125000005842 heteroatom Chemical group 0.000 description 30
- 230000004048 modification Effects 0.000 description 30
- 238000012986 modification Methods 0.000 description 30
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 29
- 210000004899 c-terminal region Anatomy 0.000 description 29
- 102000004169 proteins and genes Human genes 0.000 description 29
- 150000003254 radicals Chemical class 0.000 description 29
- 101000728229 Asticcacaulis excentricus (strain ATCC 15261 / DSM 4724 / KCTC 12464 / NCIMB 9791 / VKM B-1370 / CB 48) Astexin-1 Proteins 0.000 description 28
- 101000728234 Asticcacaulis excentricus (strain ATCC 15261 / DSM 4724 / KCTC 12464 / NCIMB 9791 / VKM B-1370 / CB 48) Astexin-2 Proteins 0.000 description 28
- 101000728232 Asticcacaulis excentricus (strain ATCC 15261 / DSM 4724 / KCTC 12464 / NCIMB 9791 / VKM B-1370 / CB 48) Astexin-3 Proteins 0.000 description 28
- 101000761079 Burkholderia thailandensis (strain ATCC 700388 / DSM 13276 / CIP 106301 / E264) Capistruin Proteins 0.000 description 28
- 101001056191 Escherichia coli Microcin J25 Proteins 0.000 description 28
- 101001138028 Rhodococcus jostii Lariatin Proteins 0.000 description 28
- 235000018102 proteins Nutrition 0.000 description 28
- 238000007792 addition Methods 0.000 description 27
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 27
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 26
- 229910052757 nitrogen Inorganic materials 0.000 description 26
- 125000001072 heteroaryl group Chemical group 0.000 description 25
- 239000000126 substance Substances 0.000 description 25
- 125000000392 cycloalkenyl group Chemical group 0.000 description 24
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 24
- 150000007942 carboxylates Chemical class 0.000 description 23
- 238000004885 tandem mass spectrometry Methods 0.000 description 23
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 22
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 21
- 125000003545 alkoxy group Chemical group 0.000 description 21
- 229910052805 deuterium Inorganic materials 0.000 description 21
- 230000000694 effects Effects 0.000 description 21
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 20
- 125000003342 alkenyl group Chemical group 0.000 description 20
- 210000004027 cell Anatomy 0.000 description 20
- 150000004820 halides Chemical class 0.000 description 20
- HEMHJVSKTPXQMS-UHFFFAOYSA-M sodium hydroxide Inorganic materials [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 20
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 19
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 19
- 125000000304 alkynyl group Chemical group 0.000 description 19
- 238000003776 cleavage reaction Methods 0.000 description 19
- JDNTWHVOXJZDSN-UHFFFAOYSA-N iodoacetic acid Chemical compound OC(=O)CI JDNTWHVOXJZDSN-UHFFFAOYSA-N 0.000 description 19
- 229910052760 oxygen Inorganic materials 0.000 description 19
- YZCKVEUIGOORGS-OUBTZVSYSA-N Deuterium Chemical compound [2H] YZCKVEUIGOORGS-OUBTZVSYSA-N 0.000 description 18
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 description 18
- 239000011347 resin Substances 0.000 description 18
- 229920005989 resin Polymers 0.000 description 18
- 239000011593 sulfur Substances 0.000 description 18
- 125000000570 L-alpha-aspartyl group Chemical group [H]OC(=O)C([H])([H])[C@]([H])(N([H])[H])C(*)=O 0.000 description 17
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 17
- 239000000872 buffer Substances 0.000 description 17
- 201000010099 disease Diseases 0.000 description 17
- 229960004452 methionine Drugs 0.000 description 17
- 239000001301 oxygen Substances 0.000 description 17
- 230000007017 scission Effects 0.000 description 17
- 239000000243 solution Substances 0.000 description 17
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 17
- 238000004458 analytical method Methods 0.000 description 16
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 15
- 229960001570 ademetionine Drugs 0.000 description 15
- 239000000543 intermediate Substances 0.000 description 15
- 150000003536 tetrazoles Chemical group 0.000 description 15
- 238000010306 acid treatment Methods 0.000 description 14
- 125000001931 aliphatic group Chemical group 0.000 description 14
- 238000003556 assay Methods 0.000 description 14
- 150000003573 thiols Chemical class 0.000 description 14
- 229940079593 drug Drugs 0.000 description 13
- 238000013467 fragmentation Methods 0.000 description 13
- 238000006062 fragmentation reaction Methods 0.000 description 13
- 238000004949 mass spectrometry Methods 0.000 description 13
- 229920006395 saturated elastomer Polymers 0.000 description 13
- DTQVDTLACAAQTR-UHFFFAOYSA-N trifluoroacetic acid Substances OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 13
- FBTSQILOGYXGMD-LURJTMIESA-N 3-nitro-L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C([N+]([O-])=O)=C1 FBTSQILOGYXGMD-LURJTMIESA-N 0.000 description 12
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 12
- 241000723792 Tobacco etch virus Species 0.000 description 12
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 12
- 239000000463 material Substances 0.000 description 12
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 12
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 12
- 238000000926 separation method Methods 0.000 description 12
- 238000003786 synthesis reaction Methods 0.000 description 12
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 11
- 108010076504 Protein Sorting Signals Proteins 0.000 description 11
- 229910052736 halogen Inorganic materials 0.000 description 11
- 150000002367 halogens Chemical class 0.000 description 11
- 229940002612 prodrug Drugs 0.000 description 11
- 239000000651 prodrug Substances 0.000 description 11
- 238000001228 spectrum Methods 0.000 description 11
- 238000001360 collision-induced dissociation Methods 0.000 description 10
- 125000004122 cyclic group Chemical group 0.000 description 10
- 150000002148 esters Chemical class 0.000 description 10
- 229960002449 glycine Drugs 0.000 description 10
- 230000003993 interaction Effects 0.000 description 10
- 239000012160 loading buffer Substances 0.000 description 10
- 229960003646 lysine Drugs 0.000 description 10
- 125000001570 methylene group Chemical group [H]C([H])([*:1])[*:2] 0.000 description 10
- 239000012453 solvate Substances 0.000 description 10
- 230000009466 transformation Effects 0.000 description 10
- 230000007306 turnover Effects 0.000 description 10
- SJRJJKPEHAURKC-UHFFFAOYSA-N N-Methylmorpholine Chemical compound CN1CCOCC1 SJRJJKPEHAURKC-UHFFFAOYSA-N 0.000 description 9
- 235000003704 aspartic acid Nutrition 0.000 description 9
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 9
- 239000003795 chemical substances by application Substances 0.000 description 9
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 9
- 235000013922 glutamic acid Nutrition 0.000 description 9
- 239000004220 glutamic acid Substances 0.000 description 9
- 239000003112 inhibitor Substances 0.000 description 9
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 8
- 229920001090 Polyaminopropyl biguanide Polymers 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 239000000460 chlorine Substances 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000004128 high performance liquid chromatography Methods 0.000 description 8
- 229910052742 iron Inorganic materials 0.000 description 8
- 239000007788 liquid Substances 0.000 description 8
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 8
- 238000002360 preparation method Methods 0.000 description 8
- 239000002904 solvent Substances 0.000 description 8
- 238000006467 substitution reaction Methods 0.000 description 8
- 229960004799 tryptophan Drugs 0.000 description 8
- 208000035475 disorder Diseases 0.000 description 7
- 239000002245 particle Substances 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 238000005406 washing Methods 0.000 description 7
- VFCYASLQFGXHTQ-VIFPVBQESA-N (2s)-2-[benzyl(nitro)amino]-3-hydroxypropanoic acid Chemical compound OC[C@@H](C(O)=O)N([N+]([O-])=O)CC1=CC=CC=C1 VFCYASLQFGXHTQ-VIFPVBQESA-N 0.000 description 6
- YYTDJPUFAVPHQA-VKHMYHEASA-N (2s)-2-amino-3-(2,3,4,5,6-pentafluorophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=C(F)C(F)=C(F)C(F)=C1F YYTDJPUFAVPHQA-VKHMYHEASA-N 0.000 description 6
- KQZQZOQFUQXPLW-YFKPBYRVSA-N (2s)-2-amino-3-(3,4,5-trihydroxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC(O)=C(O)C(O)=C1 KQZQZOQFUQXPLW-YFKPBYRVSA-N 0.000 description 6
- POGSZHUEECCEAP-ZETCQYMHSA-N (2s)-2-amino-3-(3-amino-4-hydroxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(N)=C1 POGSZHUEECCEAP-ZETCQYMHSA-N 0.000 description 6
- PEMUHKUIQHFMTH-QMMMGPOBSA-N (2s)-2-amino-3-(4-bromophenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(Br)C=C1 PEMUHKUIQHFMTH-QMMMGPOBSA-N 0.000 description 6
- JSXMFBNJRFXRCX-NSHDSACASA-N (2s)-2-amino-3-(4-prop-2-ynoxyphenyl)propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OCC#C)C=C1 JSXMFBNJRFXRCX-NSHDSACASA-N 0.000 description 6
- LJHYWUVYIKCPGU-VIFPVBQESA-N (2s)-2-amino-3-[4-(carboxymethyl)phenyl]propanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=C(CC(O)=O)C=C1 LJHYWUVYIKCPGU-VIFPVBQESA-N 0.000 description 6
- NLFOHNAFILVHGM-AWEZNQCLSA-N (2s)-2-amino-3-[4-[(2-nitrophenyl)methoxy]phenyl]propanoic acid Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1OCC1=CC=CC=C1[N+]([O-])=O NLFOHNAFILVHGM-AWEZNQCLSA-N 0.000 description 6
- NEMHIKRLROONTL-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-azidophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N=[N+]=[N-])C=C1 NEMHIKRLROONTL-QMMMGPOBSA-N 0.000 description 6
- GTVVZTAFGPQSPC-QMMMGPOBSA-N (2s)-2-azaniumyl-3-(4-nitrophenyl)propanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=C([N+]([O-])=O)C=C1 GTVVZTAFGPQSPC-QMMMGPOBSA-N 0.000 description 6
- ZXSBHXZKWRIEIA-JTQLQIEISA-N (2s)-3-(4-acetylphenyl)-2-azaniumylpropanoate Chemical compound CC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 ZXSBHXZKWRIEIA-JTQLQIEISA-N 0.000 description 6
- IHPYMWDTONKSCO-UHFFFAOYSA-N 2,2'-piperazine-1,4-diylbisethanesulfonic acid Chemical compound OS(=O)(=O)CCN1CCN(CCS(O)(=O)=O)CC1 IHPYMWDTONKSCO-UHFFFAOYSA-N 0.000 description 6
- JZRBSTONIYRNRI-VIFPVBQESA-N 3-methylphenylalanine Chemical compound CC1=CC=CC(C[C@H](N)C(O)=O)=C1 JZRBSTONIYRNRI-VIFPVBQESA-N 0.000 description 6
- IRZQDMYEJPNDEN-UHFFFAOYSA-N 3-phenyl-2-aminobutanoic acid Natural products OC(=O)C(N)C(C)C1=CC=CC=C1 IRZQDMYEJPNDEN-UHFFFAOYSA-N 0.000 description 6
- CMUHFUGDYMFHEI-QMMMGPOBSA-N 4-amino-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N)C=C1 CMUHFUGDYMFHEI-QMMMGPOBSA-N 0.000 description 6
- PZNQZSRPDOEBMS-QMMMGPOBSA-N 4-iodo-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(I)C=C1 PZNQZSRPDOEBMS-QMMMGPOBSA-N 0.000 description 6
- GTVVZTAFGPQSPC-UHFFFAOYSA-N 4-nitrophenylalanine Chemical compound OC(=O)C(N)CC1=CC=C([N+]([O-])=O)C=C1 GTVVZTAFGPQSPC-UHFFFAOYSA-N 0.000 description 6
- CJIJXIFQYOPWTF-UHFFFAOYSA-N 7-hydroxycoumarin Natural products O1C(=O)C=CC2=CC(O)=CC=C21 CJIJXIFQYOPWTF-UHFFFAOYSA-N 0.000 description 6
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Chemical group CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 6
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 6
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 6
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 6
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 6
- GEYBMYRBIABFTA-VIFPVBQESA-N O-methyl-L-tyrosine Chemical compound COC1=CC=C(C[C@H](N)C(O)=O)C=C1 GEYBMYRBIABFTA-VIFPVBQESA-N 0.000 description 6
- 239000007990 PIPES buffer Substances 0.000 description 6
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 6
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 6
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 6
- 102000016812 Radical SAM Human genes 0.000 description 6
- 108050006523 Radical SAM Proteins 0.000 description 6
- 150000001299 aldehydes Chemical class 0.000 description 6
- 150000001540 azides Chemical class 0.000 description 6
- 125000002618 bicyclic heterocycle group Chemical group 0.000 description 6
- JCZLABDVDPYLRZ-AWEZNQCLSA-N biphenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C1=CC=CC=C1 JCZLABDVDPYLRZ-AWEZNQCLSA-N 0.000 description 6
- 239000011203 carbon fibre reinforced carbon Substances 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 229960005091 chloramphenicol Drugs 0.000 description 6
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- 229950006137 dexfosfoserine Drugs 0.000 description 6
- 238000006911 enzymatic reaction Methods 0.000 description 6
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 6
- 125000000592 heterocycloalkyl group Chemical group 0.000 description 6
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 6
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 6
- 238000001727 in vivo Methods 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 230000000155 isotopic effect Effects 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 229930014626 natural product Natural products 0.000 description 6
- 150000002894 organic compounds Chemical class 0.000 description 6
- TVIDEEHSOPHZBR-AWEZNQCLSA-N para-(benzoyl)-phenylalanine Chemical compound C1=CC(C[C@H](N)C(O)=O)=CC=C1C(=O)C1=CC=CC=C1 TVIDEEHSOPHZBR-AWEZNQCLSA-N 0.000 description 6
- 230000001575 pathological effect Effects 0.000 description 6
- 239000008188 pellet Substances 0.000 description 6
- 239000000816 peptidomimetic Substances 0.000 description 6
- DCWXELXMIBXGTH-QMMMGPOBSA-N phosphonotyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-QMMMGPOBSA-N 0.000 description 6
- 229910052698 phosphorus Inorganic materials 0.000 description 6
- 125000006239 protecting group Chemical class 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- JPZXHKDZASGCLU-LBPRGKRZSA-N β-(2-naphthyl)-alanine Chemical compound C1=CC=CC2=CC(C[C@H](N)C(O)=O)=CC=C21 JPZXHKDZASGCLU-LBPRGKRZSA-N 0.000 description 6
- 101001027074 Azotobacter chroococcum mcd 1 Flavodoxin A Proteins 0.000 description 5
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical group C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 5
- 229930182832 D-phenylalanine Natural products 0.000 description 5
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 5
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 5
- 230000004913 activation Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 5
- 238000006664 bond formation reaction Methods 0.000 description 5
- 150000001721 carbon Chemical group 0.000 description 5
- BVKZGUZCCUSVTD-UHFFFAOYSA-N carbonic acid Chemical compound OC(O)=O BVKZGUZCCUSVTD-UHFFFAOYSA-N 0.000 description 5
- 238000012512 characterization method Methods 0.000 description 5
- 239000013078 crystal Substances 0.000 description 5
- 125000004093 cyano group Chemical group *C#N 0.000 description 5
- 239000002552 dosage form Substances 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 230000002255 enzymatic effect Effects 0.000 description 5
- 229910052731 fluorine Inorganic materials 0.000 description 5
- 125000000524 functional group Chemical group 0.000 description 5
- 125000001165 hydrophobic group Chemical group 0.000 description 5
- 150000002678 macrocyclic compounds Chemical class 0.000 description 5
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 5
- 239000011574 phosphorus Substances 0.000 description 5
- 230000002265 prevention Effects 0.000 description 5
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 125000000547 substituted alkyl group Chemical group 0.000 description 5
- 125000004066 1-hydroxyethyl group Chemical group [H]OC([H])([*])C([H])([H])[H] 0.000 description 4
- 238000009010 Bradford assay Methods 0.000 description 4
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical compound NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 description 4
- 125000000296 D-methionine group Chemical group [H]N([H])[C@@]([H])(C(=O)[*])C([H])([H])C([H])([H])SC([H])([H])[H] 0.000 description 4
- 125000001711 D-phenylalanine group Chemical group [H]N([H])[C@@]([H])(C(=O)[*])C([H])([H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 4
- 229940124602 FDA-approved drug Drugs 0.000 description 4
- MBMLMWLHJBBADN-UHFFFAOYSA-N Ferrous sulfide Chemical compound [Fe]=S MBMLMWLHJBBADN-UHFFFAOYSA-N 0.000 description 4
- 125000003412 L-alanyl group Chemical group [H]N([H])[C@@](C([H])([H])[H])(C(=O)[*])[H] 0.000 description 4
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- UFWIBTONFRDIAS-UHFFFAOYSA-N Naphthalene Chemical compound C1=CC=CC2=CC=CC=C21 UFWIBTONFRDIAS-UHFFFAOYSA-N 0.000 description 4
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 239000000556 agonist Substances 0.000 description 4
- 150000001408 amides Chemical class 0.000 description 4
- 125000003277 amino group Chemical group 0.000 description 4
- 229910052801 chlorine Inorganic materials 0.000 description 4
- 238000004587 chromatography analysis Methods 0.000 description 4
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 238000002425 crystallisation Methods 0.000 description 4
- 230000008025 crystallization Effects 0.000 description 4
- 229910001873 dinitrogen Inorganic materials 0.000 description 4
- ZUOUZKKEUPVFJK-UHFFFAOYSA-N diphenyl Chemical compound C1=CC=CC=C1C1=CC=CC=C1 ZUOUZKKEUPVFJK-UHFFFAOYSA-N 0.000 description 4
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 125000005843 halogen group Chemical group 0.000 description 4
- 150000002430 hydrocarbons Chemical group 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 230000002503 metabolic effect Effects 0.000 description 4
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 230000003287 optical effect Effects 0.000 description 4
- 229960001639 penicillamine Drugs 0.000 description 4
- 238000010647 peptide synthesis reaction Methods 0.000 description 4
- 239000003755 preservative agent Substances 0.000 description 4
- 125000004076 pyridyl group Chemical group 0.000 description 4
- 238000010791 quenching Methods 0.000 description 4
- 230000000171 quenching effect Effects 0.000 description 4
- 238000001896 rotating frame Overhauser effect spectroscopy Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 229940124597 therapeutic agent Drugs 0.000 description 4
- 238000000844 transformation Methods 0.000 description 4
- QWXZOFZKSQXPDC-NSHDSACASA-N (2s)-2-(9h-fluoren-9-ylmethoxycarbonylamino)propanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](C)C(O)=O)C3=CC=CC=C3C2=C1 QWXZOFZKSQXPDC-NSHDSACASA-N 0.000 description 3
- KJUGUADJHNHALS-UHFFFAOYSA-N 1H-tetrazole Chemical compound C=1N=NNN=1 KJUGUADJHNHALS-UHFFFAOYSA-N 0.000 description 3
- OLXZPDWKRNYJJZ-UHFFFAOYSA-N 2'-dA Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(CO)O1 OLXZPDWKRNYJJZ-UHFFFAOYSA-N 0.000 description 3
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- ZAMOUSCENKQFHK-UHFFFAOYSA-N Chlorine atom Chemical compound [Cl] ZAMOUSCENKQFHK-UHFFFAOYSA-N 0.000 description 3
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 description 3
- 229930182818 D-methionine Natural products 0.000 description 3
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- PXGOKWXKJXAPGV-UHFFFAOYSA-N Fluorine Chemical compound FF PXGOKWXKJXAPGV-UHFFFAOYSA-N 0.000 description 3
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical compound C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 3
- 239000007821 HATU Substances 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- PAYRUJLWNCNPSJ-UHFFFAOYSA-N N-phenyl amine Natural products NC1=CC=CC=C1 PAYRUJLWNCNPSJ-UHFFFAOYSA-N 0.000 description 3
- 229910004749 OS(O)2 Inorganic materials 0.000 description 3
- ZCQWOFVYLHDMMC-UHFFFAOYSA-N Oxazole Chemical compound C1=COC=N1 ZCQWOFVYLHDMMC-UHFFFAOYSA-N 0.000 description 3
- 239000004721 Polyphenylene oxide Substances 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- RWRDLPDLKQPQOW-UHFFFAOYSA-N Pyrrolidine Chemical compound C1CCNC1 RWRDLPDLKQPQOW-UHFFFAOYSA-N 0.000 description 3
- WGQKYBSKWIADBV-UHFFFAOYSA-N aminomethyl benzene Natural products NCC1=CC=CC=C1 WGQKYBSKWIADBV-UHFFFAOYSA-N 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 230000003466 anti-cipated effect Effects 0.000 description 3
- 239000002246 antineoplastic agent Substances 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 229940098773 bovine serum albumin Drugs 0.000 description 3
- 229910052794 bromium Inorganic materials 0.000 description 3
- 150000001735 carboxylic acids Chemical class 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 230000008602 contraction Effects 0.000 description 3
- 150000001975 deuterium Chemical class 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 239000006185 dispersion Substances 0.000 description 3
- 238000006345 epimerization reaction Methods 0.000 description 3
- 239000011737 fluorine Substances 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 229910052737 gold Inorganic materials 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 125000004366 heterocycloalkenyl group Chemical group 0.000 description 3
- 229940088597 hormone Drugs 0.000 description 3
- 239000005556 hormone Substances 0.000 description 3
- 125000002768 hydroxyalkyl group Chemical group 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 125000003453 indazolyl group Chemical group N1N=C(C2=C1C=CC=C2)* 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 230000035800 maturation Effects 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- SBOJXQVPLKSXOG-UHFFFAOYSA-N o-amino-hydroxylamine Chemical compound NON SBOJXQVPLKSXOG-UHFFFAOYSA-N 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 229920000570 polyether Polymers 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 125000002577 pseudohalo group Chemical group 0.000 description 3
- 125000003373 pyrazinyl group Chemical group 0.000 description 3
- 125000002943 quinolinyl group Chemical group N1=C(C=CC2=CC=CC=C12)* 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- QEVHRUUCFGRFIF-MDEJGZGSSA-N reserpine Chemical compound O([C@H]1[C@@H]([C@H]([C@H]2C[C@@H]3C4=C(C5=CC=C(OC)C=C5N4)CCN3C[C@H]2C1)C(=O)OC)OC)C(=O)C1=CC(OC)=C(OC)C(OC)=C1 QEVHRUUCFGRFIF-MDEJGZGSSA-N 0.000 description 3
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 description 3
- 235000016491 selenocysteine Nutrition 0.000 description 3
- 229940055619 selenocysteine Drugs 0.000 description 3
- 238000003756 stirring Methods 0.000 description 3
- 125000005415 substituted alkoxy group Chemical group 0.000 description 3
- 125000005346 substituted cycloalkyl group Chemical group 0.000 description 3
- 235000000346 sugar Nutrition 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- 125000004434 sulfur atom Chemical group 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- JOXIMZWYDAKGHI-UHFFFAOYSA-N toluene-4-sulfonic acid Chemical compound CC1=CC=C(S(O)(=O)=O)C=C1 JOXIMZWYDAKGHI-UHFFFAOYSA-N 0.000 description 3
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 3
- 229940054967 vanquish Drugs 0.000 description 3
- KRKRAOXTGDJWNI-UHFFFAOYSA-N (2R,4S)-4-Methyl glutamic acid Natural products OC(=O)C(C)CC(N)C(O)=O KRKRAOXTGDJWNI-UHFFFAOYSA-N 0.000 description 2
- SWXOGPJRIDTIRL-DOUNNPEJSA-N (4r,7s,10s,13r,16s,19r)-10-(4-aminobutyl)-n-[(2s)-1-amino-3-(1h-indol-3-yl)-1-oxopropan-2-yl]-19-[[(2r)-2-amino-3-phenylpropanoyl]amino]-16-[(4-hydroxyphenyl)methyl]-13-(1h-indol-3-ylmethyl)-6,9,12,15,18-pentaoxo-7-propan-2-yl-1,2-dithia-5,8,11,14,17-pent Chemical compound C([C@H]1C(=O)N[C@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(N[C@@H](CSSC[C@@H](C(=O)N1)NC(=O)[C@H](N)CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(N)=O)=O)C(C)C)C1=CC=C(O)C=C1 SWXOGPJRIDTIRL-DOUNNPEJSA-N 0.000 description 2
- QCHPKSFMDHPSNR-GSVOUGTGSA-N (R)-3-aminoisobutyric acid Chemical group [NH3+]C[C@@H](C)C([O-])=O QCHPKSFMDHPSNR-GSVOUGTGSA-N 0.000 description 2
- QCHPKSFMDHPSNR-VKHMYHEASA-N (S)-3-aminoisobutyric acid Chemical group [NH3+]C[C@H](C)C([O-])=O QCHPKSFMDHPSNR-VKHMYHEASA-N 0.000 description 2
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 2
- 238000005084 2D-nuclear magnetic resonance Methods 0.000 description 2
- QCHPKSFMDHPSNR-UHFFFAOYSA-N 3-Amino-2-methylpropanoic acid Natural products NCC(C)C(O)=O QCHPKSFMDHPSNR-UHFFFAOYSA-N 0.000 description 2
- 125000003349 3-pyridyl group Chemical group N1=C([H])C([*])=C([H])C([H])=C1[H] 0.000 description 2
- ZCYVEMRRCGMTRW-UHFFFAOYSA-N 7553-56-2 Chemical compound [I] ZCYVEMRRCGMTRW-UHFFFAOYSA-N 0.000 description 2
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical class CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- 229920000856 Amylose Polymers 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- 108010062877 Bacteriocins Proteins 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- WKBOTKDWSSQWDR-UHFFFAOYSA-N Bromine atom Chemical compound [Br] WKBOTKDWSSQWDR-UHFFFAOYSA-N 0.000 description 2
- 102000008203 CTLA-4 Antigen Human genes 0.000 description 2
- 108010021064 CTLA-4 Antigen Proteins 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 2
- 241000700199 Cavia porcellus Species 0.000 description 2
- 108010019670 Chimeric Antigen Receptors Proteins 0.000 description 2
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical class [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 2
- 108010069514 Cyclic Peptides Proteins 0.000 description 2
- 102000001189 Cyclic Peptides Human genes 0.000 description 2
- FFFHZYDWPBMWHY-GSVOUGTGSA-N D-Homocysteine Chemical group OC(=O)[C@H](N)CCS FFFHZYDWPBMWHY-GSVOUGTGSA-N 0.000 description 2
- 125000000030 D-alanine group Chemical group [H]N([H])[C@](C([H])([H])[H])(C(=O)[*])[H] 0.000 description 2
- 125000000028 D-cysteine group Chemical group [H]N([H])[C@@]([H])(C(=O)[*])C(S[H])([H])[H] 0.000 description 2
- KZSNJWFQEVHDMF-SCSAIBSYSA-N D-valine Chemical compound CC(C)[C@@H](N)C(O)=O KZSNJWFQEVHDMF-SCSAIBSYSA-N 0.000 description 2
- 229930182831 D-valine Natural products 0.000 description 2
- 150000008573 D-valines Chemical class 0.000 description 2
- 125000003625 D-valyl group Chemical group N[C@@H](C(=O)*)C(C)C 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- 229930195710 D‐cysteine Chemical group 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 101000978418 Homo sapiens Melanocortin receptor 4 Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- OYIFNHCXNCRBQI-BYPYZUCNSA-N L-2-aminoadipic acid Chemical compound OC(=O)[C@@H](N)CCCC(O)=O OYIFNHCXNCRBQI-BYPYZUCNSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- 102100023724 Melanocortin receptor 4 Human genes 0.000 description 2
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Chemical compound C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- 150000001204 N-oxides Chemical class 0.000 description 2
- 102000038427 NEDD8-activating enzyme E1 Human genes 0.000 description 2
- 108091007790 NEDD8-activating enzyme E1 Proteins 0.000 description 2
- 229940122313 Nucleoside reverse transcriptase inhibitor Drugs 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 241001494479 Pecora Species 0.000 description 2
- GLUUGHFHXGJENI-UHFFFAOYSA-N Piperazine Chemical compound C1CNCCN1 GLUUGHFHXGJENI-UHFFFAOYSA-N 0.000 description 2
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 2
- 102000012338 Poly(ADP-ribose) Polymerases Human genes 0.000 description 2
- 108010061844 Poly(ADP-ribose) Polymerases Proteins 0.000 description 2
- 229920000776 Poly(Adenosine diphosphate-ribose) polymerase Polymers 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- 102100040678 Programmed cell death protein 1 Human genes 0.000 description 2
- 101710089372 Programmed cell death protein 1 Proteins 0.000 description 2
- KYQCOXFCLRTKLS-UHFFFAOYSA-N Pyrazine Chemical compound C1=CN=CC=N1 KYQCOXFCLRTKLS-UHFFFAOYSA-N 0.000 description 2
- KAESVJOAVNADME-UHFFFAOYSA-N Pyrrole Chemical compound C=1C=CNC=1 KAESVJOAVNADME-UHFFFAOYSA-N 0.000 description 2
- 102000004879 Racemases and epimerases Human genes 0.000 description 2
- 108090001066 Racemases and epimerases Proteins 0.000 description 2
- 102000004389 Ribonucleoproteins Human genes 0.000 description 2
- 108010081734 Ribonucleoproteins Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 2
- 101710086549 Sporulation killing factor Proteins 0.000 description 2
- 102000013530 TOR Serine-Threonine Kinases Human genes 0.000 description 2
- 108010065917 TOR Serine-Threonine Kinases Proteins 0.000 description 2
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 2
- YTPLMLYBLZKORZ-UHFFFAOYSA-N Thiophene Chemical compound C=1C=CSC=1 YTPLMLYBLZKORZ-UHFFFAOYSA-N 0.000 description 2
- 102000002689 Toll-like receptor Human genes 0.000 description 2
- 108020000411 Toll-like receptor Proteins 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 125000002015 acyclic group Chemical group 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 230000008484 agonism Effects 0.000 description 2
- 125000005431 alkyl carboxamide group Chemical group 0.000 description 2
- 125000005119 alkyl cycloalkyl group Chemical group 0.000 description 2
- 230000029936 alkylation Effects 0.000 description 2
- 238000005804 alkylation reaction Methods 0.000 description 2
- 125000003275 alpha amino acid group Chemical group 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- RDOXTESZEPMUJZ-UHFFFAOYSA-N anisole Chemical compound COC1=CC=CC=C1 RDOXTESZEPMUJZ-UHFFFAOYSA-N 0.000 description 2
- 230000003288 anthiarrhythmic effect Effects 0.000 description 2
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 2
- 239000001961 anticonvulsive agent Substances 0.000 description 2
- 239000002220 antihypertensive agent Substances 0.000 description 2
- 229940030600 antihypertensive agent Drugs 0.000 description 2
- 229940034982 antineoplastic agent Drugs 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 239000012298 atmosphere Substances 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 125000000499 benzofuranyl group Chemical group O1C(=CC2=C1C=CC=C2)* 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical group NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 2
- 150000005347 biaryls Chemical group 0.000 description 2
- 125000002619 bicyclic group Chemical group 0.000 description 2
- 230000031018 biological processes and functions Effects 0.000 description 2
- 235000010290 biphenyl Nutrition 0.000 description 2
- 239000004305 biphenyl Substances 0.000 description 2
- GDTBXPJZTBHREO-UHFFFAOYSA-N bromine Substances BrBr GDTBXPJZTBHREO-UHFFFAOYSA-N 0.000 description 2
- 238000011088 calibration curve Methods 0.000 description 2
- 239000012482 calibration solution Substances 0.000 description 2
- CREMABGTGYGIQB-UHFFFAOYSA-N carbon carbon Chemical group C.C CREMABGTGYGIQB-UHFFFAOYSA-N 0.000 description 2
- 239000001768 carboxy methyl cellulose Substances 0.000 description 2
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 2
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 238000011210 chromatographic step Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 229960000956 coumarin Drugs 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 125000005432 dialkylcarboxamide group Chemical group 0.000 description 2
- VDQVEACBQKUUSU-UHFFFAOYSA-M disodium;sulfanide Chemical compound [Na+].[Na+].[SH-] VDQVEACBQKUUSU-UHFFFAOYSA-M 0.000 description 2
- 238000004090 dissolution Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 238000000192 extended X-ray absorption fine structure spectroscopy Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000001030 gas--liquid chromatography Methods 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- 125000001188 haloalkyl group Chemical group 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 125000004404 heteroalkyl group Chemical group 0.000 description 2
- 150000004687 hexahydrates Chemical class 0.000 description 2
- 238000004896 high resolution mass spectrometry Methods 0.000 description 2
- 238000000589 high-performance liquid chromatography-mass spectrometry Methods 0.000 description 2
- 244000005702 human microbiome Species 0.000 description 2
- BHEPBYXIRTUNPN-UHFFFAOYSA-N hydridophosphorus(.) (triplet) Chemical compound [PH] BHEPBYXIRTUNPN-UHFFFAOYSA-N 0.000 description 2
- 125000002883 imidazolyl group Chemical group 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 125000001041 indolyl group Chemical group 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 239000007972 injectable composition Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical class N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 229940047124 interferons Drugs 0.000 description 2
- 229940047122 interleukins Drugs 0.000 description 2
- 229910052740 iodine Inorganic materials 0.000 description 2
- 239000011630 iodine Substances 0.000 description 2
- RBTARNINKXHZNM-UHFFFAOYSA-K iron trichloride Chemical compound Cl[Fe](Cl)Cl RBTARNINKXHZNM-UHFFFAOYSA-K 0.000 description 2
- 150000002576 ketones Chemical class 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 125000000250 methylamino group Chemical group [H]N(*)C([H])([H])[H] 0.000 description 2
- 229940029985 mineral supplement Drugs 0.000 description 2
- 235000020786 mineral supplement Nutrition 0.000 description 2
- 230000000116 mitigating effect Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000006011 modification reaction Methods 0.000 description 2
- 239000003607 modifier Substances 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- LGQLOGILCSXPEA-UHFFFAOYSA-L nickel sulfate Chemical compound [Ni+2].[O-]S([O-])(=O)=O LGQLOGILCSXPEA-UHFFFAOYSA-L 0.000 description 2
- 229910000363 nickel(II) sulfate Inorganic materials 0.000 description 2
- 125000006574 non-aromatic ring group Chemical group 0.000 description 2
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 125000001715 oxadiazolyl group Chemical group 0.000 description 2
- 230000006320 pegylation Effects 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 239000000825 pharmaceutical preparation Substances 0.000 description 2
- 230000000144 pharmacologic effect Effects 0.000 description 2
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 125000003386 piperidinyl group Chemical group 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 229920001281 polyalkylene Polymers 0.000 description 2
- 229920000728 polyester Polymers 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 230000002335 preservative effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 2
- 125000003226 pyrazolyl group Chemical group 0.000 description 2
- 125000002098 pyridazinyl group Chemical group 0.000 description 2
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 2
- 125000000714 pyrimidinyl group Chemical group 0.000 description 2
- MMXZSJMASHPLLR-UHFFFAOYSA-N pyrroloquinoline quinone Chemical compound C12=C(C(O)=O)C=C(C(O)=O)N=C2C(=O)C(=O)C2=C1NC(C(=O)O)=C2 MMXZSJMASHPLLR-UHFFFAOYSA-N 0.000 description 2
- 239000010453 quartz Substances 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000007363 ring formation reaction Methods 0.000 description 2
- 239000003419 rna directed dna polymerase inhibitor Substances 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 239000004576 sand Substances 0.000 description 2
- 239000000932 sedative agent Substances 0.000 description 2
- 229940125723 sedative agent Drugs 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 229910052979 sodium sulfide Inorganic materials 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 125000001544 thienyl group Chemical group 0.000 description 2
- 150000007944 thiolates Chemical class 0.000 description 2
- 238000012384 transportation and delivery Methods 0.000 description 2
- ZGYICYBLPGRURT-UHFFFAOYSA-N tri(propan-2-yl)silicon Chemical compound CC(C)[Si](C(C)C)C(C)C ZGYICYBLPGRURT-UHFFFAOYSA-N 0.000 description 2
- 108700029852 vapreotide Proteins 0.000 description 2
- 229960002730 vapreotide Drugs 0.000 description 2
- 239000003981 vehicle Substances 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- KSDTXRUIZMTBNV-MRXNPFEDSA-N (2r)-2-(9h-fluoren-9-ylmethoxycarbonylamino)butanedioic acid Chemical compound C1=CC=C2C(COC(=O)N[C@H](CC(=O)O)C(O)=O)C3=CC=CC=C3C2=C1 KSDTXRUIZMTBNV-MRXNPFEDSA-N 0.000 description 1
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 description 1
- FMJPFPZKXLRBOJ-KQYNXXCUSA-N (2r,3r,4s,5r)-2-(6-aminopurin-9-yl)-5-methyloxolane-3,4-diol Chemical compound O[C@@H]1[C@H](O)[C@@H]([CH2])O[C@H]1N1C2=NC=NC(N)=C2N=C1 FMJPFPZKXLRBOJ-KQYNXXCUSA-N 0.000 description 1
- FPDYKABXINADKS-LURJTMIESA-N (2s)-2-(methylazaniumyl)hexanoate Chemical group CCCC[C@H](NC)C(O)=O FPDYKABXINADKS-LURJTMIESA-N 0.000 description 1
- LXRUAYBIUSUULX-GBXIJSLDSA-N (2s,3r)-2-amino-3-methylbutanedioic acid Chemical compound OC(=O)[C@H](C)[C@H](N)C(O)=O LXRUAYBIUSUULX-GBXIJSLDSA-N 0.000 description 1
- FHJNAFIJPFGZRI-WUJLRWPWSA-N (2s,3r)-2-amino-3-methylpentanedioic acid Chemical compound OC(=O)C[C@@H](C)[C@H](N)C(O)=O FHJNAFIJPFGZRI-WUJLRWPWSA-N 0.000 description 1
- FHJNAFIJPFGZRI-UCORVYFPSA-N (2s,3s)-2-amino-3-methylpentanedioic acid Chemical compound OC(=O)C[C@H](C)[C@H](N)C(O)=O FHJNAFIJPFGZRI-UCORVYFPSA-N 0.000 description 1
- KRKRAOXTGDJWNI-DMTCNVIQSA-N (2s,4r)-2-amino-4-methylpentanedioic acid Chemical compound OC(=O)[C@H](C)C[C@H](N)C(O)=O KRKRAOXTGDJWNI-DMTCNVIQSA-N 0.000 description 1
- KRKRAOXTGDJWNI-IMJSIDKUSA-N (2s,4s)-2-amino-4-methylpentanedioic acid Chemical compound OC(=O)[C@@H](C)C[C@H](N)C(O)=O KRKRAOXTGDJWNI-IMJSIDKUSA-N 0.000 description 1
- OQEBBZSWEGYTPG-VKHMYHEASA-N (3s)-3-aminobutanoic acid Chemical compound C[C@H](N)CC(O)=O OQEBBZSWEGYTPG-VKHMYHEASA-N 0.000 description 1
- 125000006686 (C1-C24) alkyl group Chemical group 0.000 description 1
- 125000006273 (C1-C3) alkyl group Chemical group 0.000 description 1
- 125000006527 (C1-C5) alkyl group Chemical group 0.000 description 1
- 125000004169 (C1-C6) alkyl group Chemical group 0.000 description 1
- 125000006701 (C1-C7) alkyl group Chemical group 0.000 description 1
- 125000004209 (C1-C8) alkyl group Chemical group 0.000 description 1
- 125000006694 (C2-C10) heterocyclyl group Chemical group 0.000 description 1
- 125000006695 (C2-C11) heterocyclyl group Chemical group 0.000 description 1
- 125000006696 (C2-C18) heterocyclyl group Chemical group 0.000 description 1
- 125000006687 (C2-C3) heterocyclyl group Chemical group 0.000 description 1
- 125000006688 (C2-C4) heterocyclyl group Chemical group 0.000 description 1
- 125000006689 (C2-C5) heterocyclyl group Chemical group 0.000 description 1
- 125000006690 (C2-C6) heterocyclyl group Chemical group 0.000 description 1
- 125000006691 (C2-C7) heterocyclyl group Chemical group 0.000 description 1
- 125000006692 (C2-C8) heterocyclyl group Chemical group 0.000 description 1
- 125000006693 (C2-C9) heterocyclyl group Chemical group 0.000 description 1
- GVJHHUAWPYXKBD-IEOSBIPESA-N (R)-alpha-Tocopherol Natural products OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-IEOSBIPESA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- DHBXNPKRAUYBTH-UHFFFAOYSA-N 1,1-ethanedithiol Chemical compound CC(S)S DHBXNPKRAUYBTH-UHFFFAOYSA-N 0.000 description 1
- UGUHFDPGDQDVGX-UHFFFAOYSA-N 1,2,3-thiadiazole Chemical compound C1=CSN=N1 UGUHFDPGDQDVGX-UHFFFAOYSA-N 0.000 description 1
- JYEUMXHLPRZUAT-UHFFFAOYSA-N 1,2,3-triazine Chemical compound C1=CN=NN=C1 JYEUMXHLPRZUAT-UHFFFAOYSA-N 0.000 description 1
- HTJMXYRLEDBSLT-UHFFFAOYSA-N 1,2,4,5-tetrazine Chemical compound C1=NN=CN=N1 HTJMXYRLEDBSLT-UHFFFAOYSA-N 0.000 description 1
- FYADHXFMURLYQI-UHFFFAOYSA-N 1,2,4-triazine Chemical compound C1=CN=NC=N1 FYADHXFMURLYQI-UHFFFAOYSA-N 0.000 description 1
- UDGKZGLPXCRRAM-UHFFFAOYSA-N 1,2,5-thiadiazole Chemical compound C=1C=NSN=1 UDGKZGLPXCRRAM-UHFFFAOYSA-N 0.000 description 1
- FKASFBLJDCHBNZ-UHFFFAOYSA-N 1,3,4-oxadiazole Chemical compound C1=NN=CO1 FKASFBLJDCHBNZ-UHFFFAOYSA-N 0.000 description 1
- MBIZXFATKUQOOA-UHFFFAOYSA-N 1,3,4-thiadiazole Chemical compound C1=NN=CS1 MBIZXFATKUQOOA-UHFFFAOYSA-N 0.000 description 1
- JIHQDMXYYFUGFV-UHFFFAOYSA-N 1,3,5-triazine Chemical compound C1=NC=NC=N1 JIHQDMXYYFUGFV-UHFFFAOYSA-N 0.000 description 1
- 125000005871 1,3-benzodioxolyl group Chemical group 0.000 description 1
- ZOBPZXTWZATXDG-UHFFFAOYSA-N 1,3-thiazolidine-2,4-dione Chemical compound O=C1CSC(=O)N1 ZOBPZXTWZATXDG-UHFFFAOYSA-N 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- JFLSOKIMYBSASW-UHFFFAOYSA-N 1-chloro-2-[chloro(diphenyl)methyl]benzene Chemical compound ClC1=CC=CC=C1C(Cl)(C=1C=CC=CC=1)C1=CC=CC=C1 JFLSOKIMYBSASW-UHFFFAOYSA-N 0.000 description 1
- FPIRBHDGWMWJEP-UHFFFAOYSA-N 1-hydroxy-7-azabenzotriazole Chemical compound C1=CN=C2N(O)N=NC2=C1 FPIRBHDGWMWJEP-UHFFFAOYSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- QWENRTYMTSOGBR-UHFFFAOYSA-N 1H-1,2,3-Triazole Chemical compound C=1C=NNN=1 QWENRTYMTSOGBR-UHFFFAOYSA-N 0.000 description 1
- LZIZWPKCGUKIGH-UHFFFAOYSA-N 2-(2h-tetrazol-5-yl)propanoic acid Chemical compound OC(=O)C(C)C=1N=NNN=1 LZIZWPKCGUKIGH-UHFFFAOYSA-N 0.000 description 1
- SCVJRXQHFJXZFZ-KVQBGUIXSA-N 2-amino-9-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purine-6-thione Chemical compound C1=2NC(N)=NC(=S)C=2N=CN1[C@H]1C[C@H](O)[C@@H](CO)O1 SCVJRXQHFJXZFZ-KVQBGUIXSA-N 0.000 description 1
- DJQYYYCQOZMCRC-UHFFFAOYSA-N 2-aminopropane-1,3-dithiol Chemical compound SCC(N)CS DJQYYYCQOZMCRC-UHFFFAOYSA-N 0.000 description 1
- JFOYDXJQOLMUES-UHFFFAOYSA-N 2-methyl-3-(methylazaniumyl)propanoate Chemical compound CNCC(C)C(O)=O JFOYDXJQOLMUES-UHFFFAOYSA-N 0.000 description 1
- QCDWFXQBSFUVSP-UHFFFAOYSA-N 2-phenoxyethanol Chemical compound OCCOC1=CC=CC=C1 QCDWFXQBSFUVSP-UHFFFAOYSA-N 0.000 description 1
- JQRDIZQROJQBDM-UHFFFAOYSA-N 3-amino-2,2-dimethylbutanoic acid Chemical compound CC(N)C(C)(C)C(O)=O JQRDIZQROJQBDM-UHFFFAOYSA-N 0.000 description 1
- DGNHGRLDBKAPEH-UHFFFAOYSA-N 3-azaniumyl-2,2-dimethylpropanoate Chemical group NCC(C)(C)C(O)=O DGNHGRLDBKAPEH-UHFFFAOYSA-N 0.000 description 1
- FZTIWOBQQYPTCJ-UHFFFAOYSA-N 4-[4-(4-carboxyphenyl)phenyl]benzoic acid Chemical compound C1=CC(C(=O)O)=CC=C1C1=CC=C(C=2C=CC(=CC=2)C(O)=O)C=C1 FZTIWOBQQYPTCJ-UHFFFAOYSA-N 0.000 description 1
- PXACTUVBBMDKRW-UHFFFAOYSA-M 4-bromobenzenesulfonate Chemical compound [O-]S(=O)(=O)C1=CC=C(Br)C=C1 PXACTUVBBMDKRW-UHFFFAOYSA-M 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N 4-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- NSPMIYGKQJPBQR-UHFFFAOYSA-N 4H-1,2,4-triazole Chemical compound C=1N=CNN=1 NSPMIYGKQJPBQR-UHFFFAOYSA-N 0.000 description 1
- XGYIMTFOTBMPFP-KQYNXXCUSA-N 5'-deoxyadenosine Chemical compound O[C@@H]1[C@H](O)[C@@H](C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 XGYIMTFOTBMPFP-KQYNXXCUSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 102100022455 Adrenocorticotropic hormone receptor Human genes 0.000 description 1
- 108010083590 Apoproteins Proteins 0.000 description 1
- 102000006410 Apoproteins Human genes 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108700016166 Bacillus subtilis sboA Proteins 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 229940078581 Bone resorption inhibitor Drugs 0.000 description 1
- 102000001805 Bromodomains Human genes 0.000 description 1
- 108050009021 Bromodomains Proteins 0.000 description 1
- 229940127291 Calcium channel antagonist Drugs 0.000 description 1
- OKTJSMMVPCPJKN-NJFSPNSNSA-N Carbon-14 Chemical compound [14C] OKTJSMMVPCPJKN-NJFSPNSNSA-N 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical class [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 241000193163 Clostridioides difficile Species 0.000 description 1
- 102220602520 Complement component C6_Y19W_mutation Human genes 0.000 description 1
- 102100026278 Cysteine sulfinic acid decarboxylase Human genes 0.000 description 1
- XUJNEKJLAYXESH-UWTATZPHSA-N D-Cysteine Chemical group SC[C@@H](N)C(O)=O XUJNEKJLAYXESH-UWTATZPHSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 150000008541 D-alanines Chemical class 0.000 description 1
- COLNVLDHVKWLRT-MRVPVSSYSA-N D-phenylalanine Chemical compound OC(=O)[C@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-MRVPVSSYSA-N 0.000 description 1
- 229930182827 D-tryptophan Natural products 0.000 description 1
- 108020004414 DNA Proteins 0.000 description 1
- 206010011953 Decreased activity Diseases 0.000 description 1
- SNRUBQQJIBEYMU-UHFFFAOYSA-N Dodecane Natural products CCCCCCCCCCCC SNRUBQQJIBEYMU-UHFFFAOYSA-N 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- PMVSDNDAUGGCCE-TYYBGVCCSA-L Ferrous fumarate Chemical compound [Fe+2].[O-]C(=O)\C=C\C([O-])=O PMVSDNDAUGGCCE-TYYBGVCCSA-L 0.000 description 1
- 241000192125 Firmicutes Species 0.000 description 1
- 208000012671 Gastrointestinal haemorrhages Diseases 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- DHMQDGOQFOQNFH-LGLHGEJLSA-N Glycine-d5 Chemical compound [2H]OC(=O)C([2H])([2H])N([2H])[2H] DHMQDGOQFOQNFH-LGLHGEJLSA-N 0.000 description 1
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Polymers OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 1
- 108090000353 Histone deacetylase Proteins 0.000 description 1
- 102000003964 Histone deacetylase Human genes 0.000 description 1
- 101000678419 Homo sapiens Adrenocorticotropic hormone receptor Proteins 0.000 description 1
- 101000855583 Homo sapiens Cysteine sulfinic acid decarboxylase Proteins 0.000 description 1
- 101000978431 Homo sapiens Melanocortin receptor 3 Proteins 0.000 description 1
- 102000002265 Human Growth Hormone Human genes 0.000 description 1
- 108010000521 Human Growth Hormone Proteins 0.000 description 1
- 239000000854 Human Growth Hormone Substances 0.000 description 1
- 229910021578 Iron(III) chloride Inorganic materials 0.000 description 1
- 102000005298 Iron-Sulfur Proteins Human genes 0.000 description 1
- 108010081409 Iron-Sulfur Proteins Proteins 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- 108091000076 Lysine 2,3-aminomutase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 102000030612 Melanocortin 5 receptor Human genes 0.000 description 1
- 108010088565 Melanocortin 5 receptor Proteins 0.000 description 1
- 102100023726 Melanocortin receptor 3 Human genes 0.000 description 1
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical compound CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 101100386053 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-3 gene Proteins 0.000 description 1
- 101100426732 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-9 gene Proteins 0.000 description 1
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 1
- IGFHQQFPSIBGKE-UHFFFAOYSA-N Nonylphenol Natural products CCCCCCCCCC1=CC=C(O)C=C1 IGFHQQFPSIBGKE-UHFFFAOYSA-N 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102400000050 Oxytocin Human genes 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 229920001774 Perfluoroether Chemical group 0.000 description 1
- PCNDJXKNXGMECE-UHFFFAOYSA-N Phenazine Natural products C1=CC=CC2=NC3=CC=CC=C3N=C21 PCNDJXKNXGMECE-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-DJYDEVFTSA-N Phenylalanine-d8 Chemical compound [2H]N([2H])[C@]([2H])(C(O)=O)C([2H])([2H])C1=CC=C([2H])C([2H])=C1[2H] COLNVLDHVKWLRT-DJYDEVFTSA-N 0.000 description 1
- RVGRUAULSDPKGF-UHFFFAOYSA-N Poloxamer Chemical compound C1CO1.CC1CO1 RVGRUAULSDPKGF-UHFFFAOYSA-N 0.000 description 1
- 229920002732 Polyanhydride Polymers 0.000 description 1
- 229920000954 Polyglycolide Polymers 0.000 description 1
- 229920001710 Polyorthoester Polymers 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 101800001357 Potential peptide Proteins 0.000 description 1
- 102400000745 Potential peptide Human genes 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 229940079156 Proteasome inhibitor Drugs 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- WTKZEGDFNFYCGP-UHFFFAOYSA-N Pyrazole Chemical compound C=1C=NNC=1 WTKZEGDFNFYCGP-UHFFFAOYSA-N 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- UCKMPCXJQFINFW-UHFFFAOYSA-N Sulphide Chemical compound [S-2] UCKMPCXJQFINFW-UHFFFAOYSA-N 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- DHXVGJBLRPWPCS-UHFFFAOYSA-N Tetrahydropyran Chemical compound C1CCOCC1 DHXVGJBLRPWPCS-UHFFFAOYSA-N 0.000 description 1
- DPOPAJRDYZGTIR-UHFFFAOYSA-N Tetrazine Chemical compound C1=CN=NN=N1 DPOPAJRDYZGTIR-UHFFFAOYSA-N 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical compound C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- 229940123464 Thiazolidinedione Drugs 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- YZCKVEUIGOORGS-NJFSPNSNSA-N Tritium Chemical compound [3H] YZCKVEUIGOORGS-NJFSPNSNSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241001464867 [Ruminococcus] gnavus Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 239000011149 active material Substances 0.000 description 1
- 239000012042 active reagent Substances 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 125000004423 acyloxy group Chemical group 0.000 description 1
- 239000000674 adrenergic antagonist Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 125000004183 alkoxy alkyl group Chemical group 0.000 description 1
- 125000003282 alkyl amino group Chemical group 0.000 description 1
- 125000004644 alkyl sulfinyl group Chemical group 0.000 description 1
- 125000004390 alkyl sulfonyl group Chemical group 0.000 description 1
- 229940087168 alpha tocopherol Drugs 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 125000004103 aminoalkyl group Chemical group 0.000 description 1
- 239000002269 analeptic agent Substances 0.000 description 1
- 230000000202 analgesic effect Effects 0.000 description 1
- 229940035676 analgesics Drugs 0.000 description 1
- 239000002333 angiotensin II receptor antagonist Substances 0.000 description 1
- 229940125364 angiotensin receptor blocker Drugs 0.000 description 1
- 150000001448 anilines Chemical class 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000000578 anorexic effect Effects 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 239000000730 antalgic agent Substances 0.000 description 1
- 230000001466 anti-adreneric effect Effects 0.000 description 1
- 230000002456 anti-arthritic effect Effects 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000001093 anti-cancer Effects 0.000 description 1
- 230000001078 anti-cholinergic effect Effects 0.000 description 1
- 230000003556 anti-epileptic effect Effects 0.000 description 1
- 230000002924 anti-infective effect Effects 0.000 description 1
- 239000002260 anti-inflammatory agent Substances 0.000 description 1
- 229940121363 anti-inflammatory agent Drugs 0.000 description 1
- 230000001022 anti-muscarinic effect Effects 0.000 description 1
- 230000000118 anti-neoplastic effect Effects 0.000 description 1
- 230000001139 anti-pruritic effect Effects 0.000 description 1
- 230000001754 anti-pyretic effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000002921 anti-spasmodic effect Effects 0.000 description 1
- 239000003416 antiarrhythmic agent Substances 0.000 description 1
- 229940124346 antiarthritic agent Drugs 0.000 description 1
- 239000000924 antiasthmatic agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 229940125681 anticonvulsant agent Drugs 0.000 description 1
- 239000000935 antidepressant agent Substances 0.000 description 1
- 229940005513 antidepressants Drugs 0.000 description 1
- 229960003965 antiepileptics Drugs 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003429 antifungal agent Substances 0.000 description 1
- 239000000739 antihistaminic agent Substances 0.000 description 1
- 229940125715 antihistaminic agent Drugs 0.000 description 1
- 229960005475 antiinfective agent Drugs 0.000 description 1
- 239000002579 antinauseant Substances 0.000 description 1
- 229940045985 antineoplastic platinum compound Drugs 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 239000003908 antipruritic agent Substances 0.000 description 1
- 239000000164 antipsychotic agent Substances 0.000 description 1
- 239000002221 antipyretic Substances 0.000 description 1
- 229940125716 antipyretic agent Drugs 0.000 description 1
- 229940124575 antispasmodic agent Drugs 0.000 description 1
- 239000003443 antiviral agent Substances 0.000 description 1
- 239000002249 anxiolytic agent Substances 0.000 description 1
- 230000000949 anxiolytic effect Effects 0.000 description 1
- 229940005530 anxiolytics Drugs 0.000 description 1
- 125000001204 arachidyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 238000011914 asymmetric synthesis Methods 0.000 description 1
- HONIICLYMWZJFZ-UHFFFAOYSA-N azetidine Chemical compound C1CNC1 HONIICLYMWZJFZ-UHFFFAOYSA-N 0.000 description 1
- 125000004069 aziridinyl group Chemical group 0.000 description 1
- SRSXLGNVWSONIS-UHFFFAOYSA-N benzenesulfonic acid Chemical compound OS(=O)(=O)C1=CC=CC=C1 SRSXLGNVWSONIS-UHFFFAOYSA-N 0.000 description 1
- 229940092714 benzenesulfonic acid Drugs 0.000 description 1
- 125000003785 benzimidazolyl group Chemical group N1=C(NC2=C1C=CC=C2)* 0.000 description 1
- 125000005605 benzo group Chemical group 0.000 description 1
- ZYGHJZDHTFUPRJ-UHFFFAOYSA-N benzo-alpha-pyrone Natural products C1=CC=C2OC(=O)C=CC2=C1 ZYGHJZDHTFUPRJ-UHFFFAOYSA-N 0.000 description 1
- 125000005873 benzo[d]thiazolyl group Chemical group 0.000 description 1
- 125000002047 benzodioxolyl group Chemical group O1OC(C2=C1C=CC=C2)* 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 125000004196 benzothienyl group Chemical group S1C(=CC2=C1C=CC=C2)* 0.000 description 1
- 150000003939 benzylamines Chemical class 0.000 description 1
- 125000000649 benzylidene group Chemical group [H]C(=[*])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 229940125388 beta agonist Drugs 0.000 description 1
- 239000002876 beta blocker Substances 0.000 description 1
- 229940097320 beta blocking agent Drugs 0.000 description 1
- 229940000635 beta-alanine Drugs 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 229920002988 biodegradable polymer Polymers 0.000 description 1
- 239000004621 biodegradable polymer Substances 0.000 description 1
- 230000008436 biogenesis Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 239000002617 bone density conservation agent Substances 0.000 description 1
- 230000008468 bone growth Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 125000006309 butyl amino group Chemical group 0.000 description 1
- 210000004900 c-terminal fragment Anatomy 0.000 description 1
- 239000000480 calcium channel blocker Substances 0.000 description 1
- 125000003917 carbamoyl group Chemical group [H]N([H])C(*)=O 0.000 description 1
- 125000002837 carbocyclic group Chemical group 0.000 description 1
- 150000001722 carbon compounds Chemical class 0.000 description 1
- OKTJSMMVPCPJKN-BJUDXGSMSA-N carbon-11 Chemical compound [11C] OKTJSMMVPCPJKN-BJUDXGSMSA-N 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- 125000002843 carboxylic acid group Chemical group 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 230000001364 causal effect Effects 0.000 description 1
- 230000009920 chelation Effects 0.000 description 1
- PBAYDYUZOSNJGU-UHFFFAOYSA-N chelidonic acid Natural products OC(=O)C1=CC(=O)C=C(C(O)=O)O1 PBAYDYUZOSNJGU-UHFFFAOYSA-N 0.000 description 1
- 150000005829 chemical entities Chemical class 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 230000000718 cholinopositive effect Effects 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000001977 collision-induced dissociation tandem mass spectrometry Methods 0.000 description 1
- 238000013329 compounding Methods 0.000 description 1
- 229940037530 cough and cold preparations Drugs 0.000 description 1
- 235000001671 coumarin Nutrition 0.000 description 1
- 125000001047 cyclobutenyl group Chemical group C1(=CCC1)* 0.000 description 1
- 125000001995 cyclobutyl group Chemical group [H]C1([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000003678 cyclohexadienyl group Chemical group C1(=CC=CCC1)* 0.000 description 1
- 125000000596 cyclohexenyl group Chemical group C1(=CCCCC1)* 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 125000000058 cyclopentadienyl group Chemical group C1(=CC=CC1)* 0.000 description 1
- 125000002433 cyclopentenyl group Chemical group C1(=CCCC1)* 0.000 description 1
- 125000001511 cyclopentyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C1([H])[H] 0.000 description 1
- 125000000298 cyclopropenyl group Chemical group [H]C1=C([H])C1([H])* 0.000 description 1
- 125000001559 cyclopropyl group Chemical group [H]C1([H])C([H])([H])C1([H])* 0.000 description 1
- 229940127089 cytotoxic agent Drugs 0.000 description 1
- 239000002254 cytotoxic agent Substances 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 239000000850 decongestant Substances 0.000 description 1
- 229940124581 decongestants Drugs 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 125000002704 decyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 125000004663 dialkyl amino group Chemical group 0.000 description 1
- 125000005959 diazepanyl group Chemical group 0.000 description 1
- 125000004915 dibutylamino group Chemical group C(CCC)N(CCCC)* 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 125000001664 diethylamino group Chemical group [H]C([H])([H])C([H])([H])N(*)C([H])([H])C([H])([H])[H] 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 229940079919 digestives enzyme preparation Drugs 0.000 description 1
- 239000012470 diluted sample Substances 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 125000002147 dimethylamino group Chemical group [H]C([H])([H])N(*)C([H])([H])[H] 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 125000004914 dipropylamino group Chemical group C(CC)N(CCC)* 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 239000002934 diuretic Substances 0.000 description 1
- 229940030606 diuretics Drugs 0.000 description 1
- 125000003438 dodecyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 229960004679 doxorubicin Drugs 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- ZSWFCLXCOIISFI-UHFFFAOYSA-N endo-cyclopentadiene Natural products C1C=CC=C1 ZSWFCLXCOIISFI-UHFFFAOYSA-N 0.000 description 1
- 150000002085 enols Chemical group 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 125000000031 ethylamino group Chemical group [H]C([H])([H])C([H])([H])N([H])[*] 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- YCKRFDGAMUMZLT-BJUDXGSMSA-N fluorine-18 atom Chemical compound [18F] YCKRFDGAMUMZLT-BJUDXGSMSA-N 0.000 description 1
- 108010008221 formate C-acetyltransferase Proteins 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 108700026298 freyrasin Proteins 0.000 description 1
- JKFAIQOWCVVSKC-UHFFFAOYSA-N furazan Chemical compound C=1C=NON=1 JKFAIQOWCVVSKC-UHFFFAOYSA-N 0.000 description 1
- 125000002541 furyl group Chemical group 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 208000030304 gastrointestinal bleeding Diseases 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 239000003193 general anesthetic agent Substances 0.000 description 1
- 229940005494 general anesthetics Drugs 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- 244000000058 gram-negative pathogen Species 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 125000004438 haloalkoxy group Chemical group 0.000 description 1
- 125000003187 heptyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 125000004051 hexyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001245 hexylamino group Chemical group [H]N([*])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C(C([H])([H])[H])([H])[H] 0.000 description 1
- 238000010505 homolytic fission reaction Methods 0.000 description 1
- 235000003642 hunger Nutrition 0.000 description 1
- 150000004677 hydrates Chemical class 0.000 description 1
- XMBWDFGMSWQBCA-UHFFFAOYSA-N hydrogen iodide Chemical compound I XMBWDFGMSWQBCA-UHFFFAOYSA-N 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 239000003326 hypnotic agent Substances 0.000 description 1
- 230000000147 hypnotic effect Effects 0.000 description 1
- 125000002632 imidazolidinyl group Chemical group 0.000 description 1
- 125000002636 imidazolinyl group Chemical group 0.000 description 1
- 125000004857 imidazopyridinyl group Chemical group N1C(=NC2=C1C=CC=N2)* 0.000 description 1
- 230000005746 immune checkpoint blockade Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 229940125721 immunosuppressive agent Drugs 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000000126 in silico method Methods 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 229910052738 indium Inorganic materials 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 238000013383 initial experiment Methods 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 229940124524 integrase inhibitor Drugs 0.000 description 1
- 239000002850 integrase inhibitor Substances 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- XMBWDFGMSWQBCA-NJFSPNSNSA-N iodane Chemical compound [129IH] XMBWDFGMSWQBCA-NJFSPNSNSA-N 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- UWKQSNNFCGGAFS-XIFFEERXSA-N irinotecan Chemical compound C1=C2C(CC)=C3CN(C(C4=C([C@@](C(=O)OC4)(O)CC)C=4)=O)C=4C3=NC2=CC=C1OC(=O)N(CC1)CCC1N1CCCCC1 UWKQSNNFCGGAFS-XIFFEERXSA-N 0.000 description 1
- 229960004768 irinotecan Drugs 0.000 description 1
- 125000006316 iso-butyl amino group Chemical group [H]N(*)C([H])([H])C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000001972 isopentyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 125000002183 isoquinolinyl group Chemical group C1(=NC=CC2=CC=CC=C12)* 0.000 description 1
- ZLTPDFXIESTBQG-UHFFFAOYSA-N isothiazole Chemical compound C=1C=NSC=1 ZLTPDFXIESTBQG-UHFFFAOYSA-N 0.000 description 1
- 125000004628 isothiazolidinyl group Chemical group S1N(CCC1)* 0.000 description 1
- 125000001786 isothiazolyl group Chemical group 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- CTAPFRYPJLPFDF-UHFFFAOYSA-N isoxazole Chemical compound C=1C=NOC=1 CTAPFRYPJLPFDF-UHFFFAOYSA-N 0.000 description 1
- 125000003965 isoxazolidinyl group Chemical group 0.000 description 1
- 125000000842 isoxazolyl group Chemical group 0.000 description 1
- 125000000468 ketone group Chemical group 0.000 description 1
- 229940043355 kinase inhibitor Drugs 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- MJIHNNLFOKEZEW-UHFFFAOYSA-N lansoprazole Chemical compound CC1=C(OCC(F)(F)F)C=CN=C1CS(=O)C1=NC2=CC=CC=C2N1 MJIHNNLFOKEZEW-UHFFFAOYSA-N 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 125000002463 lignoceryl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 239000003589 local anesthetic agent Substances 0.000 description 1
- 229960005015 local anesthetics Drugs 0.000 description 1
- LGAILEFNHXWAJP-BMEPFDOTSA-N macrocycle Chemical group N([C@H]1[C@@H](C)CC)C(=O)C(N=2)=CSC=2CNC(=O)C(=C(O2)C)N=C2[C@H]([C@@H](C)CC)NC(=O)C2=CSC1=N2 LGAILEFNHXWAJP-BMEPFDOTSA-N 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- UZKWTJUDCOPSNM-UHFFFAOYSA-N methoxybenzene Substances CCCCOC=C UZKWTJUDCOPSNM-UHFFFAOYSA-N 0.000 description 1
- 239000004530 micro-emulsion Substances 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 210000001589 microsome Anatomy 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 125000002950 monocyclic group Chemical group 0.000 description 1
- 125000006682 monohaloalkyl group Chemical group 0.000 description 1
- 125000002757 morpholinyl group Chemical group 0.000 description 1
- 239000000472 muscarinic agonist Substances 0.000 description 1
- 239000003149 muscarinic antagonist Substances 0.000 description 1
- 229940035363 muscle relaxants Drugs 0.000 description 1
- 239000003158 myorelaxant agent Substances 0.000 description 1
- 125000001421 myristyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- SYSQUGFVNFXIIT-UHFFFAOYSA-N n-[4-(1,3-benzoxazol-2-yl)phenyl]-4-nitrobenzenesulfonamide Chemical class C1=CC([N+](=O)[O-])=CC=C1S(=O)(=O)NC1=CC=C(C=2OC3=CC=CC=C3N=2)C=C1 SYSQUGFVNFXIIT-UHFFFAOYSA-N 0.000 description 1
- 125000000740 n-pentyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 239000006199 nebulizer Substances 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 125000001971 neopentyl group Chemical group [H]C([*])([H])C(C([H])([H])[H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000003176 neuroleptic agent Substances 0.000 description 1
- 239000002698 neuron blocking agent Substances 0.000 description 1
- FEMOMIGRRWSMCU-UHFFFAOYSA-N ninhydrin Chemical group C1=CC=C2C(=O)C(O)(O)C(=O)C2=C1 FEMOMIGRRWSMCU-UHFFFAOYSA-N 0.000 description 1
- 229910017604 nitric acid Inorganic materials 0.000 description 1
- 150000002825 nitriles Chemical class 0.000 description 1
- 125000004433 nitrogen atom Chemical group N* 0.000 description 1
- 229940042402 non-nucleoside reverse transcriptase inhibitor Drugs 0.000 description 1
- 229940021182 non-steroidal anti-inflammatory drug Drugs 0.000 description 1
- 150000004690 nonahydrates Chemical class 0.000 description 1
- 239000012457 nonaqueous media Substances 0.000 description 1
- 239000002726 nonnucleoside reverse transcriptase inhibitor Substances 0.000 description 1
- 125000001400 nonyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- SNQQPOLDUKLAAF-UHFFFAOYSA-N nonylphenol Chemical compound CCCCCCCCCC1=CC=CC=C1O SNQQPOLDUKLAAF-UHFFFAOYSA-N 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- OIPZNTLJVJGRCI-UHFFFAOYSA-M octadecanoyloxyaluminum;dihydrate Chemical compound O.O.CCCCCCCCCCCCCCCCCC(=O)O[Al] OIPZNTLJVJGRCI-UHFFFAOYSA-M 0.000 description 1
- 125000002347 octyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 150000002895 organic esters Chemical class 0.000 description 1
- 238000006053 organic reaction Methods 0.000 description 1
- 125000001181 organosilyl group Chemical group [SiH3]* 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- WCPAKWJPBJAGKN-UHFFFAOYSA-N oxadiazole Chemical compound C1=CON=N1 WCPAKWJPBJAGKN-UHFFFAOYSA-N 0.000 description 1
- 125000000160 oxazolidinyl group Chemical group 0.000 description 1
- 125000002971 oxazolyl group Chemical group 0.000 description 1
- 125000000466 oxiranyl group Chemical group 0.000 description 1
- 238000010979 pH adjustment Methods 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 238000002638 palliative care Methods 0.000 description 1
- 125000000913 palmityl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 125000004894 pentylamino group Chemical group C(CCCC)N* 0.000 description 1
- 230000016446 peptide cross-linking Effects 0.000 description 1
- 125000005010 perfluoroalkyl group Chemical group 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 229940127557 pharmaceutical product Drugs 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- 229960005323 phenoxyethanol Drugs 0.000 description 1
- COLNVLDHVKWLRT-RALIUCGRSA-N phenylalanine-d5 Chemical compound [2H]C1=C([2H])C([2H])=C(CC(N)C(O)=O)C([2H])=C1[2H] COLNVLDHVKWLRT-RALIUCGRSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical class [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 239000003757 phosphotransferase inhibitor Substances 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 125000004193 piperazinyl group Chemical group 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 150000003058 platinum compounds Chemical class 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229960000502 poloxamer Drugs 0.000 description 1
- 229920001748 polybutylene Polymers 0.000 description 1
- 125000003367 polycyclic group Chemical group 0.000 description 1
- 125000004585 polycyclic heterocycle group Chemical group 0.000 description 1
- 125000006684 polyhaloalkyl group Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920001451 polypropylene glycol Polymers 0.000 description 1
- 229940068977 polysorbate 20 Drugs 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 229940068968 polysorbate 80 Drugs 0.000 description 1
- 239000004810 polytetrafluoroethylene Substances 0.000 description 1
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 159000000001 potassium salts Chemical class 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- LJPYJRMMPVFEKR-UHFFFAOYSA-N prop-2-ynylurea Chemical compound NC(=O)NCC#C LJPYJRMMPVFEKR-UHFFFAOYSA-N 0.000 description 1
- 125000006308 propyl amino group Chemical group 0.000 description 1
- 239000003207 proteasome inhibitor Substances 0.000 description 1
- 239000003531 protein hydrolysate Substances 0.000 description 1
- 239000000612 proton pump inhibitor Substances 0.000 description 1
- 229940126409 proton pump inhibitor Drugs 0.000 description 1
- 239000003368 psychostimulant agent Substances 0.000 description 1
- 150000003217 pyrazoles Chemical class 0.000 description 1
- 125000003072 pyrazolidinyl group Chemical group 0.000 description 1
- 125000002755 pyrazolinyl group Chemical group 0.000 description 1
- PBMFSQRYOILNGV-UHFFFAOYSA-N pyridazine Chemical compound C1=CC=NN=C1 PBMFSQRYOILNGV-UHFFFAOYSA-N 0.000 description 1
- 125000000719 pyrrolidinyl group Chemical group 0.000 description 1
- 125000000168 pyrrolyl group Chemical group 0.000 description 1
- 108010060146 pyruvate formate-lyase activating enzyme Proteins 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 125000002294 quinazolinyl group Chemical group N1=C(N=CC2=CC=CC=C12)* 0.000 description 1
- 125000001567 quinoxalinyl group Chemical group N1=C(C=NC2=CC=CC=C12)* 0.000 description 1
- 230000006340 racemization Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 239000012925 reference material Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000027756 respiratory electron transport chain Effects 0.000 description 1
- 230000028617 response to DNA damage stimulus Effects 0.000 description 1
- 102200081810 rs5952410 Human genes 0.000 description 1
- 229910052594 sapphire Inorganic materials 0.000 description 1
- 239000010980 sapphire Substances 0.000 description 1
- 229930195734 saturated hydrocarbon Natural products 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 229910052706 scandium Inorganic materials 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 238000011935 selective methylation Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000035946 sexual desire Effects 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 229940126586 small molecule drug Drugs 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- FHHPUSMSKHSNKW-SMOYURAASA-M sodium deoxycholate Chemical compound [Na+].C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC([O-])=O)C)[C@@]2(C)[C@@H](O)C1 FHHPUSMSKHSNKW-SMOYURAASA-M 0.000 description 1
- 239000008247 solid mixture Substances 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 230000037351 starvation Effects 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 239000003206 sterilizing agent Substances 0.000 description 1
- 239000000021 stimulant Substances 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 125000005017 substituted alkenyl group Chemical group 0.000 description 1
- 125000003107 substituted aryl group Chemical group 0.000 description 1
- NRQODXJXWWUXFE-LYQFAKRDSA-N subtilosin a Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H]1N(C(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CC=2C3=CC=CC=C3NC=2)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)CN)[C@@H](C)O)[C@@H](C)CC)C(C)C)CCC1 NRQODXJXWWUXFE-LYQFAKRDSA-N 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000003871 sulfonates Chemical class 0.000 description 1
- 150000003457 sulfones Chemical class 0.000 description 1
- 125000000472 sulfonyl group Chemical group *S(*)(=O)=O 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 230000003319 supportive effect Effects 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- 125000003718 tetrahydrofuranyl group Chemical group 0.000 description 1
- 125000001412 tetrahydropyranyl group Chemical group 0.000 description 1
- 125000004632 tetrahydrothiopyranyl group Chemical group S1C(CCCC1)* 0.000 description 1
- 125000003831 tetrazolyl group Chemical group 0.000 description 1
- VLLMWSRANPNYQX-UHFFFAOYSA-N thiadiazole Chemical compound C1=CSN=N1.C1=CSN=N1 VLLMWSRANPNYQX-UHFFFAOYSA-N 0.000 description 1
- 125000001113 thiadiazolyl group Chemical group 0.000 description 1
- 125000001984 thiazolidinyl group Chemical group 0.000 description 1
- 125000000335 thiazolyl group Chemical group 0.000 description 1
- 125000001730 thiiranyl group Chemical group 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- 125000004001 thioalkyl group Chemical group 0.000 description 1
- HNKJADCVZUBCPG-UHFFFAOYSA-N thioanisole Chemical compound CSC1=CC=CC=C1 HNKJADCVZUBCPG-UHFFFAOYSA-N 0.000 description 1
- 229930192474 thiophene Natural products 0.000 description 1
- LXRUAYBIUSUULX-HRFVKAFMSA-N threo-3-methyl-L-aspartic acid Chemical compound OC(=O)[C@@H](C)[C@H](N)C(O)=O LXRUAYBIUSUULX-HRFVKAFMSA-N 0.000 description 1
- SWISGLLTKITYOP-UHFFFAOYSA-N thuricin cd Chemical compound N1C(=O)CNC(=O)C(C(C)CC)NC(=O)CNC(=O)C(CCC(O)=O)NC(=O)C(CO)NC(=O)C(C(C)CC)NC(=O)C(C(C)C)NC(=O)C(NC(=O)C(CO)NC(=O)CNC(=O)C(C(C)CC)NC2=O)CSC1(CO)C(=O)NC(CC(C)C)C(=O)NC(C(C)C)C(=O)NCC(=O)NC(C(NC(C)C(=O)N1)=O)(C(C)O)SCC2NC(=O)CNC(=O)C(C(C)CC)NC(=O)C(C(C)C)NC(=O)C(NC(=O)C(C)NC(=O)C(C)NC(=O)C(CC(N)=O)NC(=O)CN)CSC(C(=O)NC(CC(C)C)C(=O)NCC(O)=O)(C(C)O)NC(=O)C1CC1=CC=CC=C1 SWISGLLTKITYOP-UHFFFAOYSA-N 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000003354 tissue distribution assay Methods 0.000 description 1
- AOBORMOPSGHCAX-DGHZZKTQSA-N tocofersolan Chemical compound OCCOC(=O)CCC(=O)OC1=C(C)C(C)=C2O[C@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C AOBORMOPSGHCAX-DGHZZKTQSA-N 0.000 description 1
- 229960000984 tocofersolan Drugs 0.000 description 1
- 238000011200 topical administration Methods 0.000 description 1
- 229910021654 trace metal Inorganic materials 0.000 description 1
- 239000003204 tranquilizing agent Substances 0.000 description 1
- 230000002936 tranquilizing effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000006276 transfer reaction Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 150000003852 triazoles Chemical class 0.000 description 1
- 125000001425 triazolyl group Chemical group 0.000 description 1
- YNJBWRMUSHSURL-UHFFFAOYSA-N trichloroacetic acid Chemical compound OC(=O)C(Cl)(Cl)Cl YNJBWRMUSHSURL-UHFFFAOYSA-N 0.000 description 1
- ITMCEJHCFYSIIV-UHFFFAOYSA-M triflate Chemical compound [O-]S(=O)(=O)C(F)(F)F ITMCEJHCFYSIIV-UHFFFAOYSA-M 0.000 description 1
- 125000000876 trifluoromethoxy group Chemical group FC(F)(F)O* 0.000 description 1
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- 229910052722 tritium Inorganic materials 0.000 description 1
- 238000013024 troubleshooting Methods 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 229940124549 vasodilator Drugs 0.000 description 1
- 239000003071 vasodilator agent Substances 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 238000007794 visualization technique Methods 0.000 description 1
- 230000037221 weight management Effects 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 238000003868 zero point energy Methods 0.000 description 1
- 239000002076 α-tocopherol Substances 0.000 description 1
- 235000004835 α-tocopherol Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/02—Preparation of peptides or proteins having a known sequence of two or more amino acids, e.g. glutathione
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P11/00—Preparation of sulfur-containing organic compounds
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y504/00—Intramolecular transferases (5.4)
- C12Y504/99—Intramolecular transferases (5.4) transferring other groups (5.4.99)
Definitions
- Peptide-based therapeutics are growing due to their unique structure and ability to be produced via solid phase peptide synthesis (SPPS) or by recombinant DNA.
- SPPS solid phase peptide synthesis
- Many peptide therapeutics contain a disulfide bond in their active form. Disulfide bonds are susceptible to breakage via biological reductants such as glutathione. Additionally, many peptide therapeutics contain bulky or basic amino acid side chains which render them vulnerable to degradation by proteases. These factors contribute to their short serum half-lives.
- RiPPs ribosomally synthesized and post-translationally modified peptides
- rSAM radical S- adenosylmethionine
- RiPP maturases have potential to offer biotechnological applications in peptide alterations such as thioether installation or peptide stapling.
- rSAM enzymes use a radical intermediate to complete chemical transformations involved in natural product biosynthesis as well as primary metabolism. These enzymes contain one or more iron-sulfur [Fe-S] clusters that are essential for function.
- the [4Fe-4S] rSAM (RS) cluster is coordinated by a canonical CxxxCxxC motif in the enzyme.
- the [4Fe-4S] RS cluster one iron coordinates the a-amino and a-carboxylate moieties of SAM.
- the RS cluster When the RS cluster is catalytically active, it transfers an electron to bound SAM. Either chemical or biological reducing systems are useful for product turnover because the RS cluster is catalytically inactive in the +2 state. Homolytic cleavage of SAM forms the reactive 5’-deoxyadenosyl radical (5'- dAdo, FIG. 1).
- 5'-dAdo' acts as a radical initiator by abstracting a hydrogen atom from a specific site on the substrate, thereby forming 5 ’-deoxyadenosine (5’-dAdoH, FIG. 1) and a theoretical RiPP radical intermediate.
- the formed substrate radical is useful for substrate maturation. While only one [4Fe-4S] cluster is needed for reductive SAM cleavage, many rSAM enzymes also employ one or more auxiliary iron-sulf ur clusters (ACs) for substrate turnover (Fig. 4c). These ACs are coordinated to the enzyme by cysteine-rich C- terminal extensions from the RS canonical motif (FIG. 2).
- rSAM maturases with multiple [Fe-S] clusters that form intrapeptide bonds between Ca, CB, or Cy on a specific residue and a cysteine thiol in the peptide substrate. Many of these thioether assembling maturases only form a single thioether in the mature peptide and are relatively slow in substrate turnover.
- the RS cluster in addition to at least one AC cluster is necessary for thioether formation.
- rSAM RiPP maturases also use a critical RiPP Recognition Element (RRE), that is responsible for binding to the leader sequence of the immature peptide (FIG. 2, left).
- RRE critical RiPP Recognition Element
- PapB is a RiPP maturase that catalyzes the insertion of six thioether crosslinks in the PapA polypeptide.
- PapB catalyzes the insertion of links between the Cys thiol and the b- carbon of the Asp, where the residues being linked are in a CX 3 D motif.
- the enzyme can also accept Glu at the modification site, and that PapB introduces the crosslink to the chemically analogous ⁇ -carbon.
- PapB has also been shown to accept a shorter minimal substrate (msPapA), which only has a single pair of crosslinking amino acids in the CX 3 D motif.
- PapB can catalyze both C13 and Cy thioether linkages, and forms six thioether linkages in the wild type PapA.
- PapB contains a RS cluster and two ACs (FIG. 2). Replacing Asp residue(s) to Glu residue(s) in WT-PapA still results in successful crosslinking. Both CB and Cy thioether linkages were confirmed by 2D NMR.
- the invention in one aspect, relates to methods of chemically modifying a peptide sequence to install one or more thioether linkages. Additionally disclosed are compounds formed using methods of chemically modifying a peptide sequence. Also disclosed are methods of chemically modifying a modified PapA sequence, and compounds formed using methods of chemically modifying a modified PapA sequence.
- Also disclosed are methods of chemically modifying a compound to install a thioether linkage the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula: wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein Q 1 is a leader sequence; wherein Q 2 is a cleavable moiety; wherein R 1 is selected from -CO 2 H, -C(O)NHOH, - SO 2 NH 2 , -SO 2 NHC(O)CH 3 , -SO 3 H, -NHC(O)NHSO 2 CH 3 , -P(O)(OH) 2 , and a structure selected from: wherein R 4 is selected from hydrogen and methyl; wherein each occurrence of R 5 and R 5 , when present, is independently a residue of
- Also disclosed are methods of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide sequence with PapB, wherein the peptide sequence comprises X-Y n -Z; wherein X is a penicillamine or an amino acid residue comprising a -SH group or an amino acid residue comprising a -SeH group; wherein Y is a series of amino acid residues where n 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein Z is an aspartic acid residue, a glutamic acid residue, a hydroxy-glutamic acid residue, 2-amino-3- (2H-tetrazol-5-yl)propanoic acid, or a carboxyl-functionalized amino acid residue; and wherein the peptide sequence is not PapA.
- Also disclosed are methods of chemically modifying a modified PapA sequence to install a thioether linkage, the method comprising reacting the modified PapA sequence with PapB; wherein the modified PapA sequence comprises Cys-Y n -Asp, wherein Y is a series of amino acid residues and n 0, 1, 2, 4, 5, 6, or 7.
- thioether compounds produced by a disclosed method.
- compositions comprising an effective amount of a disclosed compound or a pharmaceutically acceptable salt thereof and a pharmaceutically acceptable carrier.
- FIG. 1 is a schematic showing the proposed mechanism for beta-thioether crosslink.
- FIG. 2 is a scheme showing the predicted structure of PapB.
- FIG. 3 is a representative image showing SDS-PAGE analysis of reconstituted and purified PapB on a 12% crosslinked gel.
- FIG. 4A and FIG. 4B show representative crosslinking data of minimal substrate PapA (msPapA) with PapB.
- FIG. 4B shows the sequence of crosslinked PapB showing all of the observed b- and y- ions from tandem mass spectrometry.
- FIG. 5 is a representative plot showing the comparison of activity of PapB processing Y17W msPapA with dithionite or FldA/FPR/NADPH.
- FIG. 6 shows representative mass spectra demonstrating the effect of 2x and 4x enzyme concentration.
- FIG. 7 shows representative mass spectra demonstrating the effect of 2x and 4x peptide concentration.
- FIG. 8A-D show representative data for the Leader-C(X 0 -X 6 )D(Xm) crosslink formation.
- FIG. 8A is a scheme showing the unmodified and modified peptide sequence illustrate the thioether crosslink based on the msPapA modification reported by Precord et al.
- FIG. 8B shows representative mass spectra for CX 0 D-CX 2 D PapB modification.
- FIG. 8C shows representative mass spectra for CX 4 D-CX 6 D PapB modification.
- FIG. 8D are schematics showing that the expected 2 Da loss is seen in each b and y fragment in the tandem mass spectrometry.
- FIG. 9A-B show representative data for the iodoacetic acid treatment for CX 0 D. Specifically, FIG. 9 A shows representative mass spectra data for CX 0 D without PapB. FIG. 9B shows shows representative mass spectra data for CX 0 D with PapB.
- FIG. 10A-B show representative data for the iodoacetic acid treatment for CX 1 D. Specifically, FIG. 10A shows representative mass spectra data for CX 0 D without PapB. FIG. 10B shows shows representative mass spectra data for CX 1 D with PapB.
- FIG. 11A-B show representative data for the iodoacetic acid treatment for CX2D. Specifically, FIG. 11A shows representative mass spectra data for CX 0 D without PapB. FIG. 11B shows shows representative mass spectra data for CX 2 D with PapB. [0035] FIG. 12A-B show representative data for the iodoacetic acid treatment for CX 4 D. Specifically, FIG. 12A shows representative mass spectra data for CX 0 D without PapB. FIG. 12B shows shows representative mass spectra data for CX 4 D with PapB.
- FIG. 13A-B show representative data for the iodoacetic acid treatment for CX5D. Specifically, FIG. 13A shows representative mass spectra data for CX 0 D without PapB. FIG. 13B shows shows representative mass spectra data for CX 5 D with PapB.
- FIG. 14A-B show representative data for the iodoacetic acid treatment for CX 6 D. Specifically, FIG. 14A shows representative mass spectra data for CX 0 D without PapB. FIG. 14B shows shows representative mass spectra data for CX 6 D with PapB.
- FIG. 15A-C show representative data for leader extensions with single, nested, and in-line crosslinks.
- FIG. 15A are peptide schemes showing the apparent crosslink locations that remain consistent after distancing the thioether motifs from the leader peptide.
- FIG. 15B are representative mass spectra showing the isotopic distributions of the peptides; a shift of 2 Da in the case of single thioether motifs or 4 Da with double thioether motifs upon addition of PapB.
- FIG. 13C are schematics showing a representation of the tandem mass spectrometry results.
- FIG. 16A-B show representative data for the iodoacetic acid treatment for Leader- AAACSANDA.
- FIG. 16A shows representative mass spectra data for Leader- AAACSANDA without PapB.
- FIG. 16B shows shows representative mass spectra data for Leader-AAACSANDA with PapB.
- FIG. 17A-B show representative data for the iodoacetic acid treatment for Leader- AAACSANDACSANDA.
- FIG. 17A shows representative mass spectra data for Leader- AAACSANDACSANDA without PapB.
- FIG. 17B shows shows representative mass spectra data for Leader-AAACSANDACSANDA with PapB.
- FIG. 18A-B show representative data for the iodoacetic acid treatment for Leader- AAACSACDAADA.
- FIG. 18A shows representative mass spectra data for Leader- AAACSACDAADA without PapB.
- FIG. 18B shows shows representative mass spectra data for Leader- AAACSACDAAD A with PapB.
- FIG. 19A-B show representative data for the iodoacetic acid treatment for Leader- AAAASACDAADA.
- FIG. 19A shows representative mass spectra data for Leader- AAAASACDAADA without PapB.
- FIG. 19B shows shows representative mass spectra data for Leader- AAAASACDAAD A with PapB.
- FIG. 20A-B show representative data for the iodoacetic acid treatment for Leader- AAACSAADAADA.
- FIG. 20A shows representative mass spectra data for Leader- AAACSAADAADA without PapB.
- FIG. 20B shows shows representative mass spectra data for Leader- AAACSAADAADA with PapB.
- FIG. 21A-C show representative data showing that PapB produces two thioether crosslinks in the AMK-1057 precursor peptide in vitro.
- FIG. 21 A is a scheme showing that the AMK-1057 precursor peptide contains the leader peptide sequence, a TEV protease recognition sequence, and two CX 3 E motifs.
- FIG. 21B shows representative mass spectra demonstrating that upon reaction with PapB in an in vitro assay, two crosslinks form. Additional processing with TEV protease produces the expected dicyclized peptide.
- FIG. 21C is a scheme demonstrating the topology of the bonds as confirmed by tandem mass spectrometry.
- FIG. 22A-C show representative data for PapB crosslinking D C and D D msPapA Peptides.
- FIG. 22A is a scheme showing the thioether crosslink.
- FIG. 22B are representative mass spectra showing formation of the thioether crosslinks.
- FIG. 22C is a scheme demonstrating the topology of the bonds as confirmed by mass spectrometry.
- FIG. 23A-B shows representative data for the iodoacetic acid treatment for Leader- D CSANDA.
- FIG. 23A shows representative mass spectra data for Leader- D CSANDA without PapB.
- FIG. 23B shows shows representative mass spectra data for Leader- D CSANDA with PapB.
- FIG. 24A-B show representative data for the iodoacetic acid treatment for Leader- CSAN D DA.
- FIG. 24A shows representative mass spectra data for Leader-CSAN D DA without PapB.
- FIG. 24B shows shows representative mass spectra data for Leader- CSAN D DA with PapB.
- FIG. 25A-B show representative data for the iodoacetic acid treatment for Leader- D CSAN D DA.
- FIG. 25A shows representative mass spectra data for Leader- D CSAN D DA without PapB.
- FIG. 25B shows shows representative mass spectra data for Leader- D CSAN D DA with PapB.
- FIG. 26A-B show representative data for msPapA “DSANCA” peptides.
- FIG. 26A shows representative mass spectra data for Leader-DSANCA and Leader- D DSANCA with and without PapB.
- FIG. 26B shows representative mass spectra data for Leader-DSAN D CA and Leader- D DSAN D CA with and without PapB.
- FIG. 27A-E show representative data for synthesis of an octreotide analog.
- FIG. 27A is a structure of the FDA-approved therapeutic octreotide.
- FIG. 27B is a schematic description of the designed peptides and the expected sites of modification upon modification with PapB. A TEV cleavage site is included in the second peptide to allow for liberation of the modified peptide sequence by PapB.
- FIG. 27C is representative mass spectra data showing the isotopic envelope of these peptides indicating that a mixed population of processed and unprocessed peptides are present after modification by PapB.
- FIG. 27D is representative mass spectra data showing that the TEV-cleaved peptide isotopic envelope reveals the anticipated 2 Da mass shift.
- FIG. 27E is a scheme showing the anticipated loss of 2 Da in each y fragment after the C and in each b fragment after the C-terminal E as confirmed by tandem mass spectrometry.
- FIG. 28 is a structure of the synthesized thioether-linked octreotide analog.
- FIG. 29 is a scheme providing a brief summary of successful PapB-mediated thioether crosslinks in tested peptide sequences.
- FIG. 30 shows representative data demonstrating that the leader peptide sequence is not required for modification via PapB.
- FIG. 31 shows representative mass spectrometry data for a one-to-one interpeptide crosslink as well as polymerization-like addition of X-mer subunits.
- FIG. 32 shows representative mass spectrometry results for a general assay peptide before and after PapB, demonstrating the presence of interpeptide products.
- FIG. 33 shows representative mass spectra data showing evidence of simple and complex mass envelopes.
- FIG. 34 is a schematic showing the experimental approaches to creating modified insulin analogs using PapB.
- FIG. 35 shows representative mass spectra data for the synthesized insulin analogs.
- FIG. 36 shows representative mass spectra data for crosslinking in peptides containing EneA.
- FIG. 37 shows representative tandem mass spectrometry data for dAdo + D24EneA msPapA adduct.
- FIG. 38 shows representative data, including mass spectrometry and EXAFS, for crosslinking in selenopeptides.
- FIG. 39 shows representative tandem mass spectrometry data for C19U msPapA.
- FIG. 40 shows representative mass spectrometry data demonstrating that aspartic acid may be replaced with glutamic acid, and cysteine may be replaced with homocysteine. Crosslinking is observed.
- FIG. 41 shows representative mass spectrometry data demonstrating that ⁇ -amino acids may be incorporated in the peptide. Crosslinking is observed.
- FIG. 42 shows representative mass spectrometry data demonstrating that no crosslinking was observed when altering the position of the C and D residues.
- FIG. 43 shows representative data demonstrating the effect of components in the reduction system employed.
- FIG. 44 is a schematic summarizing the findings of experiments conducted using prereduced PapB.
- FIG. 45 is a scatterplot showing representative data of %product as a function of time for prereduced PapB experiments.
- FIG. 46 shows representative data, including photodiode array chromatography, UV- Vis, and extracted ion chromatography, for PapB with and without reductant, as well as prereduced PapB.
- FIG. 47 is a concept schematic for a bioreactor setup for peptide modification via PapB.
- FIG. 48A-B show representative data for C-terminal glycine sequence.
- FIG. 48A is a scheme showing the thioether crosslink.
- FIG. 48B are representative mass spectra showing formation of the thioether crosslinks.
- FIG. 49A-B show representative data for deuterium labeled C-terminal glycine analogs.
- FIG. 49A is a scheme showing the thioether crosslink.
- FIG. 49B are representative mass spectra showing formation of the thioether crosslinks.
- FIG. 50A-B show representative data for C-terminal glycine carboxamide sequence.
- FIG. 50A is the structure of the sequence.
- FIG. SOB are representative mass spectra showing lack of formation of the thioether crosslinks.
- FIG. 51A-C show representative data for crosslinking with C-terminal ⁇ -amino acids.
- FIG. 50A is a scheme showing the generic thioether crosslink reaction for C-terminal ⁇ - amino acids.
- FIG. SOB is a scheme showing the thioether crosslink reaction with C-terminal ⁇ -alanine.
- FIG. 51C is the corresponding mass spectra data showing formation of the thioether crosslink.
- FIG. 52A-D show representative data for the crosslinking with various C-terminal ⁇ - amino acids.
- FIG. 52A is a scheme showing the absence of crosslink reaction with C- terminal 2,2-dimethyl-beta-alanine.
- FIG. 52B is a scheme showing the absence of crosslink reaction with C-terminal (R)-3-amino-2-methylpropanoic acid.
- FIG. 52C is a scheme showing the crosslink reaction with C-terminal (S)-3-amino-2-methylpropanoic acid.
- FIG. 52D is the corresponding mass spectra data showing formation of the thioether crosslink.
- FIG. 53 shows representative data for the crosslinking with common C-terminal ⁇ - amino acids.
- FIG. 54A shows a schematic thioether crosslinking with a D-tryptophan ⁇ -amino acid.
- FIG. 54B is the corresponding mass spectra data showing formation of the thioether crosslink
- FIG. 55A-D show representative structures of thioether crosslinking of N-methyl amino acids.
- FIG. 55A shows unsubstituted N-methylated thioether crosslinked product.
- FIG. 55B shows substituted N-methylated thioether crosslinked product.
- FIG. 55C shows a schematic thioether crosslinking with a substituted N-methylated substrate.
- FIG. 55D is the corresponding mass spectra data showing formation of the thioether crosslink.
- FIG. 56A-D show representative data for thioether crosslinking with C-terminal L- alanine or D-alanine.
- FIG. 56A shows a schematic of a C-terminal L-alanine without thioether crosslink product.
- FIG. 58B is the corresponding mass spectra data showing lack of formation of the thioether crosslink.
- FIG. 56C shows a schematic of a C-terminal D- alanine with thioether crosslink product.
- FIG. 56D is the corresponding mass spectra data showing formation of the thioether crosslink.
- FIG. 57A-B show representative data for thioether crosslinking with deuterium labeled C-terminal D-alanine.
- FIG. 57A shows a schematic of a deuterium labeled C- terminal D-alanine with thioether crosslink product.
- FIG. 57B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift and loss 3Da.
- FIG. 58A-B show representative data for thioether crosslinking with deuterium labeled C-terminal D-methionine.
- FIG. 58A shows a schematic of a deuterium labeled C- terminal D-methionine with thioether crosslink product.
- FIG. 58B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift and loss 3Da.
- FIG. 59A-B show representative data for thioether crosslinking with d 2 -labeled D- valine.
- FIG. 59A shows a structure of a deuterium labeled C-terminal D-valine.
- FIG. 59B is the corresponding mass spectra data showing formation of the thioether crosslink however mass shift is indicative of no loss of deuterium.
- FIG. 60A-B show representative data for thioether crosslinking with d 3 -labeled D- valine.
- FIG. 60A shows a schematic of a deuterium labeled side chain C-terminal D-valine with thioether crosslink product.
- FIG. 60B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift and loss 3Da.
- FIG. 61A-D show representative data for thioether crosslinking with deuterium labeled C-terminal D-phenyl alanine.
- FIG. 61A shows a structure of a deuterium labeled C a C-terminal D-phenyl alanine.
- FIG. 61B is the corresponding mass spectra data showing formation of the thioether crosslink however mass shift is indicative of no loss of deuterium.
- FIG. 61C shows a structure of a deuterium labeled aryl C-terminal D-phenyl alanine.
- FIG. 61D is the corresponding mass spectra data showing formation of the thioether crosslink however mass shift is indicative of no loss of deuterium
- FIG. 62A-B show representative data for thioether crosslinking with deuterium labeled d8-C-terminal D-phenylalanine.
- FIG. 62A shows a schematic of a deuterium labeled d8-C-terminal D-methionine with thioether crosslink product.
- FIG. 62B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift.
- FIG. 63 shows structures of sactipeptide thioether crosslink of corresponding D- aminoacids
- FIG. 64 shows structures of ranthipeptide thioether crosslink of corresponding D- aminoacids.
- FIG. 65A-B show representative data for 6-membered non-peptidic thioether crosslinking.
- FIG. 65 A shows scheme of Leader-Cys-Gly reaction.
- FIG. 65B is the corresponding mass spectra data showing lack of formation of the thioether crosslink of 6- membered ring.
- FIG. 66A-B show representative data for 7-membered non-peptidic thioether crosslinking.
- FIG. 66A shows scheme of Leader-hCys-Gly reaction.
- FIG. 66B is the corresponding mass spectra data showing formation of the thioether crosslink of 7-membered ring.
- FIG. 67A-B show representative data for 7-membered non-peptidic thioether crosslinking.
- FIG. 67 A shows scheme of Leader-Cys- ⁇ Ala reaction.
- FIG. 67B is the corresponding mass spectra data showing formation of the thioether crosslink of 7-membered ring.
- FIG. 68A-B show representative data for 8-membered non-peptidic thioether crosslinking.
- FIG. 68A shows scheme of Leader-hCys- ⁇ Ala reaction.
- FIG. 68B is the corresponding mass spectra data showing formation of the thioether crosslink of 8-membered ring.
- FIG. 69A-B show representative data for 8-membered non-peptidic thioether crosslinking.
- FIG. 69 A shows scheme of Leader-Cys-GABA reaction.
- FIG. 69B is the corresponding mass spectra data showing formation of the thioether crosslink of 8-membered ring.
- FIG. 70A-B show representative data for 9-membered non-peptidic thioether crosslinking.
- FIG. 70 A shows scheme of Leader-hCys-GABA reaction.
- FIG. 70B is the corresponding mass spectra data showing formation of the thioether crosslink of 9-membered ring.
- FIG. 71A-B show representative data for 16-membered non-peptidic thioether crosslinking.
- FIG. 71 A shows scheme of Leader-hCys-NH-PEG 3 -CO 2 H reaction.
- FIG. 71 A is the corresponding mass spectra data showing formation of the thioether crosslink of 16- membered ring.
- FIG. 72A-B show representative data for 20-membered non-peptidic thioether crosslinking.
- FIG. 72A shows scheme of Leader-hCys-NH-PEG 4 -CO 2 H reaction.
- FIG. 72B is the corresponding mass spectra data showing formation of the thioether crosslink of 20- membered ring.
- FIG. 73A-B show representative data for unusual non-peptidic thioether crosslinking.
- FIG. 73A shows scheme of Leader-Cys-Ser-Ala-Asn-2-(2-aminophenyl)acetic acid reaction.
- FIG. 73B is the corresponding mass spectra data showing formation of the thioether crosslink of 17-membered ring.
- FIG. 74A-B show representative data for unusual non-peptidic thioether crosslinking.
- FIG. 74A shows scheme of Leader-Cys-Ser-Ala-Asn-2-(2-(aminomethyl)phenyl)acetic acid reaction.
- FIG. 74B is the corresponding mass spectra data showing formation of the thioether crosslink of 18-membered ring.
- FIG. 75A-B show representative data for coumarin thioether crosslinking.
- FIG. 75A shows scheme of Leader-Cys-coumarin reaction.
- FIG. 75B is the corresponding mass spectra data showing formation of the thioether crosslink of 12-membered ring.
- FIG. 76A-C show representative data for the synthesis thioether peptidomimetic.
- FIG. 76A is a structure of Setmalanotide, an FDA approved drug.
- FIG. 76B shows a schematic thioether crosslinking with a modified peptide structure (e.g., an analog of Setmalanotide).
- FIG. 76C is the corresponding mass spectra data showing formation of the thioether crosslink.
- FIG. 77A-D show representative data for the synthesis thioether peptidomimetic.
- FIG. 77A is a structure of a Novartis orally available peptide.
- FIG. 77B is a structure of the designed peptides (an analog of the therapeutic peptide from FIG. 77A) and the expected product upon modification with PapB.
- FIG. 77C shows a schematic thioether crosslinking with a modified peptide structure.
- FIG. 77D is the corresponding mass spectra data showing formation of the thioether crosslink.
- FIG. 78A-D show representative therapeutic cyclic peptides that can be mimicked by a thioether crosslink peptide.
- FIG. 78A show the structure of a representative cyclic peptide, bremelanotide.
- FIG. 78B shows a representative structure of the thioether crosslinked product, an analog of bremelanotide, which contains the amino acid sequence norleucine, cysteine, D-phenylalanine, arginine, tryptophan, and epsilon-amino hexanoic acid (ACP).
- FIG. 78C shows a representative scheme of the Leader-XCDFRWZ XXX reaction.
- FIG. 78D is the corresponding mass spectra data showing formation of the thioether crosslink of therapeutic analog.
- FIG. 79A-E show representative data illustrating that PapB forms crosslinks in thiol- and carboxylate-containing extended sidechains.
- FIG. 79B shows a 2 Da shift in the MS for the carboxylate-containing residue as Asp.
- FIG. 79C shows a 2 Da shift in the MS for the carboxylate-containing residue as Glu.
- FIG. 79D shows a 2 Da shift in the MS for the carboxylate-containing residue as homoGlu.
- FIG. 79E shows the MS for the liberated macrocyclized peptide core from the leader sequence following cleavage of the TEV protease recognition sequence with TEV protease.
- FIG. 80 shows a representative proton NMR spectrum of the linear G(hC)SAN(hE)A peptide.
- FIG. 81 shows a representative proton NMR spectrum of the cyclized G(hC)SAN(hE)A peptide.
- FIG. 82 shows a representative ROESY spectrum of the linear G(hC)SAN(hE)A peptide.
- FIG. 83 shows a representative ROESY spectrum of the cyclized G(hC)SAN(hE)A peptide.
- FIG. 84A-C show representative data pertaining to a carboxylate isostere (tetrazole moiety) crosslinked by PapB.
- FIG. 84A shows a schematic of the linear and cyclized peptide illustrating the putative crosslink location.
- FIG. 84B shows MS results illustrating a clear 2 Da loss between an assay without PapB (darker gray) and with the addition of PapB (lighter gray).
- FIG. 84C shows the expected tandem mass spectrometry with no fragmentation between Cys and T4Az.
- FIG. 85 shows representative fragmentation of reacted D23T4Az msPapA variant.
- FIG. 86 shows representative fragments of a tetrazole loss in the D23T4Az msPapA variant
- Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another aspect includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another aspect. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed.
- the terms “about” and “at or about” mean that the amount or value in question can be the value designated some other value approximately or about the same. It is generally understood, as used herein, that it is the nominal value indicated ⁇ 10% variation unless otherwise indicated or inferred. The term is intended to convey that similar values promote equivalent results or effects recited in the claims. That is, it is understood that amounts, sizes, formulations, parameters, and other quantities and characteristics are not and need not be exact, but can be approximate and/or larger or smaller, as desired, reflecting tolerances, conversion factors, rounding off, measurement error and the like, and other factors known to those of skill in the art.
- an amount, size, formulation, parameter or other quantity or characteristic is “about” or “approximate” whether or not expressly stated to be such. It is understood that where “about” is used before a quantitative value, the parameter also includes the specific quantitative value itself, unless specifically stated otherwise.
- references in the specification and concluding claims to parts by weight of a particular element or component in a composition denotes the weight relationship between the element or component and any other elements or components in the composition or article for which a part by weight is expressed.
- X and Y are present at a weight ratio of 2:5, and are present in such ratio regardless of whether additional components are contained in the compound.
- a weight percent (wt. %) of a component is based on the total weight of the formulation or composition in which the component is included.
- IC 50 is intended to refer to the concentration of a substance (e.g., a compound or a drug) that is required for 50% inhibition of a biological process, or component of a process, including a protein, subunit, organelle, ribonucleoprotein, etc.
- a substance e.g., a compound or a drug
- an IC 50 can refer to the concentration of a substance that is required for 50% inhibition in vivo, as further defined elsewhere herein.
- IC 50 refers to the half-maximal (50%) inhibitory concentration (IC) of a substance.
- EC 50 is intended to refer to the concentration of a substance (e.g., a compound or a drug) that is required for 50% agonism of a biological process, or component of a process, including a protein, subunit, organelle, ribonucleoprotein, etc.
- an EC 50 can refer to the concentration of a substance that is required for 50% agonism in vivo, as further defined elsewhere herein.
- EC 50 refers to the concentration of agonist that provokes a response hallway between the baseline and maximum response.
- the term “optional” or “optionally” means that the subsequently described event or circumstance can or cannot occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.
- the term “subject” can be a vertebrate, such as a mammal, a fish, a bird, a reptile, or an amphibian.
- the subject of the herein disclosed methods can be a human, non-human primate, horse, pig, rabbit, dog, sheep, goat, cow, cat, guinea pig or rodent.
- the term does not denote a particular age or sex. Thus, adult and newborn subjects, as well as fetuses, whether male or female, are intended to be covered.
- the subject is a mammal.
- a patient refers to a subject afflicted with a disease, disorder, or condition.
- patient includes human and veterinary subjects.
- treatment refers to the medical management of a patient with the intent to cure, ameliorate, stabilize, or prevent a disease, pathological condition, or disorder.
- This term includes active treatment, that is, treatment directed specifically toward the improvement of a disease, pathological condition, or disorder, and also includes causal treatment, that is, treatment directed toward removal of the cause of the associated disease, pathological condition, or disorder.
- this term includes palliative treatment, that is, treatment designed for the relief of symptoms rather than the curing of the disease, pathological condition, or disorder; preventative treatment, that is, treatment directed to minimizing or partially or completely inhibiting the development of the associated disease, pathological condition, or disorder; and supportive treatment, that is, treatment employed to supplement another specific therapy directed toward the improvement of the associated disease, pathological condition, or disorder.
- the term covers any treatment of a subject, including a mammal (e.g., a human), and includes: (i) preventing the disease from occurring in a subject that can be predisposed to the disease but has not yet been diagnosed as having it; (ii) inhibiting the disease, i.e., arresting its development; or (iii) relieving the disease, i.e., causing regression of the disease.
- the subject is a mammal such as a primate, and, in a further aspect, the subject is a human.
- subject also includes domesticated animals (e.g., cats, dogs, etc.), livestock (e.g., cattle, horses, pigs, sheep, goats, etc.), and laboratory animals (e.g., mouse, rabbit, rat, guinea pig, fruit fly, etc.).
- domesticated animals e.g., cats, dogs, etc.
- livestock e.g., cattle, horses, pigs, sheep, goats, etc.
- laboratory animals e.g., mouse, rabbit, rat, guinea pig, fruit fly, etc.
- prevent refers to precluding, averting, obviating, forestalling, stopping, or hindering something from happening, especially by advance action. It is understood that where reduce, inhibit or prevent are used herein, unless specifically indicated otherwise, the use of the other two words is also expressly disclosed.
- diagnosis means having been subjected to a physical examination by a person of skill, for example, a physician, and found to have a condition that can be diagnosed or treated by the compounds, compositions, or methods disclosed herein.
- administering refers to any method of providing a pharmaceutical preparation to a subject. Such methods are well known to those skilled in the art and include, but are not limited to, oral administration, transdermal administration, administration by inhalation, nasal administration, topical administration, intravaginal administration, ophthalmic administration, intraaural administration, intracerebral administration, rectal administration, sublingual administration, buccal administration, and parenteral administration, including injectable such as intravenous administration, intra-arterial administration, intramuscular administration, and subcutaneous administration. Administration can be continuous or intermittent.
- a preparation can be administered therapeutically; that is, administered to treat an existing disease or condition.
- a preparation can be administered prophylactically; that is, administered for prevention of a disease or condition.
- the terms “effective amount” and “amount effective” refer to an amount that is sufficient to achieve the desired result or to have an effect on an undesired condition.
- a “therapeutically effective amount” refers to an amount that is sufficient to achieve the desired therapeutic result or to have an effect on undesired symptoms, but is generally insufficient to cause adverse side effects.
- the specific therapeutically effective dose level for any particular patient will depend upon a variety of factors including the condition being treated and the severity of the condition; the specific composition employed; the age, body weight, general health, sex and diet of the patient; the time of administration; the route of administration; the rate of excretion of the specific compound employed; the duration of the treatment; drugs used in combination or coincidental with the specific compound employed and like factors well known in the medical arts. For example, it is well within the skill of the art to start doses of a compound at levels lower than those required to achieve the desired therapeutic effect and to gradually increase the dosage until the desired effect is achieved. If desired, the effective daily dose can be divided into multiple doses for purposes of administration.
- compositions can contain such amounts or submultiples thereof to make up the daily dose.
- the dosage can be adjusted by the individual physician in the event of any contraindications. Dosage can vary, and can be administered in one or more dose administrations daily, for one or several days. Guidance can be found in the literature for appropriate dosages for given classes of pharmaceutical products.
- a preparation can be administered in a “prophylactically effective amount”; that is, an amount effective for prevention of a disease or condition.
- dosage form means a pharmacologically active material in a medium, carrier, vehicle, or device suitable for administration to a subject.
- a dosage forms can comprise inventive a disclosed compound, a product of a disclosed method of making, or a salt, solvate, or polymorph thereof, in combination with a pharmaceutically acceptable excipient, such as a preservative, buffer, saline, or phosphate buffered saline.
- Dosage forms can be made using conventional pharmaceutical manufacturing and compounding techniques.
- Dosage forms can comprise inorganic or organic buffers (e.g., sodium or potassium salts of phosphate, carbonate, acetate, or citrate) and pH adjustment agents (e.g., hydrochloric acid, sodium or potassium hydroxide, salts of citrate or acetate, amino acids and their salts) antioxidants (e.g., ascorbic acid, alpha-tocopherol), surfactants (e.g., polysorbate 20, polysorbate 80, polyoxyethylene 9-10 nonyl phenol, sodium desoxycholate), solution and/or cryo/lyo stabilizers (e.g., sucrose, lactose, mannitol, trehalose), osmotic adjustment agents (e.g., salts or sugars), antibacterial agents (e.g., benzoic acid, phenol, gentamicin), antifoaming agents (e.g., polydimethylsilozone), preservatives (e.g., thimerosal, 2-
- kit means a collection of at least two components constituting the kit. Together, the components constitute a functional unit for a given purpose. Individual member components may be physically packaged together or separately. For example, a kit comprising an instruction for using the kit may or may not physically include the instruction with other individual member components. Instead, the instruction can be supplied as a separate member component, either in a paper form or an electronic form which may be supplied on computer readable memory device or downloaded from an internet website, or as recorded presentation. [00132] As used herein, “instruction(s)” means documents describing relevant materials or methodologies pertaining to a kit.
- kits may include any combination of the following: background information, list of components and their availability information (purchase information, etc.), brief or detailed protocols for using the kit, trouble- shooting, references, technical support, and any other related documents.
- Instructions can be supplied with the kit or as a separate member component, either as a paper form or an electronic form which may be supplied on computer readable memory device or downloaded from an internet website, or as recorded presentation. Instructions can comprise one or multiple documents, and are meant to include future updates.
- therapeutic agent include any synthetic or naturally occurring biologically active compound or composition of matter which, when administered to an organism (human or nonhuman animal), induces a desired pharmacologic, immunogenic, and/or physiologic effect by local and/or systemic action.
- the term therefore encompasses those compounds or chemicals traditionally regarded as drugs, vaccines, and biopharmaceuticals including molecules such as proteins, peptides, hormones, nucleic acids, gene constructs and the like.
- therapeutic agents include, without limitation, medicaments; vitamins; mineral supplements; substances used for the treatment, prevention, diagnosis, cure or mitigation of a disease or illness; substances that affect the structure or function of the body, or pro-drugs, which become biologically active or more active after they have been placed in a physiological environment.
- the term “therapeutic agent” includes compounds or compositions for use in all of the major therapeutic areas including, but not limited to, adjuvants; anti-infectives such as antibiotics and antiviral agents; anti-cancer and anti-neoplastic agents such as kinase inhibitors, poly ADP ribose polymerase (PARP) inhibitors and other DNA damage response modifiers, epigenetic agents such as bromodomain and extra-terminal (BET) inhibitors, histone deacetylase (HD Ac) inhibitors, iron chelotors and other ribonucleotides reductase inhibitors, proteasome inhibitors and Nedd8-activating enzyme (NAE) inhibitors, mammalian target of rapamycin (mTOR) inhibitors, traditional cytotoxic agents such as paclitaxel, dox, irinotecan, and platinum compounds, immune checkpoint blockade agents such as cytotoxic T lymphocyte antigen-4 (CTLA-4) monoclonal antibody (mAB), programme
- the agent may be a biologically active agent used in medical, including veterinary, applications and in agriculture, such as with plants, as well as other areas.
- therapeutic agent also includes without limitation, medicaments; vitamins; mineral supplements; substances used for the treatment, prevention, diagnosis, cure or mitigation of disease or illness; or substances which affect the structure or function of the body; or pro-drugs, which become biologically active or more active after they have been placed in a predetermined physiological environment.
- pharmaceutically acceptable describes a material that is not biologically or otherwise undesirable, i.e., without causing an unacceptable level of undesirable biological effects or interacting in a deleterious manner.
- sactipeptide refers to a sulfur-to-alpha carbon thioether cross-linked peptide belonging to the ribosomally synthesized post-translationally modified peptide (RiPP) superfamily. As illustrated by the structure below, a sactipeptide contains an intramolecular thioether bond that crosslinks the sulfur atom of a cysteine residue to the ⁇ -carbon of an acceptor amino acid.
- ranthipeptide refers to a radical non-a thioether- containing peptide, which, similar to sactipeptides above, is also a member of the RiPP superfamily.
- a ranthipeptide can contain an intramolecular thioether bond that crosslinks the sulfur atom of a cysteine residue to any carbon other than the a-carbon of an acceptor amino acid.
- ranthipeptide residues containing an ⁇ - or ⁇ -carbon are shown below.
- the term “derivative” refers to a compound having a structure derived from the structure of a parent compound (e.g., a compound disclosed herein) and whose structure is sufficiently similar to those disclosed herein and based upon that similarity, would be expected by one skilled in the art to exhibit the same or similar activities and utilities as the claimed compounds, or to induce, as a precursor, the same or similar activities and utilities as the claimed compounds.
- exemplary derivatives include salts, esters, amides, salts of esters or amides, and N-oxides of a parent compound.
- the term “pharmaceutically acceptable carrier” refers to sterile aqueous or nonaqueous solutions, dispersions, suspensions or emulsions, as well as sterile powders for reconstitution into sterile injectable solutions or dispersions just prior to use.
- suitable aqueous and nonaqueous carriers, diluents, solvents or vehicles include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene glycol and the like), carboxymethylcellulose and suitable mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters such as ethyl oleate.
- Proper fluidity can be maintained, for example, by the use of coating materials such as lecithin, by the maintenance of the required particle size in the case of dispersions and by the use of surfactants.
- These compositions can also contain adjuvants such as preservatives, wetting agents, emulsifying agents and dispersing agents.
- Prevention of the action of microorganisms can be ensured by the inclusion of various antibacterial and antifungal agents such as paraben, chlorobutanol, phenol, sorbic acid and the like. It can also be desirable to include isotonic agents such as sugars, sodium chloride and the like.
- Prolonged absorption of the injectable pharmaceutical form can be brought about by the inclusion of agents, such as aluminum monostearate and gelatin, which delay absorption.
- Injectable depot forms are made by forming microencapsule matrices of the drug in biodegradable polymers such as polylactide-polyglycolide, poly(orthoesters) and poly(anhydrides). Depending upon the ratio of drug to polymer and the nature of the particular polymer employed, the rate of drug release can be controlled. Depot injectable formulations are also prepared by entrapping the drug in liposomes or microemulsions which are compatible with body tissues.
- the injectable formulations can be sterilized, for example, by filtration through a bacterial-retaining filter or by incorporating sterilizing agents in the form of sterile solid compositions which can be dissolved or dispersed in sterile water or other sterile injectable media just prior to use.
- Suitable inert carriers can include sugars such as lactose. Desirably, at least 95% by weight of the particles of the active ingredient have an effective particle size in the range of 0.01 to 10 micrometers.
- the term “substituted” is contemplated to include all permissible substituents of organic compounds.
- the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, and aromatic and nonaromatic substituents of organic compounds.
- Illustrative substituents include, for example, those described below.
- the permissible substituents can be one or more and the same or different for appropriate organic compounds.
- the heteroatoms, such as nitrogen can have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms.
- substitution or “substituted with” include the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g., a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc. It is also contemplated that, in certain aspects, unless expressly indicated to the contrary, individual substituents can be further optionally substituted (i.e., further substituted or unsubstituted).
- a 1 ,” “A 2 ,” “A 3 ,” and “A 4 ” are used herein as generic symbols to represent various specific substituents. These symbols can be any substituent, not limited to those disclosed herein, and when they are defined to be certain substituents in one instance, they can, in another instance, be defined as some other substituents.
- aliphatic or “aliphatic group,” as used herein, denotes a hydrocarbon moiety that may be straight-chain (i.e., unbranched), branched, or cyclic (including fused, bridging, and spirofused polycyclic) and may be completely saturated or may contain one or more units of unsaturation, but which is not aromatic. Unless otherwise specified, aliphatic groups contain 1-20 carbon atoms. Aliphatic groups include, but are not limited to, linear or branched, alkyl, alkenyl, and alkynyl groups, and hybrids thereof such as (cycloalkyl)alkyl, (cycloalkenyl)alkyl or (cycloalkyl)alkenyl.
- alkyl as used herein is a branched or unbranched saturated hydrocarbon group of 1 to 24 carbon atoms, such as methyl, ethyl, n-propyl, isopropyl, n- butyl, isobutyl, s-butyl, t-butyl, n-pentyl, isopentyl, s-pentyl, neopentyl, hexyl, heptyl, octyl, nonyl, decyl, dodecyl, tetradecyl, hexadecyl, eicosyl, tetracosyl, and the like.
- the alkyl group can be cyclic or acyclic.
- the alkyl group can be branched or unbranched.
- the alkyl group can also be substituted or unsubstituted.
- the alkyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, amino, ether, halide, hydroxy, nitro, silyl, sulfo-oxo, or thiol, as described herein.
- a “lower alkyl” group is an alkyl group containing from one to six (e.g., from one to four) carbon atoms.
- alkyl group can also be a Cl alkyl, C1-C2 alkyl, C1-C3 alkyl, C1-C4 alkyl, C1-C5 alkyl, C1-C6 alkyl, C1-C7 alkyl, C1-C8 alkyl, C1-C9 alkyl, Cl -CIO alkyl, and the like up to and including a C1-C24 alkyl.
- alkyl is generally used to refer to both unsubstituted alkyl groups and substituted alkyl groups; however, substituted alkyl groups are also specifically referred to herein by identifying the specific substituent(s) on the alkyl group.
- halogenated alkyl or “haloalkyl” specifically refers to an alkyl group that is substituted with one or more halide, e.g., fluorine, chlorine, bromine, or iodine.
- halogenated alkyl specifically refers to an alkyl group that is substituted with one or more halide, e.g., fluorine, chlorine, bromine, or iodine.
- monohaloalkyl specifically refers to an alkyl group that is substituted with a single halide, e.g. fluorine, chlorine, bromine, or iodine.
- polyhaloalkyl specifically refers to an alkyl group that is independently substituted with two or more halides, i.e.
- alkoxyalkyl specifically refers to an alkyl group that is substituted with one or more alkoxy groups, as described below.
- aminoalkyl specifically refers to an alkyl group that is substituted with one or more amino groups.
- hydroxyalkyl specifically refers to an alkyl group that is substituted with one or more hydroxy groups.
- cycloalkyl refers to both unsubstituted and substituted cycloalkyl moieties
- the substituted moieties can, in addition, be specifically identified herein; for example, a particular substituted cycloalkyl can be referred to as, e.g., an “alkylcycloalkyl.”
- a substituted alkoxy can be specifically referred to as, e.g., a “halogenated alkoxy”
- a particular substituted alkenyl can be, e.g., an “alkenylalcohol,” and the like.
- the practice of using a general term, such as “cycloalkyl,” and a specific term, such as “alkylcycloalkyl,” is not meant to imply that the general term does not also include the specific term.
- cycloalkyl as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms.
- cycloalkyl groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, norbomyl, and the like.
- heterocycloalkyl is a type of cycloalkyl group as defined above, and is included within the meaning of the term “cycloalkyl,” where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus.
- the cycloalkyl group and heterocycloalkyl group can be substituted or unsubstituted.
- the cycloalkyl group and heterocycloalkyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, amino, ether, halide, hydroxy, nitro, silyl, sulfo-oxo, or thiol as described herein.
- polyalkylene group as used herein is a group having two or more CH 2 groups linked to one another.
- the polyalkylene group can be represented by the formula -(CH 2 ) a - , where “a” is an integer of from 2 to 500.
- Alkoxy also includes polymers of alkoxy groups as just described; that is, an alkoxy can be a polyether such as — OA 1 — OA 2 or — OA 1 — (OA 2 ) a — OA 3 , where “a” is an integer of from 1 to 200 and A 1 , A 2 , and A 3 are alkyl and/or cycloalkyl groups.
- alkenyl as used herein is a hydrocarbon group of from 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon double bond.
- the alkenyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol, as described herein.
- groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol, as described here
- Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadienyl, cyclohexenyl, cyclohexadienyl, norbomenyl, and the like.
- heterocycloalkenyl is a type of cycloalkenyl group as defined above, and is included within the meaning of the term “cycloalkenyl,” where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus.
- the cycloalkenyl group and heterocycloalkenyl group can be substituted or unsubstituted.
- the cycloalkenyl group and heterocycloalkenyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol as described herein.
- alkynyl as used herein is a hydrocarbon group of 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon triple bond.
- the alkynyl group can be unsubstituted or substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol, as described herein.
- groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or
- cycloalkynyl as used herein is a non-aromatic carbon-based ring composed of at least seven carbon atoms and containing at least one carbon-carbon triple bound.
- cycloalkynyl groups include, but are not limited to, cycloheptynyl, cyclooctynyl, cyclononynyl, and the like.
- heterocycloalkynyl is a type of cycloalkenyl group as defined above, and is included within the meaning of the term “cycloalkynyl,” where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus.
- the cycloalkynyl group and heterocycloalkynyl group can be substituted or unsubstituted.
- the cycloalkynyl group and heterocycloalkynyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol as described herein.
- aromatic group refers to a ring structure having cyclic clouds of delocalized ⁇ electrons above and below the plane of the molecule, where the ⁇ clouds contain (4n+2) ⁇ electrons.
- aromaticity is found in Morrison and Boyd, Organic Chemistry, (5th Ed., 1987), Chapter 13, entitled “Aromaticity,” pages 477-497, incorporated herein by reference.
- aromatic group is inclusive of both aryl and heteroaryl groups.
- aryl as used herein is a group that contains any carbon-based aromatic group including, but not limited to, benzene, naphthalene, phenyl, biphenyl, anthracene, and the like.
- the aryl group can be substituted or unsubstituted.
- the aryl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, — NH 2 , carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol as described herein.
- biasryl is a specific type of aryl group and is included in the definition of “aryl.”
- the aryl group can be a single ring structure or comprise multiple ring structures that are either fused ring structures or attached via one or more bridging groups such as a carbon-carbon bond.
- biaryl can be two aryl groups that are bound together via a fused ring structure, as in naphthalene, or are attached via one or more carbon-carbon bonds, as in biphenyl.
- amine or “amino” as used herein are represented by the formula — NA 1 A 2 , where A 1 and A 2 can be, independently, hydrogen or alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. A specific example of amino is — NH 2 .
- alkylamino as used herein is represented by the formula — NH(- alkyl) where alkyl is a described herein.
- Representative examples include, but are not limited to, methylamino group, ethylamino group, propylamino group, isopropylamino group, butylamino group, isobutylamino group, (sec-butyl)amino group, (tert-butyl)amino group, pentylamino group, isopentylamino group, (tert-pentyl)amino group, hexylamino group, and the like.
- dialkylamino as used herein is represented by the formula — N(- alkyl) 2 where alkyl is a described herein.
- Representative examples include, but are not limited to, dimethylamino group, diethylamino group, dipropylamino group, diisopropylamino group, dibutylamino group, diisobutylamino group, di(sec-butyl)amino group, di(tert-butyl)amino group, dipentylamino group, diisopentylamino group, di(tert- pentyl)amino group, dihexylamino group, N-ethyl-N-methylamino group, N-methyl-N- propylamino group, N-ethyl-N-propylamino group and the like.
- esters as used herein is represented by the formula — (OC(O)A 1 or — C(O)OA 1 , where A 1 can be alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- polyester as used herein is represented by the formula — (A 1 O(O)C-A 2 -C(O)O) a — or (A 1 O(O)C-A 2 -OC(O)) a , where A 1 and A 2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group described herein and “a” is an integer from 1 to 500. “Polyester” is as the term used to describe a group that is produced by the reaction between a compound having at least two carboxylic acid groups with a compound having at least two hydroxyl groups.
- ether as used herein is represented by the formula A 1 OA 2 , where A 1 and A 2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group described herein.
- polyether as used herein is represented by the formula — (A 1 O-A 2 O) a — , where A 1 and A 2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group described herein and “a” is an integer of from 1 to 500.
- Examples of polyether groups include polyethylene oxide, polypropylene oxide, and polybutylene oxide.
- halo halogen
- halide halogen
- pseudohalide pseudohalogen
- pseudohalo pseudohalogen
- pseudohalo pseudohalo
- functional groups include, by way of example, cyano, thiocyanato, azido, trifluoromethyl, trifluoromethoxy, perfluoroalkyl, and perfluoroalkoxy groups.
- heteroalkyl refers to an alkyl group containing at least one heteroatom. Suitable heteroatoms include, but are not limited to, O, N, Si, P and S, wherein the nitrogen, phosphorous and sulfur atoms are optionally oxidized, and the nitrogen heteroatom is optionally quatemized. Heteroalkyls can be substituted as defined above for alkyl groups.
- heteroaryl refers to an aromatic group that has at least one heteroatom incorporated within the ring of the aromatic group.
- heteroatoms include, but are not limited to, nitrogen, oxygen, sulfur, and phosphorus, where N-oxides, sulfin- oxides, and dioxides are permissible heteroatom substitutions.
- the heteroaryl group can be substituted or unsubstituted.
- the heteroaryl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, amino, ether, halide, hydroxy, nitro, silyl, sulfo-oxo, or thiol as described herein.
- Heteroaryl groups can be monocyclic, or alternatively fused ring systems. Heteroaryl groups include, but are not limited to, furyl, imidazolyl, pyrimidinyl, tetrazolyl, thienyl, pyridinyl, pyrrolyl, N- methylpyrrolyl, quinolinyl, isoquinolinyl, pyrazolyl, triazolyl, thiazolyl, oxazolyl, isoxazolyl, oxadiazolyl, thiadiazolyl, isothiazolyl, pyridazinyl, pyrazinyl, benzofuranyl, benzodioxolyl, benzothiophenyl, indolyl, indazolyl, benzimidazolyl, imidazopyridinyl, pyrazolopyridinyl, and pyrazolopyrimidinyl.
- heteroaryl groups include, but are not limited to, pyridinyl, pyridazinyl, pyrimidinyl, pyrazinyl, thiophenyl, pyrazolyl, imidazolyl, benzo[d]oxazolyl, benzo[d]thiazolyl, quinolinyl, quinazolinyl, indazolyl, imidazo[ 1 ,2-b]pyridazinyl, imidazo[l ,2-a]pyrazinyl, benzo[c] [ 1 ,2,5]thiadiazolyl, benzo[c][l,2,5]oxadiazolyl, and pyrido[2,3-b]pyrazinyl.
- heterocycle or “heterocyclyl,” as used herein can be used interchangeably and refer to single and multi-cyclic aromatic or non-aromatic ring systems in which at least one of the ring members is other than carbon.
- Heterocycle includes pyridine, pyrimidine, furan, thiophene, pyrrole, isoxazole, isothiazole, pyrazole, oxazole, thiazole, imidazole, oxazole, including, 1,2,3- oxadiazole, 1,2,5-oxadiazole and 1,3,4-oxadiazole, thiadiazole, including, 1,2,3-thiadiazole, 1,2,5-thiadiazole, and 1 , 3, 4-thiadiazole, triazole, including, 1,2,3-triazole, 1,3,4-triazole, tetrazole, including 1,2,3,4-tetrazole and 1,2,4,5-tetrazole, pyridazine, pyrazine, triazine, including
- heterocyclyl group can also be a C2 heterocyclyl, C2-C3 heterocyclyl, C2- C4 heterocyclyl, C2-C5 heterocyclyl, C2-C6 heterocyclyl, C2-C7 heterocyclyl, C2-C8 heterocyclyl, C2-C9 heterocyclyl, C2-C10 heterocyclyl, C2-C11 heterocyclyl, and the like up to and including a C2-C18 heterocyclyl.
- a C2 heterocyclyl comprises a group which has two carbon atoms and at least one heteroatom, including, but not limited to, aziridinyl, diazetidinyl, dihydrodiazetyl, oxiranyl, thiiranyl, and the like.
- a C5 heterocyclyl comprises a group which has five carbon atoms and at least one heteroatom, including, but not limited to, piperidinyl, tetrahydropyranyl, tetrahydrothiopyranyl, diazepanyl, pyridinyl, and the like. It is understood that a heterocyclyl group may be bound either through a heteroatom in the ring, where chemically possible, or one of carbons comprising the heterocyclyl ring.
- bicyclic heterocycle or “bicyclic heterocyclyl,” as used herein refers to a ring system in which at least one of the ring members is other than carbon.
- Bicyclic heterocyclyl encompasses ring systems wherein an aromatic ring is fused with another aromatic ring, or wherein an aromatic ring is fused with a non-aromatic ring.
- Bicyclic heterocyclyl encompasses ring systems wherein a benzene ring is fused to a 5- or a 6- membered ring containing 1 , 2 or 3 ring heteroatoms or wherein a pyridine ring is fused to a 5- or a 6-membered ring containing 1, 2 or 3 ring heteroatoms.
- Bicyclic heterocyclic groups include, but are not limited to, indolyl, indazolyl, pyrazolo[l,5-a]pyridinyl, benzofuranyl, quinolinyl, quinoxalinyl, 1,3-benzodioxolyl, 2,3-dihydro-l,4-benzodioxinyl, 3,4-dihydro-2H- chromenyl, 1H-pyrazolo[4,3-c]pyridin-3-yl; lH-pyrrolo[3,2-b]pyridin-3-yl; and 1H- pyrazolo[3,2-b]pyridin-3-yl.
- heterocycloalkyl refers to an aliphatic, partially unsaturated or fully saturated, 3- to 14-membered ring system, including single rings of 3 to 8 atoms and bi- and tricyclic ring systems.
- the heterocycloalkyl ring-systems include one to four heteroatoms independently selected from oxygen, nitrogen, and sulfur, wherein a nitrogen and sulfur heteroatom optionally can be oxidized and a nitrogen heteroatom optionally can be substituted.
- heterocycloalkyl groups include, but are not limited to, pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, piperidinyl, piperazinyl, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, and tetrahydrofuryl.
- hydroxyl or “hydroxyl” as used herein is represented by the formula — OH.
- ketone as used herein is represented by the formula A 1 C(O)A 2 , where A 1 and A 2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- nitro as used herein is represented by the formula NO 2 .
- nitrile or “cyano” as used herein is represented by the formula CN.
- sil as used herein is represented by the formula — SiA 1 A 2 A 3 , where A 1 , A 2 , and A 3 can be, independently, hydrogen or an alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- sulfo-oxo is represented by the formulas — S(O)A 1 , — S(O) 2 A 1 , — OS(O) 2 A 1 , or — OS(O) 2 OA 1 , where A 1 can be hydrogen or an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- sulfonyl is used herein to refer to the sulfo-oxo group represented by the formula — S(O) 2 A 1 , where A 1 can be hydrogen or an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- sulfone as used herein is represented by the formula A 1 S(O) 2 A 2 , where A 1 and A 2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- sulfoxide as used herein is represented by the formula A 1 S(O)A 2 , where A 1 and A 2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
- thiol as used herein is represented by the formula — SH.
- R 1 ,” “R 2 ,” “R 3 ,” “R n ,” where n is an integer, as used herein can, independently, possess one or more of the groups listed above.
- R 1 is a straight chain alkyl group
- one of the hydrogen atoms of the alkyl group can optionally be substituted with a hydroxyl group, an alkoxy group, an alkyl group, a halide, and the like.
- a first group can be incorporated within second group or, alternatively, the first group can be pendant (i.e., attached) to the second group.
- an alkyl group comprising an amino group the amino group can be incorporated within the backbone of the alkyl group.
- the amino group can be attached to the backbone of the alkyl group.
- the nature of the group(s) that is (are) selected will determine if the first group is embedded or attached to the second group.
- compounds of the invention may contain “optionally substituted” moieties.
- substituted whether preceded by the term “optionally” or not, means that one or more hydrogen of the designated moiety are replaced with a suitable substituent.
- an “optionally substituted” group may have a suitable substituent at each substitutable position of the group, and when more than one position in any given structure may be substituted with more than one substituent selected from a specified group, the substituent may be either the same or different at every position.
- Combinations of substituents envisioned by this invention are preferably those that result in the formation of stable or chemically feasible compounds.
- individual substituents can be further optionally substituted (i.e., further substituted or unsubstituted).
- stable refers to compounds that are not substantially altered when subjected to conditions to allow for their production, detection, and, in certain aspects, their recovery, purification, and use for one or more of the purposes disclosed herein.
- Suitable monovalent substituents on R° are independently halogen, -(CH 2 ) 0-2 R ⁇ , -(haloR ⁇ ), -(CH 2 ) 0-2 OH, -(CH 2 ) 0-2 OR ⁇ , -(CH 2 ) 0-
- Suitable divalent substituents that are bound to vicinal substitutable carbons of an “optionally substituted” group include: -O(CR*2) 2-3 O-, wherein each independent occurrence of R* is selected from hydrogen, C 1-6 aliphatic which may be substituted as defined below, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0 4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
- Suitable substituents on the aliphatic group of R* include halogen, - R ⁇ , -(haloR ⁇ ), -OH, -OR ⁇ , -O(haloR ⁇ ), -CN, -C(O)OH, -C(O)OR ⁇ , -NH 2 , -NHR ⁇ , -NR ⁇ 2 , or -NO 2 , wherein each R ⁇ is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently C 1-4 aliphatic, -CH 2 Ph, -O(CH 2 ) 0-1 Ph, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
- Suitable substituents on a substitutable nitrogen of an “optionally substituted” group include is independently hydrogen, C 1-6 aliphatic which may be substituted as defined below, unsubstituted -OPh, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0 4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, or, notwithstanding the definition above, two independent occurrences of taken together with their intervening atom(s) form an unsubstituted 3-12-membered saturated, partially unsaturated, or aryl mono- or bicyclic ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
- Suitable substituents on the aliphatic group of are independently halogen, - R ⁇ , -(haloR ⁇ ), -OH, -OR ⁇ , -O(haloR ⁇ ), -CN, -C(O)OH, -C(O)OR ⁇ , -NH 2 , -NHR ⁇ , -NR ⁇ 2 , or -NO 2 , wherein each R ⁇ is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently Ci 4 aliphatic, -CH 2 Ph, -O(CH 2 ) 0-1 Ph, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
- leaving group refers to an atom (or a group of atoms) with electron withdrawing ability that can be displaced as a stable species, taking with it the bonding electrons.
- suitable leaving groups include halides and sulfonate esters, including, but not limited to, triflate, mesylate, tosylate, and brosylate.
- hydrolysable group and “hydrolysable moiety” refer to a functional group capable of undergoing hydrolysis, e.g., under basic or acidic conditions.
- hydrolysable residues include, without limitation, acid halides, activated carboxylic acids, and various protecting groups known in the art (see, for example, “Protective Groups in Organic Synthesis,” T. W. Greene, P. G. M. Wuts, Wiley-Interscience, 1999).
- organic residue defines a carbon-containing residue, i.e., a residue comprising at least one carbon atom, and includes but is not limited to the carbon-containing groups, residues, or radicals defined hereinabove.
- Organic residues can contain various heteroatoms, or be bonded to another molecule through a heteroatom, including oxygen, nitrogen, sulfur, phosphorus, or the like. Examples of organic residues include but are not limited alkyl or substituted alkyls, alkoxy or substituted alkoxy, mono or di-substituted amino, amide groups, etc.
- Organic residues can preferably comprise 1 to 18 carbon atoms, 1 to 15, carbon atoms, 1 to 12 carbon atoms, 1 to 8 carbon atoms, 1 to 6 carbon atoms, or 1 to 4 carbon atoms.
- an organic residue can comprise 2 to 18 carbon atoms, 2 to 15, carbon atoms, 2 to 12 carbon atoms, 2 to 8 carbon atoms, 2 to 4 carbon atoms, or 2 to 4 carbon atoms.
- a very close synonym of the term “residue” is the term “radical,” which as used in the specification and concluding claims, refers to a fragment, group, or substructure of a molecule described herein, regardless of how the molecule is prepared.
- a 2,4-thiazolidinedione radical in a particular compound has the structure: regardless of whether thiazolidinedione is used to prepare the compound.
- the radical for example an alkyl
- the number of atoms in a given radical is not critical to the present invention unless it is indicated to the contrary elsewhere herein.
- Organic radicals contain one or more carbon atoms.
- An organic radical can have, for example, 1-26 carbon atoms, 1-18 carbon atoms, 1-12 carbon atoms, 1-8 carbon atoms, 1-6 carbon atoms, or 1-4 carbon atoms.
- an organic radical can have 2-26 carbon atoms, 2-18 carbon atoms, 2-12 carbon atoms, 2-8 carbon atoms, 2-6 carbon atoms, or 2-4 carbon atoms.
- Organic radicals often have hydrogen bound to at least some of the carbon atoms of the organic radical.
- an organic radical that comprises no inorganic atoms is a 5, 6, 7, 8-tetrahydro-2- naphthyl radical.
- an organic radical can contain 1-10 inorganic heteroatoms bound thereto or therein, including halogens, oxygen, sulfur, nitrogen, phosphoms, and the like.
- organic radicals include but are not limited to an alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, mono-substituted amino, di- substituted amino, acyloxy, cyano, carboxy, carboalkoxy, alkylcarboxamide, substituted alkylcarboxamide, dialkylcarboxamide, substituted dialkylcarboxamide, alkylsulfonyl, alkylsulfinyl, thioalkyl, thiohaloalkyl, alkoxy, substituted alkoxy, haloalkyl, haloalkoxy, aryl, substituted aryl, heteroaryl, heterocyclic, or substituted heterocyclic radicals, wherein the terms are defined elsewhere herein.
- organic radicals that include heteroatoms include alkoxy radicals, trifluoromethoxy radicals, acetoxy radicals, dimethylamino radicals and the like.
- a formula with chemical bonds shown only as solid lines and not as wedges or dashed lines contemplates each possible isomer, e.g., each enantiomer and diastereomer, and a mixture of isomers, such as a racemic or scalemic mixture.
- Compounds described herein can contain one or more asymmetric centers and, thus, potentially give rise to diastereomers and optical isomers.
- the present invention includes all such possible diastereomers as well as their racemic mixtures, their substantially pure resolved enantiomers, all possible geometric isomers, and pharmaceutically acceptable salts thereof. Mixtures of stereoisomers, as well as isolated specific stereoisomers, are also included.
- the products of such procedures can be a mixture of stereoisomers.
- a specific stereoisomer can also be referred to as an enantiomer, and a mixture of such isomers is often called an enantiomeric mixture.
- a 50:50 mixture of enantiomers is referred to as a racemic mixture.
- Many of the compounds described herein can have one or more chiral centers and therefore can exist in different enantiomeric forms. If desired, a chiral carbon can be designated with an asterisk (*). When bonds to the chiral carbon are depicted as straight lines in the disclosed formulas, it is understood that both the (R) and (S) configurations of the chiral carbon, and hence both enantiomers and mixtures thereof, are embraced within the formula.
- one of the bonds to the chiral carbon can be depicted as a wedge (bonds to atoms above the plane) and the other can be depicted as a series or wedge of short parallel lines is (bonds to atoms below the plane).
- the Cahn-Ingold-Prelog system can be used to assign the (R) or (S) configuration to a chiral carbon.
- the enantiomers can be resolved by methods known to those skilled in the art, such as formation of diastereoisomeric salts which may be separated, for example, by crystallization (see, CRC Handbook of Optical Resolutions via Diastereomeric Salt Formation by David Kozma (CRC Press, 2001)); formation of diastereoisomeric derivatives or complexes which may be separated, for example, by crystallization, gas-liquid or liquid chromatography; selective reaction of one enantiomer with an enantiomer-specific reagent, for example enzymatic esterification; or gas-liquid or liquid chromatography in a chiral environment, for example on a chiral support for example silica with a bound chiral ligand or in the presence of a chiral solvent.
- a further step can liberate the desired enantiomeric form.
- specific enantiomers can be synthesized by asymmetric synthesis using optically active reagents, substrates, catalysts or solvents, or by converting one enantiomer into the other by asymmetric transformation.
- Designation of a specific absolute configuration at a chiral carbon in a disclosed compound is understood to mean that the designated enantiomeric form of the compounds can be provided in enantiomeric excess (e.e.).
- Enantiomeric excess is the presence of a particular enantiomer at greater than 50%, for example, greater than 60%, greater than 70%, greater than 75%, greater than 80%, greater than 85%, greater than 90%, greater than 95%, greater than 98%, or greater than 99%.
- the designated enantiomer is substantially free from the other enantiomer.
- the “R : forms of the compounds can be substantially free from the “S” forms of the compounds and are, thus, in enantiomeric excess of the “S” forms.
- “S” forms of the compounds can be substantially free of “R” forms of the compounds and are, thus, in enantiomeric excess of the “R” forms.
- a disclosed compound When a disclosed compound has two or more chiral carbons, it can have more than two optical isomers and can exist in diastereoisomeric forms. For example, when there are two chiral carbons, the compound can have up to four optical isomers and two pairs of enantiomers ((S,S)/(R,R) and (R,S)/(S,R)).
- the pairs of enantiomers e.g., (S,S)/(R,R)
- the stereoisomers that are not mirror-images e.g., (S,S) and (R,S) are diastereomers.
- diastereoisomeric pairs can be separated by methods known to those skilled in the art, for example chromatography or crystallization and the individual enantiomers within each pair may be separated as described above. Unless otherwise specifically excluded, a disclosed compound includes each diastereoisomer of such compounds and mixtures thereof.
- the compounds according to this disclosure may form prodrugs at hydroxyl or amino functionalities using alkoxy, amino acids, etc., groups as the prodrug forming moieties.
- the hydroxymethyl position may form mono-, di-, or triphosphates and again these phosphates can form prodrugs.
- Preparations of such prodrug derivatives are discussed in various literature sources (examples are: Alexander et al., J. Med. Chem. 1988, 31, 318; Aligas-Martin et al., PCT WO 2000/041531, p. 30).
- the nitrogen function converted in preparing these derivatives is one (or more) of the nitrogen atoms of a compound of the disclosure.
- “Derivatives” of the compounds disclosed herein are pharmaceutically acceptable salts, prodrugs, deuterated forms, radio-actively labeled forms, isomers, solvates and combinations thereof.
- the “combinations” mentioned in this context refer to derivatives falling within at least two of the groups: pharmaceutically acceptable salts, prodrugs, deuterated forms, radio-actively labeled forms, isomers, and solvates.
- Examples of radio- actively labeled forms include compounds labeled with tritium, phosphorous-32, iodine- 129, carbon-11, fluorine- 18, and the like.
- Compounds described herein comprise atoms in both their natural isotopic abundance and in non-natural abundance.
- the disclosed compounds can be isotopically- labeled or isotopically-substituted compounds identical to those described, but for the fact that one or more atoms are replaced by an atom having an atomic mass or mass number different from the atomic mass or mass number typically found in nature.
- isotopes that can be incorporated into compounds of the invention include isotopes of hydrogen, carbon, nitrogen, oxygen, phosphorous, fluorine and chlorine, such as 2 H, 3 H, 13 C, 14 C, 15 N, 18 O, 17 O, 35 S, 18 F and 36 Cl, respectively.
- Compounds further comprise prodrugs thereof, and pharmaceutically acceptable salts of said compounds or of said prodrugs which contain the aforementioned isotopes and/or other isotopes of other atoms are within the scope of this invention.
- Certain isotopically-labeled compounds of the present invention for example those into which radioactive isotopes such as 3 H and 14 C are incorporated, are useful in drug and/or substrate tissue distribution assays. Tritiated, i.e., 3 H, and carbon-14, i.e., 14 C, isotopes are particularly preferred for their ease of preparation and detectability.
- isotopically labeled compounds of the present invention and prodrugs thereof can generally be prepared by carrying out the procedures below, by substituting a readily available isotopically labeled reagent for a non- isotopically labeled reagent.
- the compounds described in the invention can be present as a solvate.
- the solvent used to prepare the solvate is an aqueous solution, and the solvate is then often referred to as a hydrate.
- the compounds can be present as a hydrate, which can be obtained, for example, by crystallization from a solvent or from aqueous solution.
- one, two, three or any arbitrary number of solvent or water molecules can combine with the compounds according to the invention to form solvates and hydrates.
- the invention includes all such possible solvates.
- co-crystal means a physical association of two or more molecules which owe their stability through non-covalent interaction.
- One or more components of this molecular complex provide a stable framework in the crystalline lattice.
- the guest molecules are incorporated in the crystalline lattice as anhydrates or solvates, see e.g. “Crystal Engineering of the Composition of Pharmaceutical Phases. Do Pharmaceutical Co-crystals Represent a New Path to Improved Medicines?” Almarasson, O., et. al., The Royal Society of Chemistry, 1889-1896, 2004.
- Examples of co-crystals include p- toluenesulfonic acid and benzenesulfonic acid.
- certain compounds described herein can be present as an equilibrium of tautomers. For example, ketones with an a-hydrogen can exist in an equilibrium of the keto form and the enol form.
- amides with an N-hydrogen can exist in an equilibrium of the amide form and the imidic acid form.
- pyrazoles can exist in two tautomeric forms, N 1 -unsubstituted, 3-A 3 and N 1 -unsubstituted, 5-A 3 as shown below.
- the invention includes all such possible tautomers.
- polymorphic forms or modifications It is known that chemical substances form solids, which are present in different states of order which are termed polymorphic forms or modifications.
- the different modifications of a polymorphic substance can differ greatly in their physical properties.
- the compounds according to the invention can be present in different polymorphic forms, with it being possible for particular modifications to be metastable. Unless stated to the contrary, the invention includes all such possible polymorphic forms.
- a structure of a compound can be represented by a formula: which is understood to be equivalent to a formula: wherein n is typically an integer. That is, R" is understood to represent five independent substituents, R n(a) , R n(b) , R n(c) , R n(d) , R n(e) .
- independent substituents it is meant that each R substituent can be independently defined. For example, if in one instance R n(a) is halogen, then R n(b) is not necessarily halogen in that instance.
- Certain materials, compounds, compositions, and components disclosed herein can be obtained commercially or readily synthesized using techniques generally known to those of skill in the art.
- the starting materials and reagents used in preparing the disclosed compounds and compositions are either available from commercial suppliers such as Aldrich Chemical Co., (Milwaukee, Wis.), Acros Organics (Morris Plains, N.J.), Strem Chemicals (Newburyport, MA), Fisher Scientific (Pittsburgh, Pa.), or Sigma (St.
- compositions of the invention Disclosed are the components to be used to prepare the compositions of the invention as well as the compositions themselves to be used within the methods disclosed herein. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference of each various individual and collective combinations and permutation of these compounds cannot be explicitly disclosed, each is specifically contemplated and described herein. For example, if a particular compound is disclosed and discussed and a number of modifications that can be made to a number of molecules including the compounds are discussed, specifically contemplated is each and every combination and permutation of the compound and the modifications that are possible unless specifically indicated to the contrary.
- o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein R 1 is selected from -CO 2 H, - C(O)NHOH, -SO 2 NH 2 , -SO 2 NHC(O)CH 3 , -SO 3 H, -NHC(O)NHSO 2 CH 3 , -P(O)(OH) 2 , and a structure selected from:
- R 4 is selected from hydrogen and methyl; wherein each occurrence of R 5 and R 5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R 6 and R 6 ’, when present, is independently selected from hydrogen and methyl, or wherein R 6 or R 6 ’ is covalently bonded to R 5 or R 5 ’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R 7a and R 7b , when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R 8 is selected from hydrogen and methyl, provided that the compound is not PapA.
- R 4 is selected from hydrogen and methyl; wherein each occurrence of R 5 and R 5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R 6 and R 6 ’, when present, is independently selected from hydrogen and methyl, or wherein R 6 or R 6 ’ is covalently bonded to R 5 or R 5 ’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R 7a and R 7b , when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R 8 is selected from hydrogen and methyl, provided that the compound is not PapA.
- R 2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R 3a and R 3b , when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R 4 is selected from hydrogen and methyl; wherein each occurrence of R 5 and R 5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R 6 and R 6 ’, when present, is independently selected from hydrogen and methyl, or wherein R 6 or R 6 is covalently bonded to R 5 or R 5 , respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R 7a and R 7b , when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
- o is independently 0, 1, 2, 3, 4, 5, 6, or 7.
- t 0.
- v is 1 or 2.
- R 1 is -CO 2 H or a structure:
- R 1 is -CO 2 H.
- the cleavable moiety is -CO 2 -(C4-C8 alkylene)-OC(O)-.
- the cleavable moiety is a protease recognition sequence.
- the protease recognition sequence is TEV recognition sequence.
- the compound comprises one or more D-amino acid residues. In a further aspect, the compound comprises one or more ⁇ -amino acid residues. In a still further aspect, the compound comprises one or more N-methylated amino acid residues.
- PapB installs a single thioether linkage in the compound.
- PapB installs two or more thioether linkages in the compound.
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- n is 0. In a further aspect, m is 1.
- n is 0. In a further aspect, n is 1.
- o is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, o is 1, 2, 3,
- o 1, 2, 3, or 4.
- p is 1. In a further aspect, p is 2.
- A is S. In a further aspect, A is Se.
- L is C2-C4 alkyl. In a further aspect, L is -(C1-C4 alkyl)(OCH 2 CH 2 ) q . In a still further aspect, L is a structure selected from:
- the cleavable moiety is a protease recognition sequence.
- the protease recognition sequence is a TEV protease recognition sequence.
- the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue.
- the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
- the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3).
- the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
- R 1 is selected from -CO 2 H and a structure:
- R 1 is CO 2 H.
- R 2 is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- R 2 is a residue of a side chain of an amino acid selected from alanine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, and glycine.
- one of R 3a and R 3b when present, is hydrogen, and one of R 3a and R 3b , when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- R 4 is hydrogen. In a further aspect, R 4 is methyl.
- each occurrence of R 5 when present, is independently a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- each occurrence of R 6 when present, is hydrogen. In a further aspect, each occurrence of R 6 , when present, is methyl. [00240] In various aspects, each of R 7a and R 7b , when present, is hydrogen. In a further aspect, each of R 7a and R 7b , when present, is methyl.
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
- one of R 3a and R 3b when present, is hydrogen, and one of R 3a and R 3b , when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula: [00252] In various aspects, the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
- the compound has a structure represented by a formula: wherein r is 2, 3, or 4.
- the compound has a structure represented by a formula: wherein s is 1 or 2.
- the compound has a structure represented by a formula: c. THIOETHER COMPOUNDS
- thioether compounds produced by a disclosed method.
- the method produces a thioether compound having a structure represented by a formula: wherein v’ is 0, 1, 2, or 3.
- the method further comprises addition of a reducing agent.
- the method further comprises addition of a protease.
- the method produces a thioether compound having a structure represented by a formula: wherein v’ is 0, 1, 2, or 3.
- the thioether compound is selected from:
- the method produces a thioether compound having a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula: [00266] In various aspects, the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound is a sactipeptide.
- the sactipeptide has a structure represented by a formula selected from:
- the thioether compound is a ranthipeptide.
- the ranthipeptide has a structure represented by a formula selected from:
- the method produces a thioether compound having a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound is a sactipeptide.
- the sactipeptide has a structure represented by a formula selected from:
- the thioether compound is a ranthipeptide.
- the ranthipeptide has a structure represented by a formula selected from:
- the thioether compound is selected from:
- the thioether compound is selected from:
- thioether compounds prepared by a disclosed method, wherein the thioether compound is an analog of a peptide therapeutic.
- exemplary peptide therapeutics include, but are not limited to, octreotide, setmalanotide, romidepsin, bremelanotide, pramlintide, oxytocin, setmelanotide, or cyclosporin.
- R 2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R 3a and R 3b , when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R 4 is selected from hydrogen and methyl; wherein each occurrence of R 5 and R 5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R 6 and R 6 , when present, is independently selected from hydrogen and methyl, or wherein R 6 or R 6 is covalently bonded to R 5 or R 5 , respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R 7a and R 7b , when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
- o is independently 0, 1, 2, 3, 4, 5, 6, or 7.
- t is 0.
- v is 1 or 2.
- R 1 is -CO 2 H or a structure:
- R 1 is -CO 2 H.
- the cleavable moiety is a chemically cleavable moiety.
- Exemplary chemical cleavable moieties include, but are not limited to, -CO 2 -(C4-C8 alkylene)-OC(O)-.
- the cleavable moiety is an enzymatically cleavable moiety such as, for example, a protease recognition sequence.
- the protease recognition sequence is TEV recognition sequence.
- the compound comprises one or more D-amino acid residues. In a further aspect, the compound comprises one or more ⁇ -amino acid residues. In a still further aspect, the compound comprises one or more N-methylated amino acid residues.
- PapB installs a single thioether linkage in the compound.
- PapB installs two or more thioether linkages in the compound.
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the method produces a thioether compound having a structure represented by a formula: wherein v’ is 0, 1, 2, or 3.
- the method further comprises addition of a reducing agent.
- the method further comprises addition of a protease.
- the method produces a thioether compound having a structure represented by a formula: wherein v’ is 0, 1, 2, or 3.
- the thioether compound is selected from:
- n is 0. In a further aspect, m is 1.
- n is 0. In a further aspect, n is 1.
- o is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, o is 1, 2, 3,
- o is 1, 2, 3, or 4.
- p is 1. In a further aspect, p is 2.
- A is S. In a further aspect, A is Se.
- L is C2-C4 alkyl. In a further aspect, L is -(C1-C4 alkyl)(OCH 2 CH 2 ) q . In a still further aspect, L is a structure selected from:
- the cleavable moiety is a protease recognition sequence.
- the protease recognition sequence is a TEV protease recognition sequence.
- the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue.
- the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
- the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3).
- the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
- R 1 is selected from -CO 2 H and a structure:
- R 1 is -CO 2 H.
- R 2 is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- R 2 is a residue of a side chain of an amino acid selected from alanine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, and glycine.
- one of R 3a and R 3b when present, is hydrogen, and one of R 3a and R 3b , when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- R 4 is hydrogen. In a further aspect, R 4 is methyl.
- each occurrence of R 5 when present, is independently a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- each occurrence of R 6 when present, is hydrogen. In a further aspect, each occurrence of R 6 , when present, is methyl.
- each of R 7a and R 7b when present, is hydrogen. In a further aspect, each of R 7a and R 7b , when present, is methyl.
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
- one of R 3a and R 3b when present, is hydrogen, and one of R 3a and R 3b , when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
- the compound has a structure represented by a formula: [00343] In various aspects, the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- the compound has a structure represented by a formula:
- o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
- the compound has a structure represented by a formula: wherein r is 2, 3, or 4.
- the compound has a structure represented by a formula: wherein s is 1 or 2. [00348] In various aspects, the compound has a structure represented by a formula:
- PapB installs a single thioether linkage in the compound. In a further aspect, PapB installs two or more thioether linkages in the compound.
- the method produces a thioether compound having a structure represented by a formula:
- the thioether compound has a structure represented by a formula: [00352] In various aspects, the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound is a sactipeptide.
- the sactipeptide has a structure represented by a formula selected from:
- the thioether compound is a ranthipeptide.
- the ranthipeptide has a structure represented by a formula selected from: [00366]
- the method further comprises addition of a reducing agent.
- the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof.
- the method further comprises addition of a protease.
- the protease is TEV protease.
- the method produces a thioether compound having a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound has a structure represented by a formula selected from:
- the thioether compound is a sactipeptide.
- the sactipeptide has a structure represented by a formula selected from:
- the thioether compound is a ranthipeptide.
- the ranthipeptide has a structure represented by a formula selected from:
- the thioether compound is selected from:
- the thioether compound is selected from:
- the peptide sequence comprises octreotide or vapreotide. In a yet further aspect, the peptide sequence comprises octreotide. In a yet further aspect, the peptide sequence comprises vapreotide.
- the peptide sequence comprises D FCF D WKTET (SEQ ID NO: 3), wherein the first and fourth positions are D-amino acids. [00391] In a further aspect, the peptide sequence comprises FCFAKTETA.
- the peptide sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3). In a further aspect, the peptide sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 3).
- the peptide sequence further comprises a TEV protease recognition sequence.
- the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue.
- the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
- the peptide sequence comprises one or more D-amino acid residues.
- the peptide sequence comprises one or more ⁇ -amino acid residues.
- the peptide sequence comprises one or more N-methylated amino acids.
- the modified PapA sequence comprises minimal substrate PapA.
- the modified PapA sequence is LKQINVIAGVKEPIRAYGCDSNNAANA (SEQ ID NO: 6), LKQINVIAGVKEPIRAYGCSDNNAAA (SEQ ID NO: 7), LKQINVIAGVKEPIRAYGCSNDAAA (SEQ ID NO: 8), LKQINVIAGVKEPIRAYGCSAANDA (SEQ ID NO: 9), LKQINVIAGVKEPIRAYGCSAAANDA (SEQ ID NO: 10), or LKQINVIAGVKEPIRAYGCSAAAANDA (SEQ ID NO: 11).
- the modified PapA sequence is LKQINVIAGVKEPIRAYGCDSNNAANA (SEQ ID NO: 6). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSDNNAAA (SEQ ID NO: 7). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSNDAA A (SEQ ID NO: 8). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSAANDA (SEQ ID NO: 9). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSAAANDA (SEQ ID NO: 10).
- the modified PapA sequence is LKQINVIAGVKEPIRAYGCSAAAANDA (SEQ ID NO: 11).
- the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSANDA (SEQ ID NO: 12), LKQINVIAGVKEPIRAYGAAACSANDACSANDA (SEQ ID NO: 13), LKQINVIAGVKEPIRAYGAAACSACDAADA (SEQ ID NO: 14), LKQINVIAGVKEPIRAYGAAAASACDAADA (SEQ ID NO: 15), or LKQINVIAGVKEPIRAYGAAACSAADAAADA (SEQ ID NO: 16).
- the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSANDA (SEQ ID NO: 12). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSANDACSANDA (SEQ ID NO: 13). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSACDAADA (SEQ ID NO: 14). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAAASACDAADA (SEQ ID NO: 15).
- the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSAADAAADA (SEQ ID NO: 16). [00401] In a further aspect, the modified PapA sequence comprises one or more D- amino acid residues.
- the modified PapA sequence comprises one or more ⁇ - amino acid residues.
- the modified PapA sequence comprises one or more N- methylated amino acid residues.
- X is a penicillamine or an amino acid residue comprising a -SH group or an amino acid residue comprising a -SeH group.
- X is a penicillamine.
- X is an amino acid residue comprising a -SH group.
- X is cysteine, homocysteine, D-cysteine, or D-homocysteine.
- X is homocysteine.
- X is D-cysteine.
- X is D-homocysteine.
- X is cysteine.
- X is an amino acid residue comprising a -SeH group.
- X is selenocysteine or homoselenocysteine.
- X is selenocysteine.
- X is homoselenocysteine.
- amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, ⁇ -amino acid residues, and N-methylated amino acid residues.
- Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine.
- Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino- 1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-
- Y n comprises one or more D-amino acids.
- Y n comprises one or more ⁇ -amino acids.
- Y n comprises one or more N-methylated amino acids.
- n is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9. In a further aspect, n is 0, 1,
- n is 0, 1, 2, 3, 4, 5, 6, 7, or 8.
- n is 0, 1, 2, 3, 4, 5, 6, or 7.
- n is 0, 1, 2, 3, 4, 5, or 6.
- n is 0, 1, 2, 3, 4, or 5.
- n is 0, 1, 2, 3, or 4.
- n is 0, 1, 2, or 3.
- n is 0, 1, or 2.
- n is 0 or 1.
- n is 0.
- n is 1.
- n is 2.
- n is 3.
- n is 4.
- n is 5.
- n is 6.
- n is 7.
- n is 8.
- n is 9. c. Z GROUPS
- Z is an aspartic acid residue, a glutamic acid residue, a hydroxy-glutamic acid residue, 2-amino-3-(2H-tetrazol-5-yl)propanoic acid, or a carboxyl- functionalized amino acid residue.
- Z is aspartic acid or glutamic acid. In a yet further aspect, Z is aspartic acid. In a yet further aspect, Z is glutamic acid.
- Z is a hydroxy-glutamic acid residue.
- Z is 2-amino-3-(2H-tetrazol-5-yl)propanoic acid.
- Z is a a carboxyl-functionalized amino acid residue.
- carboxyl-functionalized amino acid residues include, but are not limited to, (2S,3S)-2-amino-3-methylsuccinic acid, (2S,3R)-2-amino-3 -methylsuccinic acid, (2S,3S)-2- amino-3-methylpentanedioic acid, (2S,3R)-2-amino-3-methylpentanedioic acid, (2S,4S)-2- amino-4-methylpentanedioic acid, (2S,4R)-2-amino-4-methylpentanedioic acid, and homoglutamic acid.
- (2S,3S)-2-amino-3-methylsuccinic acid (2S,3R)-2-amino-3 -methylsuccinic acid
- (2S,3S)-2- amino-3-methylpentanedioic acid (2S,3R)-2-amino-3-methylpentanedioic acid
- amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, ⁇ -amino acid residues, and N-methylated amino acid residues.
- Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine.
- Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
- Y a comprises one or more D-amino acids.
- Y a comprises one or more ⁇ -amino acids.
- Y a comprises one or more N-methylated amino acids.
- a is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, a is 0, 1, 2, 3,
- a is 0, 1, 2, 3, 4, or 5. In a still further aspect, a is 0, 1, 2, 3, or 4. In a still further aspect, a is 0, 1, 2, or 3. In a still further aspect, a is 0, 1, or 2. In a yet further aspect, a is 0 or 1. In a yet further aspect, a is 0. In a yet further aspect, a is 1. In a yet further aspect, a is 2. In a yet further aspect, a is 3. In a yet further aspect, a is 4. In a yet further aspect, a is 5. In a yet further aspect, a is 6. In a yet further aspect, a is 7. e. Y B GROUPS
- amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, ⁇ -amino acid residues, and N-methylated amino acid residues.
- Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine.
- Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
- Yb comprises one or more D-amino acids.
- Yb comprises one or more ⁇ -amino acids.
- Yb comprises one or more N-methylated amino acids.
- b is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, b is 0, 1, 2,
- b is 0, 1, 2, 3, 4, or 5. In a still further aspect, b is 0, 1, 2, 3, or 4. In a still further aspect, b is 0, 1, 2, or 3. In a still further aspect, b is 0, 1, or 2. In a yet further aspect, b is 0 or 1. In a yet further aspect, b is 0. In a yet further aspect, b is 1. In a yet further aspect, b is 2. In a yet further aspect, b is 3. In a yet further aspect, b is 4. In a yet further aspect, b is 5. In a yet further aspect, b is 6. In a yet further aspect, b is 7. f. Y x GROUPS
- amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, ⁇ -amino acid residues, and N-methylated amino acid residues.
- Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine.
- Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
- Y x comprises one or more D-amino acids.
- Y x comprises one or more ⁇ -amino acids.
- Y x comprises one or more N-methylated amino acids.
- x is 0, 1, 2, 3, 4, 5, or 6. In a still further aspect, x is 0, 1, 2,
- x is 0, 1, 2, 3, or 4. In a still further aspect, x is 0, 1, 2, or 3. In a still further aspect, x is 0, 1, or 2. In a yet further aspect, x is 0 or 1. In a yet further aspect, x is 0. In a yet further aspect, x is 1. In a yet further aspect, x is 2. In a yet further aspect, x is 3. In a yet further aspect, x is 4. In a yet further aspect, x is 5. In a yet further aspect, x is 6. g. Y Y GROUPS
- amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, ⁇ -amino acid residues, and N-methylated amino acid residues.
- Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine.
- Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
- Y y comprises one or more D-amino acids.
- Y y comprises one or more ⁇ -amino acids.
- Y y comprises one or more N-methylated amino acids.
- y is 0, 1, 2, 3, 4, 5, 6, 7, or 8. In a still further aspect, y is 0,
- y is 0, 1, 2, 3, 4, 5, 6, or 7.
- y is 0, 1, 2, 3, 4, 5, or 6.
- y is 0, 1, 2, 3, 4, or 5.
- y is 0, 1, 2, 3, or 4.
- y is 0, 1, 2, or 3.
- y is 0, 1, or 2.
- y is 0 or 1.
- y is 0.
- y is 1.
- y is 2.
- y is 3.
- y is 4.
- y is 5.
- y is 6.
- y is 7.
- y is 8. h. Y z GROUPS
- amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, ⁇ -amino acid residues, and N-methylated amino acid residues.
- Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine.
- Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino- 1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-
- Y z comprises one or more D-amino acids.
- Y z comprises one or more ⁇ -amino acids.
- Y z comprises one or more N-methylated amino acids.
- z is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, z is 0, 1, 2, 3,
- z is 0, 1, 2, 3, 4, or 5. In a still further aspect, z is 0, 1, 2, 3, or 4. In a still further aspect, z is 0, 1, 2, or 3. In a still further aspect, z is 0, 1, or 2. In a yet further aspect, z is 0 or 1. In a yet further aspect, z is 0. In a yet further aspect, z is 1. In a yet further aspect, z is 2. In a yet further aspect, z is 3. In a yet further aspect, z is 4. In a yet further aspect, z is 5. In a yet further aspect, z is 6. In a yet further aspect, z is 7.
- each disclosed derivative can be optionally further substituted. It is also contemplated that any one or more derivative can be optionally omitted from the invention. It is understood that a disclosed compound can be provided by the disclosed methods.
- the invention relates to product compounds having a structure selected from:
- the compound is:
- the compound is:
- the compounds of this invention can be prepared by employing reactions as shown in the following schemes, in addition to other standard manipulations that are known in the literature, exemplified in the experimental sections or clear to one skilled in the art. For clarity, examples having a single substituent are shown where multiple substituents are allowed under the definitions disclosed herein.
- Preferred methods include, but are not limited to, those described below.
- the invention relates to methods of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide substrate with PapB.
- the peptide sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3).
- the leader sequence facilitates recognition of the full peptide sequence by PapB. However, the leader sequence is not required.
- the method further comprises addition of a protease.
- the peptide sequence comprises a protease recognition sequence.
- a protease in conjunction with a peptide sequence comprising a protease recognition sequence allows for cleavage of a desired product from the leader sequence.
- the protease is a TEV protease.
- TEV protease in conjunction with a peptide sequence comprising a TEV protease recognition sequence allows for cleavage of a desired product from the leader sequence.
- the peptide sequence comprises a TEV protease recognition sequence.
- the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue.
- the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
- the method further comprises addition of a reducing agent.
- reducing agents include, but are not limited to, dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, and Hantzsch esters.
- the reducing agent comprises comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof.
- the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, and titanium citrate.
- the reducing agent comprises dithionite, flavodoxin, and flavodoxin reductase. In a still further aspect, the reducing agent comprises dithionite and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, flavodoxin reductase, and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, and flavodoxin reductase.
- the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate and titanium citrate.
- the reducing agent comprises dithionite.
- the reducing agent comprises flavodoxin.
- the reducing agent comprises flavodoxin reductase.
- the reducing agent comprises titanium citrate.
- the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate.
- PapB installs two or more thioether linkages in the peptide sequence.
- thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding nested crosslinks.
- thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding in-line crosslinks.
- the invention relates to methods of chemically modifying a modified PapA sequence to install a thioether linkage, the method comprising reacting the modified PapA sequence with PapB.
- the modified PapA sequence comprises minimal substrate
- the modified PapA sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQ INVIA GVKPIRAYG (SEQ ID NO: 3).
- the leader sequence facilitates recognition of the full peptide sequence by PapB.
- the leader sequence is not required.
- the method further comprises addition of a protease.
- the modified PapA sequence comprises a protease recognition sequence.
- a protease in conjunction with a modified PapA sequence comprising a protease recognition sequence allows for cleavage of a desired product from the leader sequence.
- the protease is a TEV protease.
- TEV protease in conjunction with a modified PapA sequence comprising a TEV protease recognition sequence allows for cleavage of a desired product from the leader sequence.
- the modified PapA sequence comprises a TEV protease recognition sequence.
- the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue.
- the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
- the method further comprises addition of a reducing agent.
- reducing agents include, but are not limited to, dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, and Hantzsch esters.
- the reducing agent comprises comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof.
- the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, and titanium citrate.
- the reducing agent comprises dithionite, flavodoxin, and flavodoxin reductase. In a still further aspect, the reducing agent comprises dithionite and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, flavodoxin reductase, and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, and flavodoxin reductase.
- the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate and titanium citrate.
- the reducing agent comprises dithionite.
- the reducing agent comprises flavodoxin.
- the reducing agent comprises flavodoxin reductase.
- the reducing agent comprises titanium citrate.
- the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate.
- PapB installs two or more thioether linkages in the peptide sequence.
- thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding nested crosslinks.
- thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding in-line crosslinks.
- the plasmid pPH151 contains the suf operon that encodes for sufABCDE proteins that aid in sulfur liberation, act as a Fe-S scaffold, and donate Fe-S clusters to apo proteins.
- the suf operon is frequently included with radical SAM enzymes as it assists in assembling iron-sulfur clusters in heterologously expressed proteins.
- the transformation mixture was suspended in SOC recovery media and shaken at 200 rpm for 1 h at 37 °C.
- the mixture was plated on agar Lennox broth (LB) plates containing 34 ⁇ g/mL chloramphenicol and 34 ⁇ g/mL kanamycin and placed in an oven set to 37 °C for 16 h.
- LB gar Lennox broth
- An overnight culture (0.15 L) of LB containing 34 ⁇ g/mL chloramphenicol and 34 ⁇ g/mL kanamycin was inoculated with a single colony from the plate. Twelve aliquots (12 mL each) of overnight culture were used to inoculate twelve 2.8 L Fembach flasks containing 1 L each of LB supplemented with 34 ⁇ g/mL chloramphenicol and 34 ⁇ g/mL kanamycin. The cultures were grown at 37 °C and 180 rpm to an OD600nm of -0.35, at which point 0.1 mM iron(III) chloride (0.1 mM) and L-cysteine hydrochloride monohydrate (0.1 mM) were added.
- PapB was purified inside of a Coy Laboratories anaerobic chamber maintained with a 98% N 2 /2% H 2 atmosphere.
- Cell paste (15 g) was resuspended in a metal beaker with 0.1 L of 0.05 M KPi (pH 7.4) buffer containing 0.5 M KC1, 0.05 M imidazole, 20% glycerol (v/v) 0.1 mg/mL lysozyme, 10 ⁇ g/mL DNAse and 2 cOmplete EDTA-free Protease Inhibitor Cocktail tablets (Fisher Scientific NC0939481).
- the suspension was stirred for 30 min on ice after which the cells were lysed with a Branson digital sonifier operated at 50% amplitude for a total of 17 min (25 s on/35 s off) while stirring on ice.
- the resulting liquid was centrifuged at 18,442 x g for 45 min at 4 °C.
- Three 5 mL HisTrap HP columns (GE healthcare) charged with nickel sulfate were serially connected and equilibrated with loading buffer containing 0.05 M KPi (pH 7.4), 0.5 M KC1, 20% glycerol (v/v) and 0.05 M imidazole.
- the clarified lysate was loaded onto the columns at 3 mL/min.
- the pooled fractions were further purified by an amylose resin (NEB E8022S) column equilibrated in loading buffer containing 0.05 M KPi (pH 7.4), 0.5 M KC1, and 0.05 M imidazole.
- the column was washed with 0.15 L of loading buffer and eluted with loading buffer containing 10 mM maltose.
- the resulting dark brown fractions were pooled and solid dithiothreitol (DTT) powder was added to combined fractions to a final concentration of 2 mM.
- An aliquot (1 mL) of 90 ⁇ M TEV protease was added, and the mixture was stirred for 14 h at room temperature.
- the cleaved MBP was removed from PapB through three serially connected 5 mL HisTraps equilibrated in loading buffer. The flowthrough from this column contained cleaved PapB.
- the resulting PapB protein was desalted into buffer containing 0.05 M PIPES ⁇ NaOH (pH 7.4), 0.3 M NaCl, 2 mM DTT and 20% glycerol (v/v).
- the concentration of PapB was determined by the Bradford method using bovine serum albumin (BSA) as a standard.
- PapB was reconstituted by mixing 12 molar equivalents of 0.1 M FeCh hexahydrate and NaiS nonahydrate as follows.
- the reconstituted PapB was further purified by a Cytiva XK26 (1000mm) S- 300 column equilibrated with buffer containing 0.05 M PIPES’NaOH (pH 7.4), 0.3 M KC1, 2 mM DTT and 10% glycerol (v/v).
- the protein was eluted isocratically at 2.7 mL/min, and fractions containing PapB were identified by dark brown color and visual inspection of a Coomassie-stained SDS-PAGE gel. The pooled fractions were concentrated to ⁇ 0.5 mL. Aliquots were flash-frozen in liquid N2 and stored at -80 °C.
- PapB was quantified by Bradford assay with BSA as a standard. A typical yield from the purification outlined above is 16.5 mg pure protein for 15 g wet cell paste.
- the correction factor for the Bradford assays was determined by direct amino acid analysis on three independent preparations of the protein. Amino acid analysis was carried out by the Molecular Structure Facility at the University of California-Davis as follows. A 0.1 mL aliquot of concentrated PapB was desalted into solution containing 10 mM NaOH using an Illustra NICK column (GE Healthcare). The protein samples were hydrolyzed in a solution containing 6 M HC1 and 1% phenol at 110 °C in a vacuum and resuspended in a norleucine solution as an internal standard.
- PapB samples were analyzed by Hitachi 8800 amino acid analyzer that was calibrated with amino acid standards for protein hydrolysate on the Na-based Hitachi 8800 (Sigma, A-9906). These standards were verified by the National Institute of Standards and Technology (NIST) standard reference material 2389a.
- PapB samples were sent through a Concise ion-exchange column (AminoSep Beckman Style Na+, part # AAA-99-6312) with a secondary ninhydrin reaction for detection using Pickering Na buffers.
- the correction factor for the Bradford assays was determined to be 0.60 based on results from the three independent purifications and this factor was used in all subsequent protein concentration determinations to correct the values obtained from the less cumbersome Bradford determinations
- the iron content of reconstituted PapB was determined through inductively coupled plasma-mass spectrometer (ICP-MS) on the same three separate enzyme preparations. This was done at the Center for Water, Ecosystems and climate Science in the Department of Geology and Geophysics at the University of Utah as follows. PapB preparations were diluted to a concentration of 2-5 ⁇ M with 10% trace metal grade nitric acid before submission. The iron concentration was performed with a triple quadrupole inductively coupled plasma-mass spectrometer (ICP-MS, Agilent 8900, Santa Clara, CA). A 10 nm In/mL was added as an internal standard.
- ICP-MS inductively coupled plasma-mass spectrometer
- An external calibration curve was prepared from 1000 mg/L single elemental standard (Inorganic Ventures, Christiansburg, VA). Fe Concentrations in six calibration solutions were 0, 8.3, 20.7, 66.2, 165.5 and 331.1 ng Fe/mL; all solutions contained 10 ng In/mL.
- the blanks, calibration solutions and diluted samples were run by ICP-MS using a double-pass quartz spray chamber, PTFE nebulizer and dual- syringe introduction system (Teledyne, AVX72000), platinum cones, and sapphire injector in a quartz platinum-shielded torch. In and Fe were detected at masses of 115 and 56, with a flow of 8 mL He/min in a collision cell.
- SGI 200008 pRARE chemically competent cells were transformed with pNB512.
- the transformation was suspended in SOC recovery media and shaken at 200 rpm for 45 min at 37 °C.
- the mixture was plated on agar Lennox broth (LB) plates containing 34 ug/mL chloramphenicol and 100 mg/mL ampicillin and placed in an oven set to 37 °C for 16 h.
- An overnight culture (0.15 L) of LB containing 34 mg/mL chloramphenicol and 34 mg/mL ampicillin was inoculated with a single colony from the plate.
- Cell paste (15 g) was resuspended in a metal beaker with 0.1 L of 0.05 M KPi (pH 7.4) buffer containing 0.5M KC1, 0.05 M imidazole, 100 mg/mL lysozyme, 10 mg/mL PMSF, and 20% (v/v) glycerol. The suspension was stirred for 2 h at 4 °C. The cells were lysed with a Branson digital sonifier operated at 50% amplitude for a total of 15 min (10 s on/20 s off) while stirring on ice. The resulting liquid was centrifuged at 18,442 xg for 50 min at 4 °C.
- PapA peptides were synthesized on either a PS3 peptide synthesizer (Protein Technologies Inc.) or a Prelude peptide synthesizer (Protein Technologies Inc.). Compared to the previously reported msPapA peptide (Van der Donk, W. A.; Bindman, N. A. Nat. Prod.: Discourse, Delivery, and Design, John Wiley & Sons: Oxford, 2014; pp 197-218), the N- terminal methionine was removed in all peptide syntheses. The syntheses used standard Fmoc procedures from the manufacturer and were carried out on a 0.025 mmol scale. All natural Fmoc-amino acids were purchased from Protein Technologies Inc.
- N-Alpha-Fmoc-S- trityl-D-cysteine and Fmoc-D-aspartic acid a-tert-butyl ester were purchased from Chem Impex (04314).
- 150 mg of 2-chlorotrityl chloride resin 100-200 mesh (ChemPep) was loaded with 9.3 mg of Fmoc-Ala-OH ( ⁇ 0.2 mmol/g resin). The resin was washed three times with 5 mL DMF and three times with 5 mL dichloromethane (DCM).
- the 9.3 mg of Fmoc-Ala-OH was dissolved in 1 mL of 1:1 dichloromethane (DCM):N,N- dimethylformaide (DMF) with 0.15 mmol diisopropylethylamine (DIPEA). This solution was added to the resin and gently shaken for 1 h. The Fmoc-Ala/DIPEA solution was then removed, and the resin was washed three times with 5 mL of DCM. The uncapped sites on the resin were capped by washing the resin with 20 mL of 17:2:1 DCM:methanol:DIPEA. The resin was then washed three times with 5 mL of DCM and three times with 5 mL of
- the peptides were deprotected and cleaved from the resin by adding 5 mL of cleavage solution (87.5% (v/v) TFA, 5% (v/v) thioanisole, 3% (v/v) ethane dithiol, 2.5% (v/v) triisopropylsilane, and 2% (v/v) anisole) followed by stirring for 2 h at room temperature.
- the cleavage reaction was filtered into 30 mL of ice-cold diethyl ether to precipitate the peptide.
- the solution was poured over a Buchner funnel filter and vacuumed to collect the peptide precipitate.
- the peptide dried on the vacuum for 15 min before being washed with 80 mL of ice-cold diethyl ether. After drying for an additional hour, the peptide was resuspended in 20 mL of water and sonicated for 15 min to aid in the dissolution of the peptide. The solution was then flash frozen in liquid nitrogen and lyophilized.
- the peptides were purified using high-performance liquid chromatography (HPLC) with a Phenomenex Jupiter C18 preparative column (21.2 mm x 250 mm, 5 ⁇ m particle size, 300 ⁇ pore size) with buffer A as 0.1% trifluoroacetic acid (TFA, HPLC grade) in nanopure water and buffer B as 0.1% TFA (HPLC Grade) in acetonitrile (ACN, HPLC grade).
- the separation was carried out at a flow rate of 5mL/min with a linear gradient of buffer A from 88 to 60% over 65 min.
- the LC-MS program for peptide fraction identification was set up as follows: buffer A was LC-MS Optima water (Fisher)/0.1% (v/v) LC-MS Optima TFA (Fisher) and buffer B was LC-MS Optima acetonitrile (Fisher)/0.1% (v/v) LC-MS Optima TFA (Fisher).
- the 12 min separation consisted of washing the column with 100% A for 3 min, followed by a linear gradient to 100% B from 3 to 6 min, followed by washing the column with 100% B from 6 to 9 min, and finally reequilibration in 100% A from 9 to 12 min.
- MS detectors operated in positive ion mode and the FT analyzer settings are as follows: 70,000 resolution for the Q-Exactive and 100,000 resolution for the LTQ OrbiTrap, 1 microscan, and 200 ms maximum injection time. MS data analysis used Xcalibur software (Thermo Fisher). f. ENZYMATIC REACTIONS OF MSPAPA PEPTIDES WITH PAPB
- Assays were conducted in a Coy Laboratories anaerobic chamber with 98% N 2 /2% H 2 atmosphere at room temperature. All reactions contained 0.05 M PIPES ⁇ NaOH (pH 7.4), 2mM DTT, 2.4 mM SAM (enzymatically synthesized and purified as previously described (deGruyter, J. N., et al. Biochem. 2017, 56 (30), 3863-3873)), ⁇ 100-400 ⁇ M msPapA variants (concentration determined by peptide dry weight or by spectroscopic analysis in the case of Y19W), and 430nm-10 ⁇ M PapB.
- Either dithionite (dT) or flavodoxin (FldA), flavodoxin reductase (FPR) and NADPH were used to reduce PapB.
- the total concentration was 2mM dT.
- the mixtures contained 25 ⁇ M FldA, 2 ⁇ M FPR and 2 mM NADPH.
- the total volume of the reactions ranged from 0.1 mL for initial screenings to 0.5 mL for MS/MS Collision Induced Dissociation (CID) fragmentation experiments described below. Control reactions in the absence of dT, SAM and PapB were also conducted.
- TEV protease was added directly to the full PapB assay after initial incubation in a 1:1 volume ratio.
- the TEV-assay combination incubated for 4 h before quenching by the addition of 10% of the reaction volume of 30% (w/v) TCA.
- the samples were then centrifuged at 16,000 xg for 10 min in a microcentrifuge to pellet the precipitated PapB and TEV protease.
- the assays were analyzed using either a Vanquish UHPLC with a diode-array detector connected to a Q-Exactive mass spectrometer or an Ultimate 3000 HPLC with a diode-array detector connected to a LTQ OrbiTrap XL mass spectrometer. Each was operated in positive ion mode, the FT analyzer was set to 100,000 resolution, 1 microscan, and 200 ms maximum injection time. Xcalibur software was used to analyze data.
- Enzymatic reactions were conducted on a 0.5 mL scale as described above to obtain sufficient material. After quenching the reaction with TCA and centrifugation to remove precipitated protein, the reaction mixtures were desalted using C18 ZipTips (Millipore) following the manufacturer’s protocols. The analyzer was first tuned to the mass of each msPapA peptide. The 3 + charge state corresponding to each msPapA peptide was isolated in the CID cell using an isolation width of 1.7-2.4 m/z (depending on complete or incomplete peptide turnover), 0.1 ms activation time, a resolution of 70,000, and fragmented using a Normalized Collision Energy (NCE) of 25. The fragmentation analysis used mMass software.
- NCE Normalized Collision Energy
- PapB was obtained to homogeneity using His 6 affinity chromatography for the initial separation, followed by TEV cleavage and amylose chromatography to remove the MBP, reconstitution with Fe/S. Gel filtration was used to remove higher molecular weight complexes (FIG. 3). Since previous sequence analysis and ferrozine assays indicate that PapB likely has three [4Fe-4S] clusters — a 12-fold molar excess of iron and sulfide were added to the maturase for reconstitution. Amino acid and ICP-MS analysis of protein from multiple independent purifications show that the purified protein obtained by this procedure contains 13.5 ⁇ 0.3 mol of iron per mol of PapB.
- FIG. 14 This shows that PapB has introduced a thioether crosslink in each peptide.
- MS/MS data is consistent with the formation of thioether crosslinks in non-a positions.
- sactipeptide sulfur-to-a carbon thioether crosslinked peptides
- MS/MS spectra generally produce fragments at each residue location but contain a 2 Da loss at the acceptor (non-Cys) residue (Rea, M. C., et al. Proc. Natl. Acad.
- FIG. 15B shows, all the peptides that were examined were efficiently crosslinked by the enzyme.
- FIG. 15C demonstrates that the crosslink occurs within the CX 3 D sequence, even if an alternate D residue is available downstream.
- the naturally occurring PapA peptide is processed by PapB to introduce six ranthionine linkages, which are either in line with the Cys and Asp residues within a CX 3 D motif being crosslinked, or nested with the C residue occurring within one CX 3 D motif crosslinking with an Asp residue located C-terminal to it (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89). As FIG. 15 shows, both nested and in-line variants of the peptide were able to be crosslinked by simply repositioning the CX 3 D element within the peptide.
- Tandem mass spectrometry reveals a similar pattern in the b and y fragments; a mass loss of 2 Da is seen in each b fragment after D and in each y fragment after C.
- the fragmentation data for all identifiable peaks are shown in Tables 10-14, a stable macrocycle is seen in each peptide. In all these cases, treatment with IAA resulted in no carboxymethylation of the modified peptide (FIG. 16-20).
- PapB has been used recently to prepare peptide products that are capable of binding single protein targets, such as the SARS-CoV-2 spike receptor binding domain (King, A.M., et al. Nat. Commun. 2021, 12, 6343).
- the peptide in that design contained a leader sequence, which through a TEV protease recognition sequence is connected to a minimal substrate containing two CX 3 E motifs.
- the initial report on PapB had demonstrated that both Asp and Glu are crosslinked by the enzyme (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89).
- Tandem mass spectrometry shows a fragmentation pattern that is indicative of two thioether events occurring; one between Cys3 and Glu7, and the other between Cys9 and
- PapB can process msPapA when Cys and Asp are replaced with their dextrorotatory enantiomers was explored (FIG. 22A).
- leader- D CSANDA peptide With the leader- D CSANDA peptide, full conversion to the crosslinked peptide is seen, as evidenced by the loss of 2 Da (FIG. 22B).
- leader-CSAN D DA peptide With the leader-CSAN D DA peptide, significant substrate turnover is observed as well, but the conversion is not complete (FIG. 22B).
- the leader- D CSAN D DA is processed inefficiently under these conditions, though some product is clearly observed in the MS.
- the carboxymethylated species does not show any evidence of a 2 Da loss, which gives evidence that the Cys thiol is participating in the newly installed bond in these unnatural peptides.
- Octreotide is an 8-mer peptide with the sequence D FCF D WKTCT, with D- amino acids at the first and fourth positions.
- the two C residues form a disulfide-linked macrocycle.
- WT-PapA contains positively charged, nonpolar, polar uncharged, and bulky side-chain residues between the six donor and acceptor residue motifs (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89).
- the second design contained only the C7E replacement, but to facilitate removal of the leader peptide, an ENLYFQ sequence was incorporated between the leader and the peptide to provide a convenient site for TEV cleavage.
- the incubation of either of the designed octreotide analogs with PapB leads to formation of a new product. In each case, the product is 2 Da lighter than the starting material, consistent with the formation of a crosslink (FIG. 27B).
- An intrapeptide disulfide can be eliminated as the source of this loss because the peptide only contains one Cys residue. It is noted that the reaction is ⁇ 75% complete with this analog, as assessed from the isotopic envelope.
- TEV protease to release the modified peptide to show the feasibility of the use of this method to generate a novel octreotide analog.
- the TEV protease can accommodate other amino acids at the P1 ’ position (Kapust,
- FIG. 29 provides a brief summary of successful PapB-mediated thioether crosslinks in tested peptide sequences.
- leader peptide sequence is not required for modification via PapB.
- PapB “leaderless” sequences that contain non-proteinogenic amino acids still demonstrated thioether linkages via mass spectrometry.
- FIG. 31 -FIG. 35 mass spectrometry results reveal evidence that interpeptide crosslinking can also be achieved with PapB.
- FIG. 31 shows mass spectrometry data for a one-to-one interpeptide crosslink as well as polymerization-like addition of X-mer peptide subunits.
- FIG. 32 shows results for a general assay peptide before and after PapB, demonstrating the presence of interpeptide products.
- FIG. 33 shows mass spectrometry results showing evidence of simple and complex mass envelopes.
- thioether insulin analogs were synthesized. The results showing the crosslinked products can be found in FIG. 34 and FIG. 35.
- FIG. 47 A concept schematic for a bioreactor setup for peptide modification via PapB is shown in FIG. 47.
- leader-CX 3 CH 2 )C(O)NH 2 did not demonstrate thioether crosslinking (FIG. 50A-B).
- the cleavage reaction was filtered into 30 mL of ice-cold diethyl ether to precipitate the peptide.
- the solution was poured over a Buchner funnel filter and vacuumed to collect the peptide precipitate.
- the peptide dried on the vacuum for 15 min before being washed with 80 mL of ice-cold diethyl ether. After drying for an additional hour, the peptide was resuspended in 20 mL of water and sonified for 15 min to aid in the dissolution of the peptide.
- the solution was then flash frozen in liquid nitrogen and lyophilized.
- the peptide was resuspended in 0.05 M PIPES’NaOH (pH 7.4), 2 mM DTT, 300 mM KC1, and 15% glycerol buffer solution.
- the peptide was then assayed with PapB using the following parameters: 6.1 uM PapB, 100 uM msPapA C- terminal Gly carboxamide, 2 mM DTT, 2.1 mM SAM, and 15% glycerol in 100 uL total volume.
- the negative control (no PapB) and overnight complete assay (+ PapB) were quenched by adding 11 uL of 30% (w/v) TCA to the mixture.
- the quenched assays were spun at 16,000 xg for 10 m to pellet any precipitated enzyme or PIPES.
- the assays were analyzed using a Vanquish UHPLC with a diode-array detector connected to a Q-Exactive mass spectrometer operated in positive ion mode, the FT analyzer was set to 70,000 resolution, 1 microscan, and 200 ms maximum injection time. Xcalibur software was used to analyze data.
- leader-CX 3 ( ⁇ -amino acids) for recognition and crosslinking was explored (FIG. 51A).
- the sequence of leader-CX 3 (L-3 -aminobutyric acid) was probed and treatment with PapB resulted in loss of 2Da indicating the thioether crosslinked product (FIG. 51B-C)
- leader-CX 3 (3-amino-2,2- dimethylbutanoicacid)
- leader-CX 3 ((R)-3-amino-2-methylpropanoic acid)
- FIG. 52B The sequence of leader-CX 3 ((S)-3-amino-2-methylpropanoic acid)
- leader-CX 3 ( ⁇ -amino acids) was further explored utilizing analogs representative of natural amino acids (FIG. 53).
- a representative analog with B- tryptophan (FIG. 54A) did show evidence of a 2Da loss (FIG. 54B), which gives indication that the thioether crosslinked product was formed.
- leader-CX 3 ( ⁇ -amino acids) was further explored by probing the reaction with N-methylated ⁇ -amino acids (FIG. 55A-B).
- the reaction showed evidence of a 2Da loss (FIG. 55D), which gives indication that the thioether crosslinked product was formed.
- Phenyl alanine d8 was incorporated into leader-CSAD D F sequence (FIG. 62A) and incubated with PapB. The product showed evidence of a 3 Da loss (FIG. 62B). This is indicative of successful thioether crosslinked product to the methylene of the phenylalanine side chain.
- FIG. 68A Leader-hCys- ⁇ Ala
- FIG. 68B For the representative mass spectra illustrating a 2 Da loss
- FIG. 69A Leader- Cys-gamma amino butyric acid
- FIG. 69B for the representative mass spectra illustrating a 2 Da loss
- FIG. 70A Leader-hCys-gamma amino butyric acid
- FIG. 73A demonstrates the structures of a peptide chain containing a substituted aniline in the peptide backbone and the reacted thioether macrocycle.
- FIG. 73B shows the mass spectra of the unreacted (top, without PapB) and reacted (bottom, with PapB) aniline-containing peptide chain.
- FIG. 74A demonstrates the structures of a peptide chain containing a substituted benzylamine in the peptide and the reacted thioether macrocycle.
- FIG. 73A demonstrates the structures of a peptide chain containing a substituted benzylamine in the peptide and the reacted thioether macrocycle.
- FIG. 75A show the structures of a modified courmarin-containing peptide.
- the coumarin-like moiety is the most C- terminal aspect of the peptide.
- FIG. 75B demonstrates the mass spectra of the coumarin- containing peptide both unreacted (top, without PapB) and reacted (bottom, with PapB).
- Setmalanotide an MC4R agonist
- FIG. 76A A similar peptidomimetic analog based on the PapB reaction to form thioether crosslinked products was envisioned (FIG. 76B).
- the reaction showed evidence of a 2Da loss (FIG. 76D), which gives indication that the thioether crosslinked product was formed.
- the envisioned thioether crosslinked analog is shown in FIG. 78B.
- the PapB transformation is shown in FIG. 78C.
- the mass spectra showing the reaction both in the absence (top) and presence (bottom) of PapB is shown in FIG. 78D.
- the proposed thioether analog is shown in FIG. 78B.
- PapB can tolerate extended sidechains of the acidic residue, as both CX 3 D and CX 3 E sequences are crosslinked.
- PapB forms crosslinks in msPapA with a homocysteine (hCys) substitution at position 19 and an Asp in position 23, C19hCys and D23E, and C19hCys and homoglutamate (hGlu) at position 23 (FIG. 79A).
- MS analysis confirmed that PapB catalyzes formation of a crosslink in peptides containing hCys at position 19 and Asp (FIG. 79B), hCys at position 19 and Glu (FIG.
- the ID NMR spectrum of the peptide prior to the treatment with PapB reveals a resonance at 2.41 ppm, which is composed of a doublet of triplets integrating to two protons (FIG. 80). This feature can reasonably be assigned to H ⁇ of hGlu.
- this resonance is absent and a new triplet at 3.34 ppm integrating to a single hydrogen is observed. This new resonance is consistent with thioether installation at the position alpha to the carboxylate in the hGlu sidechain.
- the modified and unmodified peptides were subjected to ROESY analysis to establish through-space correlations of protons.
- the resonance at 2.41 ppm is coupled to resonances at 1.66, 1.75, and 1.88 ppm, corresponding to through-space coupling of H ⁇ in the hGlu sidechain to Hy and H ⁇ of hGlu, respectively (see FIG. 82).
- the new resonance at 3.25 ppm is coupled to resonances at 1.8- 1.9 ppm (see FIG. 83).
- Tetrazoles improve the bioavailability of drugs, increase their lipophilicity, and reduce side-effects when compared to carboxylate-containing compounds. This is due to the metabolic stability of tetrazole moieties — metabolic transformations of carboxylic acids are driven in part by microsomes of the liver, many of which are evaded by using a tetrazole isostere.
- the tetrazole pharmacophore has been used in a variety of drug classes, including nonsteroidal anti-inflammatory drugs, angiotensin receptor blockers, and proton pump inhibitors.
- Radical SAM a novel protein superfamily linking unresolved steps in familiar biosynthetic pathways with radical mechanisms: functional characterization using new analysis and information visualization methods. Nucleic Acids Res. 2001, 29 (5), 1097-1106.
- SufBCD complex is a new type of Fe-S scaffold with a flavin redox cofactor. J. Biol. Chem. 2010, 285 (30), 23331-23341.
- SAM S-Adenosylmethionine
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Health & Medical Sciences (AREA)
- General Chemical & Material Sciences (AREA)
- Microbiology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The present disclosure is concerned with methods of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide sequence with PapB. Also disclosed are compounds produced by such methods that may be useful in, for example, peptide therapeutic uses. This abstract is intended as a scanning tool for purposes of searching in the particular art and is not intended to be limiting of the present invention.
Description
PAPB AS A BIMOIETY-DEPENDENT THIOETHER INSTALLATION TOOL
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This Application claims the benefit of U.S. Provisional Application No. 63/446,589, filed on February 17, 2023, U.S. Provisional Application No. 63/393,174, filed on July 28, 2022, U.S. Provisional Application No. 63/337,029, filed on April 29, 2022, and U.S.
Provisional Application No. 63/331 ,393, filed on April 15, 2022, the contents of which are incorporated herein by reference in their entireties.
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH
[0002] This invention was made with government support under Grant No. GM126956 awarded by the National Institutes of Health. The government has certain rights in the invention.
REFERENCE TO SEQUENCE LISTING
[0003] The Sequence Listing submitted April 14, 2023 as a xml file named “21101.0436Pl.xml,” created on April 14, 2023, and having a size of 16,384 bytes is hereby incorporated by reference pursuant to 37 C.F.R. § 1.52(e)(5).
BACKGROUND
[0004] Peptide-based therapeutics are growing due to their unique structure and ability to be produced via solid phase peptide synthesis (SPPS) or by recombinant DNA. Many peptide therapeutics contain a disulfide bond in their active form. Disulfide bonds are susceptible to breakage via biological reductants such as glutathione. Additionally, many peptide therapeutics contain bulky or basic amino acid side chains which render them vulnerable to degradation by proteases. These factors contribute to their short serum half-lives. Strategies such as L-to-D amino acid swaps, derivatization of the N- and C-termini, N-to-C-terminal cyclization, the introduction of non-proteinogenic amino acids, and metal chelation have both increased peptide half-lives and diversified therapeutic targets. The extent of these modifications is limited to the chemical space afforded by organic synthesis and SPPS.
[0005] Nature can access vast chemical space through enzymatic reactions. Natural products are incredibly diverse in their structures which allow for their wide range of biological and
chemical activities. Recent advances in bioinformatic filtering algorithms have uncovered previously unannotated small open reading frames (sORFs). sORFs often colocalize with maturases which further process the peptide after translation. These ribosomally synthesized and post-translationally modified peptides (RiPPs) vary significantly in peptide length, structure, and biological function. RiPP maturases include members of the radical S- adenosylmethionine (rSAM) superfamily. This superfamily has been implicated in a variety of RiPP modifications, including C-C, C-N, C-O and C-S bond formation at unactivated carbons via radical mechanisms. These molecular mechanisms are of substantial interest because they afford access to unique semi-synthetic chemical spaces for production of bioinspired peptide therapeutics. RiPP maturases have potential to offer biotechnological applications in peptide alterations such as thioether installation or peptide stapling. rSAM enzymes use a radical intermediate to complete chemical transformations involved in natural product biosynthesis as well as primary metabolism. These enzymes contain one or more iron-sulfur [Fe-S] clusters that are essential for function. The [4Fe-4S] rSAM (RS) cluster is coordinated by a canonical CxxxCxxC motif in the enzyme. In the [4Fe-4S] RS cluster, one iron coordinates the a-amino and a-carboxylate moieties of SAM. When the RS cluster is catalytically active, it transfers an electron to bound SAM. Either chemical or biological reducing systems are useful for product turnover because the RS cluster is catalytically inactive in the +2 state. Homolytic cleavage of SAM forms the reactive 5’-deoxyadenosyl radical (5'- dAdo, FIG. 1). 5'-dAdo' acts as a radical initiator by abstracting a hydrogen atom from a specific site on the substrate, thereby forming 5 ’-deoxyadenosine (5’-dAdoH, FIG. 1) and a theoretical RiPP radical intermediate. The formed substrate radical is useful for substrate maturation. While only one [4Fe-4S] cluster is needed for reductive SAM cleavage, many rSAM enzymes also employ one or more auxiliary iron-sulf ur clusters (ACs) for substrate turnover (Fig. 4c). These ACs are coordinated to the enzyme by cysteine-rich C- terminal extensions from the RS canonical motif (FIG. 2). Recent studies have characterized rSAM maturases with multiple [Fe-S] clusters that form intrapeptide bonds between Ca, CB, or Cy on a specific residue and a cysteine thiol in the peptide substrate. Many of these thioether assembling maturases only form a single thioether in the mature peptide and are relatively slow in substrate turnover. The RS cluster in addition to at least one AC cluster is necessary for thioether formation. rSAM RiPP maturases also use a critical RiPP Recognition Element (RRE), that is responsible for binding to the leader sequence of the immature peptide (FIG. 2, left).
[0006] PapB is a RiPP maturase that catalyzes the insertion of six thioether crosslinks in the PapA polypeptide. PapB catalyzes the insertion of links between the Cys thiol and the b- carbon of the Asp, where the residues being linked are in a CX3D motif. Prior studies have shown that the enzyme can also accept Glu at the modification site, and that PapB introduces the crosslink to the chemically analogous γ-carbon. In addition, PapB has also been shown to accept a shorter minimal substrate (msPapA), which only has a single pair of crosslinking amino acids in the CX3D motif. PapB can catalyze both C13 and Cy thioether linkages, and forms six thioether linkages in the wild type PapA. PapB contains a RS cluster and two ACs (FIG. 2). Replacing Asp residue(s) to Glu residue(s) in WT-PapA still results in successful crosslinking. Both CB and Cy thioether linkages were confirmed by 2D NMR.
[0007] Despite the emergence of various techniques in peptide-based therapeutics, there remains a need in the art for enzymatic systems for rapid and highly specific modification of a broad range of peptide substances to obtain natural products that are unattainable by traditional synthetic chemistry methods. These needs and others are addressed herein.
SUMMARY
[0008] In accordance with the purpose(s) of the invention, as embodied and broadly described herein, the invention, in one aspect, relates to methods of chemically modifying a peptide sequence to install one or more thioether linkages. Additionally disclosed are compounds formed using methods of chemically modifying a peptide sequence. Also disclosed are methods of chemically modifying a modified PapA sequence, and compounds formed using methods of chemically modifying a modified PapA sequence.
[0009] Disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein R1 is selected from -CO2H, -
C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA. [0010] Also disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, -
SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA. [0011] Also disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, - SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3aand R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6 , when present, is independently selected from hydrogen and methyl, or wherein R6 or R6 is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
[0012] Also disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, - NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3aand R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5' , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and
R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6 is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
[0013] Also disclosed are methods of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide sequence with PapB, wherein the peptide sequence comprises X-Yn-Z, wherein n is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein X is an amino acid residue comprising a -SH or -SeH group; wherein each occurrence of Y, when present, is independently an amino acid residue; and wherein Z is an amino acid residue that is carboxyl-functionalized or tetrazolyl-functionalized, provided that the peptide sequence is not PapA.
[0014] Also disclosed are methods of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide sequence with PapB, wherein the peptide sequence comprises X-Yn-Z; wherein X is a penicillamine or an amino acid residue comprising a -SH group or an amino acid residue comprising a -SeH group; wherein Y is a series of amino acid residues where n = 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein Z is an aspartic acid residue, a glutamic acid residue, a hydroxy-glutamic acid residue, 2-amino-3- (2H-tetrazol-5-yl)propanoic acid, or a carboxyl-functionalized amino acid residue; and wherein the peptide sequence is not PapA.
[0015] Also disclosed are methods of chemically modifying a modified PapA sequence to install a thioether linkage, the method comprising reacting the modified PapA sequence with PapB; wherein the modified PapA sequence comprises Cys-Yn-Asp, wherein Y is a series of amino acid residues and n=0, 1, 2, 4, 5, 6, or 7.
[0016] Also disclosed are thioether compounds produced by a disclosed method.
[0017] Also disclosed are methods of chemically modifying a modified PapA sequence to install a thioether linkage, the method comprising reacting the modified PapA sequence with PapB, wherein the modified PapA sequence comprises Cys-Yn-Asp, wherein Y is a series of amino acid residues, and wherein n is 0, 1, 2, 4, 5, 6, or 7.
[0018] Also disclosed are compounds produced by a disclosed method.
[0021] Also disclosed are pharmaceutical compositions comprising an effective amount of a disclosed compound or a pharmaceutically acceptable salt thereof and a pharmaceutically acceptable carrier.
[0022] While aspects of the present invention can be described and claimed in a particular statutory class, such as the system statutory class, this is for convenience only and one of skill in the art will understand that each aspect of the present invention can be described and claimed in any statutory class. Unless otherwise expressly stated, it is in no way intended that any method or aspect set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not specifically state in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including matters of logic with respect to arrangement of steps or operational flow, plain meaning derived from grammatical organization or punctuation, or the number or type of aspects described in the specification.
BRIEF DESCRIPTION OF THE DRAWINGS
[0023] The above and other objects, features and advantages of the present invention will become more apparent to those of ordinary skill in the art by describing in detail exemplary embodiments thereof with reference to the accompanying drawings.
[0024] FIG. 1 is a schematic showing the proposed mechanism for beta-thioether crosslink.
[0025] FIG. 2 is a scheme showing the predicted structure of PapB.
[0026] FIG. 3 is a representative image showing SDS-PAGE analysis of reconstituted and purified PapB on a 12% crosslinked gel.
[0027] FIG. 4A and FIG. 4B show representative crosslinking data of minimal substrate PapA (msPapA) with PapB. Specifically, FIG. 4 A shows representative TIC of msPapA chromatographed on C 18 HPLC column (top left spectra). The peptide elutes at 8.1 min. A representative mass spectrum corresponding to the peak eluting at 8.1 min is shown at the bottom. The z = 3 charge state was chosen for most peptide mass envelope comparisons. A representative mass spectrum comparison of the z=3 charge state envelopes of unreacted and reacted msPapA ± PapB is shown at the top right. FIG. 4B shows the sequence of crosslinked PapB showing all of the observed b- and y- ions from tandem mass spectrometry. [0028] FIG. 5 is a representative plot showing the comparison of activity of PapB processing Y17W msPapA with dithionite or FldA/FPR/NADPH.
[0029] FIG. 6 shows representative mass spectra demonstrating the effect of 2x and 4x enzyme concentration.
[0030] FIG. 7 shows representative mass spectra demonstrating the effect of 2x and 4x peptide concentration.
[0031] FIG. 8A-D show representative data for the Leader-C(X0-X6)D(Xm) crosslink formation. Specifically, FIG. 8A is a scheme showing the unmodified and modified peptide sequence illustrate the thioether crosslink based on the msPapA modification reported by Precord et al. FIG. 8B shows representative mass spectra for CX0D-CX2D PapB modification. FIG. 8C shows representative mass spectra for CX4D-CX6D PapB modification. FIG. 8D are schematics showing that the expected 2 Da loss is seen in each b and y fragment in the tandem mass spectrometry.
[0032] FIG. 9A-B show representative data for the iodoacetic acid treatment for CX0D. Specifically, FIG. 9 A shows representative mass spectra data for CX0D without PapB. FIG. 9B shows shows representative mass spectra data for CX0D with PapB.
[0033] FIG. 10A-B show representative data for the iodoacetic acid treatment for CX1D. Specifically, FIG. 10A shows representative mass spectra data for CX0D without PapB. FIG. 10B shows shows representative mass spectra data for CX1D with PapB.
[0034] FIG. 11A-B show representative data for the iodoacetic acid treatment for CX2D. Specifically, FIG. 11A shows representative mass spectra data for CX0D without PapB. FIG. 11B shows shows representative mass spectra data for CX2D with PapB.
[0035] FIG. 12A-B show representative data for the iodoacetic acid treatment for CX4D. Specifically, FIG. 12A shows representative mass spectra data for CX0D without PapB. FIG. 12B shows shows representative mass spectra data for CX4D with PapB.
[0036] FIG. 13A-B show representative data for the iodoacetic acid treatment for CX5D. Specifically, FIG. 13A shows representative mass spectra data for CX0D without PapB. FIG. 13B shows shows representative mass spectra data for CX5D with PapB.
[0037] FIG. 14A-B show representative data for the iodoacetic acid treatment for CX6D. Specifically, FIG. 14A shows representative mass spectra data for CX0D without PapB. FIG. 14B shows shows representative mass spectra data for CX6D with PapB.
[0038] FIG. 15A-C show representative data for leader extensions with single, nested, and in-line crosslinks. Specifically, FIG. 15A are peptide schemes showing the apparent crosslink locations that remain consistent after distancing the thioether motifs from the leader peptide. FIG. 15B are representative mass spectra showing the isotopic distributions of the peptides; a shift of 2 Da in the case of single thioether motifs or 4 Da with double thioether motifs upon addition of PapB. FIG. 13C are schematics showing a representation of the tandem mass spectrometry results.
[0039] FIG. 16A-B show representative data for the iodoacetic acid treatment for Leader- AAACSANDA. FIG. 16A shows representative mass spectra data for Leader- AAACSANDA without PapB. FIG. 16B shows shows representative mass spectra data for Leader-AAACSANDA with PapB.
[0040] FIG. 17A-B show representative data for the iodoacetic acid treatment for Leader- AAACSANDACSANDA. FIG. 17A shows representative mass spectra data for Leader- AAACSANDACSANDA without PapB. FIG. 17B shows shows representative mass spectra data for Leader-AAACSANDACSANDA with PapB.
[0041] FIG. 18A-B show representative data for the iodoacetic acid treatment for Leader- AAACSACDAADA. FIG. 18A shows representative mass spectra data for Leader- AAACSACDAADA without PapB. FIG. 18B shows shows representative mass spectra data for Leader- AAACSACDAAD A with PapB.
[0042] FIG. 19A-B show representative data for the iodoacetic acid treatment for Leader- AAAASACDAADA. FIG. 19A shows representative mass spectra data for Leader- AAAASACDAADA without PapB. FIG. 19B shows shows representative mass spectra data for Leader- AAAASACDAAD A with PapB.
[0043] FIG. 20A-B show representative data for the iodoacetic acid treatment for Leader- AAACSAADAADA. FIG. 20A shows representative mass spectra data for Leader-
AAACSAADAADA without PapB. FIG. 20B shows shows representative mass spectra data for Leader- AAACSAADAADA with PapB.
[0044] FIG. 21A-C show representative data showing that PapB produces two thioether crosslinks in the AMK-1057 precursor peptide in vitro. FIG. 21 A is a scheme showing that the AMK-1057 precursor peptide contains the leader peptide sequence, a TEV protease recognition sequence, and two CX3E motifs. FIG. 21B shows representative mass spectra demonstrating that upon reaction with PapB in an in vitro assay, two crosslinks form. Additional processing with TEV protease produces the expected dicyclized peptide. FIG. 21C is a scheme demonstrating the topology of the bonds as confirmed by tandem mass spectrometry.
[0045] FIG. 22A-C show representative data for PapB crosslinking DC and DD msPapA Peptides. FIG. 22A is a scheme showing the thioether crosslink. FIG. 22B are representative mass spectra showing formation of the thioether crosslinks. FIG. 22C is a scheme demonstrating the topology of the bonds as confirmed by mass spectrometry.
[0046] FIG. 23A-B shows representative data for the iodoacetic acid treatment for Leader- DCSANDA. FIG. 23A shows representative mass spectra data for Leader- DCSANDA without PapB. FIG. 23B shows shows representative mass spectra data for Leader- DCSANDA with PapB.
[0047] FIG. 24A-B show representative data for the iodoacetic acid treatment for Leader- CSANDDA. FIG. 24A shows representative mass spectra data for Leader-CSANDDA without PapB. FIG. 24B shows shows representative mass spectra data for Leader- CSANDDA with PapB.
[0048] FIG. 25A-B show representative data for the iodoacetic acid treatment for Leader- DCSANDDA. FIG. 25A shows representative mass spectra data for Leader- DCSANDDA without PapB. FIG. 25B shows shows representative mass spectra data for Leader- DCSANDDA with PapB.
[0049] FIG. 26A-B show representative data for msPapA “DSANCA” peptides. FIG. 26A shows representative mass spectra data for Leader-DSANCA and Leader-DDSANCA with and without PapB. FIG. 26B shows representative mass spectra data for Leader-DSANDCA and Leader-DDSANDCA with and without PapB.
[0050] FIG. 27A-E show representative data for synthesis of an octreotide analog. FIG. 27A is a structure of the FDA-approved therapeutic octreotide. FIG. 27B is a schematic description of the designed peptides and the expected sites of modification upon modification with PapB. A TEV cleavage site is included in the second peptide to allow for liberation of
the modified peptide sequence by PapB. FIG. 27C is representative mass spectra data showing the isotopic envelope of these peptides indicating that a mixed population of processed and unprocessed peptides are present after modification by PapB. FIG. 27D is representative mass spectra data showing that the TEV-cleaved peptide isotopic envelope reveals the anticipated 2 Da mass shift. FIG. 27E is a scheme showing the anticipated loss of 2 Da in each y fragment after the C and in each b fragment after the C-terminal E as confirmed by tandem mass spectrometry.
[0051] FIG. 28 is a structure of the synthesized thioether-linked octreotide analog.
[0052] FIG. 29 is a scheme providing a brief summary of successful PapB-mediated thioether crosslinks in tested peptide sequences.
[0053] FIG. 30 shows representative data demonstrating that the leader peptide sequence is not required for modification via PapB.
[0054] FIG. 31 shows representative mass spectrometry data for a one-to-one interpeptide crosslink as well as polymerization-like addition of X-mer subunits.
[0055] FIG. 32 shows representative mass spectrometry results for a general assay peptide before and after PapB, demonstrating the presence of interpeptide products.
[0056] FIG. 33 shows representative mass spectra data showing evidence of simple and complex mass envelopes.
[0057] FIG. 34 is a schematic showing the experimental approaches to creating modified insulin analogs using PapB.
[0058] FIG. 35 shows representative mass spectra data for the synthesized insulin analogs. [0059] FIG. 36 shows representative mass spectra data for crosslinking in peptides containing EneA.
[0060] FIG. 37 shows representative tandem mass spectrometry data for dAdo + D24EneA msPapA adduct.
[0061] FIG. 38 shows representative data, including mass spectrometry and EXAFS, for crosslinking in selenopeptides.
[0062] FIG. 39 shows representative tandem mass spectrometry data for C19U msPapA.
[0063] FIG. 40 shows representative mass spectrometry data demonstrating that aspartic acid may be replaced with glutamic acid, and cysteine may be replaced with homocysteine. Crosslinking is observed.
[0064] FIG. 41 shows representative mass spectrometry data demonstrating that β-amino acids may be incorporated in the peptide. Crosslinking is observed.
[0065] FIG. 42 shows representative mass spectrometry data demonstrating that no crosslinking was observed when altering the position of the C and D residues.
[0066] FIG. 43 shows representative data demonstrating the effect of components in the reduction system employed.
[0067] FIG. 44 is a schematic summarizing the findings of experiments conducted using prereduced PapB.
[0068] FIG. 45 is a scatterplot showing representative data of %product as a function of time for prereduced PapB experiments.
[0069] FIG. 46 shows representative data, including photodiode array chromatography, UV- Vis, and extracted ion chromatography, for PapB with and without reductant, as well as prereduced PapB.
[0070] FIG. 47 is a concept schematic for a bioreactor setup for peptide modification via PapB.
[0071] FIG. 48A-B show representative data for C-terminal glycine sequence. FIG. 48A is a scheme showing the thioether crosslink. FIG. 48B are representative mass spectra showing formation of the thioether crosslinks.
[0072] FIG. 49A-B show representative data for deuterium labeled C-terminal glycine analogs. FIG. 49A is a scheme showing the thioether crosslink. FIG. 49B are representative mass spectra showing formation of the thioether crosslinks.
[0073] FIG. 50A-B show representative data for C-terminal glycine carboxamide sequence. FIG. 50A is the structure of the sequence. FIG. SOB are representative mass spectra showing lack of formation of the thioether crosslinks.
[0074] FIG. 51A-C show representative data for crosslinking with C-terminal β-amino acids. FIG. 50A is a scheme showing the generic thioether crosslink reaction for C-terminal β- amino acids. FIG. SOB is a scheme showing the thioether crosslink reaction with C-terminal β-alanine. FIG. 51C is the corresponding mass spectra data showing formation of the thioether crosslink.
[0075] FIG. 52A-D show representative data for the crosslinking with various C-terminal β- amino acids. FIG. 52A is a scheme showing the absence of crosslink reaction with C- terminal 2,2-dimethyl-beta-alanine. FIG. 52B is a scheme showing the absence of crosslink reaction with C-terminal (R)-3-amino-2-methylpropanoic acid. FIG. 52C is a scheme showing the crosslink reaction with C-terminal (S)-3-amino-2-methylpropanoic acid. FIG. 52D is the corresponding mass spectra data showing formation of the thioether crosslink.
[0076] FIG. 53 shows representative data for the crosslinking with common C-terminal β- amino acids.
[0077] FIG. 54A shows a schematic thioether crosslinking with a D-tryptophan β-amino acid. FIG. 54B is the corresponding mass spectra data showing formation of the thioether crosslink
[0078] FIG. 55A-D show representative structures of thioether crosslinking of N-methyl amino acids. FIG. 55A shows unsubstituted N-methylated thioether crosslinked product. FIG. 55B shows substituted N-methylated thioether crosslinked product. FIG. 55C shows a schematic thioether crosslinking with a substituted N-methylated substrate. FIG. 55D is the corresponding mass spectra data showing formation of the thioether crosslink.
[0079] FIG. 56A-D show representative data for thioether crosslinking with C-terminal L- alanine or D-alanine. FIG. 56A shows a schematic of a C-terminal L-alanine without thioether crosslink product. FIG. 58B is the corresponding mass spectra data showing lack of formation of the thioether crosslink. FIG. 56C shows a schematic of a C-terminal D- alanine with thioether crosslink product. FIG. 56D is the corresponding mass spectra data showing formation of the thioether crosslink.
[0080] FIG. 57A-B show representative data for thioether crosslinking with deuterium labeled C-terminal D-alanine. FIG. 57A shows a schematic of a deuterium labeled C- terminal D-alanine with thioether crosslink product. FIG. 57B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift and loss 3Da.
[0081] FIG. 58A-B show representative data for thioether crosslinking with deuterium labeled C-terminal D-methionine. FIG. 58A shows a schematic of a deuterium labeled C- terminal D-methionine with thioether crosslink product. FIG. 58B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift and loss 3Da.
[0082] FIG. 59A-B show representative data for thioether crosslinking with d2-labeled D- valine. FIG. 59A shows a structure of a deuterium labeled C-terminal D-valine. FIG. 59B is the corresponding mass spectra data showing formation of the thioether crosslink however mass shift is indicative of no loss of deuterium.
[0083] FIG. 60A-B show representative data for thioether crosslinking with d3-labeled D- valine. FIG. 60A shows a schematic of a deuterium labeled side chain C-terminal D-valine with thioether crosslink product. FIG. 60B is the corresponding mass spectra data showing
formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift and loss 3Da.
[0084] FIG. 61A-D show representative data for thioether crosslinking with deuterium labeled C-terminal D-phenyl alanine. FIG. 61A shows a structure of a deuterium labeled Ca C-terminal D-phenyl alanine. FIG. 61B is the corresponding mass spectra data showing formation of the thioether crosslink however mass shift is indicative of no loss of deuterium.
FIG. 61C shows a structure of a deuterium labeled aryl C-terminal D-phenyl alanine. FIG. 61D is the corresponding mass spectra data showing formation of the thioether crosslink however mass shift is indicative of no loss of deuterium
[0085] FIG. 62A-B show representative data for thioether crosslinking with deuterium labeled d8-C-terminal D-phenylalanine. FIG. 62A shows a schematic of a deuterium labeled d8-C-terminal D-methionine with thioether crosslink product. FIG. 62B is the corresponding mass spectra data showing formation of the thioether crosslink and loss of the deuterium labeled confirmed by mass shift.
[0086] FIG. 63 shows structures of sactipeptide thioether crosslink of corresponding D- aminoacids
[0087] FIG. 64 shows structures of ranthipeptide thioether crosslink of corresponding D- aminoacids.
[0088] FIG. 65A-B show representative data for 6-membered non-peptidic thioether crosslinking. FIG. 65 A shows scheme of Leader-Cys-Gly reaction. FIG. 65B is the corresponding mass spectra data showing lack of formation of the thioether crosslink of 6- membered ring.
[0089] FIG. 66A-B show representative data for 7-membered non-peptidic thioether crosslinking. FIG. 66A shows scheme of Leader-hCys-Gly reaction. FIG. 66B is the corresponding mass spectra data showing formation of the thioether crosslink of 7-membered ring.
[0090] FIG. 67A-B show representative data for 7-membered non-peptidic thioether crosslinking. FIG. 67 A shows scheme of Leader-Cys-βAla reaction. FIG. 67B is the corresponding mass spectra data showing formation of the thioether crosslink of 7-membered ring.
[0091] FIG. 68A-B show representative data for 8-membered non-peptidic thioether crosslinking. FIG. 68A shows scheme of Leader-hCys-βAla reaction. FIG. 68B is the
corresponding mass spectra data showing formation of the thioether crosslink of 8-membered ring.
[0092] FIG. 69A-B show representative data for 8-membered non-peptidic thioether crosslinking. FIG. 69 A shows scheme of Leader-Cys-GABA reaction. FIG. 69B is the corresponding mass spectra data showing formation of the thioether crosslink of 8-membered ring.
[0093] FIG. 70A-B show representative data for 9-membered non-peptidic thioether crosslinking. FIG. 70 A shows scheme of Leader-hCys-GABA reaction. FIG. 70B is the corresponding mass spectra data showing formation of the thioether crosslink of 9-membered ring.
[0094] FIG. 71A-B show representative data for 16-membered non-peptidic thioether crosslinking. FIG. 71 A shows scheme of Leader-hCys-NH-PEG3-CO2H reaction. FIG. 71 A is the corresponding mass spectra data showing formation of the thioether crosslink of 16- membered ring.
[0095] FIG. 72A-B show representative data for 20-membered non-peptidic thioether crosslinking. FIG. 72A shows scheme of Leader-hCys-NH-PEG4-CO2H reaction. FIG. 72B is the corresponding mass spectra data showing formation of the thioether crosslink of 20- membered ring.
[0096] FIG. 73A-B show representative data for unusual non-peptidic thioether crosslinking. FIG. 73A shows scheme of Leader-Cys-Ser-Ala-Asn-2-(2-aminophenyl)acetic acid reaction. FIG. 73B is the corresponding mass spectra data showing formation of the thioether crosslink of 17-membered ring.
[0097] FIG. 74A-B show representative data for unusual non-peptidic thioether crosslinking. FIG. 74A shows scheme of Leader-Cys-Ser-Ala-Asn-2-(2-(aminomethyl)phenyl)acetic acid reaction. FIG. 74B is the corresponding mass spectra data showing formation of the thioether crosslink of 18-membered ring.
[0098] FIG. 75A-B show representative data for coumarin thioether crosslinking. FIG. 75A shows scheme of Leader-Cys-coumarin reaction. FIG. 75B is the corresponding mass spectra data showing formation of the thioether crosslink of 12-membered ring.
[0099] FIG. 76A-C show representative data for the synthesis thioether peptidomimetic. FIG. 76A is a structure of Setmalanotide, an FDA approved drug. FIG. 76B shows a schematic thioether crosslinking with a modified peptide structure (e.g., an analog of Setmalanotide). FIG. 76C is the corresponding mass spectra data showing formation of the thioether crosslink.
[00100] FIG. 77A-D show representative data for the synthesis thioether peptidomimetic. FIG. 77A is a structure of a Novartis orally available peptide. FIG. 77B is a structure of the designed peptides (an analog of the therapeutic peptide from FIG. 77A) and the expected product upon modification with PapB. FIG. 77C shows a schematic thioether crosslinking with a modified peptide structure. FIG. 77D is the corresponding mass spectra data showing formation of the thioether crosslink.
[00101] FIG. 78A-D show representative therapeutic cyclic peptides that can be mimicked by a thioether crosslink peptide. FIG. 78A show the structure of a representative cyclic peptide, bremelanotide. FIG. 78B shows a representative structure of the thioether crosslinked product, an analog of bremelanotide, which contains the amino acid sequence norleucine, cysteine, D-phenylalanine, arginine, tryptophan, and epsilon-amino hexanoic acid (ACP). FIG. 78C shows a representative scheme of the Leader-XCDFRWZ XXX reaction. FIG. 78D is the corresponding mass spectra data showing formation of the thioether crosslink of therapeutic analog.
[00102] FIG. 79A-E show representative data illustrating that PapB forms crosslinks in thiol- and carboxylate-containing extended sidechains. Specifically, FIG. 79A shows a generalized linear scenario of C19hCys msPapA in which n = CH2 (Asp), (CH2)2 (Glu), or (CH2)3 (hGlu). FIG. 79B shows a 2 Da shift in the MS for the carboxylate-containing residue as Asp. FIG. 79C shows a 2 Da shift in the MS for the carboxylate-containing residue as Glu. FIG. 79D shows a 2 Da shift in the MS for the carboxylate-containing residue as homoGlu. FIG. 79E shows the MS for the liberated macrocyclized peptide core from the leader sequence following cleavage of the TEV protease recognition sequence with TEV protease.
[00103] FIG. 80 shows a representative proton NMR spectrum of the linear G(hC)SAN(hE)A peptide.
[00104] FIG. 81 shows a representative proton NMR spectrum of the cyclized G(hC)SAN(hE)A peptide.
[00105] FIG. 82 shows a representative ROESY spectrum of the linear G(hC)SAN(hE)A peptide.
[00106] FIG. 83 shows a representative ROESY spectrum of the cyclized G(hC)SAN(hE)A peptide.
[00107] FIG. 84A-C show representative data pertaining to a carboxylate isostere (tetrazole moiety) crosslinked by PapB. Specifically, FIG. 84A shows a schematic of the linear and cyclized peptide illustrating the putative crosslink location. FIG. 84B shows MS
results illustrating a clear 2 Da loss between an assay without PapB (darker gray) and with the addition of PapB (lighter gray). FIG. 84C shows the expected tandem mass spectrometry with no fragmentation between Cys and T4Az.
[00108] FIG. 85 shows representative fragmentation of reacted D23T4Az msPapA variant.
[00109] FIG. 86 shows representative fragments of a tetrazole loss in the D23T4Az msPapA variant
[00110] Additional advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or can be learned by practice of the invention. The advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
DETAILED DESCRIPTION
[00111] The present invention can be understood more readily by reference to the following detailed description of the invention and the Examples included therein.
[00112] Before the present compounds, compositions, articles, systems, devices, and/or methods are disclosed and described, it is to be understood that they are not limited to specific synthetic methods unless otherwise specified, or to particular reagents unless otherwise specified, as such may, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular aspects only and is not intended to be limiting. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, example methods and materials are now described.
[00113] While aspects of the present invention can be described and claimed in a particular statutory class, such as the system statutory class, this is for convenience only and one of skill in the art will understand that each aspect of the present invention can be described and claimed in any statutory class. Unless otherwise expressly stated, it is in no way intended that any method or aspect set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not specifically state in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any
possible non-express basis for interpretation, including matters of logic with respect to arrangement of steps or operational flow, plain meaning derived from grammatical organization or punctuation, or the number or type of aspects described in the specification. [00114] Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this pertains. The references disclosed are also individually and specifically incorporated by reference herein for the material contained in them that is discussed in the sentence in which the reference is relied upon. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided herein may be different from the actual publication dates, which can require independent confirmation.
A. DEFINITIONS
[00115] As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a functional group,” “an alkyl,” or “a residue” includes mixtures of two or more such functional groups, alkyls, or residues, and the like.
[00116] As used in the specification and in the claims, the term “comprising” can include the aspects “consisting of’ and “consisting essentially of.”
[00117] Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another aspect includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms another aspect. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. It is also understood that each unit between two particular units are also disclosed. For example, if 10 and 15 are disclosed, then 11, 12, 13, and 14 are also disclosed.
[00118] As used herein, the terms “about” and “at or about” mean that the amount or value in question can be the value designated some other value approximately or about the same. It is generally understood, as used herein, that it is the nominal value indicated ±10%
variation unless otherwise indicated or inferred. The term is intended to convey that similar values promote equivalent results or effects recited in the claims. That is, it is understood that amounts, sizes, formulations, parameters, and other quantities and characteristics are not and need not be exact, but can be approximate and/or larger or smaller, as desired, reflecting tolerances, conversion factors, rounding off, measurement error and the like, and other factors known to those of skill in the art. In general, an amount, size, formulation, parameter or other quantity or characteristic is “about” or “approximate” whether or not expressly stated to be such. It is understood that where “about” is used before a quantitative value, the parameter also includes the specific quantitative value itself, unless specifically stated otherwise.
[00119] References in the specification and concluding claims to parts by weight of a particular element or component in a composition denotes the weight relationship between the element or component and any other elements or components in the composition or article for which a part by weight is expressed. Thus, in a compound containing 2 parts by weight of component X and 5 parts by weight component Y, X and Y are present at a weight ratio of 2:5, and are present in such ratio regardless of whether additional components are contained in the compound.
[00120] A weight percent (wt. %) of a component, unless specifically stated to the contrary, is based on the total weight of the formulation or composition in which the component is included.
[00121] As used herein, “IC50” is intended to refer to the concentration of a substance (e.g., a compound or a drug) that is required for 50% inhibition of a biological process, or component of a process, including a protein, subunit, organelle, ribonucleoprotein, etc. In one aspect, an IC50 can refer to the concentration of a substance that is required for 50% inhibition in vivo, as further defined elsewhere herein. In a further aspect, IC50 refers to the half-maximal (50%) inhibitory concentration (IC) of a substance.
[00122] As used herein, “ EC50” is intended to refer to the concentration of a substance (e.g., a compound or a drug) that is required for 50% agonism of a biological process, or component of a process, including a protein, subunit, organelle, ribonucleoprotein, etc. In one aspect, an EC50 can refer to the concentration of a substance that is required for 50% agonism in vivo, as further defined elsewhere herein. In a further aspect, EC50 refers to the concentration of agonist that provokes a response hallway between the baseline and maximum response.
[00123] As used herein, the term “optional” or “optionally” means that the subsequently described event or circumstance can or cannot occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.
[00124] As used herein, the term “subject” can be a vertebrate, such as a mammal, a fish, a bird, a reptile, or an amphibian. Thus, the subject of the herein disclosed methods can be a human, non-human primate, horse, pig, rabbit, dog, sheep, goat, cow, cat, guinea pig or rodent. The term does not denote a particular age or sex. Thus, adult and newborn subjects, as well as fetuses, whether male or female, are intended to be covered. In one aspect, the subject is a mammal. A patient refers to a subject afflicted with a disease, disorder, or condition. The term “patient” includes human and veterinary subjects.
[00125] As used herein, the term “treatment” refers to the medical management of a patient with the intent to cure, ameliorate, stabilize, or prevent a disease, pathological condition, or disorder. This term includes active treatment, that is, treatment directed specifically toward the improvement of a disease, pathological condition, or disorder, and also includes causal treatment, that is, treatment directed toward removal of the cause of the associated disease, pathological condition, or disorder. In addition, this term includes palliative treatment, that is, treatment designed for the relief of symptoms rather than the curing of the disease, pathological condition, or disorder; preventative treatment, that is, treatment directed to minimizing or partially or completely inhibiting the development of the associated disease, pathological condition, or disorder; and supportive treatment, that is, treatment employed to supplement another specific therapy directed toward the improvement of the associated disease, pathological condition, or disorder. In various aspects, the term covers any treatment of a subject, including a mammal (e.g., a human), and includes: (i) preventing the disease from occurring in a subject that can be predisposed to the disease but has not yet been diagnosed as having it; (ii) inhibiting the disease, i.e., arresting its development; or (iii) relieving the disease, i.e., causing regression of the disease. In one aspect, the subject is a mammal such as a primate, and, in a further aspect, the subject is a human. The term “subject” also includes domesticated animals (e.g., cats, dogs, etc.), livestock (e.g., cattle, horses, pigs, sheep, goats, etc.), and laboratory animals (e.g., mouse, rabbit, rat, guinea pig, fruit fly, etc.).
[00126] As used herein, the term “prevent” or “preventing” refers to precluding, averting, obviating, forestalling, stopping, or hindering something from happening, especially by advance action. It is understood that where reduce, inhibit or prevent are used herein,
unless specifically indicated otherwise, the use of the other two words is also expressly disclosed.
[00127] As used herein, the term “diagnosed” means having been subjected to a physical examination by a person of skill, for example, a physician, and found to have a condition that can be diagnosed or treated by the compounds, compositions, or methods disclosed herein.
[00128] As used herein, the terms “administering” and “administration” refer to any method of providing a pharmaceutical preparation to a subject. Such methods are well known to those skilled in the art and include, but are not limited to, oral administration, transdermal administration, administration by inhalation, nasal administration, topical administration, intravaginal administration, ophthalmic administration, intraaural administration, intracerebral administration, rectal administration, sublingual administration, buccal administration, and parenteral administration, including injectable such as intravenous administration, intra-arterial administration, intramuscular administration, and subcutaneous administration. Administration can be continuous or intermittent. In various aspects, a preparation can be administered therapeutically; that is, administered to treat an existing disease or condition. In further various aspects, a preparation can be administered prophylactically; that is, administered for prevention of a disease or condition.
[00129] As used herein, the terms “effective amount” and “amount effective” refer to an amount that is sufficient to achieve the desired result or to have an effect on an undesired condition. For example, a “therapeutically effective amount” refers to an amount that is sufficient to achieve the desired therapeutic result or to have an effect on undesired symptoms, but is generally insufficient to cause adverse side effects. The specific therapeutically effective dose level for any particular patient will depend upon a variety of factors including the condition being treated and the severity of the condition; the specific composition employed; the age, body weight, general health, sex and diet of the patient; the time of administration; the route of administration; the rate of excretion of the specific compound employed; the duration of the treatment; drugs used in combination or coincidental with the specific compound employed and like factors well known in the medical arts. For example, it is well within the skill of the art to start doses of a compound at levels lower than those required to achieve the desired therapeutic effect and to gradually increase the dosage until the desired effect is achieved. If desired, the effective daily dose can be divided into multiple doses for purposes of administration. Consequently, single dose compositions can contain such amounts or submultiples thereof to make up the daily dose.
The dosage can be adjusted by the individual physician in the event of any contraindications. Dosage can vary, and can be administered in one or more dose administrations daily, for one or several days. Guidance can be found in the literature for appropriate dosages for given classes of pharmaceutical products. In further various aspects, a preparation can be administered in a “prophylactically effective amount”; that is, an amount effective for prevention of a disease or condition.
[00130] As used herein, “dosage form” means a pharmacologically active material in a medium, carrier, vehicle, or device suitable for administration to a subject. A dosage forms can comprise inventive a disclosed compound, a product of a disclosed method of making, or a salt, solvate, or polymorph thereof, in combination with a pharmaceutically acceptable excipient, such as a preservative, buffer, saline, or phosphate buffered saline. Dosage forms can be made using conventional pharmaceutical manufacturing and compounding techniques. Dosage forms can comprise inorganic or organic buffers (e.g., sodium or potassium salts of phosphate, carbonate, acetate, or citrate) and pH adjustment agents (e.g., hydrochloric acid, sodium or potassium hydroxide, salts of citrate or acetate, amino acids and their salts) antioxidants (e.g., ascorbic acid, alpha-tocopherol), surfactants (e.g., polysorbate 20, polysorbate 80, polyoxyethylene 9-10 nonyl phenol, sodium desoxycholate), solution and/or cryo/lyo stabilizers (e.g., sucrose, lactose, mannitol, trehalose), osmotic adjustment agents (e.g., salts or sugars), antibacterial agents (e.g., benzoic acid, phenol, gentamicin), antifoaming agents (e.g., polydimethylsilozone), preservatives (e.g., thimerosal, 2- phenoxyethanol, EDTA), polymeric stabilizers and viscosity-adjustment agents (e.g., polyvinylpyrrolidone, poloxamer 488, carboxymethylcellulose) and co-solvents (e.g., glycerol, polyethylene glycol, ethanol). A dosage form formulated for injectable use can have a disclosed compound, a product of a disclosed method of making, or a salt, solvate, or polymorph thereof, suspended in sterile saline solution for injection together with a preservative.
[00131] As used herein, “kit” means a collection of at least two components constituting the kit. Together, the components constitute a functional unit for a given purpose. Individual member components may be physically packaged together or separately. For example, a kit comprising an instruction for using the kit may or may not physically include the instruction with other individual member components. Instead, the instruction can be supplied as a separate member component, either in a paper form or an electronic form which may be supplied on computer readable memory device or downloaded from an internet website, or as recorded presentation.
[00132] As used herein, “instruction(s)” means documents describing relevant materials or methodologies pertaining to a kit. These materials may include any combination of the following: background information, list of components and their availability information (purchase information, etc.), brief or detailed protocols for using the kit, trouble- shooting, references, technical support, and any other related documents. Instructions can be supplied with the kit or as a separate member component, either as a paper form or an electronic form which may be supplied on computer readable memory device or downloaded from an internet website, or as recorded presentation. Instructions can comprise one or multiple documents, and are meant to include future updates.
[00133] As used herein, the terms “therapeutic agent” include any synthetic or naturally occurring biologically active compound or composition of matter which, when administered to an organism (human or nonhuman animal), induces a desired pharmacologic, immunogenic, and/or physiologic effect by local and/or systemic action. The term therefore encompasses those compounds or chemicals traditionally regarded as drugs, vaccines, and biopharmaceuticals including molecules such as proteins, peptides, hormones, nucleic acids, gene constructs and the like. Examples of therapeutic agents are described in well-known literature references such as the Merck Index (14th edition), the Physicians' Desk Reference (64th edition), and The Pharmacological Basis of Therapeutics (12th edition), and they include, without limitation, medicaments; vitamins; mineral supplements; substances used for the treatment, prevention, diagnosis, cure or mitigation of a disease or illness; substances that affect the structure or function of the body, or pro-drugs, which become biologically active or more active after they have been placed in a physiological environment. For example, the term “therapeutic agent” includes compounds or compositions for use in all of the major therapeutic areas including, but not limited to, adjuvants; anti-infectives such as antibiotics and antiviral agents; anti-cancer and anti-neoplastic agents such as kinase inhibitors, poly ADP ribose polymerase (PARP) inhibitors and other DNA damage response modifiers, epigenetic agents such as bromodomain and extra-terminal (BET) inhibitors, histone deacetylase (HD Ac) inhibitors, iron chelotors and other ribonucleotides reductase inhibitors, proteasome inhibitors and Nedd8-activating enzyme (NAE) inhibitors, mammalian target of rapamycin (mTOR) inhibitors, traditional cytotoxic agents such as paclitaxel, dox, irinotecan, and platinum compounds, immune checkpoint blockade agents such as cytotoxic T lymphocyte antigen-4 (CTLA-4) monoclonal antibody (mAB), programmed cell death protein 1 (PD-l)/programmed cell death-ligand 1 (PD-L1) mAB, cluster of differentiation 47 (CD47) mAB, toll-like receptor (TLR) agonists and other immune modifiers, cell
therapeutics such as chimeric antigen receptor T-cell (CAR-T)/chimeric antigen receptor natural killer (CAR-NK) cells, and proteins such as interferons (IFNs), interleukins (ILs), and mAbs; anti-ALS agents such as entry inhibitors, fusion inhibitors, non-nucleoside reverse transcriptase inhibitors (NNRTIs), nucleoside reverse transcriptase inhibitors (NRTIs), nucleotide reverse transcriptase inhibitors, NCP7 inhibitors, protease inhibitors, and integrase inhibitors; analgesics and analgesic combinations, anorexics, anti-inflammatory agents, anti- epileptics, local and general anesthetics, hypnotics, sedatives, antipsychotic agents, neuroleptic agents, antidepressants, anxiolytics, antagonists, neuron blocking agents, anticholinergic and cholinomimetic agents, antimuscarinic and muscarinic agents, antiadrenergics, antiarrhythmics, antihypertensive agents, hormones, and nutrients, antiarthritics, antiasthmatic agents, anticonvulsants, antihistamines, antinauseants, antineoplastics, antipruritics, antipyretics; antispasmodics, cardiovascular preparations (including calcium channel blockers, beta-blockers, beta-agonists and antiarrythmics), antihypertensives, diuretics, vasodilators; central nervous system stimulants; cough and cold preparations; decongestants; diagnostics; hormones; bone growth stimulants and bone resorption inhibitors; immunosuppressives; muscle relaxants; psychostimulants; sedatives; tranquilizers; proteins, peptides, and fragments thereof (whether naturally occurring, chemically synthesized or recombinantly produced); and nucleic acid molecules (polymeric forms of two or more nucleotides, either ribonucleotides (RNA) or deoxyribonucleotides (DNA) including both double- and single-stranded molecules, gene constructs, expression vectors, antisense molecules and the like), small molecules (e.g., doxorubicin) and other biologically active macromolecules such as, for example, proteins and enzymes. The agent may be a biologically active agent used in medical, including veterinary, applications and in agriculture, such as with plants, as well as other areas. The term “therapeutic agent” also includes without limitation, medicaments; vitamins; mineral supplements; substances used for the treatment, prevention, diagnosis, cure or mitigation of disease or illness; or substances which affect the structure or function of the body; or pro-drugs, which become biologically active or more active after they have been placed in a predetermined physiological environment.
[00134] The term “pharmaceutically acceptable” describes a material that is not biologically or otherwise undesirable, i.e., without causing an unacceptable level of undesirable biological effects or interacting in a deleterious manner.
[00135] As used herein, the term “sactipeptide” refers to a sulfur-to-alpha carbon thioether cross-linked peptide belonging to the ribosomally synthesized post-translationally
modified peptide (RiPP) superfamily. As illustrated by the structure below, a sactipeptide contains an intramolecular thioether bond that crosslinks the sulfur atom of a cysteine residue to the α-carbon of an acceptor amino acid.
[00136] As used herein, the term “ranthipeptide” refers to a radical non-a thioether- containing peptide, which, similar to sactipeptides above, is also a member of the RiPP superfamily. For example, as illustrated below, a ranthipeptide can contain an intramolecular thioether bond that crosslinks the sulfur atom of a cysteine residue to any carbon other than the a-carbon of an acceptor amino acid.
[00138] As used herein, the term “derivative” refers to a compound having a structure derived from the structure of a parent compound (e.g., a compound disclosed herein) and whose structure is sufficiently similar to those disclosed herein and based upon that similarity, would be expected by one skilled in the art to exhibit the same or similar activities and utilities as the claimed compounds, or to induce, as a precursor, the same or similar
activities and utilities as the claimed compounds. Exemplary derivatives include salts, esters, amides, salts of esters or amides, and N-oxides of a parent compound.
[00139] As used herein, the term “pharmaceutically acceptable carrier” refers to sterile aqueous or nonaqueous solutions, dispersions, suspensions or emulsions, as well as sterile powders for reconstitution into sterile injectable solutions or dispersions just prior to use. Examples of suitable aqueous and nonaqueous carriers, diluents, solvents or vehicles include water, ethanol, polyols (such as glycerol, propylene glycol, polyethylene glycol and the like), carboxymethylcellulose and suitable mixtures thereof, vegetable oils (such as olive oil) and injectable organic esters such as ethyl oleate. Proper fluidity can be maintained, for example, by the use of coating materials such as lecithin, by the maintenance of the required particle size in the case of dispersions and by the use of surfactants. These compositions can also contain adjuvants such as preservatives, wetting agents, emulsifying agents and dispersing agents. Prevention of the action of microorganisms can be ensured by the inclusion of various antibacterial and antifungal agents such as paraben, chlorobutanol, phenol, sorbic acid and the like. It can also be desirable to include isotonic agents such as sugars, sodium chloride and the like. Prolonged absorption of the injectable pharmaceutical form can be brought about by the inclusion of agents, such as aluminum monostearate and gelatin, which delay absorption. Injectable depot forms are made by forming microencapsule matrices of the drug in biodegradable polymers such as polylactide-polyglycolide, poly(orthoesters) and poly(anhydrides). Depending upon the ratio of drug to polymer and the nature of the particular polymer employed, the rate of drug release can be controlled. Depot injectable formulations are also prepared by entrapping the drug in liposomes or microemulsions which are compatible with body tissues. The injectable formulations can be sterilized, for example, by filtration through a bacterial-retaining filter or by incorporating sterilizing agents in the form of sterile solid compositions which can be dissolved or dispersed in sterile water or other sterile injectable media just prior to use. Suitable inert carriers can include sugars such as lactose. Desirably, at least 95% by weight of the particles of the active ingredient have an effective particle size in the range of 0.01 to 10 micrometers.
[00140] As used herein, the term “substituted” is contemplated to include all permissible substituents of organic compounds. In a broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, and aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, for example, those described below. The permissible substituents can be one or more and the same or different for appropriate organic compounds. For purposes of
this disclosure, the heteroatoms, such as nitrogen, can have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms. This disclosure is not intended to be limited in any manner by the permissible substituents of organic compounds. Also, the terms “substitution” or “substituted with” include the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g., a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc. It is also contemplated that, in certain aspects, unless expressly indicated to the contrary, individual substituents can be further optionally substituted (i.e., further substituted or unsubstituted).
[00141] In defining various terms, “A1,” “A2,” “A3,” and “A4” are used herein as generic symbols to represent various specific substituents. These symbols can be any substituent, not limited to those disclosed herein, and when they are defined to be certain substituents in one instance, they can, in another instance, be defined as some other substituents.
[00142] The term “aliphatic” or “aliphatic group,” as used herein, denotes a hydrocarbon moiety that may be straight-chain (i.e., unbranched), branched, or cyclic (including fused, bridging, and spirofused polycyclic) and may be completely saturated or may contain one or more units of unsaturation, but which is not aromatic. Unless otherwise specified, aliphatic groups contain 1-20 carbon atoms. Aliphatic groups include, but are not limited to, linear or branched, alkyl, alkenyl, and alkynyl groups, and hybrids thereof such as (cycloalkyl)alkyl, (cycloalkenyl)alkyl or (cycloalkyl)alkenyl.
[00143] The term “alkyl” as used herein is a branched or unbranched saturated hydrocarbon group of 1 to 24 carbon atoms, such as methyl, ethyl, n-propyl, isopropyl, n- butyl, isobutyl, s-butyl, t-butyl, n-pentyl, isopentyl, s-pentyl, neopentyl, hexyl, heptyl, octyl, nonyl, decyl, dodecyl, tetradecyl, hexadecyl, eicosyl, tetracosyl, and the like. The alkyl group can be cyclic or acyclic. The alkyl group can be branched or unbranched. The alkyl group can also be substituted or unsubstituted. For example, the alkyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, amino, ether, halide, hydroxy, nitro, silyl, sulfo-oxo, or thiol, as described herein. A “lower alkyl” group is an alkyl group containing from one to six (e.g., from one to four) carbon atoms. The term alkyl group can also be a Cl alkyl, C1-C2 alkyl, C1-C3 alkyl, C1-C4 alkyl, C1-C5 alkyl, C1-C6 alkyl, C1-C7 alkyl, C1-C8 alkyl, C1-C9 alkyl, Cl -CIO alkyl, and the like up to and including a C1-C24 alkyl.
[00144] Throughout the specification “alkyl” is generally used to refer to both unsubstituted alkyl groups and substituted alkyl groups; however, substituted alkyl groups are also specifically referred to herein by identifying the specific substituent(s) on the alkyl group. For example, the term “halogenated alkyl” or “haloalkyl” specifically refers to an alkyl group that is substituted with one or more halide, e.g., fluorine, chlorine, bromine, or iodine. Alternatively, the term “monohaloalkyl” specifically refers to an alkyl group that is substituted with a single halide, e.g. fluorine, chlorine, bromine, or iodine. The term “polyhaloalkyl” specifically refers to an alkyl group that is independently substituted with two or more halides, i.e. each halide substituent need not be the same halide as another halide substituent, nor do the multiple instances of a halide substituent need to be on the same carbon. The term “alkoxyalkyl” specifically refers to an alkyl group that is substituted with one or more alkoxy groups, as described below. The term “aminoalkyl” specifically refers to an alkyl group that is substituted with one or more amino groups. The term “hydroxyalkyl” specifically refers to an alkyl group that is substituted with one or more hydroxy groups. When “alkyl” is used in one instance and a specific term such as “hydroxyalkyl” is used in another, it is not meant to imply that the term “alkyl” does not also refer to specific terms such as “hydroxyalkyl” and the like.
[00145] This practice is also used for other groups described herein. That is, while a term such as “cycloalkyl” refers to both unsubstituted and substituted cycloalkyl moieties, the substituted moieties can, in addition, be specifically identified herein; for example, a particular substituted cycloalkyl can be referred to as, e.g., an “alkylcycloalkyl.” Similarly, a substituted alkoxy can be specifically referred to as, e.g., a “halogenated alkoxy,” a particular substituted alkenyl can be, e.g., an “alkenylalcohol,” and the like. Again, the practice of using a general term, such as “cycloalkyl,” and a specific term, such as “alkylcycloalkyl,” is not meant to imply that the general term does not also include the specific term.
[00146] The term “cycloalkyl” as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms. Examples of cycloalkyl groups include, but are not limited to, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, norbomyl, and the like. The term “heterocycloalkyl” is a type of cycloalkyl group as defined above, and is included within the meaning of the term “cycloalkyl,” where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkyl group and heterocycloalkyl group can be substituted or unsubstituted. The cycloalkyl group and heterocycloalkyl group can be substituted with one
or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, amino, ether, halide, hydroxy, nitro, silyl, sulfo-oxo, or thiol as described herein.
[00147] The term “polyalkylene group” as used herein is a group having two or more CH2 groups linked to one another. The polyalkylene group can be represented by the formula -(CH2)a- , where “a” is an integer of from 2 to 500.
[00148] The terms “alkoxy” and “alkoxyl” as used herein to refer to an alkyl or cycloalkyl group bonded through an ether linkage; that is, an “alkoxy” group can be defined as — OA1 where A1 is alkyl or cycloalkyl as defined above. “Alkoxy” also includes polymers of alkoxy groups as just described; that is, an alkoxy can be a polyether such as — OA1 — OA2 or — OA1 — (OA2)a — OA3, where “a” is an integer of from 1 to 200 and A1, A2, and A3 are alkyl and/or cycloalkyl groups. [00149] The term “alkenyl” as used herein is a hydrocarbon group of from 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon double bond. Asymmetric structures such as (A1A2)C=C(A3A4) are intended to include both the E and Z isomers. This can be presumed in structural formulae herein wherein an asymmetric alkene is present, or it can be explicitly indicated by the bond symbol C=C. The alkenyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol, as described herein.
[00150] The term “cycloalkenyl” as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms and containing at least one carbon-carbon double bound, i.e., C=C. Examples of cycloalkenyl groups include, but are not limited to, cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadienyl, cyclohexenyl, cyclohexadienyl, norbomenyl, and the like. The term “heterocycloalkenyl” is a type of cycloalkenyl group as defined above, and is included within the meaning of the term “cycloalkenyl,” where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkenyl group and heterocycloalkenyl group can be substituted or unsubstituted. The cycloalkenyl group and heterocycloalkenyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol as described herein.
[00151] The term “alkynyl” as used herein is a hydrocarbon group of 2 to 24 carbon atoms with a structural formula containing at least one carbon-carbon triple bond. The alkynyl group can be unsubstituted or substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol, as described herein.
[00152] The term “cycloalkynyl” as used herein is a non-aromatic carbon-based ring composed of at least seven carbon atoms and containing at least one carbon-carbon triple bound. Examples of cycloalkynyl groups include, but are not limited to, cycloheptynyl, cyclooctynyl, cyclononynyl, and the like. The term “heterocycloalkynyl” is a type of cycloalkenyl group as defined above, and is included within the meaning of the term “cycloalkynyl,” where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, or phosphorus. The cycloalkynyl group and heterocycloalkynyl group can be substituted or unsubstituted. The cycloalkynyl group and heterocycloalkynyl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol as described herein.
[00153] The term “aromatic group” as used herein refers to a ring structure having cyclic clouds of delocalized π electrons above and below the plane of the molecule, where the π clouds contain (4n+2) π electrons. A further discussion of aromaticity is found in Morrison and Boyd, Organic Chemistry, (5th Ed., 1987), Chapter 13, entitled “Aromaticity,” pages 477-497, incorporated herein by reference. The term “aromatic group” is inclusive of both aryl and heteroaryl groups.
[00154] The term “aryl” as used herein is a group that contains any carbon-based aromatic group including, but not limited to, benzene, naphthalene, phenyl, biphenyl, anthracene, and the like. The aryl group can be substituted or unsubstituted. The aryl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, heteroaryl, aldehyde, — NH2, carboxylic acid, ester, ether, halide, hydroxy, ketone, azide, nitro, silyl, sulfo-oxo, or thiol as described herein. The term “biaryl” is a specific type of aryl group and is included in the definition of “aryl.” In addition, the aryl group can be a single ring structure or comprise multiple ring structures that are either fused ring structures or attached via one or more bridging groups such as a carbon-carbon bond. For example, biaryl can be two aryl groups
that are bound together via a fused ring structure, as in naphthalene, or are attached via one or more carbon-carbon bonds, as in biphenyl.
[00155] The term “aldehyde” as used herein is represented by the formula — C(O)H. Throughout this specification “C(O)” is a short hand notation for a carbonyl group, i.e., C=O. [00156] The terms “amine” or “amino” as used herein are represented by the formula — NA1 A2, where A1 and A2 can be, independently, hydrogen or alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. A specific example of amino is — NH2.
[00157] The term “alkylamino” as used herein is represented by the formula — NH(- alkyl) where alkyl is a described herein. Representative examples include, but are not limited to, methylamino group, ethylamino group, propylamino group, isopropylamino group, butylamino group, isobutylamino group, (sec-butyl)amino group, (tert-butyl)amino group, pentylamino group, isopentylamino group, (tert-pentyl)amino group, hexylamino group, and the like.
[00158] The term “dialkylamino” as used herein is represented by the formula — N(- alkyl)2 where alkyl is a described herein. Representative examples include, but are not limited to, dimethylamino group, diethylamino group, dipropylamino group, diisopropylamino group, dibutylamino group, diisobutylamino group, di(sec-butyl)amino group, di(tert-butyl)amino group, dipentylamino group, diisopentylamino group, di(tert- pentyl)amino group, dihexylamino group, N-ethyl-N-methylamino group, N-methyl-N- propylamino group, N-ethyl-N-propylamino group and the like.
[00159] The term “carboxylic acid” as used herein is represented by the formula —
C(O)OH.
[00160] The term “ester” as used herein is represented by the formula — (OC(O)A1 or — C(O)OA1, where A1 can be alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. The term “polyester” as used herein is represented by the formula — (A1O(O)C-A2-C(O)O)a — or (A1O(O)C-A2-OC(O))a , where A1 and A2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group described herein and “a” is an integer from 1 to 500. “Polyester” is as the term used to describe a group that is produced by the reaction between a compound having at least two carboxylic acid groups with a compound having at least two hydroxyl groups.
[00161] The term “ether” as used herein is represented by the formula A1OA2, where A1 and A2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl,
cycloalkynyl, aryl, or heteroaryl group described herein. The term “polyether” as used herein is represented by the formula — (A1O-A2O)a — , where A1 and A2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group described herein and “a” is an integer of from 1 to 500. Examples of polyether groups include polyethylene oxide, polypropylene oxide, and polybutylene oxide.
[00162] The terms “halo,” “halogen,” or “halide” as used herein can be used interchangeably and refer to F, Cl, Br, or I.
[00163] The terms “pseudohalide,” “pseudohalogen,” or “pseudohalo” as used herein can be used interchangeably and refer to functional groups that behave substantially similar to halides. Such functional groups include, by way of example, cyano, thiocyanato, azido, trifluoromethyl, trifluoromethoxy, perfluoroalkyl, and perfluoroalkoxy groups.
[00164] The term “heteroalkyl,” as used herein, refers to an alkyl group containing at least one heteroatom. Suitable heteroatoms include, but are not limited to, O, N, Si, P and S, wherein the nitrogen, phosphorous and sulfur atoms are optionally oxidized, and the nitrogen heteroatom is optionally quatemized. Heteroalkyls can be substituted as defined above for alkyl groups.
[00165] The term “heteroaryl,” as used herein, refers to an aromatic group that has at least one heteroatom incorporated within the ring of the aromatic group. Examples of heteroatoms include, but are not limited to, nitrogen, oxygen, sulfur, and phosphorus, where N-oxides, sulfin- oxides, and dioxides are permissible heteroatom substitutions. The heteroaryl group can be substituted or unsubstituted. The heteroaryl group can be substituted with one or more groups including, but not limited to, alkyl, cycloalkyl, alkoxy, amino, ether, halide, hydroxy, nitro, silyl, sulfo-oxo, or thiol as described herein. Heteroaryl groups can be monocyclic, or alternatively fused ring systems. Heteroaryl groups include, but are not limited to, furyl, imidazolyl, pyrimidinyl, tetrazolyl, thienyl, pyridinyl, pyrrolyl, N- methylpyrrolyl, quinolinyl, isoquinolinyl, pyrazolyl, triazolyl, thiazolyl, oxazolyl, isoxazolyl, oxadiazolyl, thiadiazolyl, isothiazolyl, pyridazinyl, pyrazinyl, benzofuranyl, benzodioxolyl, benzothiophenyl, indolyl, indazolyl, benzimidazolyl, imidazopyridinyl, pyrazolopyridinyl, and pyrazolopyrimidinyl. Further not limiting examples of heteroaryl groups include, but are not limited to, pyridinyl, pyridazinyl, pyrimidinyl, pyrazinyl, thiophenyl, pyrazolyl, imidazolyl, benzo[d]oxazolyl, benzo[d]thiazolyl, quinolinyl, quinazolinyl, indazolyl, imidazo[ 1 ,2-b]pyridazinyl, imidazo[l ,2-a]pyrazinyl, benzo[c] [ 1 ,2,5]thiadiazolyl, benzo[c][l,2,5]oxadiazolyl, and pyrido[2,3-b]pyrazinyl.
[00166] The terms “heterocycle” or “heterocyclyl,” as used herein can be used interchangeably and refer to single and multi-cyclic aromatic or non-aromatic ring systems in which at least one of the ring members is other than carbon. Thus, the term is inclusive of, but not limited to, “heterocycloalkyl”, “heteroaryl”, “bicyclic heterocycle” and “polycyclic heterocycle.” Heterocycle includes pyridine, pyrimidine, furan, thiophene, pyrrole, isoxazole, isothiazole, pyrazole, oxazole, thiazole, imidazole, oxazole, including, 1,2,3- oxadiazole, 1,2,5-oxadiazole and 1,3,4-oxadiazole, thiadiazole, including, 1,2,3-thiadiazole, 1,2,5-thiadiazole, and 1 , 3, 4-thiadiazole, triazole, including, 1,2,3-triazole, 1,3,4-triazole, tetrazole, including 1,2,3,4-tetrazole and 1,2,4,5-tetrazole, pyridazine, pyrazine, triazine, including 1,2,4-triazine and 1,3,5-triazine, tetrazine, including 1,2,4,5-tetrazine, pyrrolidine, piperidine, piperazine, morpholine, azetidine, tetrahydropyran, tetrahydrofuran, dioxane, and the like. The term heterocyclyl group can also be a C2 heterocyclyl, C2-C3 heterocyclyl, C2- C4 heterocyclyl, C2-C5 heterocyclyl, C2-C6 heterocyclyl, C2-C7 heterocyclyl, C2-C8 heterocyclyl, C2-C9 heterocyclyl, C2-C10 heterocyclyl, C2-C11 heterocyclyl, and the like up to and including a C2-C18 heterocyclyl. For example, a C2 heterocyclyl comprises a group which has two carbon atoms and at least one heteroatom, including, but not limited to, aziridinyl, diazetidinyl, dihydrodiazetyl, oxiranyl, thiiranyl, and the like. Alternatively, for example, a C5 heterocyclyl comprises a group which has five carbon atoms and at least one heteroatom, including, but not limited to, piperidinyl, tetrahydropyranyl, tetrahydrothiopyranyl, diazepanyl, pyridinyl, and the like. It is understood that a heterocyclyl group may be bound either through a heteroatom in the ring, where chemically possible, or one of carbons comprising the heterocyclyl ring.
[00167] The term “bicyclic heterocycle” or “bicyclic heterocyclyl,” as used herein refers to a ring system in which at least one of the ring members is other than carbon. Bicyclic heterocyclyl encompasses ring systems wherein an aromatic ring is fused with another aromatic ring, or wherein an aromatic ring is fused with a non-aromatic ring. Bicyclic heterocyclyl encompasses ring systems wherein a benzene ring is fused to a 5- or a 6- membered ring containing 1 , 2 or 3 ring heteroatoms or wherein a pyridine ring is fused to a 5- or a 6-membered ring containing 1, 2 or 3 ring heteroatoms. Bicyclic heterocyclic groups include, but are not limited to, indolyl, indazolyl, pyrazolo[l,5-a]pyridinyl, benzofuranyl, quinolinyl, quinoxalinyl, 1,3-benzodioxolyl, 2,3-dihydro-l,4-benzodioxinyl, 3,4-dihydro-2H- chromenyl, 1H-pyrazolo[4,3-c]pyridin-3-yl; lH-pyrrolo[3,2-b]pyridin-3-yl; and 1H- pyrazolo[3,2-b]pyridin-3-yl.
[00168] The term “heterocycloalkyl” as used herein refers to an aliphatic, partially unsaturated or fully saturated, 3- to 14-membered ring system, including single rings of 3 to 8 atoms and bi- and tricyclic ring systems. The heterocycloalkyl ring-systems include one to four heteroatoms independently selected from oxygen, nitrogen, and sulfur, wherein a nitrogen and sulfur heteroatom optionally can be oxidized and a nitrogen heteroatom optionally can be substituted. Representative heterocycloalkyl groups include, but are not limited to, pyrrolidinyl, pyrazolinyl, pyrazolidinyl, imidazolinyl, imidazolidinyl, piperidinyl, piperazinyl, oxazolidinyl, isoxazolidinyl, morpholinyl, thiazolidinyl, isothiazolidinyl, and tetrahydrofuryl.
[00169] The term “hydroxyl” or “hydroxyl” as used herein is represented by the formula — OH.
[00170] The term “ketone” as used herein is represented by the formula A1C(O)A2, where A1 and A2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
[00171] The term “azide” or “azido” as used herein is represented by the formula —
N3.
[00172] The term “nitro” as used herein is represented by the formula NO2. [00173] The term “nitrile” or “cyano” as used herein is represented by the formula CN.
[00174] The term “silyl” as used herein is represented by the formula — SiA1A2A3, where A1, A2, and A3 can be, independently, hydrogen or an alkyl, cycloalkyl, alkoxy, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. [00175] The term “sulfo-oxo” as used herein is represented by the formulas — S(O)A1, — S(O)2A1, — OS(O)2A1, or — OS(O)2OA1, where A1 can be hydrogen or an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. Throughout this specification “S(O)” is a short hand notation for S=O. The term “sulfonyl” is used herein to refer to the sulfo-oxo group represented by the formula — S(O)2A1, where A1 can be hydrogen or an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. The term “sulfone” as used herein is represented by the formula A1S(O)2A2, where A1 and A2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein. The term “sulfoxide” as used herein is represented by the formula A1S(O)A2, where A1 and A2 can be, independently, an alkyl, cycloalkyl, alkenyl, cycloalkenyl, alkynyl, cycloalkynyl, aryl, or heteroaryl group as described herein.
[00176] The term “thiol” as used herein is represented by the formula — SH.
[00177] “R1,” “R2,” “R3,” “Rn,” where n is an integer, as used herein can, independently, possess one or more of the groups listed above. For example, if R1 is a straight chain alkyl group, one of the hydrogen atoms of the alkyl group can optionally be substituted with a hydroxyl group, an alkoxy group, an alkyl group, a halide, and the like. Depending upon the groups that are selected, a first group can be incorporated within second group or, alternatively, the first group can be pendant (i.e., attached) to the second group. For example, with the phrase “an alkyl group comprising an amino group,” the amino group can be incorporated within the backbone of the alkyl group. Alternatively, the amino group can be attached to the backbone of the alkyl group. The nature of the group(s) that is (are) selected will determine if the first group is embedded or attached to the second group.
[00178] As described herein, compounds of the invention may contain “optionally substituted” moieties. In general, the term “substituted,” whether preceded by the term “optionally” or not, means that one or more hydrogen of the designated moiety are replaced with a suitable substituent. Unless otherwise indicated, an “optionally substituted” group may have a suitable substituent at each substitutable position of the group, and when more than one position in any given structure may be substituted with more than one substituent selected from a specified group, the substituent may be either the same or different at every position. Combinations of substituents envisioned by this invention are preferably those that result in the formation of stable or chemically feasible compounds. In is also contemplated that, in certain aspects, unless expressly indicated to the contrary, individual substituents can be further optionally substituted (i.e., further substituted or unsubstituted).
[00179] The term “stable,” as used herein, refers to compounds that are not substantially altered when subjected to conditions to allow for their production, detection, and, in certain aspects, their recovery, purification, and use for one or more of the purposes disclosed herein.
[00180] Suitable monovalent substituents on a substitutable carbon atom of an
“optionally substituted” group are independently halogen; -(CH2)0-4Rº; -(CH2)0-4ORº; - O(CH2)0-4R°, -O-(CH2)0-4C(O)OR°; -(CH2)0-4CH(ORº)2; -(CH2)0-4SR°; -(CH2)0-4Ph, which may be substituted with R°; (CH2)0-4 O(CH2)0-1Ph which may be substituted with R°; - CH=CHPh, which may be substituted with R°; -(CH2)0-4O(CH2)0-1-pyridyl which may be substituted with R°; -NO2; -CN; -N3; -(CH2)0-4N(Rº)2; -(CH2)0-4N(Rº)C(O)Rº; -
N(R°)C(S)R°; -(CH2)0-4N(R°)C(O)NR°2; -N(Rº)C(S)NR°2; -(CH2)0-4N(Rº)C(O)OR°;
N(R°)N(R°)C(O)R°; -N(R°)N(R°)C(O)NR°2; -N(R°)N(R°)C(O)OR°; -(CH2)0-4C(O)R°; - C(S)R°; -(CH2)0-4C(O)ORº; -(CH2)0-4C(O)SR°; -(CH2)0-4C(O)OSiR°3; -(CH2)0-4OC(O)Rº;
OC(O)(CH2)0-4SR , SC(S)SR°; (CH2)0-4SC(O)R°; (CH2)0-4C(O)NR°2; C(S)NRº 2; -
C(S)SR°; -(CH2)0-4OC(O)NR°2; -C(O)N(OR°)R°; -C(O)C(O)R°; -C(O)CH2C(O)Rº; -
C(NOR°)R°; -(CH2)0-4SSR°; -(CH2)0-4S(O)2R°; -(CH2)0-4S(O)2OR°; -(CH2)0-4OS(O)2R°;
S(O)2NR°2; -(CH2)0-4S(O)R°; -N(Rº)S(O)2NR°2; -N(Rº)S(O)2R°; -N(OR°)R°; - C(NH)NR°2; -P(O)2Rº; -P(O)Rº 2; -OP(O)R°2; -OP(O)(OR°)2; SiR°3; -(C1-4 straight or branched alkylene)O-N(R°)2; or -(C1-4 straight or branched alkylene)C(O)O-N(R°)2, wherein each R° may be substituted as defined below and is independently hydrogen, C1- 6 aliphatic, -CH2Ph, -O(CH2)0-1Ph, -CH2-(5-6 membered heteroaryl ring), or a 5-6- membered saturated, partially unsaturated, or aryl ring having 0—4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, or, notwithstanding the definition above, two independent occurrences of R°, taken together with their intervening atom(s), form a 3-12- membered saturated, partially unsaturated, or aryl mono- or bicyclic ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, which may be substituted as defined below.
[00181] Suitable monovalent substituents on R° (or the ring formed by taking two independent occurrences of R° together with their intervening atoms), are independently halogen, -(CH2)0-2R●, -(haloR●), -(CH2)0-2OH, -(CH2)0-2OR●, -(CH2)0-
2CH(OR●)2; -O(haloR●), -CN, -N3, -(CH2)0-2C(O)R●, -(CH2)0-2C(O)OH, -(CH2)0- 2C(O)OR●, -(CH2)0-2SR●, -(CH2)0-2SH, -(CH2)0-2NH2, -(CH2)0-2NHR●, -(CH2)0-2NR● 2, - NO2, -SiR● 3, -OSiR● 3, -C(O)SR●, -(C1-4 straight or branched alkylene)C(O)OR●, or -SSR● wherein each R● is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently selected from C1-4 aliphatic, -CH2Ph, -O(CH2)0-1Ph, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur. Suitable divalent substituents on a saturated carbon atom of R° include =O and =S.
[00182] Suitable divalent substituents on a saturated carbon atom of an “optionally substituted” group include the following: =0, =S, =NNR*2, =NNHC(O)R*, =NNHC(O)OR*, =NNHS(O)2R*, =NR*, =NOR*, -O(C(R*2))2-3O-, or -S(C(R*2))2-3S-, wherein each independent occurrence of R* is selected from hydrogen, C1-6 aliphatic which may be substituted as defined below, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen,
oxygen, or sulfur. Suitable divalent substituents that are bound to vicinal substitutable carbons of an “optionally substituted” group include: -O(CR*2)2-3O-, wherein each independent occurrence of R* is selected from hydrogen, C1-6 aliphatic which may be substituted as defined below, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0 4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[00183] Suitable substituents on the aliphatic group of R* include halogen, - R●, -(haloR●), -OH, -OR●, -O(haloR●), -CN, -C(O)OH, -C(O)OR●, -NH2, -NHR●, -NR● 2, or -NO2, wherein each R● is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently C1-4 aliphatic, -CH2Ph, -O(CH2)0-1Ph, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[00184] Suitable substituents on a substitutable nitrogen of an “optionally substituted” group include
is
independently hydrogen, C1-6 aliphatic which may be substituted as defined below, unsubstituted -OPh, or an unsubstituted 5-6-membered saturated, partially unsaturated, or aryl ring having 0 4 heteroatoms independently selected from nitrogen, oxygen, or sulfur, or, notwithstanding the definition above, two independent occurrences of taken together with
their intervening atom(s) form an unsubstituted 3-12-membered saturated, partially unsaturated, or aryl mono- or bicyclic ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[00185] Suitable substituents on the aliphatic group of
are independently halogen, - R●, -(haloR●), -OH, -OR●, -O(haloR●), -CN, -C(O)OH, -C(O)OR●, -NH2, -NHR●, -NR● 2, or -NO2, wherein each R● is unsubstituted or where preceded by “halo” is substituted only with one or more halogens, and is independently Ci 4 aliphatic, -CH2Ph, -O(CH2)0-1Ph, or a 5-6-membered saturated, partially unsaturated, or aryl ring having 0-4 heteroatoms independently selected from nitrogen, oxygen, or sulfur.
[00186] The term “leaving group” refers to an atom (or a group of atoms) with electron withdrawing ability that can be displaced as a stable species, taking with it the bonding electrons. Examples of suitable leaving groups include halides and sulfonate esters, including, but not limited to, triflate, mesylate, tosylate, and brosylate.
[00187] The terms “hydrolysable group” and “hydrolysable moiety” refer to a functional group capable of undergoing hydrolysis, e.g., under basic or acidic conditions.
Examples of hydrolysable residues include, without limitation, acid halides, activated carboxylic acids, and various protecting groups known in the art (see, for example, “Protective Groups in Organic Synthesis,” T. W. Greene, P. G. M. Wuts, Wiley-Interscience, 1999).
[00188] The term “organic residue” defines a carbon-containing residue, i.e., a residue comprising at least one carbon atom, and includes but is not limited to the carbon-containing groups, residues, or radicals defined hereinabove. Organic residues can contain various heteroatoms, or be bonded to another molecule through a heteroatom, including oxygen, nitrogen, sulfur, phosphorus, or the like. Examples of organic residues include but are not limited alkyl or substituted alkyls, alkoxy or substituted alkoxy, mono or di-substituted amino, amide groups, etc. Organic residues can preferably comprise 1 to 18 carbon atoms, 1 to 15, carbon atoms, 1 to 12 carbon atoms, 1 to 8 carbon atoms, 1 to 6 carbon atoms, or 1 to 4 carbon atoms. In a further aspect, an organic residue can comprise 2 to 18 carbon atoms, 2 to 15, carbon atoms, 2 to 12 carbon atoms, 2 to 8 carbon atoms, 2 to 4 carbon atoms, or 2 to 4 carbon atoms.
[00189] A very close synonym of the term “residue” is the term “radical,” which as used in the specification and concluding claims, refers to a fragment, group, or substructure of a molecule described herein, regardless of how the molecule is prepared. For example, a 2,4-thiazolidinedione radical in a particular compound has the structure:
regardless of whether thiazolidinedione is used to prepare the compound. In some embodiments the radical (for example an alkyl) can be further modified (i.e., substituted alkyl) by having bonded thereto one or more “substituent radicals.” The number of atoms in a given radical is not critical to the present invention unless it is indicated to the contrary elsewhere herein.
[00190] “Organic radicals,” as the term is defined and used herein, contain one or more carbon atoms. An organic radical can have, for example, 1-26 carbon atoms, 1-18 carbon atoms, 1-12 carbon atoms, 1-8 carbon atoms, 1-6 carbon atoms, or 1-4 carbon atoms. In a further aspect, an organic radical can have 2-26 carbon atoms, 2-18 carbon atoms, 2-12 carbon atoms, 2-8 carbon atoms, 2-6 carbon atoms, or 2-4 carbon atoms. Organic radicals often have hydrogen bound to at least some of the carbon atoms of the organic radical. One
example, of an organic radical that comprises no inorganic atoms is a 5, 6, 7, 8-tetrahydro-2- naphthyl radical. In some embodiments, an organic radical can contain 1-10 inorganic heteroatoms bound thereto or therein, including halogens, oxygen, sulfur, nitrogen, phosphoms, and the like. Examples of organic radicals include but are not limited to an alkyl, substituted alkyl, cycloalkyl, substituted cycloalkyl, mono-substituted amino, di- substituted amino, acyloxy, cyano, carboxy, carboalkoxy, alkylcarboxamide, substituted alkylcarboxamide, dialkylcarboxamide, substituted dialkylcarboxamide, alkylsulfonyl, alkylsulfinyl, thioalkyl, thiohaloalkyl, alkoxy, substituted alkoxy, haloalkyl, haloalkoxy, aryl, substituted aryl, heteroaryl, heterocyclic, or substituted heterocyclic radicals, wherein the terms are defined elsewhere herein. A few non-limiting examples of organic radicals that include heteroatoms include alkoxy radicals, trifluoromethoxy radicals, acetoxy radicals, dimethylamino radicals and the like.
[00191] Compounds described herein can contain one or more double bonds and, thus, potentially give rise to cis/trans (E/Z) isomers, as well as other conformational isomers. Unless stated to the contrary, the invention includes all such possible isomers, as well as mixtures of such isomers.
[00192] Unless stated to the contrary, a formula with chemical bonds shown only as solid lines and not as wedges or dashed lines contemplates each possible isomer, e.g., each enantiomer and diastereomer, and a mixture of isomers, such as a racemic or scalemic mixture. Compounds described herein can contain one or more asymmetric centers and, thus, potentially give rise to diastereomers and optical isomers. Unless stated to the contrary, the present invention includes all such possible diastereomers as well as their racemic mixtures, their substantially pure resolved enantiomers, all possible geometric isomers, and pharmaceutically acceptable salts thereof. Mixtures of stereoisomers, as well as isolated specific stereoisomers, are also included. During the course of the synthetic procedures used to prepare such compounds, or in using racemization or epimerization procedures known to those skilled in the art, the products of such procedures can be a mixture of stereoisomers.
[00193] Many organic compounds exist in optically active forms having the ability to rotate the plane of plane-polarized light. In describing an optically active compound, the prefixes D and L or R and S are used to denote the absolute configuration of the molecule about its chiral center(s). The prefixes d and 1 or (+) and (-) are employed to designate the sign of rotation of plane-polarized light by the compound, with (-) or meaning that the compound is levorotatory. A compound prefixed with (+) or d is dextrorotatory. For a given chemical structure, these compounds, called stereoisomers, are identical except that they are
non-superimposable mirror images of one another. A specific stereoisomer can also be referred to as an enantiomer, and a mixture of such isomers is often called an enantiomeric mixture. A 50:50 mixture of enantiomers is referred to as a racemic mixture. Many of the compounds described herein can have one or more chiral centers and therefore can exist in different enantiomeric forms. If desired, a chiral carbon can be designated with an asterisk (*). When bonds to the chiral carbon are depicted as straight lines in the disclosed formulas, it is understood that both the (R) and (S) configurations of the chiral carbon, and hence both enantiomers and mixtures thereof, are embraced within the formula. As is used in the art, when it is desired to specify the absolute configuration about a chiral carbon, one of the bonds to the chiral carbon can be depicted as a wedge (bonds to atoms above the plane) and the other can be depicted as a series or wedge of short parallel lines is (bonds to atoms below the plane). The Cahn-Ingold-Prelog system can be used to assign the (R) or (S) configuration to a chiral carbon.
[00194] When the disclosed compounds contain one chiral center, the compounds exist in two enantiomeric forms. Unless specifically stated to the contrary, a disclosed compound includes both enantiomers and mixtures of enantiomers, such as the specific 50:50 mixture referred to as a racemic mixture. The enantiomers can be resolved by methods known to those skilled in the art, such as formation of diastereoisomeric salts which may be separated, for example, by crystallization (see, CRC Handbook of Optical Resolutions via Diastereomeric Salt Formation by David Kozma (CRC Press, 2001)); formation of diastereoisomeric derivatives or complexes which may be separated, for example, by crystallization, gas-liquid or liquid chromatography; selective reaction of one enantiomer with an enantiomer-specific reagent, for example enzymatic esterification; or gas-liquid or liquid chromatography in a chiral environment, for example on a chiral support for example silica with a bound chiral ligand or in the presence of a chiral solvent. It will be appreciated that where the desired enantiomer is converted into another chemical entity by one of the separation procedures described above, a further step can liberate the desired enantiomeric form. Alternatively, specific enantiomers can be synthesized by asymmetric synthesis using optically active reagents, substrates, catalysts or solvents, or by converting one enantiomer into the other by asymmetric transformation.
[00195] Designation of a specific absolute configuration at a chiral carbon in a disclosed compound is understood to mean that the designated enantiomeric form of the compounds can be provided in enantiomeric excess (e.e.). Enantiomeric excess, as used herein, is the presence of a particular enantiomer at greater than 50%, for example, greater
than 60%, greater than 70%, greater than 75%, greater than 80%, greater than 85%, greater than 90%, greater than 95%, greater than 98%, or greater than 99%. In one aspect, the designated enantiomer is substantially free from the other enantiomer. For example, the “R: forms of the compounds can be substantially free from the “S” forms of the compounds and are, thus, in enantiomeric excess of the “S” forms. Conversely, “S” forms of the compounds can be substantially free of “R” forms of the compounds and are, thus, in enantiomeric excess of the “R” forms.
[00196] When a disclosed compound has two or more chiral carbons, it can have more than two optical isomers and can exist in diastereoisomeric forms. For example, when there are two chiral carbons, the compound can have up to four optical isomers and two pairs of enantiomers ((S,S)/(R,R) and (R,S)/(S,R)). The pairs of enantiomers (e.g., (S,S)/(R,R)) are mirror image stereoisomers of one another. The stereoisomers that are not mirror-images (e.g., (S,S) and (R,S)) are diastereomers. The diastereoisomeric pairs can be separated by methods known to those skilled in the art, for example chromatography or crystallization and the individual enantiomers within each pair may be separated as described above. Unless otherwise specifically excluded, a disclosed compound includes each diastereoisomer of such compounds and mixtures thereof.
[00197] The compounds according to this disclosure may form prodrugs at hydroxyl or amino functionalities using alkoxy, amino acids, etc., groups as the prodrug forming moieties. For instance, the hydroxymethyl position may form mono-, di-, or triphosphates and again these phosphates can form prodrugs. Preparations of such prodrug derivatives are discussed in various literature sources (examples are: Alexander et al., J. Med. Chem. 1988, 31, 318; Aligas-Martin et al., PCT WO 2000/041531, p. 30). The nitrogen function converted in preparing these derivatives is one (or more) of the nitrogen atoms of a compound of the disclosure.
[00198] “Derivatives” of the compounds disclosed herein are pharmaceutically acceptable salts, prodrugs, deuterated forms, radio-actively labeled forms, isomers, solvates and combinations thereof. The “combinations” mentioned in this context refer to derivatives falling within at least two of the groups: pharmaceutically acceptable salts, prodrugs, deuterated forms, radio-actively labeled forms, isomers, and solvates. Examples of radio- actively labeled forms include compounds labeled with tritium, phosphorous-32, iodine- 129, carbon-11, fluorine- 18, and the like.
[00199] Compounds described herein comprise atoms in both their natural isotopic abundance and in non-natural abundance. The disclosed compounds can be isotopically-
labeled or isotopically-substituted compounds identical to those described, but for the fact that one or more atoms are replaced by an atom having an atomic mass or mass number different from the atomic mass or mass number typically found in nature. Examples of isotopes that can be incorporated into compounds of the invention include isotopes of hydrogen, carbon, nitrogen, oxygen, phosphorous, fluorine and chlorine, such as 2 H, 3 H, 13 C, 14 C, 15 N, 18 O, 17 O, 35 S, 18 F and 36 Cl, respectively. Compounds further comprise prodrugs thereof, and pharmaceutically acceptable salts of said compounds or of said prodrugs which contain the aforementioned isotopes and/or other isotopes of other atoms are within the scope of this invention. Certain isotopically-labeled compounds of the present invention, for example those into which radioactive isotopes such as 3 H and 14 C are incorporated, are useful in drug and/or substrate tissue distribution assays. Tritiated, i.e., 3 H, and carbon-14, i.e., 14 C, isotopes are particularly preferred for their ease of preparation and detectability. Further, substitution with heavier isotopes such as deuterium, i.e., 2 H, can afford certain therapeutic advantages resulting from greater metabolic stability, for example increased in vivo half-life or reduced dosage requirements and, hence, may be preferred in some circumstances. Isotopically labeled compounds of the present invention and prodrugs thereof can generally be prepared by carrying out the procedures below, by substituting a readily available isotopically labeled reagent for a non- isotopically labeled reagent.
[00200] The compounds described in the invention can be present as a solvate. In some cases, the solvent used to prepare the solvate is an aqueous solution, and the solvate is then often referred to as a hydrate. The compounds can be present as a hydrate, which can be obtained, for example, by crystallization from a solvent or from aqueous solution. In this connection, one, two, three or any arbitrary number of solvent or water molecules can combine with the compounds according to the invention to form solvates and hydrates. Unless stated to the contrary, the invention includes all such possible solvates.
[00201] The term “co-crystal” means a physical association of two or more molecules which owe their stability through non-covalent interaction. One or more components of this molecular complex provide a stable framework in the crystalline lattice. In certain instances, the guest molecules are incorporated in the crystalline lattice as anhydrates or solvates, see e.g. “Crystal Engineering of the Composition of Pharmaceutical Phases. Do Pharmaceutical Co-crystals Represent a New Path to Improved Medicines?” Almarasson, O., et. al., The Royal Society of Chemistry, 1889-1896, 2004. Examples of co-crystals include p- toluenesulfonic acid and benzenesulfonic acid.
[00202] It is also appreciated that certain compounds described herein can be present as an equilibrium of tautomers. For example, ketones with an a-hydrogen can exist in an equilibrium of the keto form and the enol form.
[00203] Likewise, amides with an N-hydrogen can exist in an equilibrium of the amide form and the imidic acid form. As another example, pyrazoles can exist in two tautomeric forms, N1 -unsubstituted, 3-A3 and N1 -unsubstituted, 5-A3 as shown below.
Unless stated to the contrary, the invention includes all such possible tautomers.
[00204] It is known that chemical substances form solids, which are present in different states of order which are termed polymorphic forms or modifications. The different modifications of a polymorphic substance can differ greatly in their physical properties. The compounds according to the invention can be present in different polymorphic forms, with it being possible for particular modifications to be metastable. Unless stated to the contrary, the invention includes all such possible polymorphic forms.
[00205] In some aspects, a structure of a compound can be represented by a formula:
which is understood to be equivalent to a formula:
wherein n is typically an integer. That is, R" is understood to represent five independent substituents, Rn(a), Rn(b), Rn(c), Rn(d), Rn(e). By “independent substituents,” it is meant that each R substituent can be independently defined. For example, if in one instance Rn(a) is halogen, then Rn(b) is not necessarily halogen in that instance.
[00206] Certain materials, compounds, compositions, and components disclosed herein can be obtained commercially or readily synthesized using techniques generally known to those of skill in the art. For example, the starting materials and reagents used in preparing the disclosed compounds and compositions are either available from commercial suppliers such as Aldrich Chemical Co., (Milwaukee, Wis.), Acros Organics (Morris Plains, N.J.), Strem Chemicals (Newburyport, MA), Fisher Scientific (Pittsburgh, Pa.), or Sigma (St. Louis, Mo.) or are prepared by methods known to those skilled in the art following procedures set forth in references such as Fieser and Fieser’s Reagents for Organic Synthesis, Volumes 1-17 (John Wiley and Sons, 1991); Rodd’s Chemistry of Carbon Compounds, Volumes 1-5 and supplemental volumes (Elsevier Science Publishers, 1989); Organic Reactions, Volumes 1-40 (John Wiley and Sons, 1991); March’s Advanced Organic Chemistry, (John Wiley and Sons, 4th Edition); and Larock’s Comprehensive Organic Transformations (VCH Publishers Inc., 1989).
[00207] Unless otherwise expressly stated, it is in no way intended that any method set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not actually recite an order to be followed by its steps or it is not otherwise specifically stated in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including: matters of logic with respect to arrangement of steps or operational flow; plain meaning derived from grammatical organization or punctuation; and the number or type of embodiments described in the specification.
[00208] Disclosed are the components to be used to prepare the compositions of the invention as well as the compositions themselves to be used within the methods disclosed herein. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference of each various individual and collective combinations and permutation of these compounds cannot be explicitly disclosed, each is specifically contemplated and described herein. For example, if a particular compound is disclosed and discussed and a number of modifications that can be made to a number of molecules including the compounds are discussed, specifically contemplated is each and every combination and permutation of the compound and the modifications that are possible unless specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules D, E, and F and an example of a combination molecule, A-D is disclosed,
then even if each is not individually recited each is individually and collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered disclosed. Likewise, any subset or combination of these is also disclosed. Thus, for example, the sub-group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all aspects of this application including, but not limited to, steps in methods of making and using the compositions of the invention. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the methods of the invention.
[00209] It is understood that the compounds and compositions disclosed herein have certain functions. Disclosed herein are certain structural requirements for performing the disclosed functions, and it is understood that there are a variety of structures that can perform the same function that are related to the disclosed structures, and that these structures will typically achieve the same result.
B. COMPOUNDS
[00210] In one aspect, disclosed are compounds having a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein R1 is selected from -CO2H, - C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA. [00211] Also disclosed are compounds having a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, - SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA. [00212] Also disclosed are compounds having a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, - SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3aand R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6 is covalently bonded to R5 or R5 , respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
[00213] Also disclosed are compounds having a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, - NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3aand R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6 , when present, is independently selected from hydrogen and methyl, or wherein R6 or R6 is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
[00214] In various aspects, o is independently 0, 1, 2, 3, 4, 5, 6, or 7.
[00215] In various aspects, t is 0.
[00216] In various aspects, v is 1 or 2.
[00218] In various aspects, R1 is -CO2H.
[00219] In various aspects, the cleavable moiety is -CO2-(C4-C8 alkylene)-OC(O)-. In a further aspect, the cleavable moiety is -CO2CH2CH=CHCH2OC(O)-.
[00220] In various aspects, the cleavable moiety is a protease recognition sequence. In a further aspect, the protease recognition sequence is TEV recognition sequence.
[00221] In various aspects, the compound comprises one or more D-amino acid residues. In a further aspect, the compound comprises one or more β-amino acid residues. In a still further aspect, the compound comprises one or more N-methylated amino acid residues.
[00222] In various aspects, PapB installs a single thioether linkage in the compound.
In a further aspect, PapB installs two or more thioether linkages in the compound.
[00225] In various aspects, m is 0. In a further aspect, m is 1.
[00226] In various aspects, n is 0. In a further aspect, n is 1.
[00227] In various aspects, o is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, o is 1, 2, 3,
4, 5, 6, 7, 8, or 9. In a still further aspect, o is 1, 2, 3, or 4.
[00228] In various aspects, p is 1. In a further aspect, p is 2.
[00229] In various aspects, A is S. In a further aspect, A is Se.
[00230] In various aspects, L is C2-C4 alkyl. In a further aspect, L is -(C1-C4 alkyl)(OCH2CH2)q. In a still further aspect, L is a structure selected from:
[00231] In various aspects, the cleavable moiety is a protease recognition sequence. In a further aspect, the protease recognition sequence is a TEV protease recognition sequence.
In a still further aspect, the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue. In yet a further aspect, the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1). [00232] In various aspects, the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3). In a further aspect, the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
[00234] In various aspects, R1 is CO2H.
[00235] In various aspects, R2 is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan. In a further aspect, R2 is a residue of a side chain of an amino acid selected from alanine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, and glycine.
[00236] In various aspects, one of R3a and R3b, when present, is hydrogen, and one of R3aand R3b, when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
[00237] In various aspects, R4 is hydrogen. In a further aspect, R4 is methyl.
[00238] In various aspects, each occurrence of R5, when present, is independently a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
[00239] In various aspects, each occurrence of R6, when present, is hydrogen. In a further aspect, each occurrence of R6, when present, is methyl.
[00240] In various aspects, each of R7a and R7b, when present, is hydrogen. In a further aspect, each of R7a and R7b, when present, is methyl.
In a further aspect, o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
[00249] In various aspects, one of R3a and R3b, when present, is hydrogen, and one of R3a and R3b, when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
[00251] In various aspects, the compound has a structure represented by a formula:
[00252] In various aspects, the compound has a structure represented by a formula:
In a further aspect, o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
[00254] In various aspects, the compound has a structure represented by a formula:
wherein r is 2, 3, or 4.
[00255] In various aspects, the compound has a structure represented by a formula:
wherein s is 1 or 2.
[00256] In various aspects, the compound has a structure represented by a formula:
c. THIOETHER COMPOUNDS
[00257] In one aspect, disclosed are thioether compounds produced by a disclosed method. Thus, in various aspects, the method produces a thioether compound having a structure represented by a formula:
wherein v’ is 0, 1, 2, or 3.
[00258] In various aspects, the method further comprises addition of a reducing agent. In a further aspect, the method further comprises addition of a protease.
[00259] In various aspects, the method produces a thioether compound having a structure represented by a formula:
wherein v’ is 0, 1, 2, or 3.
[00260] In various aspects, the thioether compound is selected from:
[00261] In various aspects, the method produces a thioether compound having a structure represented by a formula:
[00265] In various aspects, the thioether compound has a structure represented by a formula:
[00266] In various aspects, the thioether compound has a structure represented by a formula:
[00271] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00272] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00273] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00274] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00275] In various aspects, the thioether compound is a sactipeptide. In a further aspect, the sactipeptide has a structure represented by a formula selected from:
[00276] In various aspects, the thioether compound is a ranthipeptide. In a further aspect, the ranthipeptide has a structure represented by a formula selected from:
[00277] In various aspects, the method produces a thioether compound having a structure represented by a formula:
[00287] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00288] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00289] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00290] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00291] In various aspects, the thioether compound is a sactipeptide. In a further aspect, the sactipeptide has a structure represented by a formula selected from:
[00292] In various aspects, the thioether compound is a ranthipeptide. In a further aspect, the ranthipeptide has a structure represented by a formula selected from:
D. ANALOGS OF PEPTIDE THERAPEUTICS
[00295] In one aspect, disclosed are thioether compounds prepared by a disclosed method, wherein the thioether compound is an analog of a peptide therapeutic. Exemplary peptide therapeutics include, but are not limited to, octreotide, setmalanotide, romidepsin, bremelanotide, pramlintide, oxytocin, setmelanotide, or cyclosporin.
[00297] Also disclosed are compounds selected from:
[00298] In one aspect, disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein R1 is selected from -CO2H, - C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5’, when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA.
[00299] Also disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, - SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA.
[00300] Also disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Sc; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, - SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3aand R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4
is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6 , when present, is independently selected from hydrogen and methyl, or wherein R6 or R6 is covalently bonded to R5 or R5 , respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
[00301] Also disclosed are methods of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, - NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3aand R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6 , when present, is independently selected from hydrogen and methyl, or wherein R6 or R6 is covalently bonded to R5 or R5 , respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and C1-C4 alkyl, provided that the compound is not PapA.
[00302] In various aspects, o is independently 0, 1, 2, 3, 4, 5, 6, or 7.
[00303] In various aspects, t is 0.
[00304] In various aspects, v is 1 or 2.
[00306] In various aspects, R1 is -CO2H.
[00307] In various aspects, the cleavable moiety is a chemically cleavable moiety.
Exemplary chemical cleavable moieties include, but are not limited to, -CO2-(C4-C8 alkylene)-OC(O)-. In a further aspect, the chemically cleavable moiety is - CO2CH2CH=CHCH2OC(O)-.
[00308] In various aspects, the cleavable moiety is an enzymatically cleavable moiety such as, for example, a protease recognition sequence. In a further aspect, the protease recognition sequence is TEV recognition sequence.
[00309] In various aspects, the compound comprises one or more D-amino acid residues. In a further aspect, the compound comprises one or more β-amino acid residues. In
a still further aspect, the compound comprises one or more N-methylated amino acid residues.
[00310] In various aspects, PapB installs a single thioether linkage in the compound.
In a further aspect, PapB installs two or more thioether linkages in the compound.
[00313] In various aspects, the method produces a thioether compound having a structure represented by a formula:
wherein v’ is 0, 1, 2, or 3.
[00314] In various aspects, the method further comprises addition of a reducing agent. In a further aspect, the method further comprises addition of a protease.
[00315] In various aspects, the method produces a thioether compound having a structure represented by a formula:
wherein v’ is 0, 1, 2, or 3.
[00316] In various aspects, the thioether compound is selected from:
[00317] In various aspects, m is 0. In a further aspect, m is 1.
[00318] In various aspects, n is 0. In a further aspect, n is 1.
[00319] In various aspects, o is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, o is 1, 2, 3,
4, 5, 6, 7, 8, or 9. In a still further aspect, o is 1, 2, 3, or 4.
[00320] In various aspects, p is 1. In a further aspect, p is 2.
[00321] In various aspects, A is S. In a further aspect, A is Se.
[00322] In various aspects, L is C2-C4 alkyl. In a further aspect, L is -(C1-C4 alkyl)(OCH2CH2)q. In a still further aspect, L is a structure selected from:
[00323] In various aspects, the cleavable moiety is a protease recognition sequence. In a further aspect, the protease recognition sequence is a TEV protease recognition sequence.
In a still further aspect, the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue. In yet a further aspect, the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1). [00324] In various aspects, the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3). In a further aspect, the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
[00326] In various aspects, R1 is -CO2H.
[00327] In various aspects, R2 is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan. In a further aspect, R2 is a residue of a side chain of an amino acid selected from alanine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, and glycine.
[00328] In various aspects, one of R3a and R3b, when present, is hydrogen, and one of R3aand R3b, when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
[00329] In various aspects, R4 is hydrogen. In a further aspect, R4 is methyl.
[00330] In various aspects, each occurrence of R5, when present, is independently a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine,
cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
[00331] In various aspects, each occurrence of R6, when present, is hydrogen. In a further aspect, each occurrence of R6, when present, is methyl.
[00332] In various aspects, each of R7a and R7b, when present, is hydrogen. In a further aspect, each of R7a and R7b, when present, is methyl.
In a further aspect, o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
[00341] In various aspects, one of R3a and R3b, when present, is hydrogen, and one of R3aand R3b, when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
[00342] In various aspects, the compound has a structure represented by a formula:
[00343] In various aspects, the compound has a structure represented by a formula:
In a further aspect, o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
[00346] In various aspects, the compound has a structure represented by a formula:
wherein r is 2, 3, or 4.
[00347] In various aspects, the compound has a structure represented by a formula:
wherein s is 1 or 2.
[00348] In various aspects, the compound has a structure represented by a formula:
[00349] In various aspects, PapB installs a single thioether linkage in the compound. In a further aspect, PapB installs two or more thioether linkages in the compound.
[00350] In various aspects, the method produces a thioether compound having a structure represented by a formula:
[00351] In various aspects, the thioether compound has a structure represented by a formula:
[00352] In various aspects, the thioether compound has a structure represented by a formula:
[00360] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00361] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00362] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00363] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00364] In various aspects, the thioether compound is a sactipeptide. In a further aspect, the sactipeptide has a structure represented by a formula selected from:
[00365] In various aspects, the thioether compound is a ranthipeptide. In a further aspect, the ranthipeptide has a structure represented by a formula selected from:
[00366] In various aspects, the method further comprises addition of a reducing agent. In a further aspect, the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof.
[00367] In various aspects, the method further comprises addition of a protease. In a further aspect, the protease is TEV protease.
[00368] In various aspects, the method produces a thioether compound having a structure represented by a formula:
[00375] In various aspects, the thioether compound has a structure represented by a formula:
[00378] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00379] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00380] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00381] In various aspects, the thioether compound has a structure represented by a formula selected from:
[00382] In various aspects, the thioether compound is a sactipeptide. In a further aspect, the sactipeptide has a structure represented by a formula selected from:
[00383] In various aspects, the thioether compound is a ranthipeptide. In a further aspect, the ranthipeptide has a structure represented by a formula selected from:
F. PEPTIDES
1. PEPTIDE SUBSTRATES
[00386] In one aspect, the invention relates to chemically modifying a peptide sequence, wherein the peptide sequence comprises X-Yn-Z; wherein X is a penicillamine or an amino acid residue comprising a -SH group or an amino acid residue comprising a -SeH group; wherein Y is a series of amino acid residues where n = 0, 1 , 2, 3, 4, 5, 6, 7, 8, or 9; wherein Z is an aspartic acid residue, a glutamic acid residue, a hydroxy-glutamic acid residue, 2-amino-3-(2H-tetrazol-5-yl)propanoic acid, or a carboxyl-functionalized amino acid residue; and wherein the peptide sequence is not PapA.
[00387] In a further aspect, the peptide sequence comprises the sequence C-Ya-C-D- Yb-D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, a = 1, 2, 3, 4, 5, 6, or 7, and b= 0, 1, 2, 3, 4, 5, 6, or 7.
[00388] In a further aspect, the peptide sequence comprises the sequence C-Yx-D-Yy- C-Yz-D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, x = 0, 1, 2, 3, 4, 5, 6, or 7, y = 1, 2, 3, 4, 5, 6, 7, or 8, and z = 0, 1, 2, 3, 4, 5, 6, or 7.
[00389] In a further aspect, the peptide sequence comprises octreotide or vapreotide. In a yet further aspect, the peptide sequence comprises octreotide. In a yet further aspect, the peptide sequence comprises vapreotide.
[00390] In a further aspect, the peptide sequence comprises DFCFDWKTET (SEQ ID NO: 3), wherein the first and fourth positions are D-amino acids.
[00391] In a further aspect, the peptide sequence comprises FCFAKTETA.
[00392] In various aspects, the peptide sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3). In a further aspect, the peptide sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 3).
[00393] In various aspects, the peptide sequence further comprises a TEV protease recognition sequence. In a further aspect, the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue. In yet a further aspect, the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
[00394] In a further aspect, the peptide sequence comprises one or more D-amino acid residues.
[00395] In a further aspect, the peptide sequence comprises one or more β-amino acid residues.
[00396] In a further aspect, the peptide sequence comprises one or more N-methylated amino acids.
[00397] In one aspect, the peptide sequence is a modified PapA sequence, wherein wherein the modified PapA sequence comprises Cys-Yn-Asp, wherein Y is a series of amino acid residues and n=0, 1, 2, 4, 5, 6, or 7.
[00398] In a further aspect, the modified PapA sequence comprises minimal substrate PapA.
[00399] In a further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCDSNNAANA (SEQ ID NO: 6), LKQINVIAGVKEPIRAYGCSDNNAAA (SEQ ID NO: 7), LKQINVIAGVKEPIRAYGCSNDAAA (SEQ ID NO: 8), LKQINVIAGVKEPIRAYGCSAANDA (SEQ ID NO: 9), LKQINVIAGVKEPIRAYGCSAAANDA (SEQ ID NO: 10), or LKQINVIAGVKEPIRAYGCSAAAANDA (SEQ ID NO: 11). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCDSNNAANA (SEQ ID NO: 6). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSDNNAAA (SEQ ID NO: 7). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSNDAA A (SEQ ID NO: 8). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSAANDA (SEQ ID NO: 9). In a still further aspect, the modified PapA sequence is
LKQINVIAGVKEPIRAYGCSAAANDA (SEQ ID NO: 10). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGCSAAAANDA (SEQ ID NO: 11). [00400] In a further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSANDA (SEQ ID NO: 12), LKQINVIAGVKEPIRAYGAAACSANDACSANDA (SEQ ID NO: 13), LKQINVIAGVKEPIRAYGAAACSACDAADA (SEQ ID NO: 14), LKQINVIAGVKEPIRAYGAAAASACDAADA (SEQ ID NO: 15), or LKQINVIAGVKEPIRAYGAAACSAADAAADA (SEQ ID NO: 16). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSANDA (SEQ ID NO: 12). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSANDACSANDA (SEQ ID NO: 13). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSACDAADA (SEQ ID NO: 14). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAAASACDAADA (SEQ ID NO: 15). In a still further aspect, the modified PapA sequence is LKQINVIAGVKEPIRAYGAAACSAADAAADA (SEQ ID NO: 16). [00401] In a further aspect, the modified PapA sequence comprises one or more D- amino acid residues.
[00402] In a further aspect, the modified PapA sequence comprises one or more β- amino acid residues.
[00403] In a further aspect, the modified PapA sequence comprises one or more N- methylated amino acid residues. a. X GROUPS
[00404] In various aspects, X is a penicillamine or an amino acid residue comprising a -SH group or an amino acid residue comprising a -SeH group.
[00405] In a further aspect, X is a penicillamine.
[00406] In a further aspect, X is an amino acid residue comprising a -SH group. In a still further aspect, X is cysteine, homocysteine, D-cysteine, or D-homocysteine. In a still further aspect, X is homocysteine. In a still further aspect, X is D-cysteine. In a still further aspect, X is D-homocysteine. In a yet further aspect, X is cysteine.
[00407] In a further aspect, X is an amino acid residue comprising a -SeH group. In a still further aspect, X is selenocysteine or homoselenocysteine. In a still further aspect, X is selenocysteine. In a yet further aspect, X is homoselenocysteine.
b. YN GROUPS
[00408] In various aspects, Yn is a series of amino acid residues where n = 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9.
[00409] Examples of amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, β-amino acid residues, and N-methylated amino acid residues.
[00410] Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine. [00411] Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino- 1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-L- tyrosine and p-nitro-L-phenylalanine. Also, a p-propargyloxyphenylalanine, a 3,4-dihydroxy- L-phenyalanine (DHP), a 3,4,6-trihydroxy-L-phenylalanine, a 3,4,5-trihydroxy-L- phenylalanine, 4-nitro-phenylalanine, a p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, an L- 3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L- tyrosine, a 3 -nitro-tyrosine, a 3-thiol-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p- acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L- phenylalanine, and an isopropyl-L-phenylalanine, and the like.
[00412] In a further aspect, Yn comprises one or more D-amino acids.
[00413] In a further aspect, Yn comprises one or more β-amino acids.
[00414] In a further aspect, Yn comprises one or more N-methylated amino acids.
[00415] In various aspects, n is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9. In a further aspect, n is 0, 1,
2, 3, 4, 5, 6, 7, or 8. In a still further aspect, n is 0, 1, 2, 3, 4, 5, 6, or 7. In a still further aspect, n is 0, 1, 2, 3, 4, 5, or 6. In a still further aspect, n is 0, 1, 2, 3, 4, or 5. In a still further aspect, n is 0, 1, 2, 3, or 4. In a still further aspect, n is 0, 1, 2, or 3. In a still further aspect, n is 0, 1, or 2. In a yet further aspect, n is 0 or 1. In a yet further aspect, n is 0. In a yet further aspect, n is 1. In a yet further aspect, n is 2. In a yet further aspect, n is 3. In a yet further aspect, n is 4.
In a yet further aspect, n is 5. In a yet further aspect, n is 6. In a yet further aspect, n is 7. In a yet further aspect, n is 8. In a yet further aspect, n is 9. c. Z GROUPS
[00416] In various aspects, Z is an aspartic acid residue, a glutamic acid residue, a hydroxy-glutamic acid residue, 2-amino-3-(2H-tetrazol-5-yl)propanoic acid, or a carboxyl- functionalized amino acid residue.
[00417] In a further aspect, Z is aspartic acid or glutamic acid. In a yet further aspect, Z is aspartic acid. In a yet further aspect, Z is glutamic acid.
[00418] In a further aspect, Z is a hydroxy-glutamic acid residue.
[00419] In a further aspect, Z is 2-amino-3-(2H-tetrazol-5-yl)propanoic acid.
[00420] In a further aspect, Z is a a carboxyl-functionalized amino acid residue.
Examples of carboxyl-functionalized amino acid residues include, but are not limited to, (2S,3S)-2-amino-3-methylsuccinic acid, (2S,3R)-2-amino-3 -methylsuccinic acid, (2S,3S)-2- amino-3-methylpentanedioic acid, (2S,3R)-2-amino-3-methylpentanedioic acid, (2S,4S)-2- amino-4-methylpentanedioic acid, (2S,4R)-2-amino-4-methylpentanedioic acid, and homoglutamic acid. d. YA GROUPS
[00421] In a further aspect, Ya is a series of amino acid residues where a = 0, 1, 2, 3, 4,
5, 6, or 7.
[00422] Examples of amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, β-amino acid residues, and N-methylated amino acid residues.
[00423] Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine. [00424] Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-L- tyrosine and p-nitro-L-phenylalanine. Also, a p-propargyloxyphenylalanine, a 3,4-dihydroxy-
L-phenyalanine (DHP), a 3,4,6-trihydroxy-L-phenylalanine, a 3,4,5-trihydroxy-L- phenylalanine, 4-nitro-phenylalanine, a p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, an L- 3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L- tyrosine, a 3 -nitro-tyrosine, a 3-thiol-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p- acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L- phenylalanine, and an isopropyl-L-phenylalanine, and the like.
[00425] In a further aspect, Ya comprises one or more D-amino acids.
[00426] In a further aspect, Ya comprises one or more β-amino acids.
[00427] In a further aspect, Ya comprises one or more N-methylated amino acids.
[00428] In a further aspect, a is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, a is 0, 1, 2, 3,
4, 5, or 6. In a still further aspect, a is 0, 1, 2, 3, 4, or 5. In a still further aspect, a is 0, 1, 2, 3, or 4. In a still further aspect, a is 0, 1, 2, or 3. In a still further aspect, a is 0, 1, or 2. In a yet further aspect, a is 0 or 1. In a yet further aspect, a is 0. In a yet further aspect, a is 1. In a yet further aspect, a is 2. In a yet further aspect, a is 3. In a yet further aspect, a is 4. In a yet further aspect, a is 5. In a yet further aspect, a is 6. In a yet further aspect, a is 7. e. YB GROUPS
[00429] In a further aspect, Yb is a series of amino acid residues where b = 0, 1, 2, 3, 4,
5, 6, or 7.
[00430] Examples of amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, β-amino acid residues, and N-methylated amino acid residues.
[00431] Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine. [00432] Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-L- tyrosine and p-nitro-L-phenylalanine. Also, a p-propargyloxyphenylalanine, a 3,4-dihydroxy-
L-phenyalanine (DHP), a 3,4,6-trihydroxy-L-phenylalanine, a 3,4,5-trihydroxy-L- phenylalanine, 4-nitro-phenylalanine, a p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, an L- 3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L- tyrosine, a 3 -nitro-tyrosine, a 3-thiol-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p- acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L- phenylalanine, and an isopropyl-L-phenylalanine, and the like.
[00433] In a further aspect, Yb comprises one or more D-amino acids.
[00434] In a further aspect, Yb comprises one or more β-amino acids.
[00435] In a further aspect, Yb comprises one or more N-methylated amino acids.
[00436] In a further aspect, b is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, b is 0, 1, 2,
3, 4, 5, or 6. In a still further aspect, b is 0, 1, 2, 3, 4, or 5. In a still further aspect, b is 0, 1, 2, 3, or 4. In a still further aspect, b is 0, 1, 2, or 3. In a still further aspect, b is 0, 1, or 2. In a yet further aspect, b is 0 or 1. In a yet further aspect, b is 0. In a yet further aspect, b is 1. In a yet further aspect, b is 2. In a yet further aspect, b is 3. In a yet further aspect, b is 4. In a yet further aspect, b is 5. In a yet further aspect, b is 6. In a yet further aspect, b is 7. f. Yx GROUPS
[00437] In a further aspect, Yx is a series of amino acid residues where x = 0, 1, 2, 3, 4,
5, or 6.
[00438] Examples of amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, β-amino acid residues, and N-methylated amino acid residues.
[00439] Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine. [00440] Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-L- tyrosine and p-nitro-L-phenylalanine. Also, a p-propargyloxyphenylalanine, a 3,4-dihydroxy-
L-phenyalanine (DHP), a 3,4,6-trihydroxy-L-phenylalanine, a 3,4,5-trihydroxy-L- phenylalanine, 4-nitro-phenylalanine, a p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, an L- 3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L- tyrosine, a 3 -nitro-tyrosine, a 3-thiol-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p- acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L- phenylalanine, and an isopropyl-L-phenylalanine, and the like.
[00441] In a further aspect, Yx comprises one or more D-amino acids.
[00442] In a further aspect, Yx comprises one or more β-amino acids.
[00443] In a further aspect, Yx comprises one or more N-methylated amino acids.
[00444] In a further aspect, x is 0, 1, 2, 3, 4, 5, or 6. In a still further aspect, x is 0, 1, 2,
3, 4, or 5. In a still further aspect, x is 0, 1, 2, 3, or 4. In a still further aspect, x is 0, 1, 2, or 3. In a still further aspect, x is 0, 1, or 2. In a yet further aspect, x is 0 or 1. In a yet further aspect, x is 0. In a yet further aspect, x is 1. In a yet further aspect, x is 2. In a yet further aspect, x is 3. In a yet further aspect, x is 4. In a yet further aspect, x is 5. In a yet further aspect, x is 6. g. YY GROUPS
[00445] In a further aspect, Yy is a series of amino acid residues where y = 0, 1, 2, 3, 4,
5, 6, 7, or 8.
[00446] Examples of amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, β-amino acid residues, and N-methylated amino acid residues.
[00447] Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine. [00448] Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino-
1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-L- tyrosine and p-nitro-L-phenylalanine. Also, a p-propargyloxyphenylalanine, a 3,4-dihydroxy-
L-phenyalanine (DHP), a 3,4,6-trihydroxy-L-phenylalanine, a 3,4,5-trihydroxy-L- phenylalanine, 4-nitro-phenylalanine, a p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, an L- 3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L- tyrosine, a 3 -nitro-tyrosine, a 3-thiol-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p- acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L- phenylalanine, and an isopropyl-L-phenylalanine, and the like.
[00449] In a further aspect, Yy comprises one or more D-amino acids.
[00450] In a further aspect, Yy comprises one or more β-amino acids.
[00451] In a further aspect, Yy comprises one or more N-methylated amino acids.
[00452] In a further aspect, y is 0, 1, 2, 3, 4, 5, 6, 7, or 8. In a still further aspect, y is 0,
1, 2, 3, 4, 5, 6, or 7. In a still further aspect, y is 0, 1, 2, 3, 4, 5, or 6. In a still further aspect, y is 0, 1, 2, 3, 4, or 5. In a still further aspect, y is 0, 1, 2, 3, or 4. In a still further aspect, y is 0, 1, 2, or 3. In a still further aspect, y is 0, 1, or 2. In a yet further aspect, y is 0 or 1. In a yet further aspect, y is 0. In a yet further aspect, y is 1. In a yet further aspect, y is 2. In a yet further aspect, y is 3. In a yet further aspect, y is 4. In a yet further aspect, y is 5. In a yet further aspect, y is 6. In a yet further aspect, y is 7. In a yet further aspect, y is 8. h. Yz GROUPS
[00453] In a further aspect, Yz is a series of amino acid residues where z = 0, 1 , 2, 3, 4,
5, 6, or 7.
[00454] Examples of amino acid residues include, but are not limited to, natural amino acid residues, unnatural amino acid residues, D-amino acid residues, β-amino acid residues, and N-methylated amino acid residues.
[00455] Natural amino acid residues may include glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan, serine, cysteine, threonine, tyrosine, asparagine, glutamine, aspartic acid, glutamic acid, arginine, histidine, and lysine. [00456] Unnatural amino acid residues may include, but are not limited to, p- ethylthiocarbonyl-L-phenylalanine, p-(3-oxobutanoyl)-L-phenylalanine, 1 ,5-dansyl-alanine, 7-amino-coumarin amino acid, 7-hydroxy-coumarin amino acid, nitrobenzyl-serine, O-(2- nitrobenzyl)-L-tyrosine, p-carboxymethyl-L-phenylalanine, p-cyano-L-phenylalanine, m- cyano-L-phenylalanine, biphenylalanine, 3-amino-L-tyrosine, bipyridyl alanine, p-(2-amino- 1 -hydroxyethyl)-L-phenylalanine, p-isopropylthiocarbonyl-L-phenylalanine, 3-nitro-L-
tyrosine and p-nitro-L-phenylalanine. Also, a p-propargyloxyphenylalanine, a 3,4-dihydroxy- L-phenyalanine (DHP), a 3,4,6-trihydroxy-L-phenylalanine, a 3,4,5-trihydroxy-L- phenylalanine, 4-nitro-phenylalanine, a p-acetyl-L-phenylalanine, O-methyl-L-tyrosine, an L- 3-(2-naphthyl)alanine, a 3-methyl-phenylalanine, an O-4-allyl-L-tyrosine, a 4-propyl-L- tyrosine, a 3 -nitro-tyrosine, a 3-thiol-tyrosine, a tri-O-acetyl-GlcNAcβ-serine, an L-Dopa, a fluorinated phenylalanine, an isopropyl-L-phenylalanine, a p-azido-L-phenylalanine, a p- acyl-L-phenylalanine, a p-benzoyl-L-phenylalanine, an L-phosphoserine, a phosphonoserine, a phosphonotyrosine, a p-iodo-phenylalanine, a p-bromophenylalanine, a p-amino-L- phenylalanine, and an isopropyl-L-phenylalanine, and the like.
[00457] In a further aspect, Yz comprises one or more D-amino acids.
[00458] In a further aspect, Yz comprises one or more β-amino acids.
[00459] In a further aspect, Yz comprises one or more N-methylated amino acids.
[00460] In a further aspect, z is 0, 1, 2, 3, 4, 5, 6, or 7. In a further aspect, z is 0, 1, 2, 3,
4, 5, or 6. In a still further aspect, z is 0, 1, 2, 3, 4, or 5. In a still further aspect, z is 0, 1, 2, 3, or 4. In a still further aspect, z is 0, 1, 2, or 3. In a still further aspect, z is 0, 1, or 2. In a yet further aspect, z is 0 or 1. In a yet further aspect, z is 0. In a yet further aspect, z is 1. In a yet further aspect, z is 2. In a yet further aspect, z is 3. In a yet further aspect, z is 4. In a yet further aspect, z is 5. In a yet further aspect, z is 6. In a yet further aspect, z is 7.
[00461] It is contemplated that each disclosed derivative can be optionally further substituted. It is also contemplated that any one or more derivative can be optionally omitted from the invention. It is understood that a disclosed compound can be provided by the disclosed methods.
2. EXAMPLE PEPTIDE PRODUCTS
[00462] In one aspect, the invention relates to product compounds having a structure selected from:
G. METHODS OF CHEMICALLY MODIFYING A PEPTIDE SEQUENCE
[00465] The compounds of this invention can be prepared by employing reactions as shown in the following schemes, in addition to other standard manipulations that are known
in the literature, exemplified in the experimental sections or clear to one skilled in the art. For clarity, examples having a single substituent are shown where multiple substituents are allowed under the definitions disclosed herein.
[00466] Preferred methods include, but are not limited to, those described below.
During any of the following synthetic sequences, it may be necessary and/or desirable to protect sensitive or reactive groups on any of the molecules concerned. This can be achieved by means of conventional protecting groups, such as those described in T. W. Greene, Protective Groups in Organic Chemistry, John Wiley & Sons, 1981; and T. W. Greene and P. G. M. Wuts, Protective Groups in Organic Chemistry, John Wiley & Sons, 1991, which are hereby incorporated by reference.
[00467] Reactions used to generate the compounds of this invention are prepared by employing reactions as shown in the following Reaction Schemes, as described and exemplified below. The following examples are provided so that the invention might be more fully understood, are illustrative only, and should not be construed as limiting.
[00468] In one aspect, the invention relates to methods of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide substrate with PapB.
[00469] In a further aspect, the peptide sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3). The leader sequence facilitates recognition of the full peptide sequence by PapB. However, the leader sequence is not required.
[00470] In a further aspect, the method further comprises addition of a protease. In a further aspect, the peptide sequence comprises a protease recognition sequence. A protease in conjunction with a peptide sequence comprising a protease recognition sequence allows for cleavage of a desired product from the leader sequence. In a further aspect, the protease is a TEV protease. TEV protease in conjunction with a peptide sequence comprising a TEV protease recognition sequence allows for cleavage of a desired product from the leader sequence. In a further aspect, the peptide sequence comprises a TEV protease recognition sequence. In a still further aspect, the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue. In yet a further aspect, the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
[00471] In a further aspect, the method further comprises addition of a reducing agent. Examples of reducing agents include, but are not limited to, dithionite, flavodoxin,
flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, and Hantzsch esters. In a still further aspect, the reducing agent comprises comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof. In a still further aspect, the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, and titanium citrate. In a still further aspect, the reducing agent comprises dithionite, flavodoxin, and flavodoxin reductase. In a still further aspect, the reducing agent comprises dithionite and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, flavodoxin reductase, and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, and flavodoxin reductase. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate and titanium citrate. In a still further aspect, the reducing agent comprises dithionite. In a still further aspect, the reducing agent comprises flavodoxin. In a still further aspect, the reducing agent comprises flavodoxin reductase. In a still further aspect, the reducing agent comprises titanium citrate. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate.
[00472] In a further aspect, PapB installs two or more thioether linkages in the peptide sequence. For example, the peptide sequence may comprise the the sequence C-Ya-C-D-Yb- D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, a = 1, 2, 3, 4, 5, 6, or 7, and b= 0, 1, 2, 3, 4, 5, 6, or 7. In this example, thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding nested crosslinks. By way of example, the peptide sequence may also comprise the sequence C-Yx- D-Yy-C-Yz-D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, x = 0, 1, 2, 3, 4, 5, 6, or 7, y = 1, 2, 3, 4, 5, 6, 7, or 8, and z = 0, 1, 2, 3, 4, 5, 6, or 7. In this example, thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding in-line crosslinks.
H. METHODS OF CHEMICALLY MODIFYING A MODIFIED PAP A SEQUENCE
[00473] In one aspect, the invention relates to methods of chemically modifying a modified PapA sequence to install a thioether linkage, the method comprising reacting the modified PapA sequence with PapB.
[00474] In a further aspect, the modified PapA sequence comprises minimal substrate
PapA.
[00475] In a further aspect, the modified PapA sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQ INVIA GVKPIRAYG (SEQ ID NO: 3). The leader sequence facilitates recognition of the full peptide sequence by PapB. However, the leader sequence is not required.
[00476] In a further aspect, the method further comprises addition of a protease. In a further aspect, the modified PapA sequence comprises a protease recognition sequence. A protease in conjunction with a modified PapA sequence comprising a protease recognition sequence allows for cleavage of a desired product from the leader sequence. In a further aspect, the protease is a TEV protease. TEV protease in conjunction with a modified PapA sequence comprising a TEV protease recognition sequence allows for cleavage of a desired product from the leader sequence. In a further aspect, the modified PapA sequence comprises a TEV protease recognition sequence. In a still further aspect, the TEV protease recognition sequence is EXLYZQ (SEQ ID NO: 1), in which X is any amino acid and Z is any amino acid that contains a hydrophobic residue. In yet a further aspect, the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
[00477] In a further aspect, the method further comprises addition of a reducing agent. Examples of reducing agents include, but are not limited to, dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, and Hantzsch esters. In a still further aspect, the reducing agent comprises comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof. In a still further aspect, the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, and titanium citrate. In a still further aspect, the reducing agent comprises dithionite, flavodoxin, and flavodoxin reductase. In a still further aspect, the reducing agent comprises dithionite and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, flavodoxin reductase, and flavodoxin. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate, titanium citrate, and flavodoxin reductase. In a still further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate and titanium citrate. In a still further aspect, the reducing agent comprises dithionite. In a still further aspect, the reducing agent comprises flavodoxin. In a still further aspect, the reducing agent comprises flavodoxin reductase. In a still further aspect, the reducing agent comprises titanium citrate. In a still
further aspect, the reducing agent comprises reduced nicotinamide adenine dinucleotide phosphate.
[00478] In a further aspect, PapB installs two or more thioether linkages in the peptide sequence. For example, the peptide sequence may comprise the the sequence C-Ya-C-D-Yb- D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, a = 1, 2, 3, 4, 5, 6, or 7, and b= 0, 1, 2, 3, 4, 5, 6, or 7. In this example, thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding nested crosslinks. By way of example, the peptide sequence may also comprise the sequence C-Yx- D-Yy-C-Yz-D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, x = 0, 1, 2, 3, 4, 5, 6, or 7, y = 1, 2, 3, 4, 5, 6, 7, or 8, and z = 0, 1, 2, 3, 4, 5, 6, or 7. In this example, thioether linkages are installed between the first cysteine residue and the first aspartic acid residue and between the second cysteine residue and the second aspartic acid residue, yielding in-line crosslinks.
I. EXAMPLES
[00479] The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary of the invention and are not intended to limit the scope of what the inventors regard as their invention. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or is at ambient temperature, and pressure is at or near atmospheric.
[00480] The Examples are provided herein to illustrate the invention, and should not be construed as limiting the invention in any way. Examples are provided herein to illustrate the invention and should not be construed as limiting the invention in any way.
1. METHODS a. CLONING AND EXPRESSION OF PAPB
[00481 ] The plasmids PapB and pPH 151 were co-transformed into Escherichia coli
BL21 (DE3) T1 resistant cells (NEB C2527). The plasmid pPH151 contains the suf operon that encodes for sufABCDE proteins that aid in sulfur liberation, act as a Fe-S scaffold, and donate Fe-S clusters to apo proteins. The suf operon is frequently included with radical SAM
enzymes as it assists in assembling iron-sulfur clusters in heterologously expressed proteins. The transformation mixture was suspended in SOC recovery media and shaken at 200 rpm for 1 h at 37 °C. The mixture was plated on agar Lennox broth (LB) plates containing 34 μg/mL chloramphenicol and 34 μg/mL kanamycin and placed in an oven set to 37 °C for 16 h. An overnight culture (0.15 L) of LB containing 34 μg/mL chloramphenicol and 34 μg/mL kanamycin was inoculated with a single colony from the plate. Twelve aliquots (12 mL each) of overnight culture were used to inoculate twelve 2.8 L Fembach flasks containing 1 L each of LB supplemented with 34 μg/mL chloramphenicol and 34 μg/mL kanamycin. The cultures were grown at 37 °C and 180 rpm to an OD600nm of -0.35, at which point 0.1 mM iron(III) chloride (0.1 mM) and L-cysteine hydrochloride monohydrate (0.1 mM) were added. At OD600nm of ~0.5, the flasks were immersed in an ice bath and cooled for 20 min before inducing with 1 mM isopropyl β-D-1 -thiogalactopyranoside (IPTG). The cultures were grown overnight (-16 h), and the cells were harvested by centrifugation at 6500 x g. Typical yield is ~45 g of wet cell paste per 12 L of growth. The cell pellets were flash-frozen in liquid N2 and stored at -80 °C until use. b. PURIFICATION OF PAPB
[00482] PapB was purified inside of a Coy Laboratories anaerobic chamber maintained with a 98% N2/2% H2 atmosphere. Cell paste (15 g) was resuspended in a metal beaker with 0.1 L of 0.05 M KPi (pH 7.4) buffer containing 0.5 M KC1, 0.05 M imidazole, 20% glycerol (v/v) 0.1 mg/mL lysozyme, 10 μg/mL DNAse and 2 cOmplete EDTA-free Protease Inhibitor Cocktail tablets (Fisher Scientific NC0939481). The suspension was stirred for 30 min on ice after which the cells were lysed with a Branson digital sonifier operated at 50% amplitude for a total of 17 min (25 s on/35 s off) while stirring on ice. The resulting liquid was centrifuged at 18,442 x g for 45 min at 4 °C. Three 5 mL HisTrap HP columns (GE healthcare) charged with nickel sulfate were serially connected and equilibrated with loading buffer containing 0.05 M KPi (pH 7.4), 0.5 M KC1, 20% glycerol (v/v) and 0.05 M imidazole. The clarified lysate was loaded onto the columns at 3 mL/min. The columns were washed with eight column volumes (CV) of loading buffer and PapB was eluted with a linear gradient over 8 CV to 0.5 M imidazole in the loading buffer. Fractions containing PapB were identified by brown color and sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) gel.
[00483] The pooled fractions were further purified by an amylose resin (NEB E8022S) column equilibrated in loading buffer containing 0.05 M KPi (pH 7.4), 0.5 M KC1, and 0.05
M imidazole. The column was washed with 0.15 L of loading buffer and eluted with loading buffer containing 10 mM maltose. The resulting dark brown fractions were pooled and solid dithiothreitol (DTT) powder was added to combined fractions to a final concentration of 2 mM. An aliquot (1 mL) of 90 μM TEV protease was added, and the mixture was stirred for 14 h at room temperature. The cleaved MBP was removed from PapB through three serially connected 5 mL HisTraps equilibrated in loading buffer. The flowthrough from this column contained cleaved PapB. The resulting PapB protein was desalted into buffer containing 0.05 M PIPES·NaOH (pH 7.4), 0.3 M NaCl, 2 mM DTT and 20% glycerol (v/v). The concentration of PapB was determined by the Bradford method using bovine serum albumin (BSA) as a standard. PapB was reconstituted by mixing 12 molar equivalents of 0.1 M FeCh hexahydrate and NaiS nonahydrate as follows. Aliquots (5 μL) of the FeCh hexahydrate and Na2S were added individually, allowing 15 sec between additions to ensure thorough mixing. The FeCh was added to completion first, following by the addition of the Na2S. The reconstitution mixture was stirred for 4 h at room temperature. The resulting solution was centrifuged for 10 min at 16,000 xg for 10 min to remove any debris and desalted on a BioGel P6 DG desalting gel 100-200 mesh (wet) (Bio-Rad) into buffer containing 0.05 M PIPES·NaOH (pH 7.4), 0.3 M NaCl, 2 mM DTT, and 20% glycerol (v/v). The protein was concentrated to -3 mL with an Amicon concentrator under N2 with a YM-10 membrane
(Millipore).
[00484] The reconstituted PapB was further purified by a Cytiva XK26 (1000mm) S- 300 column equilibrated with buffer containing 0.05 M PIPES’NaOH (pH 7.4), 0.3 M KC1, 2 mM DTT and 10% glycerol (v/v). The protein was eluted isocratically at 2.7 mL/min, and fractions containing PapB were identified by dark brown color and visual inspection of a Coomassie-stained SDS-PAGE gel. The pooled fractions were concentrated to ~0.5 mL. Aliquots were flash-frozen in liquid N2 and stored at -80 °C. PapB was quantified by Bradford assay with BSA as a standard. A typical yield from the purification outlined above is 16.5 mg pure protein for 15 g wet cell paste. c. AMINO ACID ANALYSIS AND IRON CONCENTRATION
DETERMINATION
[00485] The correction factor for the Bradford assays was determined by direct amino acid analysis on three independent preparations of the protein. Amino acid analysis was carried out by the Molecular Structure Facility at the University of California-Davis as follows. A 0.1 mL aliquot of concentrated PapB was desalted into solution containing 10
mM NaOH using an Illustra NICK column (GE Healthcare). The protein samples were hydrolyzed in a solution containing 6 M HC1 and 1% phenol at 110 °C in a vacuum and resuspended in a norleucine solution as an internal standard. The PapB samples were analyzed by Hitachi 8800 amino acid analyzer that was calibrated with amino acid standards for protein hydrolysate on the Na-based Hitachi 8800 (Sigma, A-9906). These standards were verified by the National Institute of Standards and Technology (NIST) standard reference material 2389a. PapB samples were sent through a Concise ion-exchange column (AminoSep Beckman Style Na+, part # AAA-99-6312) with a secondary ninhydrin reaction for detection using Pickering Na buffers. The correction factor for the Bradford assays was determined to be 0.60 based on results from the three independent purifications and this factor was used in all subsequent protein concentration determinations to correct the values obtained from the less cumbersome Bradford determinations
[00486] The iron content of reconstituted PapB was determined through inductively coupled plasma-mass spectrometer (ICP-MS) on the same three separate enzyme preparations. This was done at the Center for Water, Ecosystems and Climate Science in the Department of Geology and Geophysics at the University of Utah as follows. PapB preparations were diluted to a concentration of 2-5 μM with 10% trace metal grade nitric acid before submission. The iron concentration was performed with a triple quadrupole inductively coupled plasma-mass spectrometer (ICP-MS, Agilent 8900, Santa Clara, CA). A 10 nm In/mL was added as an internal standard. An external calibration curve was prepared from 1000 mg/L single elemental standard (Inorganic Ventures, Christiansburg, VA). Fe Concentrations in six calibration solutions were 0, 8.3, 20.7, 66.2, 165.5 and 331.1 ng Fe/mL; all solutions contained 10 ng In/mL. The blanks, calibration solutions and diluted samples were run by ICP-MS using a double-pass quartz spray chamber, PTFE nebulizer and dual- syringe introduction system (Teledyne, AVX72000), platinum cones, and sapphire injector in a quartz platinum-shielded torch. In and Fe were detected at masses of 115 and 56, with a flow of 8 mL He/min in a collision cell. The Certified Reference Manual CRM 1643f
(National Institute of Standards and Technology, Gaithersburg, MD) was diluted 1 :20 and run with the samples as well as the calibration curve as a quality control for the calibration. The Fe in the CRM 1643f was measured to be 10% within the certified value. d. TEV PROTEASE PURIFICATION
[00487] SGI 200008 pRARE chemically competent cells were transformed with pNB512. The transformation was suspended in SOC recovery media and shaken at 200 rpm
for 45 min at 37 °C. The mixture was plated on agar Lennox broth (LB) plates containing 34 ug/mL chloramphenicol and 100 mg/mL ampicillin and placed in an oven set to 37 °C for 16 h. An overnight culture (0.15 L) of LB containing 34 mg/mL chloramphenicol and 34 mg/mL ampicillin was inoculated with a single colony from the plate. Twelve aliquots (0.010 L each) of overnight culture were used to inoculate twelve 2.8 L Fembach flasks containing 1 L each of LB supplemented with 34 μg/mL chloramphenicol and 100 μg/mL ampicillin. The cultures were grown at 37 °C and 175 rpm to an OD600nm of ~0.49, at which point 1 mM IPTG was added to each flask and the temperature was turned down to 16 °C. The cultures were grown overnight (~16 h), and the cells were harvested by centrifugation at 6500 xg. The cell pellets were flash-frozen in liquid N22 and stored at -80 °C until use.
[00488] Cell paste (15 g) was resuspended in a metal beaker with 0.1 L of 0.05 M KPi (pH 7.4) buffer containing 0.5M KC1, 0.05 M imidazole, 100 mg/mL lysozyme, 10 mg/mL PMSF, and 20% (v/v) glycerol. The suspension was stirred for 2 h at 4 °C. The cells were lysed with a Branson digital sonifier operated at 50% amplitude for a total of 15 min (10 s on/20 s off) while stirring on ice. The resulting liquid was centrifuged at 18,442 xg for 50 min at 4 °C. Two 5 mL HisTrap HP columns (GE healthcare) charged with nickel sulfate were serially connected and equilibrated with loading buffer containing 0.05 M KPi (pH 7.4), 0.5 M KC1, and 0.05 M imidazole. The clarified lysate was loaded onto the columns at 3 mL/min. The columns were washed with eight column volumes (CV) of loading buffer and TEV protease was eluted with a linear gradient over 8 CV to 0.5 M imidazole in the loading buffer. Fractions containing TEV protease were identified by SDS-PAGE, pooled and dialyzed three times against 4 L of 0.05 M KPi (pH 7.4), 0.5 M KC1, 0.05 M imidazole and 20% glycerol. The dialyzed protein was concentrated to a minimal volume followed by the addition of glycerol to a final concentration of 50% glycerol (v/v). The aliquots were flash frozen in liquid nitrogen and stored at -80 °C until use. e. SYNTHESIS OF MINIMAL SUBSTRATE PAP A (MSPAPA) AND
VARIANTS
[00489] PapA peptides were synthesized on either a PS3 peptide synthesizer (Protein Technologies Inc.) or a Prelude peptide synthesizer (Protein Technologies Inc.). Compared to the previously reported msPapA peptide (Van der Donk, W. A.; Bindman, N. A. Nat. Prod.: Discourse, Delivery, and Design, John Wiley & Sons: Oxford, 2014; pp 197-218), the N- terminal methionine was removed in all peptide syntheses. The syntheses used standard Fmoc procedures from the manufacturer and were carried out on a 0.025 mmol scale. All
natural Fmoc-amino acids were purchased from Protein Technologies Inc. N-Alpha-Fmoc-S- trityl-D-cysteine and Fmoc-D-aspartic acid a-tert-butyl ester were purchased from Chem Impex (04314). For the synthesis, 150 mg of 2-chlorotrityl chloride resin 100-200 mesh (ChemPep) was loaded with 9.3 mg of Fmoc-Ala-OH (~0.2 mmol/g resin). The resin was washed three times with 5 mL DMF and three times with 5 mL dichloromethane (DCM). The 9.3 mg of Fmoc-Ala-OH was dissolved in 1 mL of 1:1 dichloromethane (DCM):N,N- dimethylformaide (DMF) with 0.15 mmol diisopropylethylamine (DIPEA). This solution was added to the resin and gently shaken for 1 h. The Fmoc-Ala/DIPEA solution was then removed, and the resin was washed three times with 5 mL of DCM. The uncapped sites on the resin were capped by washing the resin with 20 mL of 17:2:1 DCM:methanol:DIPEA. The resin was then washed three times with 5 mL of DCM and three times with 5 mL of
DMF. The resin was then transferred to the reaction vessel.
[00490] All Fmoc-amino acids (0.15 mmol, 6 equivalents) were coupled by in situ activation withN-[(dimethylamino)-1H-1,2,3-triazo[4,5-b]pyridin-1-ylmethylene]-N- methylmethanaminium hexafluorophosphate-N-oxide (HATU) (0.15 mmol, 6 equivalents; ChemPep) in 0.6 M N-methylmorpholine. The peptides were deprotected and cleaved from the resin by adding 5 mL of cleavage solution (87.5% (v/v) TFA, 5% (v/v) thioanisole, 3% (v/v) ethane dithiol, 2.5% (v/v) triisopropylsilane, and 2% (v/v) anisole) followed by stirring for 2 h at room temperature. The cleavage reaction was filtered into 30 mL of ice-cold diethyl ether to precipitate the peptide. The solution was poured over a Buchner funnel filter and vacuumed to collect the peptide precipitate. The peptide dried on the vacuum for 15 min before being washed with 80 mL of ice-cold diethyl ether. After drying for an additional hour, the peptide was resuspended in 20 mL of water and sonicated for 15 min to aid in the dissolution of the peptide. The solution was then flash frozen in liquid nitrogen and lyophilized.
[00491] The peptides were purified using high-performance liquid chromatography (HPLC) with a Phenomenex Jupiter C18 preparative column (21.2 mm x 250 mm, 5 μm particle size, 300Å pore size) with buffer A as 0.1% trifluoroacetic acid (TFA, HPLC grade) in nanopure water and buffer B as 0.1% TFA (HPLC Grade) in acetonitrile (ACN, HPLC grade). The separation was carried out at a flow rate of 5mL/min with a linear gradient of buffer A from 88 to 60% over 65 min. Fractions were analyzed by LC-MS using one of two HPLC-MS/MS setups (Vanquish UHPLC with a diode array detector connected to a Q- Exactive or an Ultimate 3000 HPLC with a diode array detector interfaced to a LTQ OrbiTrap XL mass spectrometer) fitted with a Hypersil GOLD Cl 8 column (2.1 mm x 150
mm, 1.9 μm particle size) for separations at 0. 2mL/min. The LC-MS program for peptide fraction identification was set up as follows: buffer A was LC-MS Optima water (Fisher)/0.1% (v/v) LC-MS Optima TFA (Fisher) and buffer B was LC-MS Optima acetonitrile (Fisher)/0.1% (v/v) LC-MS Optima TFA (Fisher). The 12 min separation consisted of washing the column with 100% A for 3 min, followed by a linear gradient to 100% B from 3 to 6 min, followed by washing the column with 100% B from 6 to 9 min, and finally reequilibration in 100% A from 9 to 12 min. The MS detectors operated in positive ion mode and the FT analyzer settings are as follows: 70,000 resolution for the Q-Exactive and 100,000 resolution for the LTQ OrbiTrap, 1 microscan, and 200 ms maximum injection time. MS data analysis used Xcalibur software (Thermo Fisher). f. ENZYMATIC REACTIONS OF MSPAPA PEPTIDES WITH PAPB
[00492] Assays were conducted in a Coy Laboratories anaerobic chamber with 98% N2/2% H2 atmosphere at room temperature. All reactions contained 0.05 M PIPES·NaOH (pH 7.4), 2mM DTT, 2.4 mM SAM (enzymatically synthesized and purified as previously described (deGruyter, J. N., et al. Biochem. 2017, 56 (30), 3863-3873)), ~100-400 μM msPapA variants (concentration determined by peptide dry weight or by spectroscopic analysis in the case of Y19W), and 430nm-10 μM PapB. Either dithionite (dT) or flavodoxin (FldA), flavodoxin reductase (FPR) and NADPH were used to reduce PapB. For the assays that used chemical reductant, the total concentration was 2mM dT. For the assays that used the biological reducing system, the mixtures contained 25 μM FldA, 2 μM FPR and 2 mM NADPH. The total volume of the reactions ranged from 0.1 mL for initial screenings to 0.5 mL for MS/MS Collision Induced Dissociation (CID) fragmentation experiments described below. Control reactions in the absence of dT, SAM and PapB were also conducted.
Reactions were initiated with the addition of PapB and quenched at times ranging from 15 s to 2 h by the addition of 10% of the reaction volume of 30% (w/v) trichloroacetic acid (TCA, ACS grade). The samples were centrifuged at 16,000 xg for 10 min in a microcentrifuge to pellet the precipitated PapB. g. ALKYLATION OF MSPAPA PEPTIDES AND VARIANTS
[00493] After initial incubation, half of the enzymatic reaction and half of the control reaction was aliquoted for alkylation by iodoacetic acid. A 500 mM stock of iodoacetic acid (IAC) was prepared in the dark and added to the enzymatic reaction to a final concentration of 10mM (5x excess of the DTT concentration). These reactions were allowed to incubate in
the dark for six additional hours before quenching with the addition of 10% of the reaction volume of 30% (w/v) TCA. The samples were then centrifuged at 16,000 xg for 10 min in a microcentrifuge to pellet the precipitated PapB. h. TEV PROTEASE CLEAVAGE OF PEPTIDES
[00494] Where TEV cleavage was necessary, 90 μM TEV protease was added directly to the full PapB assay after initial incubation in a 1:1 volume ratio. The TEV-assay combination incubated for 4 h before quenching by the addition of 10% of the reaction volume of 30% (w/v) TCA. The samples were then centrifuged at 16,000 xg for 10 min in a microcentrifuge to pellet the precipitated PapB and TEV protease. i. U/HPLC-MS ANALYSIS OF ENZYMATIC REACTIONS AND
CONTROLS
[00495] The assays were analyzed using either a Vanquish UHPLC with a diode-array detector connected to a Q-Exactive mass spectrometer or an Ultimate 3000 HPLC with a diode-array detector connected to a LTQ OrbiTrap XL mass spectrometer. Each was operated in positive ion mode, the FT analyzer was set to 100,000 resolution, 1 microscan, and 200 ms maximum injection time. Xcalibur software was used to analyze data. A 20 pL aliquot was injected onto a Hypersil GOLD C18 column (2.1 mm x 150 mm, 1.9 μm particle size) (Thermo Fisher) pre-equilibrated in 0.1% (v/v) LC-MS Optima TEA (Fisher in LC-MS Optima water (Fisher). Chromatographic steps were carried out at 0.2 mL/min with buffer A containing 0.1% (v/v) TFA in Optima water and buffer B containing Optima grade acetonitrile with 0.1% (v/v) TFA. The separation consisted of washing with 100% A from 0 to 3 min, followed by a linear gradient from 100% to 0% A from 3 to 6 min, washing with 0% A from 6 to 10 min, and reequilibration with 100% A from 10 to 14 min. j . COLLISION-INDUCED DISSOCIATION (CID) FRAGMENTATION OF
UNMODIFIED AND MODIFIED PAPB
[00496] Enzymatic reactions were conducted on a 0.5 mL scale as described above to obtain sufficient material. After quenching the reaction with TCA and centrifugation to remove precipitated protein, the reaction mixtures were desalted using C18 ZipTips (Millipore) following the manufacturer’s protocols. The analyzer was first tuned to the mass of each msPapA peptide. The 3+ charge state corresponding to each msPapA peptide was isolated in the CID cell using an isolation width of 1.7-2.4 m/z (depending on complete or incomplete peptide turnover), 0.1 ms activation time, a resolution of 70,000, and fragmented
using a Normalized Collision Energy (NCE) of 25. The fragmentation analysis used mMass software.
2. CHARACTERIZATION OF PURIFIED PAPB
[00497] PapB was obtained to homogeneity using His6 affinity chromatography for the initial separation, followed by TEV cleavage and amylose chromatography to remove the MBP, reconstitution with Fe/S. Gel filtration was used to remove higher molecular weight complexes (FIG. 3). Since previous sequence analysis and ferrozine assays indicate that PapB likely has three [4Fe-4S] clusters — a 12-fold molar excess of iron and sulfide were added to the maturase for reconstitution. Amino acid and ICP-MS analysis of protein from multiple independent purifications show that the purified protein obtained by this procedure contains 13.5±0.3 mol of iron per mol of PapB. This is consistent with three [4Fe-4S] clusters per polypeptide chain. The enzymatic activity of PapB was established with HPLC- purified msPapA (FIG. 4A and FIG. 4B). The peptide elutes at 8 min under the conditions used in the separation (FIG. 4A, top left) and HR-MS/MS reveals two clearly visible charge states (FIG. 4A, bottom). Expansion of the +3-charge state (FIG. 4A, top right) reveals an isotopic envelope with the monoisotopic peak at m/z of 844.1201, which is within 0.5 ppm of the calculated unmodified peptide (calc: m/z 844.1197). In the presence of PapB, dithionite (dT), and SAM the monoisotopic peak of the +3 charge state shifts by 0.6716, which corresponds to a loss of 2 Da from the peptide. This is within 0.8 ppm of the expected mass for a singly crosslinked peptide. To validate that a thioether cross link had formed the modification reaction was carried out in bulk and the resulting sample was desalted and subjected to HR-MS/MS analysis. As expected for a crosslinked ranthipeptide (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89), no fragments are seen between the C19 and D23 in the reacted peptide. Additionally, a 2 Da loss was observed in the b23 ion and in every y series ion above y7 (FIG. 4B). The fragmentation data is shown in Table 1 below. It is noted that under these conditions, complete conversion of msPapA to a singly crosslinked peptide is routinely observed using 0.1 nmol PapB and 20 mmol msPapA in 5 min.
TABLE 1.
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
[00498] Next, the kinetics of the modification reaction catalyzed by PapB in the presence of dT or a biological reducing system (FldA/FPR/NADPH) were assessed (FIG. 5). In these experiments, the enzyme concentration was kept low (430 nM) relative to the peptide (191 μM; established by tryptophan absorbance). Under these conditions, both show robust turnover, with dT showing roughly 3-fold faster kinetics than that observed with the biological reducing system. Three replicate runs were conducted using either dT or the biological reducing system of FldA/FPR/NADPH. At each timepoint, an aliquot was taken from the initial assay batch quenched with TCA. Both the biological reducing system and the chemical reducing system had 100% substrate conversion after 300 s. At 15 s, 2- and 4-fold increases in the concentration of PapB results in conversion of unmodified msPapA to modified msPapA that is roughly 2- and 4-fold greater than initial conditions, suggesting that activity is proportional to PapB concentration. However, increasing the msPapA concentration by 2- and 4-fold did not alter the distribution of the reaction, suggesting that peptide concentration was saturating. Therefore, the rate that is measured in these experiments is a good approximation of kcat for PapB (FIG. 5-FIG. 7). Using the linear portions of the curves turnover numbers of 7.4 ± 0.1 s-1 with dT and 2.6 ± 0.2 s-1 are estimated with the biological reducing system.
3. LEVERAGING SUBSTRATE PROMISCUITY OF A RADICAL SAM RiPP
MATURASE TOWARDS INTRAMOLECULAR PEPTIDE CROSSLINKING
APPLICATIONS a. PAPB MODIFIES EXPANDED AND CONTRACTED C(X3)D MOTIFS
[00499] To assess the sequence dependence of the modification, minimal substrates containing 0-6 amino acids between the crosslinked Cys and Asp were synthesized and incubated with PapB (FIG. 8A). In each case, a loss of 2 Da is observed upon the addition of PapB (compare FIG. 8B and FIG. 8C). While the reactions with 1-5 intervening residues appear to go to completion, CXQD (FIG. 8B) and CX6D (FIG. 8C) did not fully react — suggesting that PapB does not processes these motifs efficiently. The observed monoisotopic masses for each processed and unprocessed species of peptide agree (to within < 4 ppm error) with the expected monoisotopic masses (Table 2).
TABLE 2.
z = 3 in all cases
[00500] Treatment with iodoacetic acid (IAC) suggests that no free thiols are present in the treated samples, other than the C in the unmodified portion of CX0D and CX6D (FIG 9-
FIG. 14). This shows that PapB has introduced a thioether crosslink in each peptide.
[00501] The location of modification in each msPapA peptide variant was investigated by collision-induced dissociation (CID) MS/MS. The modified msPapA peptides were analyzed and compared to the unmodified control peptides. In each case, the samples were
introduced to the mass spectrometer by direct infusion after quenching with TCA and removal of excess salts. The +3 charge state envelope was isolated and fragmented in the CID cell of the instrument. The fragmentation data showing all b and y ions that could be identified are shown in Tables 3-8 below. In general, the unmodified peptides all displayed fragmentation between Cys and Asp residues. Upon modification, by the addition of PapB, no fragmentation peaks are observable between those two residues. In the case of the b fragments, no change of mass is observed until after the Asp residue, after which a -2 Da loss is seen in each fragment. By contrast, a -2 Da loss is observed after the Cys residue in each y fragment.
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
[00502] The MS/MS data is consistent with the formation of thioether crosslinks in non-a positions. In mild CID conditions, sactipeptide (sulfur-to-a carbon thioether crosslinked peptides) MS/MS spectra generally produce fragments at each residue location but contain a 2 Da loss at the acceptor (non-Cys) residue (Rea, M. C., et al. Proc. Natl. Acad.
Sci. U. S. A. 2010, 707 (20), 9352-9357 and Lohans, C. T. J. Antibiot. 2014, 67, 23-30).
Conversely, CB- and Cy-thioether crosslinked peptides do not produce fragments within the macrocycle in mild CID conditions (Hudson, G. A., et al. J. Am. Chem. Soc. 2019, 141, 8228-
8238). Previous work by the Mitchell lab calculated the zero-point energy between Ca- and
Cβ-thioethers and revealed the Cβ-linkage electronic energy to be 12 kcal/mol more stable than that of the Ca-linkage. This energy difference provides an explanation for the difference seen in MS/MS spectra for these classes of RiPPs. All MS/MS spectra of the msPapA thioether motif expansions and contractions demonstrate a stable macrocycle — i.e., no fragments are found between the Cys and Asp residues.
[00503] The reactions with CX0D and CX6D peptides did not go to completion; therefore, unmodified peptide fragments are also seen in these reactions revealing cleavage between the C and D residues in the unmodified portion of the isolated envelope, serving as internal controls (FIG. 4D). b. PAPB TOLERATES EXTENSIONS FROM THE LEADER PEPTIDE AND
PROCESSES IN-LINE AND NESTED CROSSLINKS INDEPENDENTLY
[00504] We next explored whether the sequence context of the CX3D sequence in the natural peptide, and the specific amino acids within the motif are essential for recognition and crosslinking (FIG. ISA). We did not test an exhaustive number of modifications, as the addition of 3 or 4 Ala residues immediately adjacent to the recognition motif clearly did not impair cross linking activity. As FIG. 15B shows, all the peptides that were examined were efficiently crosslinked by the enzyme. FIG. 15C demonstrates that the crosslink occurs within the CX3D sequence, even if an alternate D residue is available downstream.
[00505] The naturally occurring PapA peptide is processed by PapB to introduce six ranthionine linkages, which are either in line with the Cys and Asp residues within a CX3D motif being crosslinked, or nested with the C residue occurring within one CX3D motif crosslinking with an Asp residue located C-terminal to it (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89). As FIG. 15 shows, both nested and in-line variants of the peptide were able to be crosslinked by simply repositioning the CX3D element within the peptide. In the case of the in-line and nested crosslinks, the treatment with PapB results in the loss of 4 Da from the peptide (FIG. 15B). The observed monoisotopic masses for these species are < 5ppm of the expected monoisotopic masses (Table 9).
[00506] Tandem mass spectrometry reveals a similar pattern in the b and y fragments; a mass loss of 2 Da is seen in each b fragment after D and in each y fragment after C. The fragmentation data for all identifiable peaks are shown in Tables 10-14, a stable macrocycle is seen in each peptide. In all these cases, treatment with IAA resulted in no carboxymethylation of the modified peptide (FIG. 16-20).
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
[00507] The results with expansions of the CXnD motif in the previous section demonstrate a lack of defined specificity in the recognition sequence, beyond the preference for Cys and Asp. The data with the nested crosslinks above extend this to include distance from the leader peptide recognition sequence, as well as the individual amino acids within the processed peptide. These observations suggest that the only elements that guide binding and crosslinking activity is the presence of proximal Cys and Asp residues, and the leader sequence, which is presumed to be required for RiPP maturases (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89). These observations support the notion that PapB may be able to be used widely to introduce thioether crosslinks in peptides that are completely unrelated to the naturally occurring PapA substrate.
[00508] Indeed, PapB has been used recently to prepare peptide products that are capable of binding single protein targets, such as the SARS-CoV-2 spike receptor binding domain (King, A.M., et al. Nat. Commun. 2021, 12, 6343). The peptide in that design contained a leader sequence, which through a TEV protease recognition sequence is connected to a minimal substrate containing two CX3E motifs. The initial report on PapB had demonstrated that both Asp and Glu are crosslinked by the enzyme (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89). In the more recent article, however, while the peptide contained two potential crosslinking motifs, only a single crosslink was observed. Considering the in vitro data with highly active protein shows that the topology of the modification can be essentially directed, this result was revisited to determine if the absence of the second crosslinks reflects the in vivo system employed rather than inherent to PapB. A synthetic peptide that is identical to the unnatural peptide used to target the SARS-CoV-2 spike receptor-binding domain was synthesized and treated with the protein as described above (FIG. 21A). As the mass spectrum of the peptide shows, the enzyme installs two crosslinks, as evidenced by the loss of 4 Da in the modified peptide (FIG. 21B). Next, TEV cleavage of the resulting product was carried out to release mature peptide and as the MS shows, it also exhibits a 4 Da loss, localizing the modification to the peptide. The observed monoisotopic masses for both the full length and the TEV cleaved peptide are < 3 ppm of the expected values (Table 15).
TABLE 15.
[00509] Tandem mass spectrometry shows a fragmentation pattern that is indicative of two thioether events occurring; one between Cys3 and Glu7, and the other between Cys9 and
Glul3 (FIG. 21C, Table 16). Therefore, the presence of a single cross link in the reported peptide was likely due to the in vivo conditions employed.
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide c. D-AMTNO ACIDS ARE PROCESSED BY PAPB
[00510] In initial experiments with PapA/PapB the CX3D spacing was intriguing, which suggested that the enzyme recognizes the Cys and Asp residues as part of a helical fragment, since Cys and Asp sidechains would be expected to be located on the same face of an alpha helix. However, the expansions and contractions of the motif clearly demonstrated that the spacing is immaterial. The expansion and contraction results suggest that only the identity of the amino acid or specific chemical moieties is important. Therefore, whether
PapB can process msPapA when Cys and Asp are replaced with their dextrorotatory enantiomers was explored (FIG. 22A). With the leader-DCSANDA peptide, full conversion to the crosslinked peptide is seen, as evidenced by the loss of 2 Da (FIG. 22B). With the leader-CSANDDA peptide, significant substrate turnover is observed as well, but the conversion is not complete (FIG. 22B). The leader-DCSANDDA is processed inefficiently under these conditions, though some product is clearly observed in the MS. While it is
possible to suggest that the small amount of product observed with this peptide is due to contaminating L-amino acids in the commercially available sources, that impurity would only account to 1-2% of product turnover. Based on the MS data, at least ~15% of the substrate is converted to product, arguing that the modification represents a bona fide DCys to DAsp thioether cross link. Finally, CID MS/MS spectrometry shows a loss of 2 Da in each y- fragment after the C residue and in the single fragment after the D residue in all three D- peptide scenarios (FIG. 22C, Tables 17-19) — showing a stable macrocycle. Control experiments show that treatment with IAA results in no carboxymethylation in the C19DC peptide (FIG. 23). In the case of the D23DD and C19DC / D23DD peptides, carboxymethylation is present upon IAA treatment due to incomplete turnover (FIG. 23-25).
However, the carboxymethylated species does not show any evidence of a 2 Da loss, which gives evidence that the Cys thiol is participating in the newly installed bond in these unnatural peptides.
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
TABLE 18.
y9* (-2 Da) AYGCSANDDA* 869.3094 869.3070 2.76 y8 YGCSAN'DA 800.2879 800.2884 -0.65 y8* (-2 Da) YGCSANDDA* 798.2723 798.2739 -1.99 y7 GCSANDDA 637.2246 637.2263 -2.66 y7* (-2 Da) GCSANDDA* 635.2090 635.2099 -1.48 y6 CSANDDA 580.2032 580.2044 -2.08 y6* (-2 Da) CSANDDA* 578.1875 578.1887 -2.00 y5 SANDDA 477.1940 477.1941 -0.27 y5* (-1 Da) SANDDA* 476.1861 Not Found N/A y4 ANDDA 390.1619 390.1619 -0.06 y4* (-1 Da) ANDDA* 389.1541 Not Found N/A
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
[00511] Next, it was attempted to transpose the Cys and Asp residues by using a DSANCA motif attached to the leader. However, we were unable to observe any crosslinked product with the transposed peptides, either with L- or D-amino acids (FIG. 26). These data support the notion that the active site has substantial flexibility with regards to the Cys, but that the interaction with Asp limit the range of available productive conformations. Previous studies have shown that mutation of a conserved PapB Arg residue, which may be near the PapA peptide Asp binding site, to an Ala abolishes activity. Inversion of the sidechain would similarly eliminate the interaction leading to no crosslinking. d. PAPB PROCESSES SEQUENCES UNRELATED TO THE WILD TYPE
PEPTIDE SEQUENCE
[00512] The results presented in the previous sections highlight the remarkable lack of sequence specificity in PapB, suggesting that the enzyme may be able to cross link virtually any sequence that is tethered to the leader sequence, so long as a Cys and a downstream Asp/Glu residue is present in the peptide. As a proof of concept, the use of PapB to generate an analog of octreotide was explored. Octreotide is an FDA-approved drug used to treat excessive human growth hormone production, to control symptoms in several types of cancers, and to treat gastrointestinal bleeding (Lamberts, S. W. J., et al. Ear. J. Endocrinol. 2019, 181, R173-R183). Octreotide has two D-amino acids, making it less susceptible to protease degradation in vivo (Muttenthaler, M., et al. Nat. Rev. DrugDiscov. 2021, 20, 309- 325).
[00513] Octreotide is an 8-mer peptide with the sequence DFCFDWKTCT, with D- amino acids at the first and fourth positions. The two C residues form a disulfide-linked macrocycle. It has been noted that WT-PapA contains positively charged, nonpolar, polar uncharged, and bulky side-chain residues between the six donor and acceptor residue motifs (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89). This observation, in combination with the successful crosslinking in an expanded motif, suggested that PapB may be able to introduce disulfide mimetic bonds via a thioether in a variety of peptide substrates so long as a thiol and carboxylate moiety are present. As a proof of concept, two octreotide analogs were synthesized. Both designs dispensed with the C-terminal Cys in favor of a Glu, which was used to cross link to the Cys with PapB. In the first design attempt, the sequence was further simplified by replacing DW4 with Ala (FIG. 27A). The octreotide analog sequence was covalently attached to the PapA leader peptide by solid-phase peptide synthesis (SPPS). The second design contained only the C7E replacement, but to facilitate removal of
the leader peptide, an ENLYFQ sequence was incorporated between the leader and the peptide to provide a convenient site for TEV cleavage. The incubation of either of the designed octreotide analogs with PapB leads to formation of a new product. In each case, the product is 2 Da lighter than the starting material, consistent with the formation of a crosslink (FIG. 27B). An intrapeptide disulfide can be eliminated as the source of this loss because the peptide only contains one Cys residue. It is noted that the reaction is ~75% complete with this analog, as assessed from the isotopic envelope. However, the observed monoisotopic masses for each peptide products species are in good agreement with the expected monoisotopic masses for a single cross link (< 3 ppm, FIG. 27B and FIG. 27C, Table 20). A structure of the synthesized octreotide analog can be found in FIG. 28. Subsequent MS/MS analysis corroborates the initial mass spectrometry data; the fragmentation pattern of the peptide depicts small fragments between the crosslinked Cys and Glu due to incomplete crosslinking. There is a clear 2 Da loss pattern in every y-fragment after the Cys and a 2 Da loss in every b- fragment after the C-terminal Glu with the modified peptide (Tables 21-22).
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
Unmodified indicates z = 1 charge state.
Underline indicates z = 2 charge state.
Italics indicates z = 3 charge state.
* Indicates the fragment originated from the modified peptide
[00514] Next, it was attempted to use TEV protease to release the modified peptide to show the feasibility of the use of this method to generate a novel octreotide analog. Other than Pro, the TEV protease can accommodate other amino acids at the P1 ’ position (Kapust,
R.B., et al. Biochem. Biophy. Res. Commun. 2002, 294 (5), 949-955), but G or S are preferred. Residues other than G or S are acceptable at the P1 ’ position, though they result in diminished enzymatic efficacy. A crystal structure of a catalytically inactive form of TEV protease that was co-crystallized with an oligopeptide substrate revealed that the side chain of the residue at the P1 ’ position is partially exposed to solvent (Phan, J., et al. J. Biol. Chem.
2002, 277 (52), 50564-60672). D-amino acids have likely not been tested at the P1’ position.
When treated with TEV protease, the peptide containing the TEV cleavage site undergoes
cleavage to release the C-terminal fragment (FIG. 27C). The results highlight that DF is tolerated the P1 ’ position. This proof-of-concept experiment demonstrates that PapB and TEV protease can be used together to generate therapeutic analogs from synthetic peptide substrates that contain both Cys and Asp/Glu residues, where PapB installs thioether bond(s) between Cys and Asp/Glu to replace disulfide bridges.
[00515] These findings support the notion that PapB can modify peptides with large spacing between the thiol and carboxylate moieties as well as sequences unrelated to PapA. These initial results indicate that PapB has utility as a bimoiety-dependent thioether installation tool; (1) it is tolerant of a variety of sidechains spanning the peptide between the donor and acceptor Cys and Asp/Glu residues, (2) the orientation as well as spacing of the carboxylate and thiol moieties is flexible, and (3) TEV recognition sequences can be introduced to allow for modified peptides to be isolated from the leader sequence. e. DISCUSSION
[00516] In the 20 years since Sofia and coworkers established the RS superfamily (Sofia, H. J., et al. Nucleic Acids Res. 2001, 29 (5), 1097-1106), there has been an explosion of complex transformations that are attributable to RS enzymes. RS enzymes vastly expand the biochemical reaction repertoire because of their ability to activate C-H bonds for a variety of transformations, which can range from epimerizations to bonds to other carbon atoms or to heteroatoms. PapB catalyzes one such transformation, which entails activation of the carbon adjacent to a carboxylate moiety to crosslink to the thiol sidechain of Cys (Precord, T. W., et al. ACS Chem. Biol. 2019, 14 (9), 1981-89). The mechanistic details of thioether crosslink formation remain to be elucidated. However, these results highlights hitherto unknown promiscuity in PapA/B that will have implications in mechanism of substrate recognition.
[00517] Based on all available structural and biochemical data on RiPP maturase proteins, one would expect that the leader sequence binds to the RiPP recognition element (RRE) domain and directs the peptide to the active site of the protein to be modified. Implicit in this is the assumption that the specificity in the substrate selection is governed by the binding energy of interactions with the leader sequence to the RRE domain. A conserved Asn sidechain in the leader sequence has been proposed previously as being required for the peptide-RRE interaction. The results that show PapB can accept substrates with Cys-to-Asp separation ranging from 0-6 amino acids, perhaps is evidence for this in that the binding energy for interactions with the leader sequence is leveraged towards reactivity. However, when these studies initially began, it was assumed that the 3 amino acid separation likely
meant that the peptide has helical structure, as has been proposed previously (King, A.M., et al. Nat. Commun. 2021, 72, 6343), which would place the sidechains of the Cys and Asp residues near one another in three-dimensional space. The observation that the enzyme can accept substrate with variable Cys-to-Asp spacing, however, suggests that the recognition relies on the specific sidechain and not the secondary structure. In other words, the enzyme specifically recognizes the Cys and Asp/Glu sidechains. Since there are no structural data on this enzyme, it is difficult to know how this may be accomplished, but it is noted that it has been proposed that in the thioether crosslinking enzymes, the thiolate of the Cys can interact with one of the auxiliary Fe-S clusters. One can imagine that the recognition of the Asp/Glu may involve a hydrophilic or positively charged patch of residues. An Arg residue in PapB (Arg372) has previously been implicated by sequence alignments, mutation of which abolished crosslinking activity. Therefore, the model for recognition that best fits the data is one where the peptide to be modified has only two albeit very specific interactions with the enzymes, outside of the leader sequence.
[00518] The observation that crosslinking efficiency is decreased when the separation is zero or six likely results from either constraint on the degrees of freedom in the shorter span, or the presence of too many possible conformations in the longer separation, both of which would lead to fewer productive interactions between the residues to be crosslinked and the specific locations in which they bind. Additional evidence for the absence of significant sequence dependence, other than the identity of the Cys and Asp/Glu is the fact that D-amino acids are tolerated. Outside of the leader sequence, it is proposed that there are no specific interactions between the enzyme and the rest of the peptide other than the binding of the thiolate and carboxylate. As with other RS enzymes, one can anticipate that the binding occurs to place the 5 ’-position of SAM within or near van Der Waals radius of the H-atom to be abstracted, which is in turn, within close proximity of the crosslinking Cys sulfur.
[00519] While there are several examples of promiscuity by rSAM RiPP maturases including hybrid RiPPs produced using chimeric leader peptides (Burkhart, B. J, et al. ACS Cent. Set. 2017, 3 (6), 629-638), RiPPs with acceptor residue alterations (Himes, P. M., et al. ACS Chem. Biol. 2016, 11 (6), 1737-1744), truncated (i.e., leaderless) peptides being accepted by epimerases (Himes, P. M., et al. ACS Chem. Biol. 2016, 77 (6), 1737-1744), changes in the macrocycle core being tolerated (King, A.M, et al. Nat. Commun. 2021, 12, 6343), and residue-epimerization shifts based on core sequence changes (Komeli, M., et al. ACS Synth. Biol. 2021, 10 (2), 236-242), this work demonstrates a highly predictable pattern for crosslink formation.
[00520] FIG. 29 provides a brief summary of successful PapB-mediated thioether crosslinks in tested peptide sequences.
4. THE LEADER PEPTIDE SEQUENCE IS NOT REQUIRED
[00521] As shown in FIG. 30, the leader peptide sequence is not required for modification via PapB. PapB “leaderless” sequences that contain non-proteinogenic amino acids still demonstrated thioether linkages via mass spectrometry.
5. INTERPEPTIDE CROSSLINK STUDIES
[00522] As shown in FIG. 31 -FIG. 35, mass spectrometry results reveal evidence that interpeptide crosslinking can also be achieved with PapB. FIG. 31 shows mass spectrometry data for a one-to-one interpeptide crosslink as well as polymerization-like addition of X-mer peptide subunits. FIG. 32 shows results for a general assay peptide before and after PapB, demonstrating the presence of interpeptide products. FIG. 33 shows mass spectrometry results showing evidence of simple and complex mass envelopes. As a proof-of-concept for interpeptide crosslinking using PapB, thioether insulin analogs were synthesized. The results showing the crosslinked products can be found in FIG. 34 and FIG. 35.
6. ADDITIONAL STUDIES
[00523] Results showing the crosslinking that occurs in studies where the peptide sequence contains (S,E)-5-aminohex-2-enedioic acid are shown in FIG. 36. Tandem mass spectrometry results of dADo + msPapA adduct are shown in FIG. 37.
[00524] Studies demonstrating that thioether crosslinking occurs via PapB in peptides that contain a selenopeptide sequence were performed, and the results (mass spectrometry and EXAFS) are shown in FIG. 38. The tandem mass spectrometry results of C19U msPapA is shown in FIG. 39.
[00525] Donor and acceptor substitutions were explored, and the successfill crosslinking results are shown in FIG. 40 and FIG. 41. However, it was found that interchanging the position of the model system C and D residues resulted in no crosslinking (FIG. 42).
[00526] Studies were then conducted to explore the nature of the electron transfer reaction occurring during modification. The results are shown in FIG. 43, and demonstrate that the system is active in both tested reduction systems, with only loss of activity when flavodoxin mononucleotide is removed from the reducing system control series. Following
these results, “prereduced” PapB studies were carried out to characterize the turnover in the absence of reductant. These results are shown in FIG. 44- FIG. 46.
[00527] A concept schematic for a bioreactor setup for peptide modification via PapB is shown in FIG. 47.
7. PAPB TOLERATES C-TERMINAL GLYCINE
[00528] The sequence of leader-CX3G for recognition and crosslinking was explored (FIG. 48A-B). Labeling experiments with deuterated glycine show the crosslinked peptide with a corresponding loss of 3 Da indicating that the thioether crosslink occurred on the carbon adjacent to the carboxylic acid (FIG. 49A-B).
[00529] However, the sequence of leader-CX3(CH2)C(O)NH2 did not demonstrate thioether crosslinking (FIG. 50A-B). The loading, synthesis, and cleavage of the C-terminal glycine carboxamide peptide, which differs from that described earlier as to all alternative C- terminal carboxylate peptides, is detailed below.
[00530] Briefly, 150 mg of Rink Amide Resin was swelled in 5 mL of DMF for 1 h in a polyprep chromatography column. Nitrogen gas was then used to remove DMF from the column. The Fmoc protecting group on the resin was deprotected by adding 10 mL of 20% (v/v) of piperidine in DMF and rocking the resin for 1 h. Nitrogen gas was used to remove the piperidine mixture from the resin. The resin was then washed three times with 5 mL of DMF. Next, 0.03 mmole of Fmoc-Gly was weighed added to 0.15 mmole HATU and 0.15 mmole HOAt in a glass scintillation vial. 10 mL of 20% N-methylmorpholine in DMF was added to the vial and gently shaken to dissolve the materials. The mixture was then added to the deprotected resin and rocked at room temperature for 4 h. The amino acid solution was then pushed out of the column with nitrogen gas and washed three times with 5 mL of DMF. The resin was then capped with a 3:2 ratio of acetic anhydride and pyridine. 10 mL of the mixture was added to the column and the resin mixture was shaken for 30 m at room temperature. The capping solution was pushed out of the column with nitrogen gas and washed 3x with 5 mL of DMF. All Fmoc-amino acids in the peptide synthesis (0.15 mmol, 6 equivalents) were coupled by in situ activation with N-[(dimethylamino)- 1H-1 ,2,3-triazo[4,5- b]pyridin-1-ylmethylene]-N-methylmethanaminium hexafluorophosphate-N-oxide (HATU) (0.15 mmol, 6 equivalents; ChemPep) in 0.4 M N-methylmorpholine. After synthesis completion, the peptide was cleaved from the resin with 10 mL of 18:1:1 TFA:H2O:triisopropylsilane. The solution was rocked at room temperature for 1 h. The cleavage reaction was filtered into 30 mL of ice-cold diethyl ether to precipitate the peptide.
The solution was poured over a Buchner funnel filter and vacuumed to collect the peptide precipitate. The peptide dried on the vacuum for 15 min before being washed with 80 mL of ice-cold diethyl ether. After drying for an additional hour, the peptide was resuspended in 20 mL of water and sonified for 15 min to aid in the dissolution of the peptide. The solution was then flash frozen in liquid nitrogen and lyophilized.
[00531] After lyophilization, the peptide was resuspended in 0.05 M PIPES’NaOH (pH 7.4), 2 mM DTT, 300 mM KC1, and 15% glycerol buffer solution. The peptide was then assayed with PapB using the following parameters: 6.1 uM PapB, 100 uM msPapA C- terminal Gly carboxamide, 2 mM DTT, 2.1 mM SAM, and 15% glycerol in 100 uL total volume. The negative control (no PapB) and overnight complete assay (+ PapB) were quenched by adding 11 uL of 30% (w/v) TCA to the mixture. The quenched assays were spun at 16,000 xg for 10 m to pellet any precipitated enzyme or PIPES. The assays were analyzed using a Vanquish UHPLC with a diode-array detector connected to a Q-Exactive mass spectrometer operated in positive ion mode, the FT analyzer was set to 70,000 resolution, 1 microscan, and 200 ms maximum injection time. Xcalibur software was used to analyze data. A 20 uL aliquot was injected onto a Hypersil GOLD C18 column (msPapA) (2.1 mm x 150 mm, 1.9 μm particle size) (Thermo Fisher) pre-equilibrated in 0.1% (v/v) LC- MS Optima TFA (Fisher) in LC-MS Optima water (Fisher). Chromatographic steps were carried out at 0.2 mL/min with buffer A containing 0.1% (v/v) TFA in Optima water and buffer B containing Optima grade acetonitrile with 0.1% (v/v) TFA. The separation consisted of washing with 100% A from 0 to 3 min, followed by a linear gradient from 100% to 0% A from 3 to 6 min, washing with 0% A from 6 to 10 min, and reequilibration with 100% A from 10 to 14 min.
8. PAPB TOLERATES C-TERMINAL β-AMINO ACIDS
[00532] The sequence of leader-CX3(β-amino acids) for recognition and crosslinking was explored (FIG. 51A). The sequence of leader-CX3(L-3 -aminobutyric acid) was probed and treatment with PapB resulted in loss of 2Da indicating the thioether crosslinked product (FIG. 51B-C)
[00533] Alternative β-amino acids were explored to probe the sequence requirements for recognition and crosslinking. The sequences of leader-CX3(3-amino-2,2- dimethylbutanoicacid) (FIG. 52 A) and leader-CX3((R)-3-amino-2-methylpropanoic acid) (FIG. 52B) did not show evidence of a 2Da loss, which indicated that the thioether crosslinked product was not formed.
[00534] The sequence of leader-CX3 ((S)-3-amino-2-methylpropanoic acid) (FIG.
52C) did show evidence of a 2Da loss (FIG. 52D), which gives indication that the thioether crosslinked product was formed. Without wishing to be bound by theory, the culmination of these selective methylations on a C-terminal B-amino acid demonstrates that only one position is not amenable to substitution. The position alpha to the carboxylate must contain an H-atom in the pro-R position. The reference point to the pro-R position described is a singly methylated alpha-to-the-carboxylate moiety. In any scenario where the absolute stereochemistry priority may shift due to a substituent priority change, the scenario of the sidechains must be compared to that singly methylated case.
[00535] The sequence of leader-CX3(β-amino acids) was further explored utilizing analogs representative of natural amino acids (FIG. 53). A representative analog with B- tryptophan (FIG. 54A) did show evidence of a 2Da loss (FIG. 54B), which gives indication that the thioether crosslinked product was formed.
[00536] The sequence of leader-CX3(β-amino acids) was further explored by probing the reaction with N-methylated β-amino acids (FIG. 55A-B). A representative analog with sequence of leader-CX3(2-methyl-3-(methylamino)propanoic acid) was treated under reaction conditions with PapB (FIG. 55C). The reaction showed evidence of a 2Da loss (FIG. 55D), which gives indication that the thioether crosslinked product was formed.
9. PAPB TOLERATES C-TERMINAL D-AMIXO ACIDS
[00537] To assess the requirements of the C-terminal carboxylic acid D and L variants, Leader-CSADLA and Leader-CSADDA were prepared and incubated with PapB (FIG. 56A- D) In the case of L-alanine (FIG. 56A) a loss of 2 Da was not observed indicating that thioether crosslinked product was not formed (FIG. 56B). In the case of D-alanine (FIG. 56C) a loss of 2 Da was observed indicating that thioether crosslinked product was formed (FIG. 56D).
[00538] Further exploration with deuterated D-alanine at the carbon adjacent to the carboxylic acid incorporated into the leader-CSADDA sequence (FIG. 57A) showed evidence of a 3 Da loss, which indicated that the thioether crosslinked product was formed adjacent to the carboxylic acid (FIG. 57B). Similarly, deuterated D-methionine at the carbon adjacent to the carboxylic acid leader-CSADDM sequence (FIG. 58A) showed evidence of a 3 Da loss, which indicated that the thioether crosslinked product was formed adjacent to the carboxylic acid (FIG. 58B).
[00539] However, deuterated D-valine at the carbon adjacent to the carboxylic acid leader-CSADDV sequence (FIG. 59A) showed evidence of a 2 Da loss, which indicated that the thioether crosslinked product was formed (FIG. 59B) but the thioether crosslinked product was not at the carbon adjacent to the carboxylic acid. A deuterated D-valine at the tertiary sidechain carbon leader-CSADDV sequence (FIG. 60A) showed evidence of a 3 Da loss (FIG. 60B), which indicated that the thioether crosslinked product was formed at the tertiary carbon
[00540] Deuterium labeled phenyl alanine at the carbon adjacent to the carboxylic acid was incorporated into leader-CSADDF sequence (FIG. 61A) and incubated with PapB. The product showed evidence of a 2 Da loss (FIG. 61B). This is again indicative of successful thioether crosslinked product but the thioether crosslinked product was not at the carbon adjacent to the carboxylic acid. Similarly, phenyl alanine d5 was incorporated into leader- CSADDF sequence (FIG. 61C) and incubated with PapB. The product showed evidence of a 2 Da loss (FIG. 61D). This is again indicative of successful thioether crosslinked product but the thioether crosslinked product was not formed with the aromatic group.
[00541] Phenyl alanine d8 was incorporated into leader-CSADDF sequence (FIG. 62A) and incubated with PapB. The product showed evidence of a 3 Da loss (FIG. 62B). This is indicative of successful thioether crosslinked product to the methylene of the phenylalanine side chain.
[00542] The sequence of CX3(C-terminal D-amino acids) was further explored utilizing analogs representative of natural amino acids to form sactipeptides (FIG. 63). [00543] The sequence of CX3(C-terminal D-amino acids) of certain amino acids provided ranthipeptides (FIG. 64).
10. PAPB TOLERATES NON PEPTIDE ANALOGS
[00544] The rings size of non-peptide sequences was explored for carbon linked analogs (FIG. 65-70). In the case of Leader-CG, no crosslinking is observed (FIG. 65A). This places a lower limit on amenable crosslink formation (FIG. 65B). In the case of Leader- (hCys)-(Gly) (FIG. 66A), a 2 Da loss is observed in the mass spectrum upon reaction with PapB (FIG. 67B). Similarly, with Leader-(Cys)-(B-Ala) (FIG. 67A), thioether crosslinking is observed upon addition of PapB (67B). The smallest ring observed is a 7-membered ring. Extensions of a single CH2 group on either the thiol- or carboxylate-containing moiety from the baseline Leader-CG scenario resulted in crosslinking. Further, the ring size can be expanded with additional CH2 groups. See FIG. 68A (Leader-hCys-βAla) for the sequence
and FIG. 68B for the representative mass spectra illustrating a 2 Da loss, FIG. 69A (Leader- Cys-gamma amino butyric acid) for the sequence and FIG. 69B for the representative mass spectra illustrating a 2 Da loss, and FIG. 70A (Leader-hCys-gamma amino butyric acid) for the sequence and FIG. 70B for the representative mass spectra illustrating a 2 Da loss. [00545] The maximum ring size of non-peptide sequences was explored using PEG- based moieties terminating in a carboxylate (FIG. 71-72). In the case of 3x PEGylation (FIG. 71A), thioether crosslinking is observed upon addition of PapB (FIG. 71B). 4x PEGylation (FIG. 72A) shows thioether formation upon addition of PapB (FIG. 72B), thioether formation is observed.
[00546] Alternate scaffolds were also explored (FIG. 73-75) to establish that both aromatics and heterocycles can be included in the thioether macrocycle. FIG. 73A demonstrates the structures of a peptide chain containing a substituted aniline in the peptide backbone and the reacted thioether macrocycle. FIG. 73B shows the mass spectra of the unreacted (top, without PapB) and reacted (bottom, with PapB) aniline-containing peptide chain. FIG. 74A demonstrates the structures of a peptide chain containing a substituted benzylamine in the peptide and the reacted thioether macrocycle. FIG. 74B shows the mass spectra of the reacted benzylamine-containing peptide chain. FIG. 75A show the structures of a modified courmarin-containing peptide. The coumarin-like moiety is the most C- terminal aspect of the peptide. FIG. 75B demonstrates the mass spectra of the coumarin- containing peptide both unreacted (top, without PapB) and reacted (bottom, with PapB).
11. PAPB UTILIZED TO PREPARE THIOETHER CROSSLINKED PEPTIDOMIMETICS
[00547] Setmalanotide, an MC4R agonist, is an FDA-approved drug indicated for chronic weight management (FIG. 76A). A similar peptidomimetic analog based on the PapB reaction to form thioether crosslinked products was envisioned (FIG. 76B). The sequence of Leader-CDDAHDDFRWX (FIG. 76C, where X = 13- Ala) was explored. The reaction showed evidence of a 2Da loss (FIG. 76D), which gives indication that the thioether crosslinked product was formed.
[00548] An orally available crosslinked thioether peptidiomimetic was recently disclosed (J. Med. Chem. 2021, 64, 5, 2622-2633) (FIG. 77A) A similar peptidomimetic analog based on the disclosed PapB reaction to formed thioether crosslinked products was envisioned (FIG. 77B). The sequence of Leader-CBXBXF (FIG. 77C, where B = norleucine, X = N-methyl norleucine) was explored. The reaction showed evidence of a 2Da loss (FIG. 77D), which gives indication that the thioether crosslinked product was formed.
[00549] Bremelanotide, an agonist of MCI R, MC4R, MC3R, MC5R, and MC2R, is an
FDA-approved drug that treats hypoactive sexual desire in premenopausal women. A similar peptidomimetic analog with the sequence Leader-BCDFRWZ (where B = norleucine, and Z = ε-ACP) was generated (FIG. 78A). The envisioned thioether crosslinked analog is shown in FIG. 78B. The PapB transformation is shown in FIG. 78C. The mass spectra showing the reaction both in the absence (top) and presence (bottom) of PapB is shown in FIG. 78D. The proposed thioether analog is shown in FIG. 78B.
12. PAPB CROSSLINKS EXTENDED SIDECHAINS OF THIOL- AND CARBOXYLATE-
CONTAINING RESIDUES.
[00550] Previous studies have demonstrated that PapB can tolerate extended sidechains of the acidic residue, as both CX3D and CX3E sequences are crosslinked. PapB forms crosslinks in msPapA with a homocysteine (hCys) substitution at position 19 and an Asp in position 23, C19hCys and D23E, and C19hCys and homoglutamate (hGlu) at position 23 (FIG. 79A). MS analysis confirmed that PapB catalyzes formation of a crosslink in peptides containing hCys at position 19 and Asp (FIG. 79B), hCys at position 19 and Glu (FIG. 79C), or hCys at position 19 and hGlu at position 23 (FIG. 79D). Previous results demonstrated that changing either the donor or acceptor residue was amenable, however in the case of D-amino acids, little crosslinking was seen when both residues were altered simultaneously. Changing both the donor and acceptor residue and observing efficient substrate conversion is uncommon in RiPP maturation. The MS analysis of the reaction products show that each substrate undergoes crosslinking, as evidenced by a 2 Da loss relative to the substrate due to the loss of one H from the hCys thiol and a second H from the sidechain of the carboxylate-containing sidechain at residue 23. As a proof of concept for the use of PapB in generating macrocyclized peptides, in the case of C19hCys/D23hGlu variant, the leader sequence has been shown to be cleaved to generate the macrocyclized core peptide (FIG. 79E).
[00551] The observation of a 2 Da shift, when taken together with the MS/MS data and the loss of the lAM-sensitivity clearly shows that a thioether crosslink has formed. However, further confirmation that as with the wildtype substrate, the crosslink was formed at the carbon atom that is alpha to the carboxylate of the sidechain was required. To this end, either unmodified or crosslinked C19hCys/D23hGlu was treated with TEV protease to liberate the modified core from the leader peptide, purified by HPLC, and subjected to both one- and two-dimensional NMR analysis. The ID NMR spectrum of the peptide prior to the treatment
with PapB reveals a resonance at 2.41 ppm, which is composed of a doublet of triplets integrating to two protons (FIG. 80). This feature can reasonably be assigned to Hε of hGlu. In the NMR spectrum of the treated peptide (FIG. 81), this resonance is absent and a new triplet at 3.34 ppm integrating to a single hydrogen is observed. This new resonance is consistent with thioether installation at the position alpha to the carboxylate in the hGlu sidechain. To further correlate the positions, the modified and unmodified peptides were subjected to ROESY analysis to establish through-space correlations of protons. In the unmodified peptide, the resonance at 2.41 ppm is coupled to resonances at 1.66, 1.75, and 1.88 ppm, corresponding to through-space coupling of Hε in the hGlu sidechain to Hy and Hβ of hGlu, respectively (see FIG. 82). In the modified peptide, the new resonance at 3.25 ppm is coupled to resonances at 1.8- 1.9 ppm (see FIG. 83). The comparison of the linear and the cyclized peptide ID spectra show a ppm shift of ~0.2 in the resonances between 1.6 and 2.0, which would make the coupled resonances in the modified peptide ROESY spectra consistent with a through-space coupling of Hε of the hGlu sidechain to HB and Hy of hGlu. Additionally, a weak coupling between 3.25 to 2.10 and 2.79-2.98 is observed. These cross peaks are consistent with Hε of the hGlu sidechain coupling with HB (2.10 ppm) and Hy (2.79-2.98) of the hCys sidechain. Without wishing to be bound by theory, these results demonstrate that PapB can crosslink extended sidechains of thiol- and carboxylate-containing residues.
[00552] While various levels of promiscuity have been reported in the rSAM RiPP field, changing both the identity of both the donor and acceptor residues and forming a chemically consistent product is unprecedented. These findings expand the understanding of the capabilities of PapB and have implications for the biosynthesis of RiPPs that contain such extended sidechains.
13. CROSSLINKING WITH A TETRAZOLE MOIETY: IMPLICATIONS FOR PEPTIDE-
BASED THERAPIES
[00553] While the data with hCys, hGlu, and D-amino acid containing peptides all suggest a high level of tolerance in PapB for various substrates, the examples shown are limited by the fact that they all contain a thiol- and carboxylate-containing amino acid. It has been demonstrated herein that selenocysteine peptides are processed by PapB, but no examples of an isostere of a carboxylate moiety being processed by an rSAM RiPP maturase currently exist. Tetrazole moieties are commonly used as a bioisostere of carboxylic acids in small-molecule drug development. Tetrazoles improve the bioavailability of drugs, increase
their lipophilicity, and reduce side-effects when compared to carboxylate-containing compounds. This is due to the metabolic stability of tetrazole moieties — metabolic transformations of carboxylic acids are driven in part by microsomes of the liver, many of which are evaded by using a tetrazole isostere. The tetrazole pharmacophore has been used in a variety of drug classes, including nonsteroidal anti-inflammatory drugs, angiotensin receptor blockers, and proton pump inhibitors.
[00554] (2H-tetrazol-5-yl)propanoic acid (T4Az) was incorporated into msPapA (D23T4Az) by SPPS and incubated it with PapB to determine if the carboxylate-containing amino acid of msPapA can be replaced with the isosteric tetrazole-containing amino acid. The structures of the linear and cyclized peptides are shown in FIG. 84A. Upon reaction with PapB, a 2 Da shift is clearly observed (FIG. 84B), suggesting formation of a crosslink. The MS/MS spectra of D23T4Az msPapA reveals no fragmentation between the Cys and T4Az residue (FIG. 84C), but the anticipated 2 Da loss is observed in the b-ions after T4Az, and in the y-ions after the Cys residue (see FIG. 85for the MS/MS spectra and all found fragments). The regiochemistry for crosslink is unconfirmed, however, several unusual peaks are present in the MS/MS that are potentially informative. For example, peaks are observed (FIG. 84C, y ’ and b') that are consistent with loss of the tetrazole sidechain moiety (see also FIG. 86 for additional y and b -fragments). Without wishing to be bound by theory, these fragments suggest that the thioether forms alpha to the tetrazole.
[00555] This data with the tetrazole analog is the first-demonstrated ability of rSAM RiPP maturases to crosslink with a tetrazole moiety, opening new avenues for the development of peptide-based therapeutics. By using the tetrazole moiety in place of a carboxylate, the metabolic stability and pharmacokinetic properties of potential peptide therapeutics can be improved. This finding greatly expands the scope of rSAM RiPP maturases.
14. REFERENCES
[00556] (1) Van der Donk, W. A.; Bindman, N. A. Nat. Prod.: Discourse, Delivery, and Design, John Wiley & Sons: Oxford, 2014; pp 197-218.
[00557] (2) Sberro, H.; Fremin, B. J.; Zlitni, S.; Edfors, F.; Greenfield, N.; Snyder, M.
P.; Pavlopoulos, G. A.; Kyrpides, N. C.; Bhatt, A. S. Large-Scale Analyses of Human Microbiomes Reveal Thousands of Small, Novel Genes. Cell. 2019, 178 (5), 1245-1259.
[00558] (3) Hetrick, K. J.; van der Donk, W. A. Ribosomally synthesized and post- translationally modified peptide natural product discovery in the genomic era. Curr. Opin. Chem. Biol. 2017, 38, 36-44.
[00559] (4) Siissmuth, R. D.; Mainz, A. Nonribosotnal Peptide Synthesis — Principles and Prospects Angew. Chem. Int. Ed. Engl. 2017, 56, 3770.
[00560] (5) deGruyter, J. N.; Malins, L. R.; Baran, P. S. Residue-Specific Peptide
Modifications: A Chemist’s Guide. Biochem. 2017, 56 (30), 3863-3873.
[00561] (6) Benjdia, A.; Baity, C..; Berteau, O. Radical SAM Enzymes in the
Biosynthesis of Ribosomally Synthesized and Post-translationally Modified Peptides (RiPPs). Front. Chem. 2017, 5 (87).
[00562] (7) Sofia, H. J.; Chen, G.; Hetzler, B. G.; Reyes-Spindola, J.F., Miller, N.E.
Radical SAM, a novel protein superfamily linking unresolved steps in familiar biosynthetic pathways with radical mechanisms: functional characterization using new analysis and information visualization methods. Nucleic Acids Res. 2001, 29 (5), 1097-1106.
[00563] (8) Duffus, B. J.; Duschene, K. S.; Shepard, E. M.; Broderick, J. B. Radical
S-Adenosylmethionine Enzymes. Chem. Rev. 2014, 114 (8), 4229-4317.
[00564] (9) Wecksler, S. R.; Stoll, S.; Tran, H.; Magnusson, O. T.; Wu, S. P.; King,
D.; et al. Pyrroloquinoline quinone biogenesis: demonstration that PqqE from Klebsiella pneumoniae is a radical S-adenosyl-L-methionine enzyme. Biochem. 2009, 48, 10151-10161.
[00565] (10) Schramma, K. R.; Bushin, L. B.; and Seyedsayamdost, M. R. Structure and biosynthesis of a macrocyclic peptide containing an unprecedented lysine-to-tryptophan crosslink. Nat. Chem. 2015, 7, 431—437.
[00566] (11) Khaliullin, B.; Aggarwal, P.; Bubas, M.; Eaton, G. R.; Eaton, S. S.;
Latham, J. A. Mycofactocin biosynthesis: modification of the peptide MftA by the radical S- adenosylmethionine protein MftC. FEBSLett. 2016, 590, 2538-2548.
[00567] (12) Bushin, L. B.; Clark, K. A.; Pelczer, I.; and Seyedsayamdost, M. R.
Charting an Unexplored Streptococcal Biosynthetic Landscape Reveals a Unique Peptide Cyclization Motif. J. Am. Chem. Soc. 2018, 140, 17674-17684.
[00568] (13) Imai, Y.; Meyer, K. J.; linishi, A.; Favre-Godal, Q.; Green, R.; Manuse,
S.; et al. A New Antibiotic Selectively Kills Gram-Negative Pathogens. Nature 2019, 576, 459-464.
[00569] (14) Clark, K. A.; Bushin, L. B.; and Seyedsayamdost, M. R. Aliphatic Ether
Bond Formation Expands the Scope of Radical SAM Enzymes in Natural Product Biosynthesis. J. Am. Chem. Soc. 2019, 141, 10610-10615.
[00570] (15) Fluhe, L.; Burghaus, O.; Wieckowski, B. M.; Giessen, T. W.; Linne, U.; and Marahiel, M. A. (2013). Two [4Fe-4S] Clusters Containing Radical SAM Enzyme SkfB Catalyze Thioether Bond Formation during the Maturation of the Sporulation Killing Factor. J. Am. Chem. Soc. 2013, 135, 959-962.
[00571] (16) Baity, C.; Guillot, A.; Fradale, L.; Brewee, C.; Boulay, M.; Kubiak, X.; et al. Ruminococcin C, an Anti-clostridial Sactipeptide Produced by a Prominent Member of the Human Microbiota Ruminococcus Gnavus. J. Biol. Chem. 2019, 294, 14512-14525.
[00572] (17) Caruso, A., Bushin, L. B., Clark, K. A., Martinie, R. J., and
Seyedsayamdost, M. R. Radical Approach to Enzymatic β-Thioether Bond Formation. J. Am. Chem. Soc. 2019, 141, 990-997.
[00573] (18) Hudson, G. A.; Burkhart, B. J.; DiCaprio, A. J.; Schwalen, C. J.; Kille,
B.; Pogorelev, T. V.; Mitchell, D. A. Bioinformatic Mapping of S-Adenosylmethionine- Dependent Ribosomally Synthesized and Post-Translationally Modified Peptides Identifies new Cα, Cβ, or Cγ-linked Thioether-Containing Peptides. J. Am. Chem. Soc. 2019, 141, 8228-8238.
[00574] (19) Bmender, N. A.; Wilcoxen, J.; Britt, R. D.; and Bandarian, V.
Biochemical and Spectroscopic Characterization of a Radical S-Adenosyl-L-Methionine Enzyme Involved in the Formation of a Peptide Thioether Cross-Link. Biochem. 2016, 55, 2122-2134.
[00575] (20) Freeman, M. F.; Gurgui, C.; Helf, M. J.; Morinaka, B. I.; Uria, A. R.;
Oldham, N. J.; et al. Metagenome Mining Reveals Polytheonamides as Posttranslationally Modified Ribosomal Peptides. Science 2012, 338, 387-390.
[00576] (21) Vagstad, A. L.; Kuranaga, T.; Piintener, S.; Pattabiraman, V. R.; Bode,
J. W.; and Piel, J. (2019). Introduction of D -Amino Acids in Minimalistic Peptide Substrates by an S-Adenosyl-L-Methionine Radical Epimerase. Angew. Chem., Int. Ed. 2019, 58, 2246- 2250.
[00577] (22) Popp, P. F.; Friebel, A. L.; Benjdia, A.; Guillot, A.; Berteau, O.; and
Mascher, T. (2021). The Epipeptide Biosynthesis Locus epeXEPAB Is Widely Distributed in Firmicutes and Triggers Intrinsic Cell Envelope Stress. Microb. Physiol. 2021, 1-12.
[00578] (23) Lewis, J. K.; Jochimsen, A. S.; Lefave, S. J.; Young, A. P.; Kincannon,
W. M.; Roberts, A. G.; Kieber-Emmons, M. T.; Bandarian V. New Role for Radical SAM Enzymes in the Biosynthesis of Thio(seleno)oxazole RiPP Natural Products. Biochem. 2021, 60 (45), 3347-3361.
[00579] (24) Precord, T. W.; Mahanata, N.; Mitchell, D. A. Reconstitution and
Substrate Specificity of the Thioether-Forming Radical S-Adenosylmethionine Enzyme in Freyrasin Biosynthesis. ACS Chem. Biol. 2019, 14 (9), 1981-89.
[00580] (25) Outten, F.W.; Djaman, O.; Storz, G. A suf operon requirement for Fe-S cluster assembly during iron starvation in Escherichia coli. Mol. Microbio. 2004, 52, 861- 872.
[00581] (26) Wollers, S.; Layer, G.; Garcia-Serres, R.; Signor, L.; Clemancey, M.;
Latour, J. M.; Fontecave, M.; Ollagnier de Choudens, S. Iron-sulfur (Fe-S) cluster assembly: the SufBCD complex is a new type of Fe-S scaffold with a flavin redox cofactor. J. Biol. Chem. 2010, 285 (30), 23331-23341.
[00582] (27) Rea, M. C.; Sit, C. S.; Clayton, E.; O’Connor, P. M.; Whittai, R. M.;
Zheng, J.; Vederas, J. C.; Ross, R. P.; Hill, C. Thuricin CD, a posttranslationally modified bacteriocin with a narrow spectrum of activity against Clostridium difficile. Proc. Natl. Acad. Sci. U. S. A. 2010, 707 (20), 9352-9357.
[00583] (28) Lohans, C. T.; Vederas, J. C. Structural characterization of thioether- bridged bacteriocins. J. Antibiot. 2014, 67, 23-30.
[00584] (29) King, A.M.; Anderson, D.A.; Glassey, E.; Segall-Shapiro, T. H.; Zhang,
Z.; Niquille, D. L.; Embree, A. C.; Pratt, K.; Williams, T. L.; Gordon, D. B.; Voigt, C. A. Selection for constrained peptides that bind to a single target protein. Nat. Commun. 2021, 72, 6343.
[00585] (30) Lamberts, S. W. J.; Hofland, L. J. Octreotide, 40 years later. Eur. J.
Endocrinol. 2019, 181, R173-R183.
[00586] (31) Muttenthaler, M.; King, G. F.; Adams, D. J.; Alewood, P. F. Trends in peptide drug discovery. Nat. Rev. Drug Discov. 2021, 20, 309-325.
[00587] (32) Kapust, R.B.; Tozser, J.; Copeland, T. D.; Waugh, D. S. The Pl’ specificity of tobacco etch virus protease. Biochem. Biophy. Res. Commun. 2002, 294 (5), 949-955.
[00588] (33) Phan, J.; Zdanov, A.; Evdokimov, A. G.; Tropea, J. E.; Peters, H. K.;
Kapust, R. B.; Li, M.; Wlodawer, A.; Waugh, D. S. Structural basis for the substrate specificity of tobacco etch vims protease. J. Biol. Chem. 2002, 277 (52), 50564-60672.
[00589] (34) Grove, T. L.; Himes, P. M.; Hwang, S.; Yumerefendi, H.; Bonanno, J. B.;
Kuhlman, B.; Almo, S. C.; Bowers, A. B. Structural Insights into Thioether Bond Formation in the Biosynthesis of Sactipeptides. J. Am. Chem. Soc. 2017, 139 (34), 11734-11744.
[00590] (35) Lanz, N. D. Booker, S. J. Identification and function of auxiliary iron- sulfur clusters in radical SAM enzymes. Biochim. Biophys. Acta. 2012, 1824 (11), 1196- 1212.
[00591] (36) Grell, T. A. J., Goldman, P. J., Drennan, C. L. SPASM and Twitch
Domains in S-Adenosylmethionine (SAM) Radical Enzymes. J. Biol. Chem. 2015, 290 (T), 3964-3971.
[00592] (37) Haft, D. H., Basu, M. K. Biological systems discovery in silico: radical S- adenosylmethionine protein families and their target peptides for posttranslational modification. J. Bacterial. 2011, 193 (11), 2745-2755.
[00593] (38) Grell, T. A. J., Kincannon, W. M., Bruender, N. A., Blaesi, E. J., Krebs,
C., Bandarian, V., Drennan, C. L. Structural and spectroscopic analyses of the sporulation killing factor biosynthetic enzyme SkfB, a bacterial AdoMet radical sactisynthase. J. Biol. Chem. 2018, 293 (45),17349-17361.
[00594] (39) Burkhart, B. J.; Kakkar, N.; Hudson, G. A.; van der Donk, W. A.;
Mitchell, D. A. Chimeric Leader Peptides for the Generation of Non-Natural Hybrid RiPP Products. ACS Cent. Sei. 2017, 3 (6), 629-638.
[00595] (40) Himes, P. M.; Allen, S. E.; Hwang, S.; Bowers, A. A. Production of
Sactipeptides in Escherichia coli; Probing the Substrate Promiscuity of Subtilosin A Biosynthesis. ACS Chem. Biol. 2016, 77 (6), 1737-1744.
[00596] (41) Vagstad, A. L.; Kuranaga, T.; Piintener, S.; Pattabiraman, V. R.; Bode, J.
W.; Piel, J. Introduction of D-Amino Acids in Minimalistic Peptide Substrates by an S- Adenosyl-L-Methionine Radical Epimerase. Angew. Chem., Int. Ed. 2018, 58, 2246-2250. [00597] (42) Komeli, M.; Fuchs, S. W.; Felder, K.; Ernst, C.; Zinsli, L. V.; Piel, J.
Promiscuous Installation of D-Amino Acids in Gene-Encoded Peptides. ACS Synth. Biol.
2021, 70 (2), 236-242.
[00598] (43) Chen, D.; Ruzicka, F. J.; Frey, P. A. A novel lysine 2,3 -aminomutase encoded by the yodO gene of Bacillus subtilis; characterization and the observation of organic radical intermediates. Biochem. J. 2000, 348, 539-549.
[00599] (44) Crain, A. V.; Broderick, J. B. Pyruvate Formate-lyase and Its Activation by Pyruvate Formate-lyase Activating Enzyme. J. Biol. Chem. 2014, 289 (9), 5723-5729. [00600] It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the scope or spirit of the invention. Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. It is
intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.
Claims
1. A method of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein o is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1 , 2, 3, 4, or 5; wherein A is S or Se; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, NHC(O)NHSO2CH3, P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5’, when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6 , when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and
C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA.
2. The method of claim 1, wherein 0 is independently 0, 1, 2, 3, 4, 5, 6, or 7.
3. The method of claim 1 or claim 2, wherein t is 0.
4. The method of any one of claims 1 to 3, wherein v is 1 or 2.
6. The method of any one of claims 1 to 4, wherein R1 is CO2H.
7. The method of any one of claims 1 to 6, wherein the cleavable moiety is -CO2-(C4-
C8 alkylene)-OC(O)-.
8. The method of any one of claims 1 to 6, wherein the cleavable moiety is CO2CH2CH=CHCH2OC(O)-.
9. The method of any one of claims 1 to 6, wherein the cleavable moiety is a protease recognition sequence.
10. The method of claim 9, wherein the protease recognition sequence is TEV recognition sequence.
11. The method of any one of claims 1 to 6, wherein the compound comprises one or more D-amino acid residues.
12. The method of any one of claims 1 to 11, wherein the compound comprises one or more p-amino acid residues.
13. The method of any one of claims 1 to 12, wherein the compound comprises one or more N-methylated amino acid residues.
14. The method of any one of claims 1 to 13, wherein PapB installs a single thioether linkage in the compound.
15. The method of any one of claims 1 to 14, wherein PapB installs two or more thioether linkages in the compound.
17. The method of claim 1, wherein the compound has a structure represented by a formula:
19. The method of any one of claims 1 to 18, wherein the method further comprises addition of a reducing agent.
20. The method of claim 19, wherein the method further comprises addition of a protease.
22. The method of claim 21, wherein the thioether compound is selected from:
23. A thioether compound produced by the method of any one of claims 20 to 22.
24. A thioether compound produced by the method of any one of claims 1 to 19.
25. A method of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein 0 is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein t is an integer from 0 to 500; wherein v is 1, 2, 3, 4, or 5; wherein A is S or Se; wherein R1 is selected from C02H, C(O)NHOH, SO2NH2, SO2NHC(O)CH3, SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and
C1-C4 alkyl; and wherein R8 is selected from hydrogen and methyl, provided that the compound is not PapA.
26. A method of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4; wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein Q1 is a leader sequence; wherein Q2 is a cleavable moiety; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine; wherein each of R3a and R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl;
wherein each occurrence of R5 and R5 , when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6’, when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and
C1-C4 alkyl, provided that the compound is not PapA.
27. The method of claim 26, wherein m is 0.
28. The method of claim 26 or claim 27, wherein m is 1.
29. The method of any one of claims 26 to 28, wherein n is 0.
30. The method of any one of claims 26 to 28, wherein n is 1.
31. The method of any one of claims 26 to 30, wherein o is 0, 1, 2, 3, 4, 5, 6, or 7.
32. The method of any one of claims 26 to 30, wherein o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
33. The method of any one of claims 26 to 30, wherein o is 1, 2, 3, or 4.
34. The method of any one of claims 26 to 33, wherein p is 1.
35. The method of any one of claims 26 to 33, wherein p is 2.
36. The method of any one of claims 26 to 35, wherein A is S.
37. The method of any one of claims 26 to 35, wherein A is Se.
38. The method of any one of claims 26 to 37, wherein L is C2-C4 alkyl.
39. The method of any one of claims 26 to 37, wherein L is -(C1-C4 alkyl)(OCH2CH2)q.
40. The method of any one of claims 26 to 37, wherein L is a structure selected from:
41. The method of any one of claims 26 to 40, wherein the cleavable moiety is a protease recognition sequence.
42. The method of claim 41, wherein the protease recognition sequence is a TEV protease recognition sequence.
43. The method of claim 42, wherein the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
44. The method of any one of claims 26 to 43, wherein the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3).
45. The method of any one of claims 26 to 43, wherein the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
47. The method of any one of claims 26 to 45, wherein R1 is -CO2H.
48. The method of any one of claims 26 to 47, wherein R2 is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
49. The method of any one of claims 26 to 47, wherein R2 is a residue of a side chain of an amino acid selected from alanine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, and glycine.
50. The method of any one of claims 26 to 49, wherein one of R3a and R3b, when present, is hydrogen, and one of R3a and R3b, when present, is a residue of a side chain of an amino
acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
51. The method of any one of claims 26 to 50, wherein R4 is hydrogen.
52. The method of any one of claims 26 to 50, wherein R4 is methyl.
53. The method of any one of claims 26 to 52, wherein each occurrence of R5, when present, is independently a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
54. The method of any one of claims 26 to 53, wherein each occurrence of R6, when present, is hydrogen.
55. The method of any one of claims 26 to 53, wherein each occurrence of R6, when present, is methyl.
56. The method of any one of claims 26 to 55, each of R7a and R7b, when present, is hydrogen.
57. The method of any one of claims 26 to 55, each of R7a and R7b, when present, is methyl.
59. The method of claim 26, wherein the compound has a structure represented by a formula:
63. The method of claim 26, wherein the compound has a structure represented by a formula:
66. The method of claim 65, wherein o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
67. The method of claim 65 or claim 66, wherein one of R3a and R3b, when present, is hydrogen, and one of R3a and R3b, when present, is a residue of a side chain of an amino acid selected from alanine, valine, leucine, serine, cysteine, methionine, arginine, lysine, asparagine, glycine, phenylalanine, tyrosine, and tryptophan.
68. The method of claim 26, wherein the compound has a structure represented by a formula:
72. The method of claim 71, wherein o is 1, 2, 3, 4, 5, 6, 7, 8, or 9.
76. The method of any one of claims 26 to75, wherein PapB installs a single thioether
linkage in the compound.
77. The method of any one of claims 26 to 75, wherein PapB installs two or more thioether linkages in the compound.
89. The method of claim 78, wherein the thioether compound has a structure represented by a formula selected from:
90. The method of claim 78, wherein the thioether compound has a structure represented by a formula selected from:
91. The method of claim 78, wherein the thioether compound has a structure represented by a formula selected from:
92. The method of claim 78, wherein the thioether compound is a sactipeptide.
93. The method of claim 92, wherein the sactipeptide has a structure represented by a formula selected from:
94. The method of claim 78, wherein the thioether compound is a ranthipeptide.
95. The method of claim 94, wherein the ranthipeptide has a structure represented by a formula selected from:
96. The method of any one of claims 23 to 95, wherein the method further comprises addition of a reducing agent.
97. The method of claim 96, wherein the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof.
98. The method of claim 96 or claim 97, wherein the method further comprises addition of a protease.
99. The method of claim 98, wherein the protease is TEV protease.
101. The method of claim 98 or claim 99, wherein the thioether compound has a structure represented by a formula:
111. The method of claim 98 or claim 99, wherein the thioether compound has a structure represented by a formula selected from:
112. The method of claim 98 or claim 99, wherein the thioether compound has a structure represented by a formula selected from:
113. The method of claim 98 or claim 99, wherein the thioether compound has a structure represented by a formula selected from:
114. The method of claim 98 or claim 99, wherein the thioether compound is a sactipeptide.
115. The method of claim 114, wherein the sactipeptide has a structure represented by a formula selected from:
116. The method of claim 98 or claim 99, wherein the thioether compound is a ranthipeptide.
118. The method of claim 98 or claim 99, wherein the thioether compound is selected from:
119. The method of claim 98 or claim 99, wherein the thioether compound is selected from:
120. A thioether compound produced by the method of any one of claims 98 to 119.
121. A thioether compound produced by the method of any one of claims 26 to 97.
122. A method of chemically modifying a compound to install a thioether linkage, the method comprising reacting the compound with PapB, wherein the compound has a structure represented by a formula:
wherein m is 0, 1, 2, 3, or 4;
wherein n is 0 or 1 ; wherein each of o and o’ is independently 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein p is 1 or 2; wherein A is S or Se; wherein L, when present, is selected from C2-C4 alkyl, -(C1-C4 alkyl)(OCH2CH2)q, and a structure selected from:
wherein q is 1, 2, 3, or 4; wherein R1 is selected from -CO2H, -C(O)NHOH, -SO2NH2, -SO2NHC(O)CH3, -SO3H, -NHC(O)NHSO2CH3, -P(O)(OH)2, and a structure selected from:
wherein R2 is a residue of a side chain of amino acid, provided that the amino acid is not isoleucine or threonine;
wherein each of R3a and R3b, when present, is independently selected from C2-C5 alkynyl, C1-C5 azido, and a residue of a side chain of an amino acid; wherein R4 is selected from hydrogen and methyl; wherein each occurrence of R5 and R5’, when present, is independently a residue of a side chain of amino acid; wherein each occurrence of R6 and R6 , when present, is independently selected from hydrogen and methyl, or wherein R6 or R6’ is covalently bonded to R5 or R5’, respectively, and, together with the intermediate atoms, comprise an unsubstituted 5-membered heterocycle; wherein each of R7a and R7b, when present, is independently selected from hydrogen and
C1-C4 alkyl, provided that the compound is not PapA.
123. A method of chemically modifying a peptide sequence to install a thioether linkage, the method comprising reacting the peptide sequence with PapB, wherein the peptide sequence comprises X-Yn-Z, wherein n is 0, 1, 2, 3, 4, 5, 6, 7, 8, or 9; wherein X is an amino acid residue comprising a -SH or -SeH group; wherein each occurrence of Y, when present, is independently an amino acid residue; and wherein Z is an amino acid residue that is carboxyl-functionalized or tetrazolyl- functionalized, provided that the peptide sequence is not PapA.
124. The method of claim 123, wherein n is 0, 1, 2, 3, 4, 5, 6, or 7.
125. The method of claim 123 or claim 124, wherein X is a cysteine residue, a homocysteine residue, a selenocysteine residue, a homoselenocysteine residue, or a
penicillamine residue.
126. The method of claim 123 or claim 124, wherein X is a cysteine residue.
127. The method of claim 123 or claim 124, wherein X is a D-amino acid residue.
128. The method of any one of claims 123 to 127, wherein at least one occurrence of Y is a D-amino acid residue.
129. The method of any one of claims 123 to 127, wherein at least one occurrence of Y is a β-amino acid residue.
130. The method of any one of claims 123 to 127, wherein at least one occurrence of Y is an N-methylated amino acid residue.
131. The method of any one of claims 123 to 130, wherein Z is a D-amino acid residue.
132. The method of any one of claims 123 to 130, wherein Z is a β-amino acid residue.
133. The method of any one of claims 123 to 130, wherein Z is an N-methylated amino acid residue.
134. The method of any one of claims 123 to 130, wherein Z is an aspartic acid residue, a glutamic acid residue, a hydroxy-glutamic acid residue, or a 2-amino-3-(2H-tetrazol-5- yl)propanoic acid residue.
135. The method of any one of claims 123 to 130, wherein Z is a carboxyl-fimctionalized amino acid residue.
136. The method of any one of claims 123 to 130, wherein Z is an aspartic acid residue.
137. The method of any one of claims 123 to 130, wherein Z is a glutamic acid residue.
138. The method of any one of claims 123 to 137, wherein the peptide sequence comprises one or more D-amino acid residues.
139. The method of any one of claims 123 to 138, wherein the peptide sequence comprises one or more β-amino acid residues.
140. The method of any one of claims 123 to 139, wherein the peptide sequence comprises
one or more N-methylated amino acid residues.
141. The method of any one of claims 123 to 140, wherein the peptide sequence comprises DFCFDWKTET (SEQ ID NO: 4).
142. The method of any one of claims 123 to 141, wherein the peptide sequence comprises FCFAKTETA (SEQ ID NO: 5).
143. The method of any one of claims 123 to 142, wherein the peptide sequence further comprises a protease recognition sequence.
144. The method of claim 143, wherein the protease recognition sequence is a TEV protease recognition sequence.
145. The method of claim 144, wherein the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
146. The method of any one of claims 123 to 145, wherein the peptide sequence further comprises a leader sequence.
147. The method of claim 146, wherein the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3).
148. The method of claim 146, wherein the leader sequence is LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
149. The method of any one of claims 123 to 148, wherein PapB installs a single thioether linkage in the peptide sequence.
150. The method of any one of claims 123 to 148, wherein PapB installs two or more thioether linkages in the peptide sequence.
151. The method of claim 150, wherein the peptide sequence comprises the sequence C- Ya-C-D-Yb-D, wherein a is 1, 2, 3, 4, 5, 6, or 7, and wherein b is 0, 1, 2, 3, 4, 5, 6, or 7.
152. The method of claim 150, wherein peptide sequence comprises the sequence C-Yx-D- Yy-C-Yz-D, wherein x is 0, 1, 2, 3, 4, 5, 6, or 7; wherein y is 1, 2, 3, 4, 5, 6, 7, or 8; and wherein z is 0, 1, 2, 3, 4, 5, 6, or 7.
153. The method of any one of claims 123 to 152, wherein the method further comprises addition of a protease.
154. The method of claim 153, wherein the protease is a TEV protease.
155. The method of claim 153 or claim 154, wherein the method further comprises addition of a reducing agent.
156. The method of claim 155, wherein the reducing agent is one or more of dithionite, flavodoxin, flavodoxin reductase, titanium citrate, and reduced nicotinamide adenine dinucleotide phosphate.
157. A thiother compound produced by the method of any one of claims 123 to 156.
158. The thioether compound of claim 157, wherein the thioether compound is an analog of a peptide therapeutic.
159. The thioether compound of claim 158, wherein the peptide-based therapeutic is octreotide, setmalanotide, romidepsin, bremelanotide, pramlintide, oxytocin, setmelanotide, or cyclosporin.
163. A pharmaceutical composition comprising an effective amount of the compound of claim 162 or a pharmaceutically acceptable salt thereof and a pharmaceutically acceptable carrier.
164. A compound selected from:
165. A pharmaceutical composition comprising an effective amount of the compound of claim 164 or a pharmaceutically acceptable salt thereof and a pharmaceutically acceptable carrier.
166. A method of chemically modifying a modified PapA sequence to install a thioether linkage, the method comprising reacting the modified PapA sequence with PapB, wherein the modified PapA sequence comprises Cys-Yn-Asp, wherein Y is a series of amino acid residues, and wherein n is 0, 1, 2, 4, 5, 6, or 7.
167. The method of claim 166, wherein the modified PapA sequence comprises minimal substrate PapA.
168. The method of claim 166 or claim 167, wherein the modified PapA sequence is LKQINVIAGVKEPIRAYGCDSNNAANA (SEQ ID NO: 6), LKQINVIAGVKEPIRAYGCSDNNAAA (SEQ ID NO: 7), LKQINVTAGVKEPTR AYGCSNDA A A (SEQ ID NO: 8), LKQINVIAGVKEPIRAYGCSAANDA (SEQ ID NO: 9), LKQINVIAGVKEPIRAYGCSAAANDA (SEQ ID NO: 10), or LKQINVIAGVKEPIRAYGCSAAAANDA (SEQ ID NO: 11).
169. The method of claim 166 or claim 167, wherein the modified PapA sequence is
LKQINVIAGVKFPIRAYGAAACSANDA (SEQ ID NO: 12), LKQINVIAGVKEPIRAYGAAACSANDACSANDA (SEQ ID NO: 13), LKQINVIAGVKEPIRAYGAAACSACDAADA (SEQ ID NO: 14), LKQINVIAGVKEPIRAYGAAAASACDAADA (SEQ ID NO: 15), or LKQINVIAGVKEPIRAYGAAACSAADAAADA (SEQ ID NO: 16).
170. The method of any one of claims 166 to 169, wherein the modified PapA sequence comprises a protease recognition sequence.
171. The method of claim 170, wherein the protease recognition sequence is a TEV protease recognition sequence.
172. The method of claim 171, wherein the TEV protease recognition sequence is ENLYFQ (SEQ ID NO: 1).
173. The method of any one of claims 166 to 172, wherein the method further comprises
addition of a protease.
174. The method of claim 173, wherein the protease is TEV protease.
175. The method of any one of claims 166 to 174, wherein the modified PapA sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQINVIAGVKPIRAYG (SEQ ID NO: 3).
176. The method of any one of claims 166 to 174, wherein the modified PapA sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2).
177. The method of any one of claims 166 to 174, wherein the method further comprises addition of a reducing agent and the modified PapA sequence further comprises a leader sequence of LKQINVIAGVKEPIRAYG (SEQ ID NO: 2) or LKQ INVIA GVKPIRAYG (SEQ ID NO: 3).
178. The method of any one of claims 166 to 177, wherein the method further comprises addition of a reducing agent.
179. The method of claim 178, wherein the reducing agent comprises dithionite, flavodoxin, flavodoxin reductase, titanium citrate, reduced nicotinamide adenine dinucleotide phosphate, or any combination thereof.
180. The method of any one of claims 166 to 179, wherein the modified PapA sequence comprises one or more D-amino residues.
181. The method of any one of claims 166 to 180, wherein the modified PapA sequence comprises one or more β-amino acid residues.
182. The method of any one of claims 166 to 181, wherein the modified PapA sequence comprises one or more N-methylated amino acid residues.
183. The method of any one of claims 166 to 182, wherein PapB installs two or more thioether linkages in the modified PapA sequence.
184. The method of claim 183, wherein the modified PapA sequence comprises the sequence C-Ya-C-D-Yb-D; wherein C is a cysteine residue , D is an aspartic acid residue, Y is a series of amino acid residues, a = 1, 2, 3, 4, 5, 6, or 7, and b= 0, 1, 2, 3, 4, 5, 6, or 7.
185. The method of claim 183, wherein the modified PapA sequence comprises the sequence C-Yx-D-Yy-C-Yz-D; wherein C is a cysteine residue, D is an aspartic acid residue, Y is a series of amino acid residues, x = 0, 1, 2, 3, 4, 5, 6, or 7, y = 1, 2, 3, 4, 5, 6, 7, or 8, and z = 0, 1, 2, 3, 4, 5, 6, or 7.
186. A compound produced by the method of any one of claims 166 to 185.
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263331393P | 2022-04-15 | 2022-04-15 | |
US63/331,393 | 2022-04-15 | ||
US202263337029P | 2022-04-29 | 2022-04-29 | |
US63/337,029 | 2022-04-29 | ||
US202263393174P | 2022-07-28 | 2022-07-28 | |
US63/393,174 | 2022-07-28 | ||
US202363446589P | 2023-02-17 | 2023-02-17 | |
US63/446,589 | 2023-02-17 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2023201372A2 true WO2023201372A2 (en) | 2023-10-19 |
WO2023201372A3 WO2023201372A3 (en) | 2024-05-30 |
Family
ID=88330424
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/065825 WO2023201372A2 (en) | 2022-04-15 | 2023-04-14 | Papb as a bimoiety-dependent thioether installation tool |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2023201372A2 (en) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20220389466A1 (en) * | 2019-11-05 | 2022-12-08 | Nitro Biosciences, Inc. | Biosynthesis of para-nitro-l-phenylalanine |
-
2023
- 2023-04-14 WO PCT/US2023/065825 patent/WO2023201372A2/en active Search and Examination
Also Published As
Publication number | Publication date |
---|---|
WO2023201372A3 (en) | 2024-05-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Montalbán-López et al. | New developments in RiPP discovery, enzymology and engineering | |
Bird et al. | Synthesis and Biophysical Characterization of Stabilized α‐Helices of BCL‐2 Domains | |
TWI579296B (en) | Cyclization of Peptide Compounds | |
Lin et al. | Atroposelective total synthesis of darobactin A | |
Batjargal et al. | Synthesis of thioester peptides for the incorporation of thioamides into proteins by native chemical ligation | |
WO2014127052A1 (en) | Perfluoro-tert-butyl hydroxyproline | |
CN113226375A (en) | Methods and compositions | |
Mollo et al. | P450 monooxygenase ComJ catalyses side chain phenolic cross-coupling during complestatin biosynthesis | |
US10421954B2 (en) | Hybrid polymers, pharmaceutical compositions and methods of synthesizing the same | |
US9957297B2 (en) | Template-fixed peptidomimetics as inhibitors of FPR1 | |
EP0383190B1 (en) | Ribonucleotide reductase inhibitors | |
Jiang et al. | Design, synthesis and biological activity of novel demethylvancomycin dimers against vancomycin-resistant enterococcus faecalis | |
Noden et al. | Total synthesis of Paenibacterin and its analogues | |
WO2023201372A2 (en) | Papb as a bimoiety-dependent thioether installation tool | |
US7279304B2 (en) | Methods for preparation of macrocyclic molecules and macrocyclic molecules prepared thereby | |
Gu et al. | Structural characterization of a lipopeptide antibiotic A54145E (Asn3Asp9) produced by a genetically engineered strain of Streptomyces fradiae | |
von Nussbaum et al. | Total synthesis and initial structure–activity relationships of longicatenamycin A | |
WO2020201450A1 (en) | Peptide synthesis and modification | |
Van de Vijver et al. | Antibacterial 5′-O-(N-dipeptidyl)-sulfamoyladenosines | |
CN116670151A (en) | Antibacterial cyclic peptides targeting acinetobacter and other gram negative pathogens | |
Byerly-Duke et al. | Thioimidate Solutions to Thioamide Problems during Peptide Synthesis | |
US9376700B2 (en) | Methods of modifying N-termini of a peptide or protein using transferases | |
Ho et al. | Exploring the selectivity and engineering potential of an NRPS condensation domain involved in the biosynthesis of the thermophilic siderophore fuscachelin | |
Xie et al. | Chemical Synthesis of K48‐Linked Diubiquitin by Incorporation of a Lysine‐Linked Auxiliary Handle | |
EP3906935A1 (en) | Peptides for preventing or treating viral infections |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23789207 Country of ref document: EP Kind code of ref document: A2 |
|
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) |