AU2017298565B2 - Insulin analogs - Google Patents
Insulin analogs Download PDFInfo
- Publication number
- AU2017298565B2 AU2017298565B2 AU2017298565A AU2017298565A AU2017298565B2 AU 2017298565 B2 AU2017298565 B2 AU 2017298565B2 AU 2017298565 A AU2017298565 A AU 2017298565A AU 2017298565 A AU2017298565 A AU 2017298565A AU 2017298565 B2 AU2017298565 B2 AU 2017298565B2
- Authority
- AU
- Australia
- Prior art keywords
- absent
- leu
- val
- glu
- cys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical class N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 title claims abstract description 557
- 108090001061 Insulin Proteins 0.000 claims abstract description 188
- 238000000034 method Methods 0.000 claims abstract description 176
- 102000004877 Insulin Human genes 0.000 claims abstract description 150
- 229940125396 insulin Drugs 0.000 claims abstract description 142
- 102000003746 Insulin Receptor Human genes 0.000 claims abstract description 139
- 108010001127 Insulin Receptor Proteins 0.000 claims abstract description 139
- 239000002435 venom Substances 0.000 claims abstract description 57
- 231100000611 venom Toxicity 0.000 claims abstract description 57
- 210000001048 venom Anatomy 0.000 claims abstract description 57
- 239000013078 crystal Substances 0.000 claims abstract description 42
- 230000001225 therapeutic effect Effects 0.000 claims abstract description 14
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 381
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 280
- 239000004026 insulin derivative Substances 0.000 claims description 212
- 235000001014 amino acid Nutrition 0.000 claims description 206
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 199
- 150000001413 amino acids Chemical class 0.000 claims description 189
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 175
- 150000001875 compounds Chemical class 0.000 claims description 120
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 claims description 119
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 118
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 109
- 238000006467 substitution reaction Methods 0.000 claims description 86
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 85
- PBGKTOXHQIOBKM-FHFVDXKLSA-N insulin (human) Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)[C@@H](C)CC)[C@@H](C)O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 PBGKTOXHQIOBKM-FHFVDXKLSA-N 0.000 claims description 83
- 101000976075 Homo sapiens Insulin Proteins 0.000 claims description 81
- FDKWRPBBCBCIGA-REOHCLBHSA-N (2r)-2-azaniumyl-3-$l^{1}-selanylpropanoate Chemical compound [Se]C[C@H](N)C(O)=O FDKWRPBBCBCIGA-REOHCLBHSA-N 0.000 claims description 74
- FDKWRPBBCBCIGA-UWTATZPHSA-N D-Selenocysteine Natural products [Se]C[C@@H](N)C(O)=O FDKWRPBBCBCIGA-UWTATZPHSA-N 0.000 claims description 74
- ZKZBPNGNEQAJSX-UHFFFAOYSA-N selenocysteine Natural products [SeH]CC(N)C(O)=O ZKZBPNGNEQAJSX-UHFFFAOYSA-N 0.000 claims description 74
- 229940055619 selenocysteine Drugs 0.000 claims description 74
- 235000016491 selenocysteine Nutrition 0.000 claims description 74
- 229920001184 polypeptide Polymers 0.000 claims description 66
- UHBYWPGGCSDKFX-UHFFFAOYSA-N carboxyglutamic acid Chemical compound OC(=O)C(N)CC(C(O)=O)C(O)=O UHBYWPGGCSDKFX-UHFFFAOYSA-N 0.000 claims description 64
- 210000004369 blood Anatomy 0.000 claims description 49
- 239000008280 blood Substances 0.000 claims description 49
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 claims description 45
- -1 His Chemical compound 0.000 claims description 44
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 claims description 43
- 229960002591 hydroxyproline Drugs 0.000 claims description 43
- 239000000178 monomer Substances 0.000 claims description 43
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 claims description 43
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 41
- 239000008103 glucose Substances 0.000 claims description 40
- 125000003118 aryl group Chemical group 0.000 claims description 38
- 230000000694 effects Effects 0.000 claims description 38
- 230000004913 activation Effects 0.000 claims description 37
- 125000001931 aliphatic group Chemical group 0.000 claims description 37
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 36
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 36
- 102000004169 proteins and genes Human genes 0.000 claims description 34
- 108090000623 proteins and genes Proteins 0.000 claims description 34
- 241000282414 Homo sapiens Species 0.000 claims description 33
- 235000018102 proteins Nutrition 0.000 claims description 33
- 150000003839 salts Chemical class 0.000 claims description 33
- 239000008194 pharmaceutical composition Substances 0.000 claims description 32
- 102000005962 receptors Human genes 0.000 claims description 28
- 108020003175 receptors Proteins 0.000 claims description 28
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 27
- 235000018417 cysteine Nutrition 0.000 claims description 27
- 239000003814 drug Substances 0.000 claims description 20
- 210000004027 cell Anatomy 0.000 claims description 18
- 238000004519 manufacturing process Methods 0.000 claims description 17
- 230000004071 biological effect Effects 0.000 claims description 15
- 239000003937 drug carrier Substances 0.000 claims description 15
- 239000000816 peptidomimetic Substances 0.000 claims description 15
- 238000012360 testing method Methods 0.000 claims description 15
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 claims description 14
- 238000011156 evaluation Methods 0.000 claims description 13
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 13
- BVAUMRCGVHUWOZ-ZETCQYMHSA-N (2s)-2-(cyclohexylazaniumyl)propanoate Chemical compound OC(=O)[C@H](C)NC1CCCCC1 BVAUMRCGVHUWOZ-ZETCQYMHSA-N 0.000 claims description 12
- 238000012216 screening Methods 0.000 claims description 12
- 208000001072 type 2 diabetes mellitus Diseases 0.000 claims description 12
- LDUWTIUXPVCEQF-LURJTMIESA-N (2s)-2-(cyclopentylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NC1CCCC1 LDUWTIUXPVCEQF-LURJTMIESA-N 0.000 claims description 11
- 239000000556 agonist Substances 0.000 claims description 11
- 230000003247 decreasing effect Effects 0.000 claims description 11
- 238000000126 in silico method Methods 0.000 claims description 11
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 11
- 235000000346 sugar Nutrition 0.000 claims description 11
- 210000004899 c-terminal region Anatomy 0.000 claims description 10
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 9
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 9
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 claims description 9
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 claims description 9
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 9
- 201000001421 hyperglycemia Diseases 0.000 claims description 9
- 229930182817 methionine Natural products 0.000 claims description 9
- 238000000302 molecular modelling Methods 0.000 claims description 8
- 241000237972 Conus geographus Species 0.000 claims description 7
- 206010022489 Insulin Resistance Diseases 0.000 claims description 7
- 102220623459 Pentraxin-4_H10E_mutation Human genes 0.000 claims description 7
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 7
- DQLHSFUMICQIMB-VIFPVBQESA-N (2s)-2-amino-3-(4-methylphenyl)propanoic acid Chemical compound CC1=CC=C(C[C@H](N)C(O)=O)C=C1 DQLHSFUMICQIMB-VIFPVBQESA-N 0.000 claims description 6
- 102220472978 Cytochrome c oxidase subunit 6B1_H10Q_mutation Human genes 0.000 claims description 6
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 6
- 102200072130 rs139340178 Human genes 0.000 claims description 6
- 125000001433 C-terminal amino-acid group Chemical group 0.000 claims description 5
- 101500025353 Homo sapiens Insulin A chain Proteins 0.000 claims description 5
- 238000000379 bremsstrahlung isochromat spectroscopy Methods 0.000 claims description 5
- 208000004104 gestational diabetes Diseases 0.000 claims description 5
- 108700028250 desoctapeptide- insulin Proteins 0.000 claims description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 2
- 238000013461 design Methods 0.000 abstract description 13
- 241001638933 Cochlicella barbara Species 0.000 abstract description 4
- 230000000069 prophylactic effect Effects 0.000 abstract description 2
- 229940024606 amino acid Drugs 0.000 description 185
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 154
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 151
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 135
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 103
- 230000027455 binding Effects 0.000 description 65
- 239000000243 solution Substances 0.000 description 52
- 239000000203 mixture Substances 0.000 description 45
- 235000002639 sodium chloride Nutrition 0.000 description 40
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 35
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 27
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 24
- 125000003275 alpha amino acid group Chemical group 0.000 description 24
- 101000852815 Homo sapiens Insulin receptor Proteins 0.000 description 23
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 23
- 229910001868 water Inorganic materials 0.000 description 23
- 102000047882 human INSR Human genes 0.000 description 22
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 19
- 125000004429 atom Chemical group 0.000 description 17
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 16
- 239000002904 solvent Substances 0.000 description 16
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 15
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 15
- 238000007792 addition Methods 0.000 description 15
- 239000003153 chemical reaction reagent Substances 0.000 description 15
- 239000000872 buffer Substances 0.000 description 14
- 206010012601 diabetes mellitus Diseases 0.000 description 13
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 12
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical class NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 12
- 239000003795 chemical substances by application Substances 0.000 description 12
- 230000011664 signaling Effects 0.000 description 12
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 11
- 239000011347 resin Substances 0.000 description 11
- 229920005989 resin Polymers 0.000 description 11
- 239000006196 drop Substances 0.000 description 10
- 230000004481 post-translational protein modification Effects 0.000 description 10
- 210000001789 adipocyte Anatomy 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 230000003993 interaction Effects 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 239000007983 Tris buffer Substances 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 239000004615 ingredient Substances 0.000 description 8
- 239000012071 phase Substances 0.000 description 8
- 230000026731 phosphorylation Effects 0.000 description 8
- 238000006366 phosphorylation reaction Methods 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- 239000011780 sodium chloride Substances 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- 239000011575 calcium Substances 0.000 description 7
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical class NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 7
- 239000003446 ligand Substances 0.000 description 7
- 239000002245 particle Substances 0.000 description 7
- 238000011282 treatment Methods 0.000 description 7
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 6
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 6
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 6
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 6
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 6
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 6
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 6
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 6
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 6
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 6
- QWCKQJZIFLGMSD-UHFFFAOYSA-N alpha-aminobutyric acid Chemical class CCC(N)C(O)=O QWCKQJZIFLGMSD-UHFFFAOYSA-N 0.000 description 6
- 230000009435 amidation Effects 0.000 description 6
- 238000007112 amidation reaction Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 6
- XVOYSCVBGLVSOL-UHFFFAOYSA-N cysteic acid Chemical class OC(=O)C(N)CS(O)(=O)=O XVOYSCVBGLVSOL-UHFFFAOYSA-N 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 229910017053 inorganic salt Inorganic materials 0.000 description 6
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical class C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 6
- 230000007017 scission Effects 0.000 description 6
- 239000007787 solid Substances 0.000 description 6
- 208000024891 symptom Diseases 0.000 description 6
- 229940123452 Rapid-acting insulin Drugs 0.000 description 5
- 108010026951 Short-Acting Insulin Proteins 0.000 description 5
- 239000013543 active substance Substances 0.000 description 5
- 125000003636 chemical group Chemical group 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 238000010511 deprotection reaction Methods 0.000 description 5
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 5
- 208000035475 disorder Diseases 0.000 description 5
- 230000004153 glucose metabolism Effects 0.000 description 5
- 239000002953 phosphate buffered saline Substances 0.000 description 5
- 239000000843 powder Substances 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- FUOOLUPWFVMBKG-UHFFFAOYSA-N 2-Aminoisobutyric acid Chemical class CC(C)(N)C(O)=O FUOOLUPWFVMBKG-UHFFFAOYSA-N 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 4
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 4
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 4
- 208000002705 Glucose Intolerance Diseases 0.000 description 4
- 102100030668 Glutamate receptor 4 Human genes 0.000 description 4
- 101710087627 Glutamate receptor 4 Proteins 0.000 description 4
- 101500025354 Homo sapiens Insulin B chain Proteins 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 4
- 238000002441 X-ray diffraction Methods 0.000 description 4
- 238000002835 absorbance Methods 0.000 description 4
- 230000021736 acetylation Effects 0.000 description 4
- 238000006640 acetylation reaction Methods 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 4
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 4
- 235000011130 ammonium sulphate Nutrition 0.000 description 4
- 239000012491 analyte Substances 0.000 description 4
- 230000000903 blocking effect Effects 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 150000005829 chemical entities Chemical class 0.000 description 4
- 238000002425 crystallisation Methods 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 238000001212 derivatisation Methods 0.000 description 4
- 239000000539 dimer Substances 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 239000006185 dispersion Substances 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 229960002989 glutamic acid Drugs 0.000 description 4
- 235000011187 glycerol Nutrition 0.000 description 4
- 230000013595 glycosylation Effects 0.000 description 4
- 238000006206 glycosylation reaction Methods 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 230000002209 hydrophobic effect Effects 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 238000002347 injection Methods 0.000 description 4
- 239000007924 injection Substances 0.000 description 4
- 238000001990 intravenous administration Methods 0.000 description 4
- 150000002632 lipids Chemical class 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 238000000329 molecular dynamics simulation Methods 0.000 description 4
- 238000010647 peptide synthesis reaction Methods 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 201000009104 prediabetes syndrome Diseases 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000006337 proteolytic cleavage Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000004007 reversed phase HPLC Methods 0.000 description 4
- 239000001632 sodium acetate Substances 0.000 description 4
- 235000017281 sodium acetate Nutrition 0.000 description 4
- PUZPDOWCWNUUKD-UHFFFAOYSA-M sodium fluoride Chemical compound [F-].[Na+] PUZPDOWCWNUUKD-UHFFFAOYSA-M 0.000 description 4
- 235000011121 sodium hydroxide Nutrition 0.000 description 4
- 239000001488 sodium phosphate Substances 0.000 description 4
- 229910000162 sodium phosphate Inorganic materials 0.000 description 4
- 235000011008 sodium phosphates Nutrition 0.000 description 4
- 229940124597 therapeutic agent Drugs 0.000 description 4
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 4
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 4
- MRTPISKDZDHEQI-YFKPBYRVSA-N (2s)-2-(tert-butylamino)propanoic acid Chemical class OC(=O)[C@H](C)NC(C)(C)C MRTPISKDZDHEQI-YFKPBYRVSA-N 0.000 description 3
- NPDBDJFLKKQMCM-SCSAIBSYSA-N (2s)-2-amino-3,3-dimethylbutanoic acid Chemical class CC(C)(C)[C@H](N)C(O)=O NPDBDJFLKKQMCM-SCSAIBSYSA-N 0.000 description 3
- ZXSBHXZKWRIEIA-JTQLQIEISA-N (2s)-3-(4-acetylphenyl)-2-azaniumylpropanoate Chemical compound CC(=O)C1=CC=C(C[C@H](N)C(O)=O)C=C1 ZXSBHXZKWRIEIA-JTQLQIEISA-N 0.000 description 3
- OGNSCSPNOLGXSM-UHFFFAOYSA-N 2,4-diaminobutyric acid Chemical compound NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 description 3
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical class NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 3
- WVDDGKGOMKODPV-UHFFFAOYSA-N Benzyl alcohol Chemical compound OCC1=CC=CC=C1 WVDDGKGOMKODPV-UHFFFAOYSA-N 0.000 description 3
- 101100228200 Caenorhabditis elegans gly-5 gene Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 102000004218 Insulin-Like Growth Factor I Human genes 0.000 description 3
- 102000048143 Insulin-Like Growth Factor II Human genes 0.000 description 3
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical class CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 3
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical class NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 3
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical class OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 3
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical class NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 3
- XIGSAGMEBXLVJJ-YFKPBYRVSA-N L-homocitrulline Chemical class NC(=O)NCCCC[C@H]([NH3+])C([O-])=O XIGSAGMEBXLVJJ-YFKPBYRVSA-N 0.000 description 3
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Chemical class CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 3
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical class CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 3
- OFOBLEOULBTSOW-UHFFFAOYSA-N Malonic acid Chemical compound OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 3
- SJRJJKPEHAURKC-UHFFFAOYSA-N N-Methylmorpholine Chemical compound CN1CCOCC1 SJRJJKPEHAURKC-UHFFFAOYSA-N 0.000 description 3
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Chemical class OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 3
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Chemical class NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 3
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Chemical class OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- 108091006300 SLC2A4 Proteins 0.000 description 3
- 108010077895 Sarcosine Chemical class 0.000 description 3
- 102000013275 Somatomedins Human genes 0.000 description 3
- 101000993800 Sus scrofa Insulin Proteins 0.000 description 3
- 239000006180 TBST buffer Substances 0.000 description 3
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 229960002684 aminocaproic acid Drugs 0.000 description 3
- 239000001166 ammonium sulphate Substances 0.000 description 3
- 238000013103 analytical ultracentrifugation Methods 0.000 description 3
- 239000005557 antagonist Substances 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 235000010323 ascorbic acid Nutrition 0.000 description 3
- 229960005070 ascorbic acid Drugs 0.000 description 3
- 239000011668 ascorbic acid Substances 0.000 description 3
- 238000005574 benzylation reaction Methods 0.000 description 3
- 229940000635 beta-alanine Drugs 0.000 description 3
- 230000006287 biotinylation Effects 0.000 description 3
- 238000007413 biotinylation Methods 0.000 description 3
- 238000005251 capillar electrophoresis Methods 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 229960002173 citrulline Drugs 0.000 description 3
- 235000013477 citrulline Nutrition 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 230000008025 crystallization Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 239000002612 dispersion medium Substances 0.000 description 3
- 230000009881 electrostatic interaction Effects 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 3
- UHBYWPGGCSDKFX-VKHMYHEASA-N gamma-carboxy-L-glutamic acid Chemical compound OC(=O)[C@@H](N)CC(C(O)=O)C(O)=O UHBYWPGGCSDKFX-VKHMYHEASA-N 0.000 description 3
- 235000013922 glutamic acid Nutrition 0.000 description 3
- 239000004220 glutamic acid Substances 0.000 description 3
- 229940093915 gynecological organic acid Drugs 0.000 description 3
- 235000011167 hydrochloric acid Nutrition 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 238000001802 infusion Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 229910001629 magnesium chloride Inorganic materials 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 150000007522 mineralic acids Chemical class 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 230000000869 mutational effect Effects 0.000 description 3
- 150000007524 organic acids Chemical class 0.000 description 3
- 235000005985 organic acids Nutrition 0.000 description 3
- 229960003104 ornithine Drugs 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- NLKNQRATVPKPDG-UHFFFAOYSA-M potassium iodide Chemical compound [K+].[I-] NLKNQRATVPKPDG-UHFFFAOYSA-M 0.000 description 3
- 230000001376 precipitating effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 229940043230 sarcosine Drugs 0.000 description 3
- 238000007423 screening assay Methods 0.000 description 3
- 230000019491 signal transduction Effects 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- FVAUCKIRQBBSSJ-UHFFFAOYSA-M sodium iodide Chemical compound [Na+].[I-] FVAUCKIRQBBSSJ-UHFFFAOYSA-M 0.000 description 3
- 230000009870 specific binding Effects 0.000 description 3
- 238000007920 subcutaneous administration Methods 0.000 description 3
- HNKJADCVZUBCPG-UHFFFAOYSA-N thioanisole Chemical compound CSC1=CC=CC=C1 HNKJADCVZUBCPG-UHFFFAOYSA-N 0.000 description 3
- ZGYICYBLPGRURT-UHFFFAOYSA-N tri(propan-2-yl)silicon Chemical compound CC(C)[Si](C(C)C)C(C)C ZGYICYBLPGRURT-UHFFFAOYSA-N 0.000 description 3
- 239000003981 vehicle Substances 0.000 description 3
- 238000002424 x-ray crystallography Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 2
- INEWUCPYEUEQTN-UHFFFAOYSA-N 3-(cyclohexylamino)-2-hydroxy-1-propanesulfonic acid Chemical compound OS(=O)(=O)CC(O)CNC1CCCCC1 INEWUCPYEUEQTN-UHFFFAOYSA-N 0.000 description 2
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- ROUFCTKIILEETD-UHFFFAOYSA-N 5-nitro-2-[(5-nitropyridin-2-yl)disulfanyl]pyridine Chemical compound N1=CC([N+](=O)[O-])=CC=C1SSC1=CC=C([N+]([O-])=O)C=N1 ROUFCTKIILEETD-UHFFFAOYSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 239000008001 CAPS buffer Substances 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 241000700199 Cavia porcellus Species 0.000 description 2
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 2
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 102000058061 Glucose Transporter Type 4 Human genes 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000599951 Homo sapiens Insulin-like growth factor I Proteins 0.000 description 2
- 102000003839 Human Proteins Human genes 0.000 description 2
- 108090000144 Human Proteins Proteins 0.000 description 2
- 108090001117 Insulin-Like Growth Factor II Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical compound CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 description 2
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 208000003926 Myelitis Diseases 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- AMQJEAYHLZJPGS-UHFFFAOYSA-N N-Pentanol Chemical compound CCCCCO AMQJEAYHLZJPGS-UHFFFAOYSA-N 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 2
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- 102000001708 Protein Isoforms Human genes 0.000 description 2
- 108010029485 Protein Isoforms Proteins 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 241000700157 Rattus norvegicus Species 0.000 description 2
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 2
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 150000001447 alkali salts Chemical class 0.000 description 2
- 150000001371 alpha-amino acids Chemical class 0.000 description 2
- 235000008206 alpha-amino acids Nutrition 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 235000019270 ammonium chloride Nutrition 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 229940121375 antifungal agent Drugs 0.000 description 2
- 239000003429 antifungal agent Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 239000011230 binding agent Substances 0.000 description 2
- 238000006664 bond formation reaction Methods 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 2
- YKYOUMDCQGMQQO-UHFFFAOYSA-L cadmium dichloride Chemical compound Cl[Cd]Cl YKYOUMDCQGMQQO-UHFFFAOYSA-L 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 229910002092 carbon dioxide Inorganic materials 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 235000015165 citric acid Nutrition 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000002939 deleterious effect Effects 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- 238000000132 electrospray ionisation Methods 0.000 description 2
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011067 equilibration Methods 0.000 description 2
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 2
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000005755 formation reaction Methods 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 238000004108 freeze drying Methods 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 230000004190 glucose uptake Effects 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 150000007529 inorganic bases Chemical class 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 239000007951 isotonicity adjuster Substances 0.000 description 2
- WQVJUBFKFCDYDQ-BBWFWOEESA-N leubethanol Natural products C1=C(C)C=C2[C@H]([C@H](CCC=C(C)C)C)CC[C@@H](C)C2=C1O WQVJUBFKFCDYDQ-BBWFWOEESA-N 0.000 description 2
- 125000005647 linker group Chemical group 0.000 description 2
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 2
- IIPYXGDZVMZOAP-UHFFFAOYSA-N lithium nitrate Chemical compound [Li+].[O-][N+]([O-])=O IIPYXGDZVMZOAP-UHFFFAOYSA-N 0.000 description 2
- INHCSSUBVCNVSK-UHFFFAOYSA-L lithium sulfate Chemical compound [Li+].[Li+].[O-]S([O-])(=O)=O INHCSSUBVCNVSK-UHFFFAOYSA-L 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- YIXJRHPUWRPCBB-UHFFFAOYSA-N magnesium nitrate Chemical compound [Mg+2].[O-][N+]([O-])=O.[O-][N+]([O-])=O YIXJRHPUWRPCBB-UHFFFAOYSA-N 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- BJEPYKJPYRNKOW-UHFFFAOYSA-L malate(2-) Chemical compound [O-]C(=O)C(O)CC([O-])=O BJEPYKJPYRNKOW-UHFFFAOYSA-L 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- OSWPMRLSEDHDFF-UHFFFAOYSA-N methyl salicylate Chemical compound COC(=O)C1=CC=CC=C1O OSWPMRLSEDHDFF-UHFFFAOYSA-N 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 238000000324 molecular mechanic Methods 0.000 description 2
- 238000012900 molecular simulation Methods 0.000 description 2
- 239000006199 nebulizer Substances 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 239000002674 ointment Substances 0.000 description 2
- 150000007530 organic bases Chemical class 0.000 description 2
- 230000003647 oxidation Effects 0.000 description 2
- 238000007254 oxidation reaction Methods 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 230000036470 plasma concentration Effects 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 229910052700 potassium Inorganic materials 0.000 description 2
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 2
- IOLCXVTUBQKXJR-UHFFFAOYSA-M potassium bromide Chemical compound [K+].[Br-] IOLCXVTUBQKXJR-UHFFFAOYSA-M 0.000 description 2
- NROKBHXJSPEDAR-UHFFFAOYSA-M potassium fluoride Chemical compound [F-].[K+] NROKBHXJSPEDAR-UHFFFAOYSA-M 0.000 description 2
- 235000011118 potassium hydroxide Nutrition 0.000 description 2
- FGIUAXJPYTZDNR-UHFFFAOYSA-N potassium nitrate Chemical compound [K+].[O-][N+]([O-])=O FGIUAXJPYTZDNR-UHFFFAOYSA-N 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000002953 preparative HPLC Methods 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 2
- 230000002035 prolonged effect Effects 0.000 description 2
- 125000006239 protecting group Chemical group 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000029964 regulation of glucose metabolic process Effects 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 238000004062 sedimentation Methods 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- HELHAJAZNSDZJO-OLXYHTOASA-L sodium L-tartrate Chemical compound [Na+].[Na+].[O-]C(=O)[C@H](O)[C@@H](O)C([O-])=O HELHAJAZNSDZJO-OLXYHTOASA-L 0.000 description 2
- JHJLBTNAGRQEKS-UHFFFAOYSA-M sodium bromide Chemical compound [Na+].[Br-] JHJLBTNAGRQEKS-UHFFFAOYSA-M 0.000 description 2
- IHQKEDIOMGYHEB-UHFFFAOYSA-M sodium dimethylarsinate Chemical compound [Na+].C[As](C)([O-])=O IHQKEDIOMGYHEB-UHFFFAOYSA-M 0.000 description 2
- 239000011775 sodium fluoride Substances 0.000 description 2
- 235000013024 sodium fluoride Nutrition 0.000 description 2
- VWDWKYIASSYTQR-UHFFFAOYSA-N sodium nitrate Chemical compound [Na+].[O-][N+]([O-])=O VWDWKYIASSYTQR-UHFFFAOYSA-N 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 229940074404 sodium succinate Drugs 0.000 description 2
- ZDQYSKICYIVCPN-UHFFFAOYSA-L sodium succinate (anhydrous) Chemical compound [Na+].[Na+].[O-]C(=O)CCC([O-])=O ZDQYSKICYIVCPN-UHFFFAOYSA-L 0.000 description 2
- 239000001433 sodium tartrate Substances 0.000 description 2
- 229960002167 sodium tartrate Drugs 0.000 description 2
- 235000011004 sodium tartrates Nutrition 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 150000003573 thiols Chemical group 0.000 description 2
- JOXIMZWYDAKGHI-UHFFFAOYSA-N toluene-4-sulfonic acid Chemical compound CC1=CC=C(S(O)(=O)=O)C=C1 JOXIMZWYDAKGHI-UHFFFAOYSA-N 0.000 description 2
- 230000000699 topical effect Effects 0.000 description 2
- 238000006257 total synthesis reaction Methods 0.000 description 2
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 2
- 230000005945 translocation Effects 0.000 description 2
- 210000003462 vein Anatomy 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 239000003643 water by type Substances 0.000 description 2
- JIAARYAFYJHUJI-UHFFFAOYSA-L zinc dichloride Chemical compound [Cl-].[Cl-].[Zn+2] JIAARYAFYJHUJI-UHFFFAOYSA-L 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- QBYIENPQHBMVBV-HFEGYEGKSA-N (2R)-2-hydroxy-2-phenylacetic acid Chemical compound O[C@@H](C(O)=O)c1ccccc1.O[C@@H](C(O)=O)c1ccccc1 QBYIENPQHBMVBV-HFEGYEGKSA-N 0.000 description 1
- XJRUHYXXHXECDI-HSZRJFAPSA-N (2r)-2-(9h-fluoren-9-ylmethoxycarbonylamino)-5-[(2-methylpropan-2-yl)oxy]-4-[(2-methylpropan-2-yl)oxycarbonyl]-5-oxopentanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@H](CC(C(=O)OC(C)(C)C)C(=O)OC(C)(C)C)C(O)=O)C3=CC=CC=C3C2=C1 XJRUHYXXHXECDI-HSZRJFAPSA-N 0.000 description 1
- DVBUCBXGDWWXNY-SFHVURJKSA-N (2s)-5-(diaminomethylideneamino)-2-(9h-fluoren-9-ylmethoxycarbonylamino)pentanoic acid Chemical compound C1=CC=C2C(COC(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C3=CC=CC=C3C2=C1 DVBUCBXGDWWXNY-SFHVURJKSA-N 0.000 description 1
- VRYALKFFQXWPIH-PBXRRBTRSA-N (3r,4s,5r)-3,4,5,6-tetrahydroxyhexanal Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)CC=O VRYALKFFQXWPIH-PBXRRBTRSA-N 0.000 description 1
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- XYGVIBXOJOOCFR-BTJKTKAUSA-N (z)-but-2-enedioic acid;8-chloro-6-(2-fluorophenyl)-1-methyl-4h-imidazo[1,5-a][1,4]benzodiazepine Chemical compound OC(=O)\C=C/C(O)=O.C12=CC(Cl)=CC=C2N2C(C)=NC=C2CN=C1C1=CC=CC=C1F XYGVIBXOJOOCFR-BTJKTKAUSA-N 0.000 description 1
- WBYWAXJHAXSJNI-VOTSOKGWSA-M .beta-Phenylacrylic acid Natural products [O-]C(=O)\C=C\C1=CC=CC=C1 WBYWAXJHAXSJNI-VOTSOKGWSA-M 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- VHJLVAABSRFDPM-UHFFFAOYSA-N 1,4-dithiothreitol Chemical compound SCC(O)C(O)CS VHJLVAABSRFDPM-UHFFFAOYSA-N 0.000 description 1
- UGBLISDIHDMHJX-UHFFFAOYSA-N 1-(4-fluorophenyl)-4-[4-(2-methoxyphenyl)piperazin-1-yl]butan-1-one;hydrochloride Chemical compound [Cl-].COC1=CC=CC=C1N1CC[NH+](CCCC(=O)C=2C=CC(F)=CC=2)CC1 UGBLISDIHDMHJX-UHFFFAOYSA-N 0.000 description 1
- DURPTKYDGMDSBL-UHFFFAOYSA-N 1-butoxybutane Chemical compound CCCCOCCCC DURPTKYDGMDSBL-UHFFFAOYSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- ABEXEQSGABRUHS-UHFFFAOYSA-N 16-methylheptadecyl 16-methylheptadecanoate Chemical compound CC(C)CCCCCCCCCCCCCCCOC(=O)CCCCCCCCCCCCCCC(C)C ABEXEQSGABRUHS-UHFFFAOYSA-N 0.000 description 1
- QZTKDVCDBIDYMD-UHFFFAOYSA-N 2,2'-[(2-amino-2-oxoethyl)imino]diacetic acid Chemical compound NC(=O)CN(CC(O)=O)CC(O)=O QZTKDVCDBIDYMD-UHFFFAOYSA-N 0.000 description 1
- XZXYQEHISUMZAT-UHFFFAOYSA-N 2-[(2-hydroxy-5-methylphenyl)methyl]-4-methylphenol Chemical compound CC1=CC=C(O)C(CC=2C(=CC=C(C)C=2)O)=C1 XZXYQEHISUMZAT-UHFFFAOYSA-N 0.000 description 1
- HCZMHWVFVZAHCR-UHFFFAOYSA-N 2-[2-(2-sulfanylethoxy)ethoxy]ethanethiol Chemical compound SCCOCCOCCS HCZMHWVFVZAHCR-UHFFFAOYSA-N 0.000 description 1
- VFXZKNGPBLVKPC-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid;sodium Chemical compound [Na].OCCN1CCN(CCS(O)(=O)=O)CC1 VFXZKNGPBLVKPC-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 1
- KIUMMUBSPKGMOY-UHFFFAOYSA-N 3,3'-Dithiobis(6-nitrobenzoic acid) Chemical compound C1=C([N+]([O-])=O)C(C(=O)O)=CC(SSC=2C=C(C(=CC=2)[N+]([O-])=O)C(O)=O)=C1 KIUMMUBSPKGMOY-UHFFFAOYSA-N 0.000 description 1
- DVLFYONBTKHTER-UHFFFAOYSA-N 3-(N-morpholino)propanesulfonic acid Chemical compound OS(=O)(=O)CCCN1CCOCC1 DVLFYONBTKHTER-UHFFFAOYSA-N 0.000 description 1
- BMYNFMYTOJXKLE-UHFFFAOYSA-N 3-azaniumyl-2-hydroxypropanoate Chemical compound NCC(O)C(O)=O BMYNFMYTOJXKLE-UHFFFAOYSA-N 0.000 description 1
- ZLGORAQNGFCUFU-UHFFFAOYSA-N 5-(4-chlorophenyl)-2-phenylpyrazol-3-amine Chemical compound NC1=CC(C=2C=CC(Cl)=CC=2)=NN1C1=CC=CC=C1 ZLGORAQNGFCUFU-UHFFFAOYSA-N 0.000 description 1
- DDFHBQSCUXNBSA-UHFFFAOYSA-N 5-(5-carboxythiophen-2-yl)thiophene-2-carboxylic acid Chemical compound S1C(C(=O)O)=CC=C1C1=CC=C(C(O)=O)S1 DDFHBQSCUXNBSA-UHFFFAOYSA-N 0.000 description 1
- 239000007988 ADA buffer Substances 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 241000238421 Arthropoda Species 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 210000002237 B-cell of pancreatic islet Anatomy 0.000 description 1
- 239000007989 BIS-Tris Propane buffer Substances 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 101001011741 Bos taurus Insulin Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 239000008000 CHES buffer Substances 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 241000282994 Cervidae Species 0.000 description 1
- 241000251556 Chordata Species 0.000 description 1
- WBYWAXJHAXSJNI-SREVYHEPSA-N Cinnamic acid Chemical compound OC(=O)\C=C/C1=CC=CC=C1 WBYWAXJHAXSJNI-SREVYHEPSA-N 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- 241000237970 Conus <genus> Species 0.000 description 1
- 241000237980 Conus tulipa Species 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- LTMHDMANZUZIPE-AMTYYWEZSA-N Digoxin Natural products O([C@H]1[C@H](C)O[C@H](O[C@@H]2C[C@@H]3[C@@](C)([C@@H]4[C@H]([C@]5(O)[C@](C)([C@H](O)C4)[C@H](C4=CC(=O)OC4)CC5)CC3)CC2)C[C@@H]1O)[C@H]1O[C@H](C)[C@@H](O[C@H]2O[C@@H](C)[C@H](O)[C@@H](O)C2)[C@@H](O)C1 LTMHDMANZUZIPE-AMTYYWEZSA-N 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283074 Equus asinus Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000160765 Erebia ligea Species 0.000 description 1
- 108091006020 Fc-tagged proteins Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 101000976092 Gallus gallus Insulin Proteins 0.000 description 1
- 239000007821 HATU Substances 0.000 description 1
- 101001076292 Homo sapiens Insulin-like growth factor II Proteins 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 235000003332 Ilex aquifolium Nutrition 0.000 description 1
- 241000209027 Ilex aquifolium Species 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 229940079288 Insulin receptor agonist Drugs 0.000 description 1
- 102100037852 Insulin-like growth factor I Human genes 0.000 description 1
- 229910021578 Iron(III) chloride Inorganic materials 0.000 description 1
- 241000764238 Isis Species 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- WHXSMMKQMYFTQS-UHFFFAOYSA-N Lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 244000246386 Mentha pulegium Species 0.000 description 1
- 235000016257 Mentha pulegium Nutrition 0.000 description 1
- 235000004357 Mentha x piperita Nutrition 0.000 description 1
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101001055320 Myxine glutinosa Insulin-like growth factor Proteins 0.000 description 1
- FSVCELGFZIQNCK-UHFFFAOYSA-N N,N-bis(2-hydroxyethyl)glycine Chemical compound OCCN(CCO)CC(O)=O FSVCELGFZIQNCK-UHFFFAOYSA-N 0.000 description 1
- MKWKNSIESPFAQN-UHFFFAOYSA-N N-cyclohexyl-2-aminoethanesulfonic acid Chemical compound OS(=O)(=O)CCNC1CCCCC1 MKWKNSIESPFAQN-UHFFFAOYSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 229910021586 Nickel(II) chloride Inorganic materials 0.000 description 1
- GRYLNZFGIOXLOG-UHFFFAOYSA-N Nitric acid Chemical compound O[N+]([O-])=O GRYLNZFGIOXLOG-UHFFFAOYSA-N 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000015731 Peptide Hormones Human genes 0.000 description 1
- 108010038988 Peptide Hormones Proteins 0.000 description 1
- 108010043958 Peptoids Proteins 0.000 description 1
- 229920002685 Polyoxyl 35CastorOil Polymers 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 108010005991 Pork Regular Insulin Proteins 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- IWYDHOAUDWTVEP-UHFFFAOYSA-N R-2-phenyl-2-hydroxyacetic acid Natural products OC(=O)C(O)C1=CC=CC=C1 IWYDHOAUDWTVEP-UHFFFAOYSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- DWAQJAXMDSEUJJ-UHFFFAOYSA-M Sodium bisulfite Chemical compound [Na+].OS([O-])=O DWAQJAXMDSEUJJ-UHFFFAOYSA-M 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- GSEJCLTVZPLZKY-UHFFFAOYSA-N Triethanolamine Chemical compound OCCN(CCO)CCO GSEJCLTVZPLZKY-UHFFFAOYSA-N 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 239000003875 Wang resin Substances 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000003070 absorption delaying agent Substances 0.000 description 1
- 150000001242 acetic acid derivatives Chemical class 0.000 description 1
- ZOIORXHNWRGPMV-UHFFFAOYSA-N acetic acid;zinc Chemical compound [Zn].CC(O)=O.CC(O)=O ZOIORXHNWRGPMV-UHFFFAOYSA-N 0.000 description 1
- 150000008043 acidic salts Chemical class 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 125000002015 acyclic group Chemical group 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 210000000577 adipose tissue Anatomy 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 230000002009 allergenic effect Effects 0.000 description 1
- PMMURAAUARKVCB-UHFFFAOYSA-N alpha-D-ara-dHexp Natural products OCC1OC(O)CC(O)C1O PMMURAAUARKVCB-UHFFFAOYSA-N 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- SWLVFNYSXGMGBS-UHFFFAOYSA-N ammonium bromide Chemical compound [NH4+].[Br-] SWLVFNYSXGMGBS-UHFFFAOYSA-N 0.000 description 1
- VZTDIZULWFCMLS-UHFFFAOYSA-N ammonium formate Chemical compound [NH4+].[O-]C=O VZTDIZULWFCMLS-UHFFFAOYSA-N 0.000 description 1
- 229940107816 ammonium iodide Drugs 0.000 description 1
- 230000003444 anaesthetic effect Effects 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- 239000012131 assay buffer Substances 0.000 description 1
- 230000035578 autophosphorylation Effects 0.000 description 1
- NGPGDYLVALNKEG-UHFFFAOYSA-N azanium;azane;2,3,4-trihydroxy-4-oxobutanoate Chemical compound [NH4+].[NH4+].[O-]C(=O)C(O)C(O)C([O-])=O NGPGDYLVALNKEG-UHFFFAOYSA-N 0.000 description 1
- 230000003385 bacteriostatic effect Effects 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 235000015278 beef Nutrition 0.000 description 1
- PXXJHWLDUBFPOL-UHFFFAOYSA-N benzamidine Chemical compound NC(=N)C1=CC=CC=C1 PXXJHWLDUBFPOL-UHFFFAOYSA-N 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 235000019445 benzyl alcohol Nutrition 0.000 description 1
- 239000007998 bicine buffer Substances 0.000 description 1
- 239000003833 bile salt Substances 0.000 description 1
- 229940093761 bile salts Drugs 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000005460 biophysical method Methods 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- OWMVSZAMULFTJU-UHFFFAOYSA-N bis-tris Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 1
- HHKZCCWKTZRCCL-UHFFFAOYSA-N bis-tris propane Chemical compound OCC(CO)(CO)NCCCNC(CO)(CO)CO HHKZCCWKTZRCCL-UHFFFAOYSA-N 0.000 description 1
- 230000036765 blood level Effects 0.000 description 1
- 238000010504 bond cleavage reaction Methods 0.000 description 1
- IXIBAKNTJSCKJM-BUBXBXGNSA-N bovine insulin Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@H]1CSSC[C@H]2C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@H](C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=3C=CC(O)=CC=3)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3C=CC(O)=CC=3)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=3NC=NC=3)NC(=O)[C@H](CO)NC(=O)CNC1=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O)=O)CSSC[C@@H](C(N2)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](NC(=O)CN)[C@@H](C)CC)C(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C=CC=CC=1)C(C)C)C1=CN=CN1 IXIBAKNTJSCKJM-BUBXBXGNSA-N 0.000 description 1
- 235000021152 breakfast Nutrition 0.000 description 1
- QCUOBSQYDGUHHT-UHFFFAOYSA-L cadmium sulfate Chemical compound [Cd+2].[O-]S([O-])(=O)=O QCUOBSQYDGUHHT-UHFFFAOYSA-L 0.000 description 1
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 1
- FLJPGEWQYJVDPF-UHFFFAOYSA-L caesium sulfate Chemical compound [Cs+].[Cs+].[O-]S([O-])(=O)=O FLJPGEWQYJVDPF-UHFFFAOYSA-L 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- VSGNNIFQASZAOI-UHFFFAOYSA-L calcium acetate Chemical compound [Ca+2].CC([O-])=O.CC([O-])=O VSGNNIFQASZAOI-UHFFFAOYSA-L 0.000 description 1
- 235000011092 calcium acetate Nutrition 0.000 description 1
- 239000001639 calcium acetate Substances 0.000 description 1
- 229960005147 calcium acetate Drugs 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 150000007942 carboxylates Chemical class 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 239000013626 chemical specie Substances 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 235000013985 cinnamic acid Nutrition 0.000 description 1
- 229930016911 cinnamic acid Natural products 0.000 description 1
- 150000001860 citric acid derivatives Chemical class 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- GVPFVAHMJGGAJG-UHFFFAOYSA-L cobalt dichloride Chemical compound [Cl-].[Cl-].[Co+2] GVPFVAHMJGGAJG-UHFFFAOYSA-L 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 229940075614 colloidal silicon dioxide Drugs 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 239000008120 corn starch Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- LTMHDMANZUZIPE-PUGKRICDSA-N digoxin Chemical compound C1[C@H](O)[C@H](O)[C@@H](C)O[C@H]1O[C@@H]1[C@@H](C)O[C@@H](O[C@@H]2[C@H](O[C@@H](O[C@@H]3C[C@@H]4[C@]([C@@H]5[C@H]([C@]6(CC[C@@H]([C@@]6(C)[C@H](O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)C[C@@H]2O)C)C[C@@H]1O LTMHDMANZUZIPE-PUGKRICDSA-N 0.000 description 1
- 229960005156 digoxin Drugs 0.000 description 1
- LTMHDMANZUZIPE-UHFFFAOYSA-N digoxine Natural products C1C(O)C(O)C(C)OC1OC1C(C)OC(OC2C(OC(OC3CC4C(C5C(C6(CCC(C6(C)C(O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)CC2O)C)CC1O LTMHDMANZUZIPE-UHFFFAOYSA-N 0.000 description 1
- UGMCXQCYOVCMTB-UHFFFAOYSA-K dihydroxy(stearato)aluminium Chemical compound CCCCCCCCCCCCCCCCCC(=O)O[Al](O)O UGMCXQCYOVCMTB-UHFFFAOYSA-K 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- WPUMTJGUQUYPIV-JIZZDEOASA-L disodium (S)-malate Chemical compound [Na+].[Na+].[O-]C(=O)[C@@H](O)CC([O-])=O WPUMTJGUQUYPIV-JIZZDEOASA-L 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 150000002019 disulfides Chemical class 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- RDYMFSUJUZBWLH-UHFFFAOYSA-N endosulfan Chemical compound C12COS(=O)OCC2C2(Cl)C(Cl)=C(Cl)C1(Cl)C2(Cl)Cl RDYMFSUJUZBWLH-UHFFFAOYSA-N 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 201000010063 epididymitis Diseases 0.000 description 1
- CCIVGXIOQKPBKL-UHFFFAOYSA-M ethanesulfonate Chemical compound CCS([O-])(=O)=O CCIVGXIOQKPBKL-UHFFFAOYSA-M 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 239000000834 fixative Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 239000001530 fumaric acid Substances 0.000 description 1
- 235000011087 fumaric acid Nutrition 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- IECPWNUMDGFDKC-MZJAQBGESA-N fusidic acid Chemical class O[C@@H]([C@@H]12)C[C@H]3\C(=C(/CCC=C(C)C)C(O)=O)[C@@H](OC(C)=O)C[C@]3(C)[C@@]2(C)CC[C@@H]2[C@]1(C)CC[C@@H](O)[C@H]2C IECPWNUMDGFDKC-MZJAQBGESA-N 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 229940083124 ganglion-blocking antiadrenergic secondary and tertiary amines Drugs 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000007903 gelatin capsule Substances 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 229940049906 glutamate Drugs 0.000 description 1
- 208000035474 group of disease Diseases 0.000 description 1
- CLTXFEAAEJABQN-UHFFFAOYSA-N heptane-1,1,1-triol Chemical compound CCCCCCC(O)(O)O CLTXFEAAEJABQN-UHFFFAOYSA-N 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- ACCCMOQWYVYDOT-UHFFFAOYSA-N hexane-1,1-diol Chemical compound CCCCCC(O)O ACCCMOQWYVYDOT-UHFFFAOYSA-N 0.000 description 1
- SAMYCKUDTNLASP-UHFFFAOYSA-N hexane-2,2-diol Chemical compound CCCCC(C)(O)O SAMYCKUDTNLASP-UHFFFAOYSA-N 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 235000001050 hortel pimenta Nutrition 0.000 description 1
- 102000044162 human IGF1 Human genes 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 1
- 238000005417 image-selected in vivo spectroscopy Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 239000003701 inert diluent Substances 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 239000007972 injectable composition Substances 0.000 description 1
- 108010033606 insulin dimers Proteins 0.000 description 1
- 230000006362 insulin response pathway Effects 0.000 description 1
- 238000012739 integrated shape imaging system Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- RBTARNINKXHZNM-UHFFFAOYSA-K iron trichloride Chemical compound Cl[Fe](Cl)Cl RBTARNINKXHZNM-UHFFFAOYSA-K 0.000 description 1
- 230000006122 isoprenylation Effects 0.000 description 1
- 238000002334 isothermal calorimetry Methods 0.000 description 1
- 238000009533 lab test Methods 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 229910052744 lithium Inorganic materials 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- UEGPKNKPLBYCNK-UHFFFAOYSA-L magnesium acetate Chemical compound [Mg+2].CC([O-])=O.CC([O-])=O UEGPKNKPLBYCNK-UHFFFAOYSA-L 0.000 description 1
- 239000011654 magnesium acetate Substances 0.000 description 1
- 235000011285 magnesium acetate Nutrition 0.000 description 1
- 229940069446 magnesium acetate Drugs 0.000 description 1
- 235000011147 magnesium chloride Nutrition 0.000 description 1
- 159000000003 magnesium salts Chemical class 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- GMDNUWQNDQDBNQ-UHFFFAOYSA-L magnesium;diformate Chemical compound [Mg+2].[O-]C=O.[O-]C=O GMDNUWQNDQDBNQ-UHFFFAOYSA-L 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- VZCYOOQTPOCHFL-UPHRSURJSA-N maleic acid Chemical compound OC(=O)\C=C/C(O)=O VZCYOOQTPOCHFL-UPHRSURJSA-N 0.000 description 1
- 239000011976 maleic acid Substances 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 229960002510 mandelic acid Drugs 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229940098779 methanesulfonic acid Drugs 0.000 description 1
- STZCRXQWRGQSJD-GEEYTBSJSA-M methyl orange Chemical compound [Na+].C1=CC(N(C)C)=CC=C1\N=N\C1=CC=C(S([O-])(=O)=O)C=C1 STZCRXQWRGQSJD-GEEYTBSJSA-M 0.000 description 1
- 229940012189 methyl orange Drugs 0.000 description 1
- 235000010270 methyl p-hydroxybenzoate Nutrition 0.000 description 1
- WBYWAXJHAXSJNI-UHFFFAOYSA-N methyl p-hydroxycinnamate Natural products OC(=O)C=CC1=CC=CC=C1 WBYWAXJHAXSJNI-UHFFFAOYSA-N 0.000 description 1
- 229960001047 methyl salicylate Drugs 0.000 description 1
- 125000000250 methylamino group Chemical group [H]N(*)C([H])([H])[H] 0.000 description 1
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 1
- 239000008108 microcrystalline cellulose Substances 0.000 description 1
- 229940016286 microcrystalline cellulose Drugs 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 108091005601 modified peptides Proteins 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 239000002324 mouth wash Substances 0.000 description 1
- 229940051866 mouthwash Drugs 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 239000007923 nasal drop Substances 0.000 description 1
- 239000007922 nasal spray Substances 0.000 description 1
- 239000006218 nasal suppository Substances 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- QMMRZOWCJAIUJA-UHFFFAOYSA-L nickel dichloride Chemical compound Cl[Ni]Cl QMMRZOWCJAIUJA-UHFFFAOYSA-L 0.000 description 1
- 229910017604 nitric acid Inorganic materials 0.000 description 1
- 239000012457 nonaqueous media Substances 0.000 description 1
- 239000000346 nonvolatile oil Substances 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- QUANRIQJNFHVEU-UHFFFAOYSA-N oxirane;propane-1,2,3-triol Chemical compound C1CO1.OCC(O)CO QUANRIQJNFHVEU-UHFFFAOYSA-N 0.000 description 1
- 230000026792 palmitoylation Effects 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- UWJJYHHHVWZFEP-UHFFFAOYSA-N pentane-1,1-diol Chemical compound CCCCC(O)O UWJJYHHHVWZFEP-UHFFFAOYSA-N 0.000 description 1
- 239000000813 peptide hormone Substances 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 229940124531 pharmaceutical excipient Drugs 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920000729 poly(L-lysine) polymer Polymers 0.000 description 1
- 239000008389 polyethoxylated castor oil Substances 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 235000011056 potassium acetate Nutrition 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 239000011698 potassium fluoride Substances 0.000 description 1
- 235000003270 potassium fluoride Nutrition 0.000 description 1
- WFIZEGIEIOHZCP-UHFFFAOYSA-M potassium formate Chemical compound [K+].[O-]C=O WFIZEGIEIOHZCP-UHFFFAOYSA-M 0.000 description 1
- 229960004839 potassium iodide Drugs 0.000 description 1
- 239000004323 potassium nitrate Substances 0.000 description 1
- 235000010333 potassium nitrate Nutrition 0.000 description 1
- 229940093928 potassium nitrate Drugs 0.000 description 1
- OTYBMLCTZGSZBG-UHFFFAOYSA-L potassium sulfate Chemical compound [K+].[K+].[O-]S([O-])(=O)=O OTYBMLCTZGSZBG-UHFFFAOYSA-L 0.000 description 1
- 229910052939 potassium sulfate Inorganic materials 0.000 description 1
- 239000001120 potassium sulphate Substances 0.000 description 1
- 235000011151 potassium sulphates Nutrition 0.000 description 1
- 239000001472 potassium tartrate Substances 0.000 description 1
- 229940111695 potassium tartrate Drugs 0.000 description 1
- 235000011005 potassium tartrates Nutrition 0.000 description 1
- ZNNZYHKDIALBAK-UHFFFAOYSA-M potassium thiocyanate Chemical compound [K+].[S-]C#N ZNNZYHKDIALBAK-UHFFFAOYSA-M 0.000 description 1
- 229940116357 potassium thiocyanate Drugs 0.000 description 1
- 230000036515 potency Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 150000003141 primary amines Chemical class 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000003380 propellant Substances 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 239000002516 radical scavenger Substances 0.000 description 1
- 238000001525 receptor binding assay Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000033458 reproduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 229940081974 saccharin Drugs 0.000 description 1
- 235000019204 saccharin Nutrition 0.000 description 1
- 239000000901 saccharin and its Na,K and Ca salt Substances 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 238000003345 scintillation counting Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 239000012679 serum free medium Substances 0.000 description 1
- 239000005288 shirasu porous glass Substances 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 239000003711 snail venom Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- IRHWMYKYLWNHTL-UHFFFAOYSA-M sodium 2-(N-morpholino)ethanesulfonate Chemical compound [Na+].[O-]S(=O)(=O)CCN1CCOCC1 IRHWMYKYLWNHTL-UHFFFAOYSA-M 0.000 description 1
- 235000019265 sodium DL-malate Nutrition 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 235000010267 sodium hydrogen sulphite Nutrition 0.000 description 1
- 235000009518 sodium iodide Nutrition 0.000 description 1
- 239000001394 sodium malate Substances 0.000 description 1
- PRWXGRGLHYDWPS-UHFFFAOYSA-L sodium malonate Chemical compound [Na+].[Na+].[O-]C(=O)CC([O-])=O PRWXGRGLHYDWPS-UHFFFAOYSA-L 0.000 description 1
- 239000004317 sodium nitrate Substances 0.000 description 1
- 235000010344 sodium nitrate Nutrition 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 229910052938 sodium sulfate Inorganic materials 0.000 description 1
- 235000011152 sodium sulphate Nutrition 0.000 description 1
- HELHAJAZNSDZJO-UHFFFAOYSA-L sodium tartrate Chemical compound [Na+].[Na+].[O-]C(=O)C(O)C(O)C([O-])=O HELHAJAZNSDZJO-UHFFFAOYSA-L 0.000 description 1
- VGTPCRGMBIAPIM-UHFFFAOYSA-M sodium thiocyanate Chemical compound [Na+].[S-]C#N VGTPCRGMBIAPIM-UHFFFAOYSA-M 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 150000005846 sugar alcohols Polymers 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 229910021653 sulphate ion Inorganic materials 0.000 description 1
- 238000002198 surface plasmon resonance spectroscopy Methods 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 125000004213 tert-butoxy group Chemical group [H]C([H])([H])C(O*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000012085 test solution Substances 0.000 description 1
- WROMPOXWARCANT-UHFFFAOYSA-N tfa trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F.OC(=O)C(F)(F)F WROMPOXWARCANT-UHFFFAOYSA-N 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- 230000034005 thiol-disulfide exchange Effects 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- UYPYRKYUKCHHIB-UHFFFAOYSA-N trimethylamine N-oxide Chemical compound C[N+](C)(C)[O-] UYPYRKYUKCHHIB-UHFFFAOYSA-N 0.000 description 1
- 125000002221 trityl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1C([*])(C1=C(C(=C(C(=C1[H])[H])[H])[H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000001291 vacuum drying Methods 0.000 description 1
- 238000009777 vacuum freeze-drying Methods 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 239000008215 water for injection Substances 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
- 239000004246 zinc acetate Substances 0.000 description 1
- 235000013904 zinc acetate Nutrition 0.000 description 1
- 239000011592 zinc chloride Substances 0.000 description 1
- 235000005074 zinc chloride Nutrition 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 239000011686 zinc sulphate Substances 0.000 description 1
- 235000009529 zinc sulphate Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/62—Insulins
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/30—Drug targeting using structural data; Docking or binding prediction
-
- C—CHEMISTRY; METALLURGY
- C30—CRYSTAL GROWTH
- C30B—SINGLE-CRYSTAL GROWTH; UNIDIRECTIONAL SOLIDIFICATION OF EUTECTIC MATERIAL OR UNIDIRECTIONAL DEMIXING OF EUTECTOID MATERIAL; REFINING BY ZONE-MELTING OF MATERIAL; PRODUCTION OF A HOMOGENEOUS POLYCRYSTALLINE MATERIAL WITH DEFINED STRUCTURE; SINGLE CRYSTALS OR HOMOGENEOUS POLYCRYSTALLINE MATERIAL WITH DEFINED STRUCTURE; AFTER-TREATMENT OF SINGLE CRYSTALS OR A HOMOGENEOUS POLYCRYSTALLINE MATERIAL WITH DEFINED STRUCTURE; APPARATUS THEREFOR
- C30B29/00—Single crystals or homogeneous polycrystalline material with defined structure characterised by the material or by their shape
- C30B29/54—Organic compounds
- C30B29/58—Macromolecular compounds
-
- C—CHEMISTRY; METALLURGY
- C30—CRYSTAL GROWTH
- C30B—SINGLE-CRYSTAL GROWTH; UNIDIRECTIONAL SOLIDIFICATION OF EUTECTIC MATERIAL OR UNIDIRECTIONAL DEMIXING OF EUTECTOID MATERIAL; REFINING BY ZONE-MELTING OF MATERIAL; PRODUCTION OF A HOMOGENEOUS POLYCRYSTALLINE MATERIAL WITH DEFINED STRUCTURE; SINGLE CRYSTALS OR HOMOGENEOUS POLYCRYSTALLINE MATERIAL WITH DEFINED STRUCTURE; AFTER-TREATMENT OF SINGLE CRYSTALS OR A HOMOGENEOUS POLYCRYSTALLINE MATERIAL WITH DEFINED STRUCTURE; APPARATUS THEREFOR
- C30B7/00—Single-crystal growth from solutions using solvents which are liquid at normal temperature, e.g. aqueous solutions
- C30B7/02—Single-crystal growth from solutions using solvents which are liquid at normal temperature, e.g. aqueous solutions by evaporation of the solvent
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2299/00—Coordinates from 3D structures of peptides, e.g. proteins or enzymes
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Physics & Mathematics (AREA)
- Crystallography & Structural Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Medical Informatics (AREA)
- Theoretical Computer Science (AREA)
- Materials Engineering (AREA)
- Metallurgy (AREA)
- Pharmacology & Pharmacy (AREA)
- Diabetes (AREA)
- Endocrinology (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
Abstract
The present invention relates to insulin analogs, particularly insulin analogs having shortened B chains. The present invention also relates to the crystal structure of insulin from the venom of cone snails and to methods of using the crystal and related structural information to screen for and design insulin analogs that interact with or modulate the insulin receptor. The present invention also relates to therapeutic and prophylactic methods using insulin analogs.
Description
INSULIN ANALOGS
The present application claims priority from Australian provisional application number AU2016902883, filed 22 July 2016, which is hereby incorporated by reference in its entirety. The present application also claims priority from US provisional application number 62/483,118, filed 4 April 2017, which is hereby incorporated by reference in its entirety.
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH
This invention, in part, was made with government support under GM 48677 awarded by National Institutes of Health. The government has certain rights in the invention.
FIELD OF THE INVENTION
The present invention relates generally to insulin analogs. More particularly, the present invention relates to rapid acting insulin analogs having shortened B chains. The present invention also relates to the crystal structure of insulin from the venom of cone snails and to methods of using the crystal and related structural information to screen for and design insulin analogs that interact with or modulate the insulin receptor.
BACKGROUND TO THE INVENTION
Insulin is a polypeptide hormone that plays a central role in the regulation of glucose metabolism, reproduction and cognition. Human insulin monomer consists of two polypeptide chains, the A- and B -chains, which are covalently linked by two disulfide bridges (CysA7-CysB7 and CysA19-B20). The A-chain consists of 21 amino acids and the B chain consists of 30 amino acids. A third disulfide bridge is located within the A chain (CysA6-CysAl l). In the body, insulin exists as monomers, dimers and hexamers. The hexamer consists of three insulin dimers held together by two central zinc ions. Human insulin is stored in pancreatic β-cells as the hexamer. The biologically active form that binds the insulin receptor is monomeric. Insulin hexamer- monomer conversion is crucial to its bioavailability.
Disturbance of insulin regulation is associated with often severe clinical manifestations, such as diabetes myelitis, hyperglycemia, as well as other similar conditions. Diabetes mellitus (referred to as diabetes) is a group of disorders that is characterized by high blood sugar levels over a prolonged period of time. Diabetes can arise if the pancreas does not produce enough insulin or if the body does not respond properly to insulin. Administration of insulin or insulin analogs remains the most effective method of treating conditions such as diabetes. Treatment of diabetes often involves administration of a combination of rapid acting, pre-prandial insulin as well as a longer-acting insulin to maintain basal levels of the hormone.
Rapid-acting insulin analogs have a fast onset of activity. Typically, they are either monomeric or rapidly dissociate into the monomeric form on injection into an affected individual. Structurally, these insulin analogs differ from normal human insulin by having modifications within the B-chain C-terminal region (residues B26- B30) that are deleterious to insulin multimerization. However, further C-terminal truncation of the B chain in order to abolish self-association has led to near complete loss of activity, presumably because PheB24 is critical for activity. For example, des- octapeptide[B23-B30] insulin (DOI), a monomeric analogue, preserves less than 0.1 % bioactivity (Bao at al., 1997). PheB24 lies immediately C-terminal to a Type 1 β-turn formed by residues GlyB20-GluB21-ArgB22-GlyB23, with both the triplet PheB24- PheB25-TyrB26 and the Type 1 β-turn being highly conserved in vertebrate insulins.
There is a need for new insulin analogs, and methods for designing such analogs, which are monomeric, fast acting, and retain the human insulin receptor signalling activity. SUMMARY OF THE INVENTION
The inventors have characterised the newly identified insulin Con-Ins Gl from the venom of Conus geographus and show that it is monomeric, but still binds to the human insulin receptor and retains signalling activity. The inventors further successfully produced crystals of Con-Ins Gl and elucidated its three-dimensional structure using X-ray crystallography. The structural data presented herein have now enabled identification, for the first time, the key amino acid positions and interactions
that permit Con-Ins Gl to retain its activity, despite lacking the aromatic triplet, PheB24-PheB25-TyrB26, of human insulin.
In an aspect, the present invention provides an insulin analog comprising an A chain peptide and a B chain peptide, wherein the B chain comprises an aromatic or large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin and/or an aromatic or large aliphatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin, wherein the analog comprises at least one amino acid found in human insulin but lacking in the corresponding position of Conus geographus venom insulin, and wherein the A chain peptide and the B chain peptide are bonded together across at least one pair of cysteine residues.
In some embodiments, the aromatic residue is selected from the group consisting of tyrosine, phenylalanine, tryptophan, histidine and 4-methylphenylalanine. In some embodiments, the large aliphatic residue is selected from the group consiting of isoleucine, cyclohexylalanine, cyclopentylalanine and methionine. In some embodiments, the aromatic or large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin is selected from the group consisting of tyrosine, phenylalanine, 4-methylphenylalanine, histidine, tryptophan, methionine, cyclopentylalanine and cyclohexylalanine. In some embodiments, the aromatic or large aliphatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin is selected from the group consisting of tyrosine, phenylalanine, 4-methylphenylalanine, histidine, tryptophan, methionine, cyclopentylalanine and cyclohexylalanine.
In some embodiments, the B chain is truncated at the C-terminal end when compared to human insulin. In some embodiments, the B chain is lacking one or more or all of the nine C-terminal amino acids of human insulin. In some embodiments, the B chain is at least lacking PheB24 of human insulin. In some embodiments, the B chain is at least lacking the human B chain aromatic triplet (amino acids PheB24-PheB25- TyrB26 of human insulin).
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-XA2-XA3-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-
XA12-XA13-XA14-XA15-XA16-XA17-XA18-XA1 -CySA20-XA21-XA22-XA23-XA24-XA25-XA26-
XA27-XA28-XA29-XA30-XA3I-XA32-XA33-XA34, wherein XA2 = Val or He; XA3 = Val or Ala; XA4 = Glu, Asp, Cys or gamma carboxyglutamate; XAs = Gin, Glu, gamma carboxyglutamate, His or Val; CysA6, CysA7, and CysAn are independently Cys or selenocysteine; XAg = Thr, His, Asp, Gin, Tyr, Lys, Ala or Val; XAg = Ser, Arg, Asn, Gly, His or Lys; XAio = He, Pro, Tyr, Ala, Ser, Val, Phe, His or Thr; XAi2 = Ser or Thr; XAi3 = Leu, Asn, Val, Arg or Asp; XAM = Tyr, Ala, Gin, His, Asp or Glu; XAis = Gin, Glu or Thr; XA½ = Phe, Leu, or Ala; XAn = Glu, Gin, Lys, Arg, He, Met, Thr or Ser; XAi8 = Lys, Ser, Thr, Asn, Gin or Glu; XAi = Tyr or Phe; CysA2o = Cys, selenocysteine, amidated Cys, or amidated selenocysteine; X^i = Asn, Pro, His, Ser, Gly, Ala, or is absent; XA22 = Pro, Asn, Thr, Leu, Ser or is absent; X^3 = Thr, Leu, Val, Ser or is absent; XA24 = Arg, Thr, Met, Gin, Leu or is absent; XA25 = Glu, Gly or is absent; from XA26 = Ser, Leu or is absent; ΧΛ27 to XA3I are independently Ser or are absent; XA32 = Ala, Ser or is absent; XA33 = Ala, Val or is absent; and XA34 = Ala or is absent (SEQ ID NO: 1); and a B chain peptide comprising the sequence XB 1-XB2-XB3-XB4-XBs-XB6-
XB7-XB8-CysB9-XB10-XBll-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB19-XB20-CySB21- XB22-XB23" XB24-XB25-XB26-XB27-XB28-XB29-XB30-XB31-XB32-XB33-XB34-XB35-XB36-XB37"
XB38-XB39, wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Ala, Asp, Gly, Pro, Leu, Phe, or His; XB4 = Ala, Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Lys, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; ΧΒγ = His, Tyr, Arg or He; XBg = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB1o = Gly, Gin or Asp; XB 11 = Ser, Leu, Gly or Pro; XB12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB13 = He, Leu, Asp, Val or Ala; XBM = Thr, Ala, Pro, Val or Arg; XBis = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Thr, Tyr, Arg or Gly; XBn = Thr, Tyr, Pro, Leu or Gly; XBis = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 1 = Leu, Asp, Gin, Gly, Lys, Glu, Arg, Ser or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Val, Tyr, Phe, His, Gly, Gin, Leu, amidated His, amidated Val or is absent; XB23 = Glu, Asp, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent;
XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB2 = Thr, His, He, Leu, Ser, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3i = He, Lys or is absent; XB32 = Ala, Asp, Ser, Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB3 = Ala or is absent (SEQ ID NO: 2).
In some embodiments, the B chain peptide comprises the sequence XB1-XB2-
ΧΒ3-ΧΒ4-ΧΒ5-ΧΒ6-ΧΒ7-ΧΒ8-^Υ8Β -ΧΒ10-ΧΒ11-ΧΒ12-ΧΒ 13-ΧΒ14-ΧΒ 15-ΧΒ16-ΧΒ17-ΧΒ18- XB 19-XB20-CySB21-XB22-XB23-XB24-XB25-XB26-XB27-XB28-XB29-¾30-XB31-XB32-XB33- XB34-XB35-XB36-XB37-XB38-XB39, wherein XB 1 = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBS = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBio = Gly, Gin or Asp; XB11 = Ser, Leu, Gly or Pro; XB 12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB14 = Thr, Ala, Pro, Val or Arg; XB15 = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 1 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB20 = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB2s = Tyr, Cys, His or is absent; XB29 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO: 3).
In some embodiments, the B chain peptide comprises the sequence XBj-XB2-
ΧΒ3-ΧΒ4-ΧΒ5-ΧΒ6-ΧΒ7-ΧΒ8-^Υ8Β9-ΧΒ10-ΧΒ11-ΧΒ12-ΧΒ13-ΧΒ14-ΧΒ15-ΧΒ16-ΧΒ17-ΧΒ18- XB19-XB20-CySB21-XB22-XB23-XB24-XB25-XB26-XB27-XB28-XB29-XB30-XB31-XB32-XB33-
XB34-XB35-XB36-XB37-XB38-XB39, wherein XBI = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBS = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBIO = Gly, Gin or Asp; XBn = Ser, Leu, Gly or Pro; XBi2 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XBi3 = He, Leu, Asp, Val or Ala; XBi4 = Thr, Ala, Pro, Val or Arg; XBis = Asn, Asp, Ala, Val, Thr, Pro or Glu; XBi6 = Ala, Ser, Gin, His, Tyr, Arg or Gly; ΧΒΠ = Tyr or Leu; XBis = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XBi9 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB20 = Val, Leu or Lys; CB2I = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; and XB24
, XB25 , XB26 , XB27 , Χβ28 , Χβ29 > Χβ30, Χβ31 , Χβ32 , Χβ33 , Χβ34 , Χβ35 , Χβ36 , Χβ37 , Χβ38 and XB39 are absent (SEQ ID NO: 4).
In some embodiments, the B chain peptide comprises the sequence XBI-XB2- XB3-XB4-XB5-XB6-XB7-XB8-CySB9-Gly-Ser-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB19-
XB20-CySB21-XB22"XB23- XB24-XB25-XB26-XB27-XB28-XB29-XB30-XB31-XB32-XB33-XB34"
XB35-XB36-XB37-XB38-XB39, where XBI = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBS = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBI2 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XBi3 = He, Leu, Asp, Val or Ala; XBi4 = Thr, Ala, Pro, Val or Arg; XBis = Asn, Asp, Ala, Val, Thr, Pro or Glu; XBi6 = Ala, Ser, Gin, His, Tyr, Arg or Gly; ΧΒΠ = Tyr or Leu; XBIS = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XBI9 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val, Leu or Lys; CB2I = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB2s = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB29 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3I = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent;
XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB38 = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO: 5).
In some embodiments, the B chain peptide comprises the sequence XB1-XB2- XB3-XB4-XB5-XB6-XB7-XB8-CySB9-Gly-Ser-XB 12-XB13-XB14-XB15-XB16-XB17-XB18-XB19- XB20-CySB21-XB22"XB23- XB24-XB25-XB26-XB27-XB28-XB29-XB30-XB31-XB32-XB33-XB34"
XB35-XB36-XB37-XB38-XB3 , where XB1 = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; ΧΒγ = His or Tyr; XBg = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB 1s = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 19 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val or Leu; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB2 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB31 = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB3 = Ala or is absent (SEQ ID NO: 6).
In some embodiments, the B chain peptide comprises the sequence XB1-XB2- XB3-XB4-XB5-XB6-XB7-XB8-CySB -Gly-Ser-XBi2-XB13-XB14-XB15-XB16-XB17-XB18-XB19-
XB20-CysB2i-XB22-XB23> wherein XB1 = Thr, Asn or is absent; XB2 = Phe, Ser or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys, Asn or Gin; XB7 = His or Tyr; XBg = Arg, He or Leu; XBi2 = His, Asp, Glu or gamma carboxyglutamate; CysB9 = Cys or selenocysteine; XB 13 = Val, He or Leu; XB 14 = Thr, Ala, Pro, or Val; XB 1s = Glu, Val, Asn or Asp; XB 16 = Ser, Gin, Tyr or Ala; XB 17 = Tyr or Leu; XB 1g = Tyr, Asp, Met or Val; XB 19 = Leu, Asp, Gin or Lys; XB2o = Leu or
Val; CysB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr or Gly; and XB23 = Glu, Arg, Gly or is absent (SEQ ID NO: 7).
In some embodiments, the B chain peptide comprises the sequence XBI-XB2-
XB3-XB4-XB5-XB6-His-XB8-CySB -Gly-Ser-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB1 - XB20-CysB21-XB22-XB23, wherein XBi = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBS = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB8 = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XBi3 = He or Leu; XBI4 = Thr, or Val; XBis = Glu, or Asn; XBi6 = Ser or Ala; XBn = Tyr or Leu, XBis = Tyr or Met; XBi = Leu or Asp, XB2o = Leu or Val; XB22 = Tyr; and XB23 = Glu or Arg (SEQ ID NO: 8).
In some embodiments, XBn and XB22 are Tyr. In some embodiments, XB22 isTyr. In some embodiments, ΧΒΠ isTyr.
In some embodiments, the A chain peptide comprises the sequence Gly-XA2-
XA3-XA4-XA5-CySA6-CySA7-XA8-XA -XA10-CySAll-XA12-XA13-XA14-XA15-Phe-XA17-XA18- XA19-CySA20-XA21- XA22-XA23-XA24-XA25-XA26-XA27-XA28-XA29"XA30-XA31-XA32-XA33-
XA34, wherein XA2 = Val or He; XA3 = Val or Ala; XA4 = Glu, gamma carboxyglutamate or Cys; XAs = Gin, Glu, gamma carboxyglutamate, His or Val; CysA6, CysA7, and CysA11 are independently Cys or selenocysteine; XAs = Thr, His, Asp, Gin, Tyr, Lys or Val; XA9 = Ser, Arg, Asn, His or Lys; XAio = He, Pro, Tyr, Ala, Ser, Phe, His or Thr; XAi2 = Ser or Thr; XA13 = Leu, Asn, Val or Asp; AU = Tyr, Ala, Gin, Asp or Glu; XAI5 = Gin, Glu or Thr; or Ala; XAn = Glu, Lys, Arg, He, Met, Thr or Ser; XAis = Lys, Thr, Asn, Gin or Glu; XAi9 = Tyr or Phe; CysA2o = Cys, selenocysteine, amidated Cys, or amidated selenocysteine; XA2i = Asn, Pro, His, Ser, Gly, Ala, or is absent; ΧΛ22 = Pro, Asn, Thr, Leu, Ser or is absent; XA23 = Thr, Leu, Val, Ser or is absent; XA24 = Arg, Thr, Met, Gin, Leu or is absent; XA25 = Glu, Gly or is absent; from XA26 = Ser, Leu or is absent; XA27 to XA3j are independently Ser or are absent; XA32 = Ala, Ser or is absent; XA33 = Ala, Val or is absent; and XA34 = Ala or is absent (SEQ ID NO: 9).
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-XA2-Val-XA4-XA5-CysA6-CysA7-XAg-XA9-XA1o-CysAl l- Ser-XA13-XA14-XA15-XA16-XAi7-XAi8-Tyr-CysA2o-XA2i, wherein XA2 is Val or He, XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XAg is His or Thr, XA9 is Arg or
Ser, XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, XAI6 is Phe or Leu, ΧΑΠ is Lys or Glu, XAIS is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 10); and a B chain peptide comprising the sequence XBI-XB2-XB3-XB4-XBS-XB6-
His-XB8-CySB -Gly-Ser-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB1 -XB20-CySB21-XB22- XB23, wherein XBi = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBS = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBS = Arg or Leu; XBI2 = His, Glu or gamma carboxyglutamate; XBi3 = He or Leu; XBi4 = Thr, or Val; XBI5 = Glu, or Asn; XBi6 = Ser or Ala; XBn = Tyr or Leu; XBis = Tyr or Met; XBi = Leu or Asp, XB2o = Leu or Val; XB22 = Gly or Tyr; and XB23 = Glu or Arg (SEQ ID NO: 1 1).
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-XA2-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-Ser- XAi3-XAi4-XAi5-Phe-XAi7-XAi8-Tyr-CysA20-XA2i, wherein XA2 is Val or He, XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XA8 is His or Thr, XA9 is Arg or Ser, XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAIS is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 12); and an B chain peptide comprising the sequence XBi-XB2-XB3-XB4-XB5-XB6-His-XB8-CysB9-Gly- 86Γ-ΧΒΙ2-ΧΒΙ3-ΧΒΙ4-ΧΒΙ5-ΧΒΙ6- XBi7-XBi8-XBi9-XB20-CysB2i-Tyr-XB23, wherein XBI = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB8 = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XBi3 = He or Leu; XBi4 = Thr, or Val; XBis = Glu, or Asn; XBI6 = Ser or Ala; ΧΒΠ = Tyr or Leu; XBIS = Tyr or Met; XBI9 = Leu or Asp, XB20 = Leu or Val; and XB23 = Glu or Arg (SEQ ID NO: 13).
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-Val-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-Ser- XAi3-XAi4-XAi5-Phe-XAi7-XAi8-Tyr-CysA20-XA2i, wherein XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XAs is His or Thr, XA9 is Arg or Ser, XAIO is Pro or He, XAI3 is Asn or Leu, XAM is Ala or Tyr, XAIS is Glu or Gin, ΧΑΠ is Lys or Glu, XAI8 is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 14); and an B chain peptide comprising the sequence XBi-XB2-XB3-XB4-XB5-XB6-His-Arg-CysB9-Gly-Ser-XBi2-Ile- XBI4-XBI5-XBI6- XBi7-XBi8-XBi9-Leu-CysB2i-Tyr-XB23, wherein XBi = Thr or is absent;
XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB 12 = His, Glu or gamma carboxyglutamate; XB 14 = Thr, or Val; XB1s = Glu, or Asn; XB 16 = Ser or Ala; XB17 = Tyr or Leu; XB1g = Tyr or Met; XB19 = Leu or Asp, and XB23 = Glu or Arg (SEQ ID NO: 15).
In an embodiment, the insulin analog is identical to human insulin with the exception of a truncated B-chain at the C-terminus and an aromatic residue or large aliphatic residue at amino acid number 15 and/or 20 of the B chain. Examples are provided, but are not limited to, the three below embodiments where Xaa is an aromatic residue or large aliphatic residue.
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser-Leu-Tyr- Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 16); and a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Xaa-Tyr- Leu-Val-Cys-Gly-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 17).
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser-Leu-Tyr- Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 18); and a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Leu-Tyr- Leu-Val-Cys-Xaa-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 19).
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser-Leu-Tyr- Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 20); and a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Xaa-Tyr- Leu-Val-Cys-Xaa-Glu (SEQ ID NO: 21), where Xaa is an aromatic residue or large aliphatic residue.
In some embodiments, the insulin analog comprises a number of modified amino acids. In some embodiments, the insulin analog comprises one or more or all of the following;
i) XA4 is gamma carboxyglutamate,
ii) XB5 = hydroxyproline; and
iii) XBI2 = gamma carboxyglutamate.
In some embodiments, CysB9 of the B chain peptide is bonded to CysA6 of the A chain peptide. In some embodiments, CysB2i of the B chain peptide is bonded to CysA20 of the A chain peptide. In some embodiments, CysA7 is bonded to CysA11.
In some embodiments, the A chain peptide and the B chain peptide are linked together at one pair of their respective terminal ends. In some embodiments, the A chain peptide and the B chain peptide are linked together at both terminal ends.
In some embodiments, the insulin analog has an IC50 against the human IR-B receptor of less than 10~6 M. In some embodiments, the insulin analog does not bind human IGF-IR or binds IGF-IR weakly. In some embodiments, the analog has an affinity (¾) for human IGF-IR of weaker than 100 nM.
In some embodiments, the insulin analog is predominantly monomeric. In some embodiments, at least 75% of the analog is monomeric in solution.
In some embodiments, the insulin analog has increased bioavailability when administered to a human when compared human insulin. In some embodiments, the insulin analog has a peak bioavailability within 0.5 to 3 hours of administration to a human. In some embodiments, the insulin analog has an onset of activity within 10 minutes of administration.
In a further aspect, the present invention provides a pharmaceutical composition, comprising the insulin analog as defined herein or a pharmaceutically acceptable salt thereof and one or more pharmaceutically acceptable carriers.
In still a further aspect, the present invention provides a method for treating and/or preventing an insulin-related condition, comprising administering a therapeutically effective amount of the insulin analog as defined herein to a subject in need thereof. In some embodiments, the insulin related condition is hyperglycemia, insulin resistance, type-1 diabetes, gestational diabetes or type-2 diabetes.
In still a further aspect, the present invention provides a method for decreasing blood glucose levels, comprising administering a therapeutically effective amount of the insulin analog as defined herein to a subject in need thereof.
In still a further aspect, the present invention provides use of the insulin analog as defined herein in the manufacture of a medicament for treating and/or preventing an insulin-related condition in a subject. In still a further aspect, the present invention provides use of the insulin analog as defined herein in the manufacture of a medicament for decreasing blood glucose levels in a subject.
In still a further aspect, the present invention provides an insulin analog as defined herein for use in treating and/or preventing an insulin-related condition in a subject. In still a further aspect, the present invention provides an insulin analog as defined herein for use in decreasing blood glucose levels in a subject.
In still a further aspect, the present invention provides peptides comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20. In some instances, the substitution at amino acid 20 is G20Y, G20F, or G20P. In some instances, the substitution at amino acid 10 is H10E, H10D or H10Q.
In some embodiments, the peptides comprise an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20, further comprising at least one substitution in the A chain peptide. In some instances, the at least one substitution in the A chain peptide is T8H, T8Y, T8K, or S9R.
In some embodiments, the peptides comprise an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20, further comprising at least two substitutions in the A chain peptide. In some instances, the at least two substitutions in the A chain peptide are two of the substitutions selected from: T8H, T8Y, T8K, and S9R.
In some embodiments, the peptide is a des-octapeptide insulin. In some instances, the B chain peptide comprises the sequence of FVNQHLCGSELVEALYLVCYER (SEQ ID NO: 30). In some instances, the the A chain comprises the sequence of GIVEQCCHRICSLYQLENYCN (SEQ ID NO: 39).
In some embodiments, the A chain peptide and B chain peptide are bonded via at least one disulfide bond. In some embodiments, the peptide is a monomer.
In some embodiments, the insulin A chain peptide is at least 70% identical to wild type human insulin A chain peptide.
In yet another aspect, the present invention provides pharmaceutical compositions comprising an insulin analog, peptide or compound as defined herein. In some embodiments, the present invention provides pharmaceutical compositions comprising a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 and a pharmaceutically acceptable carrier.
In yet another aspect, the present invention provides methods of increasing insulin receptor activation in a subject comprising administering a therapeutically effective amount of an insulin analog, peptide or compound as defined herein. In some embodiments, the present invention provides methods of increasing insulin receptor activation in a subject comprising administering a therapeutically effective amount of a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 to a subject in need thereof.
In yet another aspect, the present invention provides methods of lowering the blood sugar in a subject comprising administering a therapeutically effective amount of an insulin analog, peptide or compound as defined herein. In some embodiments, the present invention provides methods of lowering the blood sugar in a subject comprising administering a therapeutically effective amount of a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 to a subject in need thereof.
In yet another aspect, the present invention methods of treating type 1 diabetes in a subject comprising administering a therapeutically effective amount of an insulin analog, peptide or compound as defined herein. In some embodiments, the present invention provides methods of treating type 1 diabetes in a subject comprising administering a therapeutically effective amount of a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 to a subject in need thereof. In some
instances, the subject has been diagnosed with type 1 diabetes prior to administering the peptide.
In still a further aspect, there is provided a therapeutic protein having an A chain peptide bonded to a B chain peptide via at least one disulfide bond, wherein the A chain comprises the sequence of GIVEQCCHRICSLYQLENYCN (SEQ ID NO: 39), and wherein the B chain peptide comprises the sequence of FVNQHLCGSELVEALYLVCYER (SEQ ID NO: 30).
In still a further aspect, the present invention provides a method of redesigning or modifying a polypeptide which is known to bind to an insulin receptor (IR) comprising performing structure-based evaluation of a structure defined by the atomic coordinates of Appendix I or a subset thereof and redesigning or chemically modifying the polypeptide as a result of the evaluation. In some embodiments, the structure-based evaluation comprises comparison of the structure defined by the atomic coordinates of Appendix I or a subset thereof, with the atomic coordinates of insulin or a subset thereof. In some embodiments, the structure-based evaluation further comprises molecular modelling of a complex formed between the structure defined by the atomic coordinates of Appendix I or a subset thereof with the atomic coordinates of an insulin receptor or a subset thereof. In some embodiments, the method further comprises synthesising or obtaining the redesigned or chemically modified polypeptide and testing for its ability to bind IR. In some embodiments, the method further comprises synthesising or obtaining the redesigned or chemically modified polypeptide and determining the ability of the redesigned or chemically modified polypeptide to modulate IR activation. In some embodiments, the method further comprises synthesising or obtaining the redesigned or chemically modified polypeptide and determining the ability of the redesigned or chemically modified polypeptide to lower blood glucose levels. In some embodiments, the polypeptide which is known to bind to IR is insulin. In some embodiments, the insulin is human insulin. In another aspect, there is also provided a polypeptide which has been redesigned or modified by the method as defined herein. In some embodiments, the polypeptide is monomeric.
In another aspect, the present invention provides an isolated molecule which is an IR agonist, wherein the molecule is identified and/or designed based on the 3D
structure of Con-Ins Gl defined by the atomic coordinates of Appendix I or a subset thereof. In some embodiments, the molecule is a peptide, polypeptide or peptidomimetic. In some embodiments, the molecule is monomeric. In some embodiments, the molecule has an IC50 against the human IR-B receptor of less than 10~6 M.
In another aspect, the present invention provides a method of identifying a compound which binds IR, the method comprising:
i) generating a three-dimensional structure model of a polypeptide having
a) a structure defined by the atomic coordinates of Appendix I or a subset thereof, or
b) a structure having a root mean square deviation less than about 2.0A when superimposed on the corresponding backbone atoms of a), and
ii) designing or screening for a compound which potentially binds the IR.
In some embodiments, generating a three-dimensional structure model comprises generating a model of the polypeptide bound to IR or regions thereof. In some embodiments, the method further comprises synthesising the compound which potentially binds the IR. In some embodiments, the compound modulates at least one biological activity of IR. In some embodiments, the compound is monomeric. In some embodiments, the method further comprises testing the compound designed or screened for in ii) for its ability to modulate blood glucose levels. In some embodiments, steps i) and ii) are performed in silico.
In another aspect, the present invention provides a computer-based method of identifying a compound which mimics insulin activity, the method comprising
i) generating a three-dimensional structure model of a polypeptide having
a) a structure defined by the atomic coordinates of Appendix I or a subset thereof, or
b) a structure having a root mean square deviation less than about 2.0 A when superimposed on the corresponding backbone atoms of a), and
ii) designing or screening for a compound which mimics insulin activity.
In some embodiments, generating a three-dimensional structure model comprises generating a model of the polypeptide bound to IR or regions thereof. In
some embodiments, the method further comprises synthesising the compound which potentially binds the IR. In some embodiments, the compound modulates at least one biological activity of IR. In some embodiments, the compound is monomeric. In some embodiments, the method further comprises testing the compound designed or screened for in ii) for its ability to modulate blood glucose levels. In some embodiments, steps i) and ii) are performed in silico.
In a further aspect, the present invention provides a compound identified using a method defined herein.
In a further aspect, the present invention provides a crystal of Con-Ins Gl polypeptide having a space group PA 2 with unit cell dimensions of a = b= c = 74.91 A with up to about 2 % variation in any cell dimension.
In a further aspect, the present invention provides the structure of Con-Ins Gl polypeptide as defined by the atomic coordinates of Appendix I.
In a further aspect, the present invention provides for the use of the structure of Con-Ins Gl polypeptide as defined by the atomic coordinates of Appendix I as a structural model. In some embodiments, the structural model is used for identification of insulin analogs. The present invention also provides insulin analogs identified by the use defined herein.
In yet another aspect, the present invention provides a pharmaceutical composition comprising the insulin analog, polypeptide molecule and/or compound as defined herein.
Any embodiment herein shall be taken to apply mutatis mutandis to any other embodiment unless specifically stated otherwise. For instance, as the skilled person would understand examples of insulin analogs, peptides and health conditions outlined herein for the methods of the invention equally apply to the use and pharmaceutical compositions of the invention. It is also intended that embodiments of the present invention include manufacturing steps such as incorporating the compound into a pharmaceutical composition in the manufacture of a medicament.
Throughout this specification, unless specifically stated otherwise or the context requires otherwise, reference to a single step, composition of matter, group of steps or
group of compositions of matter shall be taken to encompass one and a plurality (i.e. one or more) of those steps, compositions of matter, groups of steps or group of compositions of matter.
Additional advantages of the disclosed method and compositions will be set forth in part in the description which follows, and in part will be understood from the description, or may be learned by practice of the disclosed method and compositions. The advantages of the disclosed method and compositions will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims.
The invention is hereinafter described by way of the following non-limiting
Examples and with reference to the accompanying figures. The present invention is not to be limited in scope by the specific embodiments described herein, which are intended for the purpose of exemplification only. Functionally-equivalent products, compositions and methods are clearly within the scope of the invention, as described herein.
BRIEF DESCRIPTION OF THE ACCOMPANYING FIGURES
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments of the disclosed method and compositions and together with the description, serve to explain the principles of the disclosed method and compositions.
Figure 1: Sequence comparison with human insulin. The sequence of Con- Ins Gl and comparison with human insulin. The conserved cysteine residues and aromatic triplet are shaded grey. The disulphide bonds are indicated with a solid line connecting the two cysteine residues, γ: γ-carboxylated-glutamate; O: hydroxyproline; *: C-terminal amidation.
Figure 2 Characterization of Con-Ins Gl. Competition binding analysis to human IR (isoform B) of Con-Ins Gl (n=9), PTM-free sCon-Ins Gl (n=9) and human insulin (n=21). Error bars depict SEM (when absent error bars are smaller than marker size). sCon-Ins Gl : Con-Ins Gl with diselenide bond between SecA6 and Sec A 10.
Figure 3: Insulin signalling measured by Akt phosphorylation analysis. Akt phosphorylation analysis of sCon-Ins Gl, PTM-free sCon-Ins Gl and hins (n=4). Error bars depict SEM (when absent error bars are smaller than marker size).
Figure 4a: The x-ray crystal structure of Con-Ins Gl. Superposition of Con- Ins Gl and hins (PDB entry 1MSO). The backbones of the Con-Ins Gl B- and A chains are in grey, and the backbones of the hins B- and A chains are in black and white (respectively).
Figure 4b: The hydrophobic core of Con-Ins Gl. Core structure of Con- Ins Gl compared to that of hins. The backbones of the Con-Ins Gl B- and A chains are in grey, and the backbones of the hins B- and A chains are in black and white (respectively).
Figure 4c, 4d and 4e: Post-translational modifications of Con-Ins Gl. Side chain interactions of GlaA4, GlaBlO and HypB3 (respectively, with that of GlaA4 being compared to that of hins GluA4). The backbones of the Con-Ins Gl B- and A chains are in grey, and the backbones of the hins B- and A chains are in black and white (respectively).
Figure 5: The x-ray crystal structure of Con-Ins Gl. Stereo image showing the arrangement within the crystal of Con-Ins Gl monomers around the crystallographic four-fold axis. Four sulphate molecules (centre) are modelled— with unrestrained coordinates and effective occupancy each of 0.25— into a relatively featureless blob of difference electron density on the four-fold axis. The sulphate ion forms part of a charge-compensated cluster comprising the amino-terminal group of GlyAl and a side-chain carboxylate of GlaA4 from each Con-Ins Gl monomer.
Figure 6: Con-ins Gl binding hIR. Molecular model of Con-Ins Gl in the context of the primary insulin binding site of the human insulin receptor, hins residues B22-B27 (in their hIR-bound form from PDB entry 40GA) are overlaid in black. The Figure illustrates how the side chain of Con-Ins Gl TyrB 15, once rotated from its receptor free conformation, may together with that of Con-Ins Gl TyrB20 act as a surrogate for that of hins PheB24 in formation of the Con-Ins Gl / hIR complex. The A chain of Con-Ins Gl (foreground) is transparent for clarity.
Figure 7: TyrB15 and TyrB20. Isosurface representation of the (2mF0bs-DFcaic) difference electron density in the vicinity of Con- Ins Gl residues TyrB 15 and TyrB20, contoured at the 1.5 σ level. The side chains of both these residues appear disordered compared to those of neighbouring residues (for example, that of TyrA19).
Figure 8: GlaA4 of Con-Ins Gl. Schematic diagram showing the interaction of the side chain of Con-Ins Gl PTM residue GlaA4 with side chain of hIR ocCT residue Asn711 , as observed within the molecular model of the complex of Con-Ins Gl with the primary binding site of MR. The molecular surface is that of the hIR LI domain.
Figure 9: Insulin signalling measured by Akt phosphorylation analysis. Akt phosphorylation analysis of hlns, hIns[DOI] and Mns[TyrB 15, TyrB20, DOI].
Figure 10: Characterization of Con-Ins Gl. Sedimentation equilibrium analysis of Con-Ins Gl at 30,000 rpm (black points) and 45,000 rpm (grey points) with the best fit (lines) to a single species of apparent MW 5380 ± 55 g/mol.
Figure 11: Con-InsGl in co-complex with Fv83-7.IR310.T and IR-A704-719. Two orthogonal views of an overlay of the two copies in the asymmetric unit of crystal structure of the complex of Con-InsGl in co-complex with Fv83-7.IR310.T and IR- A704"719. Within each Panel, one copy is shown as a grey Ca trace with relatively thick linkages and the other as a black Ca trace with relatively thin linkages. The overlay is based on common residues within the IR310.T moiety. The CR domains and their attached Fv83-7 are omitted for clarity.
Figure 12: Con-InsGl in co-complex with Fv83-7.IR310.T and IR-A704"719. Overlay of the crystal structure of Con-InsGl in co-complex with Fv83-7.IR310.T and IR-A704"719 with the crystal structure of hlns in co-complex with Fab83-7.IR310.T and IR-A704"719 (PDB entry 40GA). The Con-InsGl complex is shown as a light grey Ca trace (with thicker lines) and the hlns complex as a black Ca trace (with thinner lines, except for residues B22-B30 which are shown in thick black). The overlay is based on common residues within the IR310.T moiety. The cysteine -rich domain of IR310.T and its attached antibody fragment are omitted for clarity.
Figure 13: TyrBIS and TyrB20. Overlay of hlns in complex with Fab83-7, IR310.T and IR-A704"719 (PDB entry 40GA; labelled) and Con-Ins Gl in complex with Fv83-7, IR310.T and IR-A704"719 (underlined labels) based on the common domain LI
of IR310.T. The LI domain of IR310.T is shown in cartoon ribbon representation, while hins, Con-Ins Gl and IR-A704"719 are shown in Ca trace representation. The CR domain of IR310.T is omitted for clarity. The side chains and Ca atoms of hIR Phe714, hins LeuB 15 and Phe B24 and Con-Ins Gl TyrB15 are shown in ball-and-stick representation. The spatial correspondence of the side chains of hins PheB24 and Con- InsGl TyrB15 is evident. No interpretable electron density is present for Con-Ins Gl TyrB20. The respective A chains of Con-Ins Gl and of hins are omitted for clarity.
Figure 14: Molecular modelling of hIns[DOI] bound to components that comprise the primary binding site (site 1) of the hIR. Molecular model of hins [DOI] in complex with the IR LI domain (residues Gly5 to Cysl55) and the IR-A704" 719segment (residues Phe705 to Ser719 of the IR-A isoform). The IR-A704"719 segment is shown in cartoon ribbon representation and is coloured dark grey. The hins [DOI] A chain and B chain are shown in cartoon ribbon representation and are labelled. The transparent molecular surface is that of the hIR LI domain. The hIR LI domain is shown in cartoon ribbon representation with the side chain of Tyr67 shown.
Figure 15: Molecular modelling of hIns[TyrB15, DOI] bound to components that comprise the primary binding site (site 1) of the hIR. Molecular model of hIns[TyrB15, DOI] in complex with the IR LI domain (residues Gly5 to Cysl55) and the IR-A segment Phe705 to Ser719 (IR-A704"719). The IR-A704"719 segment is shown in cartoon ribbon representation and is coloured dark grey. The hIns[TyrB15, DOI] A chain and B chain are shown in cartoon ribbon representation and are labelled. The molecular surface is that of the hIR LI domain. The figure illustrates how the side-chain of TyrB 15 projects into the hydrophobic core of the DOI- (IR-A704 719)-L1 interface occupying space otherwise occupied by hins LeuB 15.
Figure 16: Molecular modelling of hIns[DOI, TyrB20] bound to components that comprise the primary binding site (site 1) of the hIR Molecular model of hIns[TyrB20, DOI] in complex with the IR LI domain (residues Gly5 to Cysl55) and the IR-A704"719. The figure illustrates how the side -chain of TyrB20 remained in the hins B24 binding site, with all other interactions with the receptor appearing native-like. The IR-A704"719 segment is shown in cartoon ribbon representation and is coloured dark grey. The hIns[TyrB20, DOI] A chain and B chain
are shown in cartoon ribbon representation and are labelled. The transparent molecular surface is that of the hIR LI domain. The hIR LI domain is shown in cartoon ribbon representation with the side chain of Tyr67 shown.
Figure 17: Positional scan of hIns[DOI]. The resultant mutational AAG (kcal/mol) contribution at each site of hIns[DOI].
Figure 18: Positional scan of hIns[TyrB15, DOI]. The resultant mutational A AG (kcal/mol) contribution at each site of Mns[TyrB15, DOI].
Figure 19: Positional scan of Mns[TyrB20, DOI]. The resultant mutational AAG (kcal/mol) contribution at each site of hIns[TyrB20, DOI].
Figure 20: Schematic of insulin multimer equilibrium. Figure shows that insulin monomerization slows absorption rate.
Figure 21: Chemical total synthesis of human DOI insulin. Figure shows the chemical total synthesis of human DOI insulin. Thr-Ser isopeptide (boxed in red) was used to increase the solubility of insulin A chain.
Figure 22: Insulin signalling activation by exemplified insulin analogs.
Figure shows the effects of B 15 Tyr and B20 Tyr on hIR activation. The sequence for each peptide used is also shown.
Figure 23: Insulin signalling activation by exemplified insulin analogs. Figure shows the effects of B10 Glu, B20 Tyr on hIR activation. The sequence for each peptide used is also shown.
Figure 24: Insulin signalling activation by exemplified insulin analogs. Figure 24A and 24B show peptide sequences/modified amino acids and effects of B20 residues in activating insulin signaling, respectively.
Figure 25: Insulin signalling activation by exemplified insulin analogs. Figure shows the effects of A8 His, A9 Arg on hIR activation. The sequence for each peptide used is also shown.
Figure 26: Insulin signalling activation by exemplified insulin analogs. Figure shows the individual effect of A8, A9, B 10 and B20 on hIR activation.
Figure 27: Insulin signalling activation by venom insulins. Figure shows the insulin signaling activation of several venom insulins with similar potencies to Con-Ins Gl (top panel). Sequence alignment of these venom insulins (bottom panel). Residues
at position 9 and 10 in the A chain and 10 and 20 in the B chain are highlighted, γ and * denote post-translational modifications (gamma-carboxyglutmate and C-terminal amidation, respectively). KEY TO THE SEQUENCE LISTING
SEQ ID NO: 1 - 41 : Insulin analogs, peptides and/or compounds according to embodiments of the present disclosure.
DETAILED DESCRIPTION OF THE INVENTION
General Techniques and Definitions
Unless specifically defined otherwise, all technical and scientific terms used herein shall be taken to have the same meaning as commonly understood by one of ordinary skill in the art (e.g., molecular genetics, pharmacology, protein crystallography, protein chemistry, biochemistry and the like).
Unless otherwise indicated, the techniques utilized in the present invention are standard procedures, well known to those skilled in the art. Such techniques are described and explained throughout the literature in sources such as, J. Perbal, A Practical Guide to Molecular Cloning, John Wiley and Sons (1984), J. Sambrook et al. Molecular Cloning: A Laboratory Manual, Cold Spring Harbour Laboratory Press (1989), T.A. Brown (editor), Essential Molecular Biology: A Practical Approach, Volumes 1 and 2, IRL Press (1991), D.M. Glover and B.D. Hames (editors), DNA Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 1996), and F.M. Ausubel et al. (editors), Current Protocols in Molecular Biology, Greene Pub. Associates and Wiley-Interscience (1988, including all updates until present), Ed Harlow and David Lane (editors) Antibodies: A Laboratory Manual, Cold Spring Harbour Laboratory, (1988), and J.E. Coligan et al. (editors) Current Protocols in Immunology, John Wiley & Sons (including all updates until present).
Throughout this specification the word "comprise", or variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated element, integer or step, or group of elements, integers or steps, but not the exclusion of any other element, integer or step, or group of elements, integers or steps.
The term "and/or", e.g., "X and/or Y" shall be understood to mean either "X and Y" or "X or Y" and shall be taken to provide explicit support for both meanings or for either meaning. In addition, the articles "a" and "an" as used in this application and the appended claims may generally be construed to mean "one or more" unless specified otherwise or clear from context to be directed to a singular form.
The term "insulin" means human insulin, pig insulin, guinea pig insulin, chicken insulin, mouse insulin, beef insulin or venom insulin. In some embodiments, insulin means human insulin. The term "venom insulin" means a cone snail venom insulin. Preferably, venom insulin means Con-Ins Gl .
The term "insulin analog" as used herein refers to any agent that is capable of mimicking the activity of insulin. In some embodiments, the insulin analog is at least an insulin receptor agonist. In some embodiments, the insulin analog binds to the insulin receptor. Preferably insulin analogs may be peptides, polypeptides, proteins or peptidomimetics. In some embodiments, the insulin analog is a peptide. As the person skilled in the art would understand, unles the context indicates otherwise, the terms "insulin analog", "peptide" and insulin peptide" are used interchangeably. Insulin analogs also include the IR agonists, molecules, compounds and the like identified by the methods disclosed herein.
The term "peptide," as used herein, refers to a polymer of amino acids ranging from two to about fifty amino acids (e.g., 4, 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, or 45 amino acids in length). The term peptide encompasses both unmodified peptides, modified peptides, and otherwise chemically derivatized peptides (for example phosphorylated, sulphated, amidated and the like). In some embodiments, the peptide may be an unnatural peptide oligomer, such as those described in Sadowsky et al. (2005) and Sadowsky et al. (2007). The term "polypeptide," or "protein" as used interchangeably herein, refers to a polymer of amino acids generally greater than about 50 amino acids in total length and typically having stable characteristic secondary and tertiary structures. The term "polypeptide" or "protein" may also include a combination of such polymers (for example two or more) associating with stable tertiary quaternary structure resulting either through their non-covalent or covalent association.
In some embodiments, the peptide, protein or polypeptide comprises amino acids that occur naturally in the subject to be treated. In some embodiments, the peptide or polypeptide comprises one or more unnatural amino acids, modified amino acids or synthetic amino acid analogues. Such amino acids include, but are not limited to, the D-isomers of the common amino acids, 2,4-diaminobutyric acid, oc-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2- amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- butylalanine, phenylglycine, cyclohexylalanine, cyclopentylalanine, β-alanine, fluoro- amino acids, designer amino acids such as β-methyl amino acids, Coc-methyl amino acids, Noc-methyl amino acids, and amino acid analogues in general. Also included within the scope are peptides, polypeptides or proteins which are differentially modified during or after synthesis, for example, by biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. These modifications may serve to increase the stability and/or bioactivity of the peptide, protein or polypeptide.
The term amino acid "modification" refers to a substitution of an amino acid, or the derivation of an amino acid by the addition and/or removal of chemical groups to/from the amino acid, and includes substitution with any of the 20 amino acids commonly found in human proteins, as well as atypical or non-naturally occurring amino acids. Commercial sources of atypical amino acids include Sigma-Aldrich (Milwaukee, Wis.), ChemPep Inc. (Miami, Fla.), and Genzyme Pharmaceuticals (Cambridge, Mass.). Atypical amino acids may be purchased from commercial suppliers, synthesized de novo, or chemically modified or derivatized from naturally occurring amino acids.
As used herein an amino acid "substitution" refers to the replacement of one amino acid residue by a different amino acid residue. The substitued amino acid may be any of the 20 amino acids commonly found in human proteins, as well as atypical or non-naturally occurring amino acids.
The terms "A chain peptide" and "B chain peptide" are interchangeable with "insulin A chain peptide" and "insulin B chain peptide."
As used herein, reference to a compound that is a "derivative thereof refers to a compound that is adapted or modified from an ancestral compound and has a similar but new structure and which has a similar biological activity as the ancestral compound. In some embodiments, the ancestral compound is a small molecule, a peptide, polypeptide, protein or an insulin analog as described herein. In some embodiments, the ancestral compound is a peptide, polypeptide, protein or insulin analog which may be modified to include any chemical modification, comprise single or multiple substitutions, deletions and/or additions of any molecules associated with the protein or peptide, such as carbohydrates, lipids and/or proteins or peptides. In one embodiment, "derivatives" of proteins, polypeptides or peptides include those modified analogues resulting from glycosylation, acetylation, phosphorylation, amidation, palmitoylation, myristoylation, isoprenylation, lipidation, alkylation, derivatization, introduction of protective/blocking groups, proteolytic cleavage or binding to an antibody or to another cellular ligand.
Throughout the application, all references to a particular amino acid position by letter and number (e.g. position A5 or B5) refer to the amino acid at that position of either the A chain (e.g. position A5) or the B chain (e.g. position B5) in the respective A chain or B chain of venom insulin Con-Gl Ins from Conus geographus, or the corresponding amino acid position in any analogs thereof. For example, a reference herein to "position B17" absent any further elaboration would mean the corresponding position B 15 of the B chain of human insulin as Con-Ins Gl has two additional N- terminal B chain residues.
As used herein, the phrase "at a position corresponding to amino acid number" refers to the relative position of the amino acid compared to surrounding amino acids with reference to a defined amino acid sequence. For instance, in some embodiments, when compared to human insulin (see Figure 1) the B chain of the insulin analog of the invention may have one or two additional N-terminal amino acids, such as present in Con-Ins Gl. In an example, upon performing a protein alignment the skilled person would readily comprehend that the leucine (15th amino acid) of the B chain of naturally
occurring human insulin corresponds to the 17th amino acid of the B chain Con-Ins Gl (see Figure 1). In a preferred embodiment of the invention, this 15th amino acid of the B chain of naturally occurring human insulin is an aromatic residue or a large aliphatic residue and/or 20th amino acid of the B chain of naturally occurring human insulin is an aromatic residue or a large aliphatic residue.
The term "monomeric insulin" refers to insulin and insulin analogs that are less prone to forming higher order species (such as dimers, tetramers, hexamers etc) than human insulin. Preferably, the insulin or insulin analog is fully or substantially monomeric, e.g., at least 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% monomeric.
As would be understood by the person skilled in the art, the term "therapeutic" refers to a treatment, therapy, or drug that can treat a disease or condition or that can ameliorate one or more symptoms associated with a disease or condition. As used herein, a therapeutic can refer to a therapeutic compound, including, but not limited to proteins, peptides, nucleic acids (e.g. CpG oligonucleotides), small molecules, vaccines, allergenic extracts, antibodies, gene therapies, other biologies or small molecules.
As used herein, the term "subject" refers to any organism susceptible to insulin related disorders. As would be understood by the person skilled in the art, the term "subject" and "patient" can be used interchangeably. For example, the subject can be a mammal, avian, arthropod, chordate, amphibian or reptile. Exemplary subjects include but are not limited to human, primate, livestock (e.g. sheep, cow, chicken, horse, donkey, pig), companion animals (e.g. dogs, cats), laboratory test animals (e.g. mice, rabbits, rats, guinea pigs, hamsters), captive wild animal (e.g. fox, deer). In one example, the subject is a mammal. In one example, the subject is human.
The term "treating" as used herein, includes prophylaxis of the specific disorder or condition, or alleviation of the symptoms associated with a specific disorder or condition and/or preventing or eliminating said symptoms. For example, as used herein the term "treating diabetes" will refer in general to maintaining glucose blood levels within acceptable levels and may include increasing or decreasing blood glucose levels depending on a given situation.
As the skilled person would understand, insulin analogs will be administered in a therapeutically effective amount. The terms "effective amount" or "therapeutically effective amount," as used herein, refer to an amount of an insulin analog being administered sufficient to relieve to some extent one or more of the symptoms of the disease or condition being treated. The result can be reduction and/or alleviation of the signs, symptoms, or causes of a disease, or any other desired alteration of a biological system. For example one symptom would be the prevention or treatment of hyperglycemia. An "effective amount" of an insulin analog is an amount effective to achieve a desired pharmacologic effect or therapeutic improvement without undue adverse side effects. By way of example only, therapeutically effective amounts may be determined by routine experimentation, including but not limited to a dose escalation clinical trial. The term "therapeutically effective amount" includes, for example, a prophylactically effective amount. It is understood that "an effective amount" or "a therapeutically effective amount" can vary from subject to subject, due to variation in metabolism of the compound of any of age, weight, general condition of the subject, the condition being treated, the severity of the condition being treated, and the judgment of the prescribing physician. Thus, it is not always possible to specify an exact "effective amount." However, an appropriate "effective" amount in any individual case may be determined by one of ordinary skill in the art using routine experimentation. Where more than one therapeutic agent is used in combination, a "therapeutically effective amount" of each therapeutic agent can refer to an amount of the therapeutic agent that would be therapeutically effective when used on its own, or may refer to a reduced amount that is therapeutically effective by virtue of its combination with one or more additional therapeutic agents.
The term "onset" of activity, as used herein refers to the length of time before insulin reaches the blood stream and begins to lower blood glucose levels, "peak" refers to the time period when the insulin analog best lowers blood glucose levels and "duration" refers to how long the insulin continues to work, i.e. lower blood glucose levels. The person skilled in the art would be aware that onset, peak and duration of an insulin analog may vary depending on factors such as the patient, the condition of the patient, and the route of administration.
The term "IR" as used herein includes wild-type IR and variants thereof including allelic variants and naturally occurring mutations and genetically engineered variants. It will be readily apparent to the skilled person that IR may be derived from other species not specifically disclosed herein. Furthermore, the skilled person will have no difficulties identifying such other suitable IR given the known conservation of IR sequences from primitive organisms through to mammals and humans.
Venom insulin crystals
In an aspect, the present invention provides a crystal comprising venom insulin. As used herein, the term "crystal" means a structure (such as a three dimensional (3D) solid aggregate) in which the plane faces intersect at definite angles and in which there is a regular structure (such as internal structure) of the constituent chemical species. The term "crystal" refers in particular to a solid physical crystal form such as an experimentally prepared crystal.
Crystals according to the invention may be prepared using venom insulin from organisms in the genus Conus, such as Conus geographus and Conus tulipa. Some embodiments relate to insulins from the venom of Conus geographus. However, the venom insulin may also be from other species. Typically, these insulins comprise a 20 amino acid A chain and a 23 amino acid B, however the length of the A and B chain can vary. The amino acids in the A and B chain may be post-translationally modified; example post-translational modifications include but are not limited to glutamic acid may be replaced by γ-carboxylated glutamic acid (also referred to as the conjugate base gamma carboxyglutamate), proline may be replaced by hydroxyproline, the C-terminus may be amidated, cysteine may be replaced by seleoncysteine. The person skilled in the art will recognize that other post-translation modifications are possible.
In a preferred embodiment the venom insulin is Con-Ins Gl and has the sequence shown below:
A-chain: GVVyHCCHRPCSNAEFKKYC* (SEQ ID NO: 22)
B-chain: TFDTOKHRCGSylTNSYMDLCYR (SEQ ID NO: 23)
where y is γ-carboxylated glutamic acid, O is hydroxyproline and *the C-terminus of the A-chain is amidated. However, the insulin polypeptide may also be obtained from other species or a non-native designed sequence.
Crystals may be constructed with wild-type sequences or variants thereof, including naturally occurring mutations as well as genetically engineered variants. Typically, variants have at least 90, 95 or 98% sequence identity with a corresponding wild-type venom insulin.
The production of a crystal comprising venom insulin is described below.
In preferred embodiments, the present invention provides a crystal of Con-Ins Gl having a space group P432 with unit cell dimensions of a = b = c = 74.91 A with up to about 2 % variation in any cell dimension.
In a preferred embodiment, a crystal comprising venom insulin has the atomic coordinates set forth in Appendix I. As used herein, the term "atomic coordinates" refer to a set of values which define the position of one or more atoms with reference to a system of axes. It will be understood by those skilled in the art that atomic coordinates may be varied, without affecting significantly the accuracy of models derived therefrom; thus, although the invention provides a very precise definition of a preferred atomic structure, it will be understood that minor variations are envisaged and the claims are intended to encompass such variations. Preferred are variants in which the root mean square deviation (RMSD) of the x, y and z co-ordinates for all backbone atoms other than hydrogen is less than 2.0 A (preferably less than 1.5 A, 1.3 A, 1 A, 0.7 A or less than 0. 3 A) compared with the coordinates given in Appendix I. It will be readily appreciated by those skilled in the art that a 3D rigid body rotation and/or translation of the atomic coordinates does not alter the structure of the molecule concerned.
Crystal Structure of Venom Insulin
In further aspects, a crystal structure of a venom insulin, or a region thereof is also provided. In some embodiments, the venom insulin is Con-Ins Gl. In some embodiments, the crystal structure of a venom insulin is the structure of Con-Ins Gl as defined by the atomic coordinates of Appendix I.
The atomic coordinates obtained experimentally for venom insulin are shown in Appendix I. However, a person skilled in the art will appreciate that a set of atomic coordinates determined by X-ray crystallography is not without standard error. Accordingly, any set of structure coordinates for venom insulin that has a root mean square deviation of protein backbone atoms of less than 0.75 A when superimposed (using backbone atoms) on the atomic coordinates listed in Appendix I shall be considered identical.
The present invention also comprises the atomic coordinates of venom insulin that substantially conform to the atomic coordinates listed in Appendix I. A structure that "substantially conforms" to a given set of atomic coordinates is a structure wherein at least about 50% of such structure has an RMSD of less than about 2.0 A for the backbone atoms in secondary structure elements in each domain, preferably less than about 1.5 A for the backbone atoms in secondary structure elements in each domain, and more preferably, less than about 1.3 A for the backbone atoms in secondary structure elements in each domain, and, in increasing preference, less than about 1.0 A, less than about 0.7 A, less than about 0.5 A, and most preferably, less than about 0.3 A for the backbone atoms in secondary structure elements in each domain.
In a more preferred embodiment, a structure that substantially conforms to a given set of atomic coordinates is a structure wherein at least about 75% of such structure has the recited RMSD value, and more preferably, at least about 90% of such structure has the recited RMSD value, and most preferably, about 100% of such structure has the recited RMSD value.
In an even more preferred embodiment, the above definition of "substantially conforms" can be extended to include atoms of amino acid side chains. As used herein, the phrase "common amino acid side chains" refers to amino acid side chains that are common to both the structure which substantially conforms to a given set of atomic coordinates and the structure that is actually represented by such atomic coordinates.
It will be appreciated that a set of atomic coordinates for a polypeptide is a relative set of points that define a shape in three dimensions. Thus, it is possible that an entirely different set of coordinates could define a similar or identical shape.
Moreover, slight variations in the individual coordinates will have little effect on overall shape.
The variations in coordinates may be generated due to mathematical manipulations of the structure coordinates. For example, the structure coordinates set forth in Appendix I could be manipulated by crystallographic permutations of the structure coordinates, fractionalisation of the structure coordinates, integer additions or subtractions to sets of the structure coordinates, inversion of the structure coordinates, or any combination thereof.
Alternatively, modification in the crystal structure due to mutations, additions, substitutions, and/or deletions of amino acids, or other changes in any of the components that make up the crystal could also account for variations in structure coordinates.
Various computational analyses are used to determine whether a molecular complex or a portion thereof is sufficiently similar to all or parts of the structure of the venom insulin described above. Such analyses may be carried out using software known to the person skilled in the art, for example PDBeFOLD (Krissinel and Henrick, 2004), DALI (Holm and Rosenstrom, 2010), LSQMAN (Kleywegt and Jones, 1994) and CHIMERA (Pettersen et al. 2004).
Comparisons typically involve calculation of the optimum translations and rotations required such that the root mean square difference of the fit over the specified pairs of equivalent atoms is an absolute minimum. This number is given in angstroms. Accordingly, structural coordinates of venom insulin within the scope of the present invention include structural coordinates related to the atomic coordinates listed in Appendix I by whole body translations and/or rotations. Accordingly, RMSD values listed above assume that at least the backbone atoms of the structures are optimally superimposed which may require translation and/or rotation to achieve the required optimal fit from which to calculate the RMSD value.
In some embodiments, there is also provided subsets of said atomic coordinates listed in Appendix I and subsets that conform substantially thereto. Preferred subsets define one or more regions of the venom insulin, for example, (i) the A chain, (ii) the B chain, (iii) the hydrophobic core (for example in Con-Ins Gl the hydrophobic core
comprises the side chains of residues ValA2, CysA6, CysAl l, PheA16, TyrA19, ArgB6, IleBl l, TyrB15 and LeuB18), (iv) the PTM and residues interacting with the PTM, (v) the receptor binding surface (for example, the IR binding surface of Con-Ins Gl); (vi) TyrB 15 and residues interacting with TyrB 15; (vii) TyrB20 and residues interacting with TyrB20; (viii) PheA16 and residues interacting with PheA16. (ix) subsets of residues in the immediate vicinity the respective A-chain termini.
A three dimensional structure of a venom insulin or region thereof which substantially conforms to a specified set of atomic coordinates can be modelled by a suitable modelling computer program such as MODELER (Sali and Blundell, 1993), as implemented in the Insight II Homology software package (Insight II (97.0), MSI, San Diego), using information, for example, derived from the following data: (1) the amino acid sequence of the venom insulin; (2) the amino acid sequence of the related portion(s) of the protein represented by the specified set of atomic coordinates having a three dimensional configuration; and, (3) the atomic coordinates of the specified three dimensional configuration. A three dimensional structure of a venom insulin which substantially conforms to a specified set of atomic coordinates can also be calculated by a method such as molecular replacement, which is described in detail below.
Structure coordinates/atomic coordinates are typically loaded onto a machine readable-medium for subsequent computational manipulation. Thus models and/or atomic coordinates are advantageously stored on machine -readable media, such as magnetic or optical media and random-access or read-only memory, including tapes, diskettes, hard disks, CD-ROMs and DVDs, flash memory cards or chips, servers and the internet. The machine is typically a computer. The present invention also provides a computer readable media having recorded thereon data representing a model and/or the atomic coordinates of Con-Ins Gl. Also provided is a computer readable media having recorded thereon coordinate data according to Appendix I, or a subset thereof of either, where said coordinate data define a three dimensional structure of Con-Ins Gl or a region of Con-Ins Gl .
The present invention also provides a set of atomic coordinates as shown in Appendix I, or a subset thereof or either, in which said coordinates define a three dimensional structure of a venom insulin. The structure coordinates/atomic coordinates
may be used in a computer to generate a representation, e.g. an image, of the three- dimensional structure of the cone snail insulin crystal which can be displayed by the computer and/or represented in an electronic file.
The structure coordinates/atomic coordinates and models derived therefrom may also be used for a variety of purposes such as drug discovery, biological reagent (binding protein) selection and X-ray crystallographic analysis of other protein crystals. In one aspect, the use of the structure of Con-Ins Gl as a structural model is provided. In some embodiments, the structural model may be used for identification of insulin analogs. The present invention also encompasses insulin analogs identified using the structure of Con-Ins Gl as a structural model.
The three-dimensional structure of Con-Ins Gl or a region thereof may be used to develop models useful for drug design and in silico screening of candidate compounds that interact with and/or modulate IR. Other physicochemical characteristics may also be used in developing the model, e.g. bonding, electrostatics etc.
Generally the term "in silico" refers to the creation in a computer memory, i.e., on a silicon or other like chip. Unless stated otherwise "in silico" means "virtual." When used herein the term "in silico" is intended to refer to screening methods based on the use of computer models rather than in vitro or in vivo experiments.
Molecular replacement
The crystal structure of Con-Ins Gl provided herein may also be used to model/solve the structure of a new crystal using molecular replacement.
In some aspects, the present invention also provides the use of the structure of venom insulin, or a subset thereof, as a structural model.
In some embodiments, the atomic coordinates of venom insulin, such as those set forth in Appendix I, or a region of venom insulin can be used for determining at least a portion of the three-dimensional structure of a molecular complex which contains at least some structural features similar to the venom insulin. In particular, structural information about another crystallised venom insulin or insulin analog may
be obtained. This may be achieved by any of a number of well-known techniques, including molecular replacement.
Methods of molecular replacement are generally known by those of skill in the art, for example PHASER (McCoy et al. 2007). Methods of molecular replacement are generally described in Brunger, 1997; Navaza and Saludjian, 1997; Tong and Rossmann, 1997; Bentley, 1997; Lattman, 1985; Rossmann, 1972.
Generally, X-ray diffraction data are collected from the crystal of a crystallised target structure. The X-ray diffraction data is transformed to calculate a Patterson function. The Patterson function of the crystallised target structure is compared with a Patterson function calculated from a known structure (referred to herein as a search structure). The Patterson function of the crystallised target structure is rotated on the search structure Patterson function to determine the correct orientation of the crystallised target structure in the crystal. The translation function is then calculated to determine the location of the target structure with respect to the crystal axes. Once the crystallised target structure has been correctly positioned in the unit cell, initial phases for the experimental data can be calculated. These phases are necessary for calculation of an electron density map from which structural differences can be observed and for refinement of the structure. Preferably, the structural features (e.g., amino acid sequence, conserved di-sulphide bonds, and beta-strands or beta-sheets) of the search molecule are related to the crystallised target structure.
The electron density map can, in turn, be subjected to any well-known model building and structure refinement techniques to provide a final, accurate structure of the unknown crystallised molecular complex (e.g. see Jones et al. 1991; Brunger et al. 1998).
Obtaining accurate values for the phases, by methods other than molecular replacement, is a time-consuming process that involves iterative cycles of approximations and refinements and greatly hinders the solution of crystal structures. However, when the crystal structure of a protein containing at least a homologous portion has been solved, the phases from the known structure provide a satisfactory estimate of the phases for the unknown structure. By using molecular replacement, all or part of the structure coordinates of venom insulin provided herein (and set forth in
Appendix I) can be used to determine the structure of a crystallised molecule/molecular complex whose structure is unknown more rapidly and efficiently than attempting to determine such information ab initio.
The structure of any portion of any crystallised molecule/molecular complex that is sufficiently homologous to any portion of the venom insulin may be solved by this method. This method is especially useful in determining the structure of insulin analogs that were designed using the methods described herein.
All of the molecules/molecular complexes referred to herein may be studied using well-known X-ray diffraction techniques and may be refined versus 1.5-3.5 A resolution X-ray data to an R value of about 0.25 or less using computer software, such as X-PLOR (Yale University, distributed by Molecular Simulations, Inc. ; see Briinger, 1996), REFMAC (Murshudov et al. 1997), PHENIX (Adams et al. 2010) and BUSTER (distributed by Global Phasing Ltd, Bricogne et al. 2011). This information may thus be used to optimize known insulin analogs, and more importantly, to design new or improved insulin analogs.
Methods for Crystallising Venom Insulin
The present invention also provides a method for producing crystals comprising venom insulin. Crystal forms of venom insulin disclosed herein can be obtained by the following crystallization methods.
In one aspect, the present invention provides a method for crystallizing venom insulin comprising the steps:
(a) providing an aqueous solution of venom insulin;
(b) optionally, concentrating the solution of step (a); and
(c) diluting the solution of step (a) or step (b) with a precipitant solution so that the final concentration of the venom insulin is in the range of 0.5 mg/mL to 10 mg/mL.
The aqueous solution of venom insulin provided in step (a) may be buffered or unbuffered. Any suitable buffer known to a person skilled in the art may be used. Non- limiting examples for the venom insulin solutions include Tris-HCI, sodium phosphate, and triethanolamine buffer. In case the desired pH value is not obtained, adjusting the
pH by addition of a base or acid such as ammonium chloride, sodium acetate, sodium hydroxide, potassium hydroxide, or hydrochloric acid can be performed. Preferably, the solution of step (a) contains 10 mM HC1.
Optionally, the solution may optionally then be concentrated. In some embodiments, the solution is concentrated by centrifugal filtration. However, any suitable technique known to a person skilled in the art may be used. After the concentration step (b) the concentration of the venom insulin should be in the range of 1.0 mg/mL to 20 mg/mL. Preferably, the concentration of the venom insulin is approximately 4.0 mg/mL. This is the concentration of the venom insulin at the end of step (b) before dilution with the buffered precipitant solution of step (c). The concentration of the venom insulin in the concentrated solution of step (b) or during the concentration in step (b) or after the concentration of step (b) could be determined, for instance, by the Bradford protein assay or by other routine methods including UV absorbance spectroscopy.
In step (c) a precipitant solution is added to the solution of step (b). Preferably the solution of step (b) is diluted with the precipitant solution at a ratio of between 1:4 to 4: 1 (solution of step (b): buffered precipitant solution), most preferably at a ratio of 1 : 1.
The precipitant solution may comprise at least one small organic amphiphilic molecule and/or at least one inorganic salt. In some embodiments, the small organic amphiphilic molecule may be selected from the group comprising or consisting of glycerol, ethylene glycol, butyl ether, benzamidine, dioxane, ethanol, isopropanol, butanol, pentanol, methyl pentanediol, pentanediol, hexanediol, heptanetriol, dithiothreitol, MES, Tris, Tris-HCI, Bis-Tris, imidazole, bicine, trimethylamine N- oxide, succinic acid, DL-malate, CHES, CAPS, glycine, CAPSO, ADA, MOPS, Bis- Tris propane, SPG, MIB, PCB, MMT, N-[2- hydroxyethyl]piperazine-N'-[2- ethanesulfonic acid] (HEPES), TRIS, sucrose and combinations thereof. In a preferred embodiment, the small organic amphiphilic molecule comprises DL-malate, TRIS and MES. In some embodiments, the small organic amphiphilic molecules may act as a buffer. In some embodiments, the pH of the small organic amphiphilic molecules in solution may be adjusted to between about 2.0 and about 11.0 and any pH in between,
for example 2.0, 2.5, 3.0, 3.5, 4.0, 4..5, 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0, 8.5, 9.0, 9.5, 10.0, 10.5 and 11.0. In some embodiments, the pH is between about 8.0 and about 10.0. In a preferred embodiment, the pH is 9.0.
In some embodiments, the amount of all small organic amphiphilic molecules in the precipitant solution is between 1 % weight/volume and 50% weight/volume, preferably between 5% weight/volume and 20% weight/volume , and more preferably between 8% weight/volume and 12% weight/volume of the small organic amphiphilic molecule in regard to the total weight of the buffered precipitant solution. In a preferred embodiment, the buffered precipitant solution comprises 10% weight/volume of the small organic amphiphilic molecule.
The term "at least one small organic amphiphilic molecule" means that mixtures of the afore-mentioned small organic amphiphilic molecules may be used. In case mixtures of small organic amphiphilic molecules are used, the amount used refers to the total amount of all small organic amphiphilic molecules together.
The precipitant solution may also comprise an inorganic salt. In some embodiments, the inorganic salts is selected from the group comprising or consisting of ammonium chloride, ammonium sulphate, ammonium acetate, ammonium fluoride, ammonium bromide, ammonium iodide, ammonium nitride, cadmium chloride, cadmium sulphate, calcium chloride, calcium acetate, cesium chloride, cesium sulphate, cobalt chloride, ferric chloride, lithium acetate, lithium chloride, lithium nitrate, lithium sulphate, magnesium acetate, magnesium formate, magnesium nitrate, nickel chloride, potassium acetate, potassium bromide, potassium fluoride, potassium formate, potassium iodide, potassium nitrate, potassium thiocyanate, potassium/sodium tartrate, sodium acetate, sodium bromide, sodium fluoride, sodium iodide, sodium nitrate, sodium phosphate, sodium sulphate, sodium thiocyanate, zinc chloride, zinc sulphate, zinc acetate, magnesium chloride, sodium chloride, sodium cacodylate, sodium hydroxide, potassium chloride, potassium hydroxide, potassium sulphate, sodium acetate, sodium tartrate, ammonium formate, di-ammonium tartrate, sodium malonate, di-sodium tartrate, sodium succinate, and sodium succinate and combinations thereof. In a preferred embodiment, the inorganic salt is ammonium sulphate.
In some embodiments, the amount of the at least one inorganic salt in the precipitant solution is between about 50 mM and about 4000 mM, between about 1000 mM and about 3000 mM, between about 1500 mM and about 2500 mM or about 2000 mM. In case more than one inorganic salt is used, the afore-mentioned concentration refers to the concentration of all inorganic salts together and not to the concentration of each single salt used in the mixture of inorganic salts.
In some embodiments, the precipitant solution may be buffered. Any buffer known to a person may be used. In some embodiments, the buffer comprises or consists of citric acid, sodium acetate, sodium citrate, sodium cacodylate, HEPES sodium, TRIS HC1, CAPSO, CAPS, sodium malate, sodium MES and the like and combinations thereof. In some embodiments, the precipitant solution may have a pH between about 2.0 and about 11.0 and any pH in between, for example 2.0, 2.5, 3.0, 3.5, 4.0, 4..5, 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0, 8.5, 9.0, 9.5, 10.0, 10.5 and 11.0. In some embodiments, the precipitant solution may have a pH between about 8.0 and about 10.0. In a preferred embodiment, the precipitant solution may have a pH of 9.0.
In some embodiments, the buffered precipitant solution contains more than 1 M of at least one inorganic salt and/or between 5% by weight and 20% by weight of at least one small organic amphiphilic molecule. In a preferred embodiment, the precipitant solution comprises 2.0M ammonium sulphate and 10% DL-malate-MES- Tris (pH 9.0).
In theory, at least one precipitating agent in the diluted solution of step (c) competes with the protein molecules for water, thus leading to supersaturation of the protein. Crystals can normally only grow from supersaturated states, and thus they can grow from precipitates. Salts, polymers, and organic solvents are suitable precipitating agents. In addition to the components of the buffered precipitant solution listed above, the solution of step (c) may contain further precipitating agents.
In some embodiments, the hanging drop or the sitting drop methods are used for crystallization. The "hanging drop vapor diffusion" technique is the most popular method for the crystallization of macromolecules. By this method, a drop composed of a mixture of sample and reagent is placed in vapor equilibration with a liquid reservoir of reagent. Typically the drop contains a lower reagent concentration than the reservoir.
To achieve equilibrium, water vapor leaves the drop and eventually ends up in the reservoir. As water leaves the drop, the sample undergoes an increase in relative supersaturation. Both the sample and reagent increase in concentration as water leaves the drop for the reservoir. Equilibration is reached when the reagent concentration in the drop is approximately the same as that in the reservoir.
Insulin Analogs
Wild type insulin comprises an A chain peptide and a B chain peptide. Wild type human insulin A chain is represented by the sequence GIVEQCCTSICSLYQLENYCN (SEQ ID NO: 24). Wild type human insulin B chain is represented by the sequence FVNQHLCGSHLVEALYLVCGERGFFYTPKT (SEQ ID NO: 25).
The present inventors have determined the three-dimensional structure of Con- Gl Ins, a monomeric insulin that lacks an equivalent to the aromatic triplet PheB24- PheB25-TyrB26 of human insulin. Without wishing to be bound by theory it is thought that the side chain of TyrB 15 may compensate for the absence of the critical human insulin PheB24 in terms of IR engagement. It is also thought that the side chain of Con- Ins Gl TyrB20 may be involved in compensating for the lack of an equivalent to human insulin PheB24. The potential importance of these residues could not be predicted based on sequence analysis. The structural findings provided herein provide a platform for the design of a novel class of therapeutic human insulin analogues that are intrinsically monomeric and rapid-acting.
In one aspect, the present invention provides an insulin analog comprising an A chain peptide and a B chain peptide, wherein the B chain comprises an aromatic or large aliphatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin and/or an aromatic or large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin, wherein the analog comprises at least one amino acid found in human insulin but lacking in the corresponding position of Conus geographus insulin, and wherein the A chain peptide and the B chain peptide are bonded together across at least one pair of cysteine
residues. In some embodiments, the aromatic residue or large aliphatic residue can be a natural or a non-natural amino acid.
The co-crystal structure of an insulin-IR complex revealed that the side chain of PheB24 plays a unique role as an "anchor" within a nonpolar pocket (referred to as the B24 related binding pocket) defined by the IR and insulin B chain (Menting et al. 2014). Without wishing to be bound by theory, the present invention envisions that the large aliphatic or aromatic susbtitutions at position 15 and/or position 20 of the B chain of human insulin may compensate for the lack of PheB24 by inserting in the B24 related binding pocket. It is contemplated that the side-chains of large aliphatic or aromatic residues may be physical and chemically compatable with B24 related binding pocket.
In some embodiments, the aromatic or large aliphatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin is selected from the group consisting of tyrosine, phenylalanine, 4-methylphenylalanine, histidine, tryptophan, methionine, cyclopentylalanine and cyclohexylalanine. In some embodiments, the aromatic or large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin is selected from the group consisting of tyrosine, phenylalanine, 4-methylphenylalanine, histidine, tryptophan, methionine, cyclcopentylalanine and cyclohexylalanine. The aromatic residue or large aliphatic may be a natural or non-natural amino acid.
In some embodiments, the B chain comprises an aromatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin and/or an aromatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin. The aromatic residue may be a natural or non-natural amino acid. For example, the aromatic amino acid can be tyrosine, phenylalanine, tryptophan, histidine, 4-acetylphenylalanine and the like.
In some embodiments, the insulin analog has a tyrosine at a position corresponding to amino acid number 15 of the B chain of human insulin. In some embodiments, the insulin analog has a tyrosine at a position corresponding to amino acid number 20 of the B chain of human insulin. In some embodiments, the insulin analog has a tyrosine at a position corresponding to amino acid number 15 of the B
chain of human insulin and/or a tyrosine at a position corresponding to amino acid number 20 of the B chain of human insulin. In some embodiments, the insulin analog has a phenylalanine at a position corresponding to amino acid number 15 of the B chain of human insulin and/or a phenylalanine at a position corresponding to amino acid number 20 of the B chain of human insulin. In some embodiments, the insulin analog has a tryptophan at a position corresponding to amino acid number 15 of the B chain of human insulin and/or a tryptophan at a position corresponding to amino acid number 20 of the B chain of human insulin. In some embodiments, the insulin analog has a 4- acetylphenylalanine at a position corresponding to amino acid number 15 of the B chain of human insulin and/or a 4-acetylphenylalanine at a position corresponding to amino acid number 20 of the B chain of human insulin.
In some embodiments, the B chain comprises a large aliphatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin and/or a large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin. As used herein, a "large aliphatic" residue has a side-chain that is larger than the leucine (naturally occurring in human insulin at position 15 and 20) side-chain. For example, in some embodiemnts the side-chain of large aliphatic residue may have the same number or more non-hydrogen atoms compared to leucine. In some embodiments the side-chain of the large aliphatic residue has a greater side-chain volume when compared to leucine. In some embodiments, the side-chain of large aliphatic residue has a greater molecular weigth when compared to leucine. In some embodiments, the side-chain of the large aliphatic residue has more conformational flexibility when compared to leucine. The large aliphatic" residue may be may be a natural or non-natural amino acid. For example, the large aliphatic residue may methionine, isoleucine, cyclopentylalanine or cyclohexylalanine and the like.
In some embodiments, the insulin analog has a methionine at a position corresponding to amino acid number 15 of the B chain of human insulin. In some embodiments, the insulin analog has a methionine at a position corresponding to amino acid number 20 of the B chain of human insulin. In some embodiments, the insulin analog has a cyclohexylalanine at a position corresponding to amino acid number 15 of the B chain of human insulin and/or a cyclohexylalanine at a position corresponding to
amino acid number 20 of the B chain of human insulin. In some embodiments, the insulin analog has a cyclopentylalanine at a position corresponding to amino acid number 15 of the B chain of human insulin and/or a cyclopentylalanine at a position corresponding to position 20 of the B chain of human insulin.
In some embodiments, the insulin analogs have modified B chains which lack the aromatic triplet PheB24-PheB25-TyrB26 thought essential for IR binding, and have, in most cases, shorter B-chains compared to human insulin. In some embodiments, the B chain is truncated at the C-terminal end when compared to human insulin. In some embodiments, the B chain is lacking one or more of the nine C- terminal amino acids of human insulin, for example, the B chain is lacking one, two, three, four, five, six, seven, eight or nine C-terminal amino acids of human insulin. In some embodiments, the B chain is at least lacking PheB24 of human insulin. In some embodiments, the B chain is at least lacking the human B chain aromatic triplet (amino acids PheB24-PheB25-TyrB26 of human insulin). These residues may be absent or may be substituted such that the amino acids at a position corresponding to amino acid number 24, 25 and/or 26 of the B chain of human insulin are not phenylalanine, phenylalanine or tyrosine, respectively.
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-XA2-XA3-XA4-XA5-CysA6- CysA7-XA8-XA9-XA1o-CysA11- XA12-XA13-XA14-XA15-XA16-XA17-XA18-XA1 -CySA20-XA21-XA22-XA23-XA24-XA25-XA26-XA27-
XA28-XA29-XA30-XA3I-XA32-XA33-XA34, wherein XA2 = Val or He; XA3 = Val or Ala; XA4 = Glu, Asp, gamma carboxyglutamate or Cys; XA5 = Gin, Glu, gamma carboxyglutamate, His or Val; CysA6, CysA7, and CysA11 are independently Cys or selenocysteine; XAg = Thr, His, Asp, Gin, Tyr, Lys, Ala or Val; XAg = Ser, Arg, Asn, Gly, His or Lys; XAio = He, Pro, Tyr, Ala, Ser, Val, Phe, His or Thr; XAi2 = Ser or Thr; XAi3 = Leu, Asn, Val, Arg or Asp; XAw = Tyr, Ala, Gin, His, Asp or Glu; XAis = Gin, Glu or Thr; XAi6 = Phe, Leu, or Ala; XAn = Glu, Gin, Lys, Arg, He, Met, Thr or Ser; XAi8 = Lys, Ser, Thr, Asn, Gin or Glu; XA1g = Tyr or Phe; CysA2o = Cys, selenocysteine, amidated Cys, or amidated selenocysteine; XA2i = Asn, Pro, His, Ser, Gly, Ala, or is absent; XA22 = Pro, Asn, Thr, Leu, Ser or is absent; XA23 = Thr, Leu, Val, Ser or is absent; XA24 = Arg, Thr, Met, Gin, Leu or is absent; X^s = Glu, Gly or
is absent; from XA26 = Ser, Leu or is absent; XA27 to XA3i are independently Ser or are absent; XA32 = Ala, Ser or is absent; XA33 = Ala, Val or is absent; and XA34 = Ala or is absent (SEQ ID NO: 1); and
a B chain peptide comprising the sequence XBI-XB2-XB3-XB4-XBS-XB6-XB7-XB8- CySB -XB10-XBll-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB1 -XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29-XB30-XB31-XB32-XB33-XB34-XB35-XB36-XB37-XB38"XB39, wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Ala, Asp, Gly, Pro, Leu, Phe, or His; XB4 = Ala, Thr, Pro, Asp, Val or Gly; XB5 = Asn, Pro, His, Thr, Arg, Lys, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBio = Gly, Gin or Asp; XBn = Ser, Leu, Gly or Pro; XBi2 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XBi3 = He, Leu, Asp, Val or Ala; XBi4 = Thr, Ala, Pro, Val or Arg; XBis = Asn, Asp, Ala, Val, Thr, Pro or Glu; XBI6 = Ala, Ser, Gin, His, Thr, Tyr, Arg or Gly; ΧΒΠ = Thr, Tyr, Pro, Leu or Gly; XBIS = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XBi9 = Leu, Asp, Gin, Gly, Lys, Glu, Arg, Ser or Thr; XB20 = Val, Leu or Lys; CB2I = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Val, Tyr, Phe, His, Gly, Gin, Leu, amidated His, amidated Val or is absent; XB23 = Glu, Asp, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB29 = Thr, His, He, Leu, Ser, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3i = He, Lys or is absent; XB32 = Ala, Asp, Ser, Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB38 = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO : 2) ; .
In some embodiments, the insulin analog comprises a the B chain peptide comprises the sequence XBI-XB2-XB3-XB4-XBS-XB6-XB7-XB8- CysB9-XBio-XBii-XBi2- B13- B14- B15- B16- B17- B18- B1 - B20-CySB21- B22- B23- B24- B25- B26- B27-
XB28-XB29-XB30-XB3I-XB32-XB33-XB34-XB35-XB36-XB37-XB38-XB39, wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro,
Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBio = Gly, Gin or Asp; XBn = Ser, Leu, Gly or Pro; XBi2 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB 1s = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 1 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB2 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO: 3);.
In some embodiments, the insulin analog comprises a B chain peptide comprises the sequence ΧΒ1-ΧΒ2-ΧΒ3-ΧΒ4-ΧΒ5-ΧΒ6-ΧΒ7-ΧΒ8-^Υ8Β9-ΧΒΙΟ-ΧΒΙΙ-ΧΒΙ2-ΧΒΙ3-ΧΒΙ4-
XB15-XB16-XB17-XB18-XB19-XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29" XB30-XB3I-XB32-XB33-XB34-XB35-XB36-XB37-XB38-XB39, wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB 1o = Gly, Gin or Asp; XB 11 = Ser, Leu, Gly or Pro; XB 12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB 1s = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 18 = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 19 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg,
Ser, Gly or is absent; and XB24 > XB25 > XB26 > XB27 > XB28 > XB29 > XB30, XB3I > XB32 > XB33 > B34 , B35 , XB36 , XB37 , XB38 and XB3 are absent (SEQ ID NO: 4);.
In some embodiments, the insulin analog comprises a B chain peptide comprises the sequence XBI-XB2-XB3-XB4-XBS-XB6-XB7-XB8- CysB9-Gly-Ser-XBi2-XBi3-XBi4-XBi5- XB16-XB17-XB18-XB19-XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29"XB30-
XB3I-XB32-XB33-XB34-XB35-XB36-XB37-XB38-XB39, where XBI = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBS = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBi2 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XBi3 = He, Leu, Asp, Val or Ala; XBi4 = Thr, Ala, Pro, Val or Arg; XBIS = Asn, Asp, Ala, Val, Thr, Pro or Glu; XBI6 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XBis = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XBi9 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB20 = Val, Leu or Lys; CB2I = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB29 = Thr, His, Leu, Tyr or is absent; XB30 = Pro, Glu, Leu, He, Arg or is absent; XB3i = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB38 = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO: 5);.
In some embodiments, the insulin analog comprises a B chain peptide comprises the sequence XBi-XB2-XB3-XB4-XB5-XB6-XB7-XB8-CysB9-Gly-Ser-XBi2-XBi3-XBi4-XBi5- XB16-XB17-XB18-XB19-XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29"XB30-
XB3I-XB32-XB33-XB34-XB35-XB36-XB37-XB38-XB39, where XBI = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBS = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His or Tyr; XB8 = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XBI2 = His, Glu,
gamma carboxyglutamate, Asp, or Asn; XBi3 = He, Leu, Asp, Val or Ala; XBi4 = Thr, Ala, Pro, Val or Arg; XB15 = Asn, Asp, Ala, Val, Thr, Pro or Glu; XBi6 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 1 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val or Leu; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB2 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB3 = Ala or is absent (SEQ ID NO: 6);.
In some embodiments, the insulin analog comprises a B chain peptide comprises the sequence XB1-XB2-XB3-XB4-XB5-XB6-XB7-XB8-CysB9-Gly-Ser-XB 12-XBi3-XBi4-XBi5- XBi6-XBi7-XBi8-XBi9-XB20-CysB2i-XB22-XB23, wherein XBI = Thr, Asn or is absent; XB2 = Phe, Ser or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys, Asn or Gin; XB7 = His or Tyr; XBs = Arg, He or Leu; XBi2 = His, Asp, Glu or gamma carboxyglutamate; CysB9 = Cys or selenocysteine; XB 13 = Val, He or Leu; XB 14 = Thr, Ala, Pro, or Val; XB 1s = Glu, Val, Asn or Asp; XB 16 = Ser, Gin, Tyr or Ala; XB 17 = Tyr or Leu; XB 1g = Tyr, Asp, Met or Val; XB 19 = Leu, Asp, Gin or Lys; XB2o = Leu or Val; CysB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr or Gly; and XB23 = Glu, Arg, Gly or is absent (SEQ ID NO: 7);.
In some embodiments, the insulin analog comprises a the B chain peptide comprises the sequence XB1-XB2-XB3-XB4-XB5-XB6-His-XBg-CysB9-Gly-Ser-XB 12-XBi3- XBi4-XBi5-XBi6-XBi7-XBi8-XBi -XB20-CysB21-XB22-XB23, wherein XBI = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB8 = Arg or Leu; XB12 = His, Glu or gamma carboxyglutamate; XB 13 = He or Leu; XB 14 = Thr, or Val; XB 1s = Glu, or Asn; XB 16 = Ser or Ala; XB 17 = Tyr or Leu, XB 1g = Tyr or Met; XB 19 = Leu or Asp, XB2o = Leu or Val; XB22 = Tyr; and XB23 = Glu or Arg (SEQ ID NO: 8);.
In some embodiments, the residue at position XB 17 and XB22 are tyrosine. In some embodiments, XB22 isTyr. In some embodiments, XBn isTyr.
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-XA2-XA3-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysA11- XA12-XA13-XA14-XA15-Phe-XA17-XA18-XA19-CySA20-XA21- XA22-XA23-XA24-XA25-XA26"
XA27-XA28-XA29-XA30-XA3I-XA32-XA33-XA34, wherein XA2 = Val or He; XA3 = Val or Ala; XA4 = Glu, gamma carboxyglutamate or Cys; XAs = Gin, Glu, gamma carboxyglutamate, His or Val; CysA6, CysA7, and CysA11 are independently Cys or selenocysteine; XAg = Thr, His, Asp, Gin, Tyr, Lys or Val; XA9 = Ser, Arg, Asn, His or Lys; XAio = He, Pro, Tyr, Ala, Ser, Phe, His or Thr; XAi2 = Ser or Thr; XA13 = Leu, Asn, Val or Asp; AU = Tyr, Ala, Gin, Asp or Glu; XAis = Gin, Glu or Thr; or Ala; XAn = Glu, Lys, Arg, He, Met, Thr or Ser; XAis = Lys, Thr, Asn, Gin or Glu; XAi = Tyr or Phe; CysA2o = Cys, selenocysteine, amidated Cys, or amidated selenocysteine; X^i = Asn, Pro, His, Ser, Gly, Ala, or is absent; XA22 = Pro, Asn, Thr, Leu, Ser or is absent; XA23 = Thr, Leu, Val, Ser or is absent; ΧΛ24 = Arg, Thr, Met, Gin, Leu or is absent; XA25 = Glu, Gly or is absent; from ΧΛ26 = Ser, Leu or is absent; XA27 to XA3j are independently Ser or are absent; XA32 = Ala, Ser or is absent; XA33 = Ala, Val or is absent; and XA34 = Ala or is absent (SEQ ID NO: 9);.
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-XA2-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XA1o-CysA11-Ser- XAi3-XAi4-XAi5-XAi6-XAi7-XAi8-Tyr-CysA20-XA2i, wherein XA2 is Val or lie, XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XAg is His or Thr, XA9 is Arg or Ser, XAio is Pro or He, XA13 is Asn or Leu, AU is Ala or Tyr, XA1s is Glu or Gin, XA½ is Phe or Leu, ΧΑΠ is Lys or Glu, XAis is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 10);; and
an B chain peptide comprising the sequence XB 1-XB2-XB3-XB4-XBs-XB6-His-
XB8-CySB -Gly-Ser-XB 12-XB13-XB14-XB15-XB16-XB17-XB18-XB1 -XB20-CySB21-XB22-XB23, wherein XB 1 = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XB5 = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBg = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XB 13 = He or Leu; XB 14 = Thr, or Val; XB 1s =
Glu, or Asn; XBi6 = Ser or Ala; XBn = Tyr or Leu; XB 1g = Tyr or Met; XBi = Leu or Asp, XB20 = Leu or Val; XB22 = Gly or Tyr; and XB23 = Glu or Arg (SEQ ID NO: 11);.
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-XA2-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-Ser- XAi3-XAi4-XAi5-Phe-XAi7-XAi8-Tyr-CysA2o-XA2i, wherein XA2 is Val or He, XA4 is Glu or gamma carboxyglutamate, XAS is His or Gin, XAS is His or Thr, XA is Arg or Ser, XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAIS is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 12);; and a B chain peptide comprising the sequence XB 1-XB2-XB3-XB4-XB5-XB6-His-XBg-CysB9-Gly- Ser-XBi2-XBi3-XBi4-XBi5-XBi6- XBi7-XBi8-XBi9-XB20-CysB2i-Tyr-XB23, wherein XBj = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBg = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XB13 = He or Leu; XB14 = Thr, or Val; XB 1s = Glu, or Asn; XBi6 = Ser or Ala; XB17 = Tyr or Leu; XB 1g = Tyr or Met; XB19 = Leu or Asp, XB2o = Leu or Val; and XB23 = Glu or Arg (SEQ ID NO: 13);.
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-Val-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-Ser- XAi3-XAi4-XAi5-Phe-XAi7-XAi8-Tyr-CysA20-XA2i, wherein XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XA8 is His or Thr, XA9 is Arg or Ser, XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAI8 is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 14);; and a B chain peptide comprising the sequence XBi-XB2-XB3-XB4-XBs-XB6-His-Arg-CysB9-Gly-Ser-XBi2-Ile- XBI4-XBI5-XBI6- XBi7-XBi8-XBi9-Leu-CysB21-Tyr-XB23, wherein XB 1 = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB 12 = His, Glu or gamma carboxyglutamate; XB14 = Thr, or Val; XB 1s = Glu, or Asn; XB16 = Ser or Ala; XB 17 = Tyr or Leu; XB1g = Tyr or Met; XB19 = Leu or Asp, and XB23 = Glu or Arg (SEQ ID NO: 15);.
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-XA2-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-Ser- XAi3-XAi4-XAi5-Phe-XAi7-XAi8-Tyr-CysA2o-XA2i, wherein XA2 is Val or He, XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XA8 is His or Thr, XA9 is Arg or Ser,
XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAIS is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 26);; and a B chain peptide comprising the sequence XBi-XB2-XB3-XB4-XB5-XB6-His-XB8-CysB9-Gly- Ser-XBi2-XBi3-XBi4-XBi5-XBi6-Tyr-XBi8-XBi9-XB20-CysB2i-XB22-XB23, wherein XBI = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBS = Arg or Leu; XBI2 = His, Glu or gamma carboxyglutamate; XBi3 = He or Leu; XBi4 = Thr, or Val; XBis = Glu, or Asn; XBI6 = Ser or Ala; XBis = Tyr or Met; XBi = Leu or Asp, XB20 = Leu or Val; XB22 = Gly or Tyr; and XB23 = Glu or Arg (SEQ ID NO: 27);.
In some embodiments, the insulin analog comprises an A chain peptide comprising a sequence Gly-Val-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-CysAii-Ser- XAi3-XAi4-XAi5-Phe-XAi7-XAi8-Tyr-CysA20-XA2i, wherein XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XA8 is His or Thr, XA9 is Arg or Ser, XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAI8 is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 28); and a B chain peptide comprising the sequence XBi-XB2-XB3-XB4-XB5-XB6-His-Arg-CysB9-Gly-Ser-XBi2-Ile- XBi4-XBi5-XBi6-Tyr-XBi8-XBi9-Leu-CysB2i-XB22-XB23, wherein XBi = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBS = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBi2 = His, Glu or gamma carboxyglutamate; XBi4 = Thr, or Val; XBis = Glu, or Asn; XBi6 = Ser or Ala; XBis = Tyr or Met; XBi9 = Leu or Asp, XB22 = Gly or Tyr; and XB23 = Glu or Arg (SEQ ID NO: 29);.
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser-Leu-Tyr- Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 16); and a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Xaa-Tyr- Leu-Val-Cys-Gly-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 17).
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser-Leu-Tyr- Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 18); and a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Leu-Tyr-
Leu-Val-Cys-Xaa-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 19).
In some embodiments, the insulin analog comprises an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr-Ser-Ile-Cys-Ser-Leu-Tyr- Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 20); and a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Xaa-Tyr- Leu-Val-Cys-Xaa-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 21).
A number of modifications of human insulin have previously been identified which increase the affinity of the analog for IR. For example, replacing ThrA8 of human insulin with histidine leads to a three fold increase in affinity for IR (Glendorf et al. (2011). In some embodiments, the insulin analog has a histidine residue at the position corresponding to ThrA8 of human insulin.
The insulin analogs may comprise one or more unnatural amino acids, modified amino acids or synthetic amino acid analogues, some of which are indicated with the sequences herein. For example, such amino acids include, but are not limited to, the D- isomers of the common amino acids, 2,4-diaminobutyric acid, oc-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, gamma carboxyglutamate, hydroxyproline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, cyclopentylalanine, selenocysteine, amidated cysteine, amidated selenocysteine, β- alanine, fluoro-amino acids, designer amino acids such as β-methyl amino acids, Coc- methyl amino acids, Noc-methyl amino acids, and amino acid analogues in general. Also included within the scope are peptides which are differentially modified during or after synthesis, for example, by biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc.
As disclosed herein, a synthetic analogue of Con-Ins Gl containing PTMs was four times more active against the human IR-B than a PTM-free analogue (see examples). In some embodiments, one or more Glu residues can be replaced with
gamma-carboxyglutamate (Gla), for example at the XA4 or XAs Glu positions of various A chains, or at the XBi2 Glu position of some B chains. In some embodiments, one or more Pro residues can be replaced with hydroxyproline (Hyp), for example at the XBs Pro position of certain B chains. In some embodiments, the C-terminal ends can be amidated, such as an amidated Cys (*) at the terminal end of various A chains, among others. In some embodiments, the insulin analog comprises one or more of the following (i) XA4 is gamma carboxyglutamate, ii) XBs = hydroxyproline; and iii) XBi2 = gamma carboxyglutamate. The present inventors have found that at least some of these PTM contribute to Con-Ins Gl binding to the IR.
The insulin analogs have an A chain peptide and the B chain peptide that are bonded together across at least one pair of cysteine residues. In some embodiments, CysB9 of the B chain peptide is bonded to CysA6 of the A chain peptide, CysB2i of the B chain peptide is bonded to CysA2o of the A chain peptide and/or CysA7 is bonded to CysAii.
In some embodiments, the A chain peptide and the B chain peptide can be linked together at one or more terminal ends. In some embodiments the A chain peptide and the B chain peptide are linked together at a terminal end. For example, the N- terminus of the A chain peptide can be linked to the N- or C-terminus of the B chain peptide, the C-terminus of the A chain peptide can be linked to the N- or C-terminus of the B chain peptide, the C-terminus of the A chain peptide can be linked to the N- terminus of the A chain peptide or the C-terminus of the B chain peptide can be linked to the N- terminus of the B chain peptide. In some embodiments, the A chain peptide and the B chain peptide are linked together at both terminal ends. Thus, while in some embodiments the insulin analog is acyclic, in other cases the insulin analogs may be cyclic, while retaining the Cys-bonding pattern described. In such embodiments, the insulin analog has cyclized backbone such that the A and B chains have no free N- or C-terminus (for the embodiment whereby both terminal ends are linked). The linkage at the one or more terminal ends can be directly between amino acids of the A and B chain peptide backbones, or there can be a linker of one or more amino acids or other linker molecules bonded therebetween.
In some embodiments, chemical groups, residues or groups of residues known to the person skilled in the art to improve stability can be added to the C-terminus and/or N-terminus. In some embodiments, chemical groups, residues or groups of residues known to the person skilled in the art to improve bioavailability can be added to the C- terminus and/or N-terminus. In some embodiments, such residues or groups that can be added to the N-terminus can also replace Gly within the insulin analogs. In some embodiments, fluorescent tags are may be attached to either the C- or N-terminus.
Insulin analog peptides as disclosed herein can be made by any technique or method known to the person skilled in the art, and any of such techniques or methods are considered to be within the present scope. For example, techniques include, but are not limited to, chemical synthesis, solid phase peptide synthesis, recombinant expression, a combination of peptide synthesis and recombinant expression, and the like. Example techniques are described further in the Examples section. The insulin analog may be prepared in various forms, for example native, fusions, glycosylated, lipidated, etc.
The insulin analogs are preferably prepared in substantially pure form (i.e. substantially free from host cell proteins or other contaminants). Typically, the insulin analog is substantially pure when it is at least 60%, by weight, of total protein present. For example, the insulin analog is at least 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, more preferably at least 90%, by weight, of total protein present.
The present disclosure also provides salts or derivatives of the insulin analogs, The term "salt", as used herein, denotes acidic and/or basic salts, formed with inorganic or organic acids and/or bases, preferably basic salts. While pharmaceutically acceptable salts are generally preferred, particularly when employing the insulin analogs as medicaments, other salts find utility, for example, in processing these compounds, or where non-medicament-type uses are contemplated. Salts of these compounds may be prepared by any technique know to a person skilled in the art.
The term "pharmaceutically acceptable salt" as used herein, refers to salts of compounds that retain the biological activity of the parent compound, and which are not biologically or otherwise undesirable. Many of the compounds disclosed herein are capable of forming acid and/or base salts by virtue of the presence of amino and/or carboxyl groups or groups similar thereto. Pharmaceutically acceptable base addition salts can be prepared from inorganic and organic bases. Salts derived from inorganic bases, include by way of example only, sodium, potassium, lithium, ammonium, calcium and magnesium salts. Salts derived from organic bases include, but are not limited to, salts of primary, secondary and tertiary amines. Pharmaceutically acceptable acid addition salts may be prepared from inorganic and organic acids. Salts derived from inorganic acids include hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid, and the like. Salts derived from organic acids include acetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid, malic acid, malonic acid, succinic acid, maleic acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, methanesulfonic acid, ethanesulfonic acid, p-toluene- sulfonic acid, salicylic acid, and the like.
The term "derivative" as used herein includes alpha amino acids wherein one or more side groups found in the naturally occurring alpha-amino acids have been modified. Thus, for example the naturally-occurring amino acids may be replaced with a variety of uncoded or modified amino acids such as the corresponding D-amino acid or N-methyl amino acid. Other modifications are known to the person skilled in the art and include substitution of hydroxyl, thiol, amino and carboxyl functional groups with chemically similar groups, for example substitution of -SH with -SeH in cysteine.
In some embodiments, the insulin analog, salt or derivative thereof is able to bind the IR. Preferably, the IR is the human IR-B receptor. In some embodiments, the IC50 or affinity (¾) against the human IR-B receptor of less than 10~4 M, 10~5 M, 10~6 M 10"7 M, 10"8 M, 10"9 M or 10"10 M. Preferably, the IC50 against the human IR-B receptor of less than 10~6 M.
In some embodiments, the insulin analog, salt or derivative thereof does not bind IGF-IR or binds the IGF-IR weakly. As used herein, "weakly" refers to an insulin analog that does not bind with sufficient affinity to the IGF-IR to result in activation of
the IGF-IR and/or cause signal transduction via IGF-IR. In some embodiments, the insulin analog has an IC50 or Kd for IGF-IR of weaker than 10"9 M, 10"8 M, 10"7 M, 10"6 M, 10~5 M or 10"4 M, preferably, the insulin analog has an affinity (¾) for IGF-IR of weaker than 100 nM.
In some embodiments, the insulin analog, salt or derivative thereof is predominantly monomeric in solution. In some embodiments, at least 50% of the insulin analog is a monomer, at least 60% of the insulin analog is a monomer, at least 70% of the insulin analog is a monomer, at least 75% of the insulin analog is a monomer, at least 80% of the insulin analog is a monomer, at least 85% of the insulin analog is a monomer, at least 90% of the insulin analog is a monomer, at least 95% of the insulin analog is a monomer, at least 98% of the insulin analog is a monomer, at least 99% of the insulin analog is a monomer or approximately 100 % of the insulin analog is a monomer. In some embodiments, the insulin analog is monomeric. In some embodiments, the insulin analog may be at least partially monomeric and dissociate into monomeric form upon administration to a subject. In some embodiments, the insulin analog is monomeric or dissociates into a monomeric form in a subjects blood stream.
In some embodiments, the insulin analog, salt or derivative thereof is a rapid acting insulin analog. In some embodiments, has increased bioavailability when administered to a human when compared human insulin. In some embodiments, the insulin analog, salt or derivative thereof has a peak bioavailability within 10 minutes to 6 hours of administration to a human. In some embodiments, the maximum plasma concentration of the insulin analog after administration occurs earlier than the maximum plasma concentration of human insulin after administration. For example, the peak availability of the insulin analog, salt or derivative thereof occurs within 10 minutes to 4 hours of administration, within 15 minutes to 3 hours of administration, within 30 minutes to 1 hour of administration or within 40 to 55 minutes of administration. In some embodiments, the insulin analog, salt or derivative thereof has an onset of activity within 2 min, 5 min, 10 minute, 15 minute, 20 minute or 30 minutes of administration. Preferably, the onset of activity is within 10 to 30 minutes of administration.
Insulin analogs of the present invention also include those designed or identified using a method of the invention and those which are capable of recognising and binding to a target binding site.
Target binding sites include physiological binding partners of insulin, such as the IR, as well as regions of physiological binding partners. For example, a target binding site may be a short polypeptide defining an epitope (e.g. corresponding to a loop structure identified below as a target binding site) or a mimetic, e.g. a peptidomimetic, mimicking a loop structure.
In some embodiments, the target binding site is the insulin receptor, preferably human insulin receptor. In some embodiments, the target binding site is a region of IR involved in insulin docking to the receptor. In some embodiments, the region of the IR includes low affinity target binding sites comprising one or more of the following: the LI domain, the CT peptide and the CR domain of IR ectodomain. With regards to the LI domain, the target binding site preferably comprises portions of the molecular surface of the central β-sheet of LI and portions of the molecular surface of the second LRR which contain Phe39 or the loop in the fourth LRR rung of LI, or preferably both. With regards the CR domain, the target binding site preferably comprises module 6 of the CR domain.
In some embodiments, the low affinity target binding site may comprise one or more amino acids from one or more of the following amino acid sequences: (i) amino acids 1-156; (ii) amino acids 704-719; and (iii) amino acids 157-310.
With regards to amino acids 1-156, the target binding site preferably comprises at least one amino acid from the amino acid sequence 1-68, preferably 1-55, and more preferably amino acid sequence 27-55. The target binding site preferably comprises at least one amino acid selected from Argl4, Asnl5, Gln34, Leu36, Leu37, Phe39, Pro43- Phe46, Phe64, Leu87, Phe88, Asn90 and Phe89, more preferably at least one amino acid selected from Argl4, Asnl5, Gln34, Leu37, Phe39, Pro43-Phe46, Phe64, yet more preferably at least one amino acid selected from Phe39 and Pro43-Phe46, and most preferably at least Phe39.
With regards to amino acids 157-310, the target binding site preferably comprises at least one amino acid from the amino acid sequence 192-310, more
preferably at least one amino acid from the sequence 227-303, yet more preferably least one amino acid selected from the sequence 259-284.
In another aspect, the present invention also provides peptides comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20. Disclosed are peptides comprising an A chain peptide and a B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 compared to wild type human insulin. In some instances, any conservative amino acid substitution can be present at positions 10, 20, or both positions. For example, another hydrophilic amino acid, polar amino acid, or aliphatic amino acid could be substituted at one or both positions.
In some instances of the disclosed peptides, the substitution at amino acid 20 of the B chain peptide can be G20Y, G20F, or G20P. In some instances, the substitution at amino acid 20 is G20Y. In some instances, the substitution at amino acid 20 can be G20P and the peptide further comprises a substitution at amino acid 21, wherein the substitution at amino acid 21 can be G21H. In some instances, the amino acid substitution can be any conservative substitution from glycine.
In some instances of the disclosed peptides, the substitution at amino acid 10 of the B chain peptide can be H10E, H10D or H10Q. In some instances, the substitution at amino acid 10 is H10E. In some instances, the amino acid substitution can be any conservative substitution from histidine.
In some instances, both the insulin A chain peptide and the B chain peptide can contain substitutions compared to wild type insulin. Disclosed are peptides comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 and further comprising at least one substitution in the A chain peptide. In some instances, the at least one substitution can be found at position 8 or 9. In some instances, the at least one substitution in the A chain peptide can be T8H, T8Y, T8K, or S9R. In some instances, any conservative amino acid substitution can be present at position 8 or 9 or both positions. For example, another hydrophilic amino acid could be substituted or other polar amino acids could be substituted.
Disclosed are peptides comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 and further comprising at least two substitutions in the A chain peptide. In some instances, the at least two substitutions can be found at positions 8 and 9. In some instances, the at least two substitutions in the A chain peptide can be selected from: T8H, T8Y, T8K, and S9R. In some instances, any conservative amino acid substitution can be present at position 8 or 9 or both positions. For example, another hydrophilic amino acid could be substituted or other polar amino acids could be substituted at one or both positions.
In some instances, the B chain peptide is lacking one or more, up to eight, of the
C-terminal amino acids compared to wild type. Thus, the disclosed peptides can be des- octapeptide insulin peptides (missing the last 8 amino acids of the C-terminus of the human insulin B chain). For example, in some instances the disclosed peptides can have a B chain peptide that comprises the sequence of:
FVNQHLCGSELVEALYLVCYER (SEQ ID NO: 30),
FVNQHLCGSELVEALYLVCFER (SEQ ID NO: 31),
FVNQHLCGSELVEALYLVCPER (SEQ ID NO: 32),
FVNQHLCGSDLVEALYLVCYER (SEQ ID NO:33),
FVNQHLCGSDLVEALYLVCFER (SEQ ID NO: 34),
FVNQHLCGSDLVEALYLVCPER (SEQ ID NO: 35),
FVNQHLCGS QL VE ALYL VC YER (SEQ ID NO: 36),
FVNQHLCGSQLVEALYLVCFER (SEQ ID NO: 37), or
FVNQHLCGS QL VE ALYL VCPER (SEQ ID NO: 38).
In some instances, the disclosed peptides can have an A chain comprising the sequence of GIVEQCCHRICSLYQLENYCN (SEQ ID NO: 39),
GIVEQCCYRICSLYQLENYCN (SEQ ID NO: 40), or
GIVEQCCKRICSLYQLENYCN (SEQ ID NO: 41).
In some instances of the disclosed peptides, the A chain peptide and B chain peptide can be bonded via at least one disulfide bond. In some instances, the A chain peptide and B chain peptide can be bonded via at least two disulfide bonds.
In some instances, the disclosed peptides are monomers. In other words, in some instances, the disclosed peptides are less likely to form dimers, tetramers, hexamers, etc.
In some instances of the disclosed peptides, the insulin A chain peptide can be at least 70% identical to wild type human insulin A chain peptide. In some instances, the insulin A chain peptide can be at least 60, 65, 70, 75, 80, 85, 90, 95, 99% identical to wild type human insulin A chain peptide. In some instances, the percent identity can be reached by the deletion of one or more amino acids from the N-terminus or C-terminus end of the disclosed peptides.
In some instances of the disclosed peptides, the insulin B chain peptide can be at least 70% identical to wild type human insulin B chain peptide. In some instances, the insulin B chain peptide can be at least 60, 65, 70, 75, 80, 85, 90, 95, 99% identical to wild type human insulin B chain peptide. In some instances, the percent identity can be reached by the deletion of one or more amino acids from the N-terminus or C-terminus end of the disclosed peptides.
In some instances, the disclosed peptides can comprise one or more unnatural amino acids, modified amino acids or synthetic amino acid analogues. Such amino acids include, but are not limited to, the D-isomers of the common amino acids, 2,4- diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, 2-aminobutyric acid, 6-amino hexanoic acid, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, cyclopentylalanine, β-alanine, fluoro-amino acids, designer amino acids such as β- methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid analogues in general. Also included within the scope are peptides which are differentially modified during or after synthesis, for example, by biotinylation, benzylation, glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. These modifications may serve to increase the stability and/or bioactivity of the peptide.
In further instances of the disclosed peptides, provided are therapeutic proteins having an A chain peptide bonded to a B chain peptide via at least one disulfide bond, wherein the A chain comprises the sequence of GIVEQCCHRICSLYQLENYCN (SEQ ID NO: 39), and wherein the B chain peptide comprises the sequence of FVNQHLCGSELVEALYLVCYER (SEQ ID NO: 30).
It is appreciated that the disclosed therapeutic proteins can be employed in pharmaceutical compositions and used in connection with treatment of disorders including diabetes. Methods for the Design of Insulin Analogs
The three-dimensional structure of venom insulin provided by the present invention may be used to design insulin analogs (also referred to herein as IR agonists, molecules and compounds), particularly rapid acting insulin analogs. In one aspect, there is provided the use of the structure of Con-Ins Gl as defined by the atomic coordinates of Appendix I as a structural model. In some embodiments, the structural model is used for identification of insulin analogs.
In some embodiments, a method of identifying, designing or screening for a compound that can potentially interact with IR is provided. The method comprises performing structure-based identification, design or screening of a compound based on the compound's interactions with an IR structure defined by the three-dimensional structure of Con-Ins Gl, or a subset thereof.
The present invention is also useful for improving the properties of known IR binding molecules. For example, known IR binding molecules can be screened against the 3D structure of Con-Ins Gl defined by the atomic coordinates of Appendix I or a portion thereof, and an assessment made of the ability to self-associate and the potential to interact with IR. In view of this assessment the known IR binding molecule could be redesigned (i.e. chemically modified) so as to impart one or more of the following properties: (i) reduce self-association, (ii) improve its affinity for the low affinity binding site of IR (i.e. the binding site governing selectivity), (iii) improve its affinity for the high affinity binding site for IR (i.e. the binding site governing signal transduction) and (iv) lower its affinity for binding to IGF-1R.
In some embodiments, a method of redesigning or modifying a polypeptide which is known to bind to IR, or a region of the IR, is provided. The method comprises performing structure-based evaluation of a structure defined by the atomic coordinates of Appendix I or a subset thereof, and redesigning or chemically modifying the polypeptide as a result of the evaluation. In some embodiments, structure-based evaluation comprises comparison of the structure defined by the atomic coordinates of Appendix I or a subset thereof, with the atomic coordinates of insulin or a subset thereof. In some embodiments, structure-based evaluation further comprises molecular modelling of a complex formed between the structure defined by the atomic coordinates of Appendix I or a subset thereof with the atomic coordinates of an insulin receptor or a subset thereof. In some embodiments, the model is defined by the atomic coordinates of Appendix II or a subset thereof.
In some embodiments, the method further comprises synthesising or obtaining the redesigned or chemically modified polypeptide and testing for its ability to bind IR. In some embodiments, the ability of the redesigned or chemically modified polypeptide to modulate IR activation is determined. In some embodiments, the ability of the redesigned or chemically modified polypeptide to lower blood glucose levels may be determined.
In some embodiments, the polypeptide which is known to bind to IR is an insulin-like growth factor (IGF). Suitable IGFs include those from humans, pigs, cattle, birds, mice and the like. In some embodiments the IGF is human IGF-I or IGF-II.
In a preferred embodiment, the polypeptide which is known to bind to IR is insulin. Suitable insulin includes human insulin, porcine insulin bovine insulin, ovine insulin, murine insulin, guinea pig insulin and the like. In some embodiments, the insulin is human insulin.
As used herein, the term "modelling" includes the quantitative and qualitative analysis of molecular structure and/or function based on atomic structural information and interaction models. The term "modelling" includes conventional numeric-based molecular dynamic and energy minimisation models, interactive computer graphic models, modified molecular mechanics models, distance geometry and other structure- based constraint models.
Molecular modelling techniques can be applied to the atomic coordinates of Con-Ins Gl or a region thereof to derive a range of 3D models and to investigate the structure of binding sites, such as the binding site of IR and other protein targets.
A region of Con-Ins Gl as referred to herein may be defined by a single amino acid (or side-chain thereof), by a continuous amino acid sequence or by two or more separate amino acids and/or stretches of amino acids. Such separate amino acids and/or stretches of amino acids may exist in close spatial proximity to one another in the three dimensional structure or may have the potential to be brought into close spatial proximity, for example, upon the binding of a suitable ligand. Suitably, regions of Con-Ins Gl comprise amino acid sequences involved in the binding of IR, both the initial selective low affinity binding and the subsequent high affinity binding to the other monomer in the IR dimer.
These techniques may also be used to screen for or design small and large chemical entities which are capable of binding IR and modulating the activity of IR. The screen may employ a solid 3D screening system or a computational screening system.
In some embodiments, such modelling methods are to design or select chemical entities (for example a polypeptide, peptide, peptidomimetic, compound and the like) that possess stereochemical complementary to particular regions of IR. By "stereochemical complementarity" we mean that the compound or a portion thereof makes a sufficient number of energetically favourable contacts with the receptor as to have a net reduction of free energy on binding to the receptor.
It will be appreciated that it is not necessary that the complementarity between chemical entities and the receptor site extend over all residues of the receptor site in order to inhibit binding of a molecule or complex that naturally interacts with IR ectodomain.
A number of methods may be used to identify chemical entities possessing stereo-complementarity to a region of the IR. For instance, the process may begin by visual inspection of Con-Ins Gl structure on a computer screen based on the atomic coordinates of Con-Ins Gl, or region thereof, in Appendix I generated from the machine -readable storage medium. In some embodiments, the process may begin by
molecular modelling of a complex formed between the structure defined by the atomic coordinates of Appendix I or a subset thereof with the atomic coordinates of an insulin receptor or a subset thereof. Modelling software that is well known and available in the art may be used, for example MODELLER (v9.15) (Webb and Sali, (2014). This modelling step may be followed by energy minimization with standard molecular mechanics force fields such as CHARMM (Guvench et al. 2011 ; Best et al. 2012). Modelling and energy minimization may be followed by molecular dynamics simulations using software known in the art, for example NAMD (Phillips et al. 2005). In addition, there are a number of more specialized computer programs to assist in the process of selecting the binding moieties of this invention.
There is also provided a polypeptide or salt or analog thereof which has been redesigned or modified by the method of redesigning or modifying a polypeptide defined herein. Preferably, the redesigned or modified polypeptide is monomeric, or dissociates to a monomer when administered to a subject.
Preferred regions of the IR are those governing specificity, for example those described as target binding sites above.
In another aspect, there is provided an isolated molecule which is an IR agonist wherein the molecule is identified and/or designed based on the 3D structure of Con- Ins Gl defined by the atomic coordinates of Appendix I or a subset thereof. Suitable molecules include peptides, polypeptides or peptidomimetics.
The term "peptidomimetic", as used herein is a molecule that mimics the biological activity of a peptide but is no longer completely peptidic in chemical nature. By strict definition, a peptidomimetic is a molecule that no longer contains any peptide bonds (that is, amide bonds between amino acids). However, the term peptide mimetic can be used to describe molecules that are no longer completely peptidic in nature, such as pseudo-peptides, semi-peptides and peptoids. Whether completely or partially non- peptide, peptidomimetics of the invention provide a spatial arrangement of reactive chemical moieties that closely resembles the three-dimensional arrangement of active groups in the peptide on which the peptidomimetic is based. As a result of this similar active-site geometry, the peptidomimetic has effects on biological systems which are similar to the biological activity of the peptide.
Suitable peptidomimetics based on venom insulin can be developed using readily available techniques and/or the methods described herein. Thus, for example, peptide bonds can be replaced by non-peptide bonds that allow the peptidomimetic to adopt a similar structure, and therefore biological activity, to the original peptide. Further modifications can also be made by replacing chemical groups of the amino acids with other chemical groups of similar structure. The development of peptidomimetics derived from venom insulin can be aided by reference to the three dimensional structure of these residues as provided in Appendix I or a subset thereof. This structural information can be used to search three-dimensional databases to identify molecules having a similar structure, using programs such as MACCS-3D and ISIS/3D (Molecular Design Ltd., San Leandro, CA), ChemDBS-3D (Chemical Design Ltd., Oxford, U.K.), and Sybyl/3DB Unity (Tripos Associates, St. Louis, MO).
Those skilled in the art will recognize that the design of a peptidomimetic may require slight structural alteration or adjustment of a chemical structure designed or identified using the methods of the invention. In general, mimetics identified or designed using the methods of the invention can be synthesized chemically and then tested for ability to modulate insulin receptor activity using any of the methods described herein. The methods of the invention are particularly useful because they can be used to greatly decrease the number of potential mimetics which must be screened for their ability to modulate insulin receptor activity.
In some embodiments, the isolated molecule is able to bind the IR. Preferably, the IR is the human IR-B receptor. In some embodiments, the IC50 or ¾ against the human IR-B receptor is stronger than 10"4 M, 10"5 M, 10"6 M 10"7 M, 10"8 M, 10"9 M or 10~10 M. Preferably, the IC50 or Kd against the human IR-B receptor is stronger than 10" 6 M. In some embodiments, the isolated molecule does not bind IGF-IR or binds IGF- IR weakly. Preferably, the isolated molecule has an affinity (¾) for IGF-IR of weaker than 100 nM.
In some embodiments, the isolated molecule is predominantly monomeric in solution. In some embodiments, at least 50% of the isolated molecule is a monomer, at least 60% of the isolated molecule is a monomer, at least 70% of the isolated molecule is a monomer, at least 75% of the isolated molecule is a monomer, at least 80% of the
isolated molecule is a monomer, at least 85% of the isolated molecule is a monomer, at least 90% of the isolated molecule is a monomer, at least 95% of the isolated molecule is a monomer, at least 98% of the isolated molecule is a monomer, at least 99% of the isolated molecule is a monomer or approximately 100 % of the isolated molecule is a monomer. In some embodiments, the isolated molecule may be at least partially monomeric and dissociate into monomeric form upon administration to a subject. In some embodiments, the isolated molecule is monomeric or dissociates into a monomeric form in a subjects blood stream.
The present invention is also useful in the identification and/or design of insulin analogs which do not bind or only bind weakly to IGF-IR. For example, insulin analogs identified using the methods of this invention can be screened in silico, in vitro and/or in vivo for their ability to bind the IGF-IR. Any insulin analogs found or suspected to bind to IGF-IR can be redesigned so as to be more selective for IR.
In some embodiments, an insulin analog (which includes an isolated molecule, compound or IR agonist) identified by the methods herein does not bind IGF-IR or binds the IGF-IR weakly. In some embodiments, the insulin analog has an IC50 or ¾ for IGF-IR of weaker than 10"9 M, 10"8 M, 10"7 M, 10"6 M, 10"5 M or 10"4 M, preferably, the insulin analog has an affinity (Kd) for IGF-IR of weaker than 100 nM.
The present disclosure also provides a method of identifying a compound which binds IR, the method comprising:
i) generating a three-dimensional structure model of a polypeptide having
a) a structure defined by the atomic coordinates of Appendix I or a subset thereof, or
b) a structure having a root mean square deviation less than about 2.0A when superimposed on the corresponding backbone atoms of a), and ii) designing or screening for a compound which potentially binds the IR.
In some embodiments, generating a three-dimensional structure model comprises generating a model of the polypeptide bound to IR or regions thereof. Preferred regions of the IR are those governing specificity, for example those described as target binding sites above. In some embodiments, the model is defined by the atomic coordinates of Appendix II or a subset thereof.
The model may be adaptive in a sense that it allows for slight surface changes to improve the fit between the candidate compound and the protein, e.g. by small movements in side chains or main chain.
In some embodiments, the methods further comprise synthesising the compound which potentially binds the IR. In some embodiments, compound modulates at least one biological activity of IR. In some embodiments, the method may further comprise testing the compound designed or screened for in ii) for its ability to modulate at least one biological activity of IR. For example, the method may further comprise testing the compound designed or screened for in ii) for its ability to modulate blood glucose levels. In some embodiments, steps i) and ii) are performed in silico.
The present invention also provides a computer-based method of identifying a compound which mimics insulin activity, the method comprising
i) generating a three-dimensional structure model of a polypeptide having
a) a structure defined by the atomic coordinates of Appendix I or a subset thereof, or
b) a structure having a root mean square deviation less than about 2.0A when superimposed on the corresponding backbone atoms of a), and ii) designing or screening for a compound which mimics insulin activity.
In some embodiments, generating a three-dimensional structure model comprises generating a model of the polypeptide bound to IR or regions thereof. In some embodiments, the model is defined by the atomic coordinates of Appendix II or a subset thereof. Preferred regions of the IR are those governing specificity, for example those described as target binding sites above.
In some embodiments, the methods further comprise synthesising the compound which potentially binds the IR. In some embodiments, compound modulates at least one biological activity of IR. In some embodiments, the method may further comprise testing the compound designed or screened for in ii) for its ability to modulate at least one biological activity of IR. For example, the method may further comprise testing the compound designed or screened for in ii) for its ability to modulate blood glucose levels. In some embodiments, steps i) and ii) are performed in silico.
In some embodiments, the compounds identified by the methods of the present invention are able to bind the IR. Preferably, the IR is the human IR-B receptor. In some embodiments, the IC50 against the human IR-B receptor of less than 10~4 M, 10~5 M, 10"6 M 10"7 M, 10"8 M, 10"9 M or 10"10 M. Preferably, the IC50 against the human IR- B receptor of less than 10~6 M.
In some embodiments, the compounds identified by the methods of the present invention do not bind IGF-IR or binds the IGF-IR weakly. In some embodiments, the insulin analog has an IC50 or Kd for IGF-IR of weaker than 10"9 M, 10"8 M, 10"7 M, 10"6 M, 10~5 M or 10"4 M, preferably, the insulin analog has an affinity (¾) for IGF-IR of weaker than 100 nM.
In some embodiments, the compounds identified by the methods of the present invention are predominantly monomeric in solution. In some embodiments, at least 50% of the compound is a monomer, at least 60% of the compound is a monomer, at least 70% of the compound is a monomer, at least 75% of the compound is a monomer, at least 80% of the compound is a monomer, at least 85% of the compound is a monomer, at least 90% of the compound is a monomer, at least 95% of the compound is a monomer, at least 98% of the compound is a monomer, at least 99% of the compound is a monomer or approximately 100 % of the compound is a monomer. In some embodiments, the compound may be at least partially monomeric and dissociate into monomeric form upon administration to a subject. In some embodiments, the compound is monomeric or dissociates into a monomeric form in a subjects blood stream.
As will be readily understood by those skilled in this field the methods of the present invention provide a rational method for designing and selecting insulin analog proteins which interact with the insulin receptor. In the some cases these proteins may require further development in order to increase activity. Such further development is routine in this field and will be assisted by the structural information provided in this application. It is intended that in particular embodiments the methods of the present invention includes such further developmental steps.
Once an insulin analog has been designed or selected by the above methods, the efficiency with which that insulin analog may bind to a target such as the IR can be
tested and optimised by computational evaluation. For example, an insulin analog that has been designed or selected to bind the IR must also preferably traverse a volume not overlapping that occupied by the binding site when it is bound to the native IR. An insulin analog designed or selected as binding to IR may be further computationally optimised so that in its bound state it would preferably lack repulsive electrostatic interaction with the target protein. Such non-complementary (e.g., electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge-dipole interactions. Specifically, the sum of all electrostatic interactions between the compound and the insulin analog when the insulin analog is bound to IR, preferably make a neutral or favourable contribution to the enthalpy of binding.
Once an insulin analog has been optimally selected or designed, as described above, substitutions may then be made in some of its atoms or side groups to improve or modify its binding properties. Generally, initial substitutions are conservative, i.e., the replacement group will have approximately the same size, shape, hydrophobicity and charge as the original group. It should, of course, be understood that components known in the art to alter conformation should be avoided. Such substituted insulin analogs may then be analysed for efficiency of fit to IR by the same computer methods described in detail above.
Specific computer software is available in the art to evaluate compound deformation energy and electrostatic interaction. Examples of programs designed for such uses include: Gaussian 92, revision C (Frisch, Gaussian, Inc., Pittsburgh, PA) ; AMBER, version 4.0 (Kollman, University of California at San Francisco); QUANTA/CHARMM (Molecular Simulations, Inc., Burlington, MA); and Insight II/Discover (Biosysm Technologies Inc., San Diego, CA).
The present invention encompasses insulin analogs (including IR agonists, molecules, compounds and the like) identified using a method described herein. Some embodiments, also relate pharmaceutical compositions comprising the insulin analogs (including IR agonists, molecules, compounds and the like) identified using a method described herein.
Screening Assays and Confirmation of Binding and Biological Activity
Insulin analogs (which includes compounds and molecules identified using the methods of the present disclosure) of the present invention are preferably assessed by a number of in vitro and in vivo assays of IR and/or IGF-1R function to confirm their ability to interact with and modulate IR and/or IGF-1R activity. For example, compounds may be tested for their ability to bind to IR and/or IGF-1R and/or for their ability to modulate e.g. activate or disrupt IR and/or IGF-1R signal transduction.
Where the screening assay is a binding assay, IR or IGF-1R may be joined to a label, where the label can directly or indirectly provide a detectable signal. Various labels include radioisotopes, fluorescent molecules, chemiluminescent molecules, enzymes, specific binding molecules, particles, e.g., magnetic particles, and the like. Specific binding molecules include pairs, such as biotin and streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the complementary member would normally be labelled with a molecule that provides for detection, in accordance with known procedures.
A variety of other reagents may be included in the screening assay. These include reagents like salts, neutral proteins, e.g., albumin, detergents, etc., which are used to facilitate optimal protein-protein binding and/or reduce non-specific or background interactions. Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease inhibitors, antimicrobial agents, etc., may be used. The components are added in any order that produces the requisite binding. Incubations are performed at any temperature that facilitates optimal activity, typically between 4 and 40 °C.
Direct binding of compounds to IR or IGF-1R can also be done by Surface Plasmon Resonance (BIAcore) (reviewed in Morton and Myszka, 1998). Here the receptor is immobilized on a CM5 or other sensor chip by either direct chemical coupling using amine or thiol-disulphide exchange coupling (Nice and Catimel, 1999) or by capturing the receptor ectodomain as an Fc fusion protein to an appropriately derivatised sensor surface (Morten and Myszka, 1998). The potential insulin analog (called an analyte) is passed over the sensor surface at an appropriate flow rate and a range of concentrations. The classical method of analysis is to collect responses for a
wide range of analyte concentrations. A range of concentrations provides sufficient information about the reaction, and by using a fitting algorithm such as CLAMP (see Morton and Myszka, 1998), rate constants can be determined (Morton and Myszka, 1998; Nice and Catimel, 1999). Normally, the ligand surface is regenerated at the end of each analyte binding cycle. Surface regeneration ensures that the same number of ligand binding sites is accessible to the analyte at the beginning of each cycle.
Incubation periods are selected for optimum activity, but may also be optimized to facilitate rapid high-throughput screening. Normally, between 0.1 and 1 hour will be sufficient. In general, a plurality of assay mixtures is run in parallel with different test agent concentrations to obtain a differential response to these concentrations. Typically, one of these concentrations serves as a negative control, i.e. at zero concentration or below the level of detection.
The basic format of an in vitro competitive receptor binding assay as the basis of a heterogeneous screen for insulin analog replacements for native insulin may be as follows: occupation of the active site of IR or IGF-IR is quantified by time-resolved fluorometric detection (TRFD) as described by Denley et al. (2004). RTR-A, RTR-B and P6 cells are used as sources of IR-A, IR-B and IGF-IR respectively. Cells are lysed with lysis buffer (20 mM HEPES, 150 mM NaCl, 1.5 mM MgCl2, 10% (v/v) glycerol, 1% (v/v) Triton X-100, 1 mM EGTA pH 7.5) for 1 hour at 4°C. Lysates are centrifuged for 10 minutes at 3500 rpm and then 100 μΐ is added per well to a white Greiner Lumitrac 600 plate previously coated with anti-insulin receptor antibody 83-7 or anti-IGF-lR antibody 24-31. Neither capture antibody interferes with receptor binding by insulin, IGF-I or IGF-II. Approximately 100,000 fluorescent counts of europium-labelled insulin or europium-labelled IGF-I are added to each well along with various amounts of unlabelled competitor insulin analog and incubated for 16 hours at 4°C. Wells are washed with 20 mM Tris, 150 mM NaCl, 0.05% (v/v) Tween 20 (TBST) and DELFIA enhancement solution (100 μΐ/well) is added. Time -resolved fluorescence is measured using 340 nm excitation and 612 nm emission filters with a BMG Lab Technologies Polarstar™ Fluorimeter or a Wallac Victor II (EG & G Wallac, Inc.).
Examples of other suitable assays which may be employed to assess the binding and biological activity of compounds to and on IR are well known in the art. For example, suitable assays may be found in PCT International Publication Number WO 03/027246. Examples of suitable assays include the following:
(i) Receptor autophosphorylation (as described by Denley et al. (2004). R" IR-
A, R IR-B cells or P6 cells are plated in a Falcon 96 well flat bottom plate at 2.5 x 104 cells/well and grown overnight at 37°C, 5% C(¾. Cells are washed for 4 hours in serum-free medium before treating with one of either insulin, IGF-I or IGF-II in ΙΟΟμΙ DMEM with 1% BSA for 10 minutes at 37°C, 5% C02. Lysis buffer containing 2mM Na3V04 and 1 mg/ml NaF is added to cells and receptors from lysates are captured on 96 well plates precoated with antibody 83-7 or 24-31 and blocked with lx TBST/0.5 BSA. After overnight incubation at 4°C, the plates are washed with 1 x TBST. Phosphorylated receptor is detected with europium-labelled antiphosphotyrosine antibody PY20 (130 ng/well, room temperature, 2 hours). DELFIA enhancement solution (100 μΐ/well) is added and time resolved fluorescence detected as described above.
(ii) Glucose uptake using 2-deoxy-[U-14C] glucose (as described by Olefsky, 1978). Adipocytes between days 8-12 post-differentation in 24-well plates are washed twice in Krebs-Ringer Bicarbonate Buffer (25mM Hepes, pH 7.4 containing 130 mM NaCl, 5 mM KC1, KH2P04, 1.3 mM MgS04.7H20, 25 mM NaHC03 and 1.15 mM CaCl2) supplemented with 1% (w/v) RIA-grade BSA and 2 mM sodium pyruvate. Adipocytes are equilibrated for 90 min at 37°C prior to insulin addition, or for 30 min prior to agonist or antagonist addition. Insulin (Actrapid, Novogen) is added over a concentration range of 0.7 to 70 nM for 30 min at 37°C. Agonist or antagonist (0 to 500 mM) is added to adipocytes for 90 min followed by the addition of native insulin in the case of antagonists. Uptake of 50 mM 2-deoxy glucose and 0.5 mCi 2-deoxy-[U- 14C] glucose (NEN, PerkinElmer Life Sciences) per well is measured over the final 10 min of agonist stimulation by scintillation counting.
(iii) Glucose transporter GLUT4 translocation using plasma membrane lawns (as described by Robinson and James (1992) and Marsh et al. (1995)).
(iv) GLUT4 translocation using plasma membrane lawns (as described by Marsh et al. 1995). 3T3-L1 fibroblasts are grown on glass coverslips in 6-well plates and differentiated into adipocytes. After 8-12 days post-differentiation, adipocytes are serum-starved for 18 hrs in DMEM containing 0.5% FBS. Cells are washed twice in Krebs-Ringer Bicarbonate Buffer, pH 7.4 and equilibrated for 90 min at 37°C prior to insulin (ΙΟΟηΜ) addition, or for 30 min prior to compound (ΙΟΟμΜ) addition. After treatments, adipocytes are washed in 0.5 mg/ml poly-L-lysine in PBS, shocked hypotonically by three washes in 1 :3 (v/v) membrane buffer (30 mM Hepes, pH 7.2 containing 70 mM KC1, 5 mM MgCl2, 3 mM EGTA and freshly added 1 mM DTT and 2 mM PMSF) on ice. The washed cells are then sonicated using a probe sonicator (Microson) at setting 0 in 1: 1 (v/v) membrane buffer on ice, to generate a lawn of plasma membrane fragments that remain attached to the coverslip. The fragments are fixed in 2% (w/v) paraformaldehyde in membrane buffer for 20 min at 22°C and the fixative quenched by 100 mM glycine in PBS. The plasma membrane fragments are then blocked in 1% (w/v) Blotto in membrane buffer for 60 min at 22°C and immunolabelled with an in-house rabbit affinity purified anti-GLUT4 polyclonal antibody (clone R10, generated against a peptide encompassing the C-terminal 19 amino acids of GLUT4) and Alexa 488 goat anti-rabbit secondary antibody (Molecular Probes; 1:200). Coverslips are mounted onto slides using FluoroSave reagent (Calbiochem), and imaged using an OptiScan confocal laser scanning immunofluoroscence microscope (Optiscan, VIC, Australia). Data are analysed using ImageJ (NIH) imaging software. At least six fields are examined within each experiment for each condition, and the confocal microscope gain settings over the period of experiments are maintained to minimise between-experiment variability.
Insulin analog activity may be determined using an adipocyte assay. Insulin increases uptake of H glucose into adipocytes and its conversion into lipid. Incorporation of H into a lipid phase is determined by partitioning of lipid phase into a scintillant mixture, which excludes water-soluble H products. The effect of insulin analogs on the incorporation of H glucose at a sub-maximal insulin dose is determined. The method is adapted from Moody et al. (1974). Mouse epididymal fat pads are dissected out, minced into digestion buffer (Krebs-Ringer 25 mM HEPES, 4%
HSA, 1.1 mM glucose, 0.4 mg/ml Collagenase Type 1, pH 7.4), and digested for up to 1.5 hours at 36.5 C. After filtration, washing (Krebs-Ringer HEPES, 1% HSA) and resuspension in assay buffer (Krebs-Ringer HEPES, 1% HSA), free fat cells are pipetted into 96-well Picoplates containing test solution.
The assay is started by addition of H glucose (e.g. ex. Amersham TRK 239), in a final concentration of 0.45 mM glucose. The assay is incubated for 2 hours at 36.5 °C, in a Labshaker incubation tower, 400 rpm, then terminated by the addition of Permablend/Toluene scintillant (or equivalent), and the plates sealed before standing for at least 1 hour and detection in a Packard Top Counter or equivalent. A full native insulin standard curve (8 dose) is run as control on each plate.
Data are presented graphically, as the effect of the insulin analog on H glucose uptake, with data compared to a native insulin response. The assay can also be run at basal or maximal insulin concentration.
To test the in vivo activity of an insulin analog, an intravenous blood glucose test may be carried out on Wistar rats as follows. Male Mol:Wistar rats, weighing about 300 g, are divided into two groups. A 10 μΐ sample of blood is taken from the tail vein for determination of blood glucose concentration. The rats are then anaesthetized (e.g. with Hypnorm/Dormicum) at t =30 min and blood glucose measured again at t =-20 min and at t = 0 min. After the t = 0 sample is taken, the rats are injected into the tail vein with vehicle or test substance in an isotonic aqueous buffer at a concentration corresponding to a lml/kg volume of injection. Blood glucose is measured at times 5, 10, 20, 30, 40, 60, 80, 120, and 180 min. The anaesthetic administration is repeated at 20 min intervals.
Insulin analogs, compounds and/or molecules designed or selected according to the methods of the present invention may also be assessed by a number of biophysical methods. Suitable methods include x-ray crystallography, analytical ultracentrifugation, size exclusion chromatography, isothermal calorimetry and the like. For example, insulin analogs (which includes compounds and molecules identified using the methods of the present disclosure) may be subjected to further confirmation by crystallization of the analog and structural determination, as described herein. For example, the multimerisation state in solution may be determined by analytical ultracentrifugation.
Analytical ultracentrifugation is carried out at 20°C using a Beckman XLI analytical centrifuge in 12 mm path-length cells. The sample containing the compound is diluted in 10 mM HC1 into 10 mM Tris, 50 mM NaCl, pH 7.4 to a final concentration of 100 μg/mL. The person skilled in the art will appreciate that other buffers, salts and the like may be used. For example, samples may also be prepared that contain 0.2 mM Ζη(¾, 2 mM CaCh, 1 mM sodium phosphate (pH 7.4) or 0.1 M ammonium sulfate. An equal volume of 10 mM NaOH is added to neutralize any pH change. A total sample volume of 100 μΐ. is used.
Radial concentration distributions are measured by absorbance at 220 nm. Sedimentation equilibrium was established at 30,000 and 45,000 rpm, as assessed by sequential absorbance scans 1 h apart. Data at both speeds are jointly fitted to a single ideal sedimenting species in SEDPHAT (Houtman et al. 2007) using values of solution density and solvent partial specific volume estimated from composition using SEDNTERP (Laue et al. 1992). With the exception of the disulfides, all post- translational modifications are neglected in the estimation of compound partial specific volume.
Clinical Indications, Routes of Administration, Dosages and Pharmaceutical Compositions
The insulin analogs, compounds and molecules with which the present disclosure is concerned are of value in the treatment of conditions which are responsive to activation and/or modulation of IR. These conditions include those for which regulation of glucose metabolism and/or blood glucose levels is indicated.
Conditions which are responsive to activation and/or modulation of IR include, but are not limited to, diabetes myelitis (e.g. type 1 diabetes, type 2 diabetes, gestational diabetes), hyperglycemia, insulin resistance, impaired glucose tolerance and the like. In some embodiments, the condition is an insulin-related condition. Insulin related conditions, include but are not limited to, hyperglycemia, insulin resistance, type-1 diabetes, gestational diabetes or type-2 diabetes.
The insulin analogs of the present invention are suitable for use in a subject, such as a mammal including a human, in order to regulate glucose metabolism.
Accordingly, there is provided methods for regulating glucose metabolism. Some embodiments relate to a method for regulating glucose metabolism by administering to a subject in need thereof a therapeutically effective amount of such an insulin analog. Some embodiments relate to a method for treating an insulin-related condition, comprising administering a therapeutically effective amount of the insulin analog as defined herein to a subject in need thereof. Some embodiments relate to a method for treating diabetes by administering to a subject in need thereof a therapeutically effective amount of an insulin analog. Diabetes, includes but is not limited to, type 1 diabetes, type 2 diabetes or gestational diabetes. Some embodiments relate to a method for treating hyperglycemia by administering to a subject in need thereof a therapeutically effective amount of such an insulin analog. Some embodiments relate to a method for treating insulin resistance by administering to a subject in need thereof a therapeutically effective amount of such an insulin analog. Some embodiments relate to a method for treating impaired glucose tolerance by administering to a subject in need thereof a therapeutically effective amount of such an insulin analog. Some embodiments relate to a method for decreasing blood glucose levels in a subject by administering to a subject in need thereof a therapeutically effective amount of such an insulin analog.
For example, disclosed are methods of treating type 1 diabetes in a subject comprising administering a therapeutically effective amount of a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 to a subject in need thereof. In some instances, the substitution at amino acid 20 of the B chain peptide can be G20Y, G20F, or G20P. In some instances of the disclosed peptides, the substitution at amino acid 10 of the B chain peptide can be H10E, H10D or H10Q. In some instances, any combination of the B chain substitutions at amino acid 10 and 20 can be present. In some instance, the A chain of the administered peptide can also comprise at least one substitution. For example, in some instances, the at least one substitution in the A chain peptide can be T8H, T8Y, T8K, or S9R. In some instances, the amino acid substitution can be present at position 8 or 9 or both positions. Thus, in some instances, any combination of the disclosed B chain peptide substitutions and A chain peptide
substitutions can be present. Also disclosed herein are methods of treating type 1 diabetes in a subject comprising administering a therapeutically effective amount of an insulin analog as defined herein. In some instances, the subject has been diagnosed with type 1 diabetes prior to administering the peptide. In some instances, the subject has been diagnosed with being at risk for developing type 1 diabetes prior to administering the peptide.
Use of an insulin analog, compound or molecule as defined herein in the manufacture of a medicament is also provided. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for regulating glucose metabolism in a subject. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for treating and/or preventing an insulin-related condition in a subject. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for treating and/or preventing an diabetes in a subject. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for treating and/or preventing an hyperglycemia in a subject. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for treating and/or preventing insulin resistance in a subject. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for treating and/or preventing impaired glucose tolerance in a subject. Some embodiments relate to the use of the insulin analog as defined herein in the manufacture of a medicament for decreasing blood glucose levels in a subject.
Some embodiments also relate to an insulin analog as defined herein for use in regulating glucose metabolism in a subject. Some embodiments relate to an insulin analog as defined herein for use in treating and/or preventing an insulin-related condition in a subject. Some embodiments relate to an insulin analog as defined herein for use in treating and/or preventing diabetes in a subject. Some embodiments relate to an insulin analog as defined herein for use in treating and/or preventing hyperglycemia in a subject. Some embodiments relate to an insulin analog as defined herein for use in treating and/or preventing insulin resistance in a subject. Some embodiments relate to an insulin analog as defined herein for use in treating and/or preventing impaired
glucose tolerance in a subject. Some embodiments relate to an insulin analog as defined herein for use in decreasing blood glucose levels in a subject.
In some embodiments, administration of the insulin analog results in a decrease in blood glucose levels. Preferably, the insulin analog is a rapid acting insulin analog such that administration of the insulin analog results in a decrease in blood glucose levels within 60 minutes of administration. In some embodiments, administration of the insulin analog results in a decrease in blood glucose levels within minutes, 40 minutes, 30 minutes, 20 minutes, 10 minutes or 5 minutes of administration. In some embodiments, administration of the insulin analog results in a decrease in blood glucose levels for 1 hour, 2 hours, 3 hours, 4 hours, 5 hours, 6 hours, 7 hours, 8 hours, 9 hours, 10 hours, 11 hours, or 12 hours. In any of the methods disclosed herein, the preferred compounds that have been discussed in details herein could be administered. Blood glucose levels can be measured by any means known to one of skill in the art. Methods of Increasing Insulin Receptor Activation
Disclosed are methods of increasing insulin receptor activation in a subject comprising administering a therapeutically effective amount of any one of the disclosed insulin analog, peptide, compound, molecule or pharmaceutical compositions to a subject in need thereof. In some instances, a subject in need thereof can be a subject known to have decreased insulin receptor activation compared to a standard activation level. In some instances, a standard activation level of insulin receptor activation can be based on established levels in healthy individuals. In some instances, a standard activation level of insulin receptor activation can be based on established levels in the subject being treated prior to the determination of a need for increased insulin receptor activation.
For example, disclosed are methods of increasing insulin receptor activation in a subject comprising administering a therapeutically effective amount of a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 to a subject in need thereof. In some instances, the substitution at amino acid 20 of the B chain peptide can be G20Y, G20F, or G20P. In some instances of the disclosed peptides, the
substitution at amino acid 10 of the B chain peptide can be H10E, H10D or H10Q. In some instances, any combination of the B chain substitutions at amino acid 10 and 20 can be present. In some instance, the A chain of the administered peptide can also comprise at least one substitution. For example, in some instances, the at least one substitution in the A chain peptide can be T8H, T8Y, T8K, or S9R. In some instances, the amino acid substitution can be present at position 8 or 9 or both positions. Thus, in some instances, any combination of the disclosed B chain peptide substitutions and A chain peptide substitutions can be present. Also disclosed are methods of increasing insulin receptor activation in a subject comprising administering a therapeutically effective amount of an insulin analog as defined herein.
Methods of Lowering Blood Sugar
Disclosed are methods of lowering the blood sugar in a subject comprising administering a therapeutically effective amount of any one of the disclosed insulin analogs, peptides, compounds or pharmaceutical compositions to a subject in need thereof.
In some instances, a subject in need thereof can be a subject known to have increased blood sugar compared to a standard blood sugar level. In some instances, a standard activation level of insulin receptor activation can be based on established levels in healthy individuals. In some instances, a standard activation level of insulin receptor activation can be based on established levels in the subject being treated prior to the determination of a need for increased insulin receptor activation.
For example, disclosed are methods of lowering the blood sugar in a subject comprising administering a therapeutically effective amount of a peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20 to a subject in need thereof. In some instances, the substitution at amino acid 20 of the B chain peptide can be G20Y, G20F, or G20P. In some instances of the disclosed peptides, the substitution at amino acid 10 of the B chain peptide can be H10E, H10D or H10Q. In some instances, any combination of the B chain substitutions at amino acid 10 and 20 can be present. In some instance, the A chain of the administered peptide can also comprise at
least one substitution. For example, in some instances, the at least one substitution in the A chain peptide can be T8H, T8Y, T8K, or S9R. In some instances, the amino acid substitution can be present at position 8 or 9 or both positions. Thus, in some instances, any combination of the disclosed B chain peptide substitutions and A chain peptide substitutions can be present. Also disclosed are methods of lowering the blood sugar in a subject comprising administering a therapeutically effective amount of an insulin analog as defined herein.
Administration and dosages
The routes of administration and dosages described are intended only as a guide.
The person skilled in the art will understand, based on the disclosure set forth herein, that specific dosage regimens for compositions, formulations, methods and uses encompassed herein may be determined empirically through clinical and/or pharmacokinetic experimentation, and that such dosages may be adjusted according to prespecified effectiveness and/or toxicity criteria. It will also be understood that a specific dosage and treatment regimen for any particular patient will depend upon a variety of factors, including the activity and concentration of the specific compounds employed, the characteristics of the patient such as age, weight and response of the particular patient, active combination, the judgment of the treating physician and the nature and severity of the condition being treated. The below dosages are exemplary of the average case. There can, of course, be individual instances where higher or lower dosage ranges are merited, and such are within the scope of this invention.
In accordance with the methods and uses as described herein, a subject may receive a therapeutically effective amount of an insulin analog in one or more doses. The actual amount administered, and the rate and time-course of administration, will vary with the route of administration, the nature of the benefit required, the nature and severity of the condition being treated, the condition of the subject being treated and will ultimately be at the discretion of the attendant veterinarian or medical professional. Prescription of treatment, e.g. decisions on dosage, timing, etc., is within the responsibility of the attendant veterinarian or medical professional and typically takes
account of the nature of the disorder, the condition of the individual patient, the site of delivery, the method of administration and other factors known to practitioners.
A person skilled in the art will understand that one or more analog(s) together or separately can be administered on any appropriate schedule, e.g., from one or more times per day to one or more times per week; including once every other day, for any number of days or weeks, or any variation thereon. Normally, the insulin analog is administered one, two, three, four or more times daily. However, the insulin analog can also be administered on an as needs basis, for example when blood glucose levels are above the normal range or when blood glucose levels need to be reduced. Generally the compound or composition of the present invention is administered with, before or after ingesting food. Preferably, the insulin analog is administered prior to every meal, for example breakfast, lunch and dinner. In some embodiments, the compound or composition is administered 5, 10, 20, 30, 40, 50 or 60 minutes before ingesting food. In some embodiments, the compound or composition is administered immediately before ingesting food.
The insulin analog, as well as pharmaceutical compositions as described herein, can be administered by any route known to one of skill in the art, the parenteral being of most interest. Accordingly, in one embodiment of the invention the insulin analogs are administered by the parenteral route, such as by injection or infusion. Other suitable administration routes, for example enteral (e.g. oral administration), are within the scope of the present invention. In a specific embodiment, the parenteral route is preferred and includes intravenous, intraarticular, intraperitoneal, subcutaneous, intramuscular, intrastemal injection and infusion as well as administration by the sublingual, transdermal, topical, transmucosal including nasal route, or by inhalation such as, e.g., pulmonary inhalation.
The insulin analogs can be administered in a suitable vehicle or they can be administered in the form of a suitable pharmaceutical composition. Such compositions are also within the scope of the invention. In the following are described suitable pharmaceutical compositions.
Pharmaceutical compositions
A person skilled in the art will appreciate that the insulin analogs described herein may be formulated in pharmaceutical compositions. Such compositions may include the insulin analog and one or more pharmaceutically acceptable carriers.
As used herein, the term "pharmaceutically acceptable carrier" includes any and all solids or solvents (such as phosphate buffered saline buffers, water, saline) dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration. The pharmaceutically acceptable carriers must be 'acceptable' in the sense of being compatible with the other ingredients of the composition and not deleterious to the recipient thereof. Suitable pharmaceutical carriers are described in "Remington's Pharmaceutical Sciences" by E. W. Martin (Mack Publishing Co., Easton, Pa.); Gennaro, A. R., Remington: The Science and Practice of Pharmacy, 20th Edition, (Lippincott, Williams and Wilkins), 2000; Liberman, et al. Eds., Pharmaceutical Dosage Forms, Marcel Decker, New York, N.Y., 1980; and Kibbe, et al. Eds., Handbook of Pharmaceutical Excipients (3rd Ed.), American Pharmaceutical Association, Washington, 1999. Generally, suitable pharmaceutically acceptable carriers are known in the art and are selected based on the end use application. For example, pharmaceutically acceptable carriers that may be used in the present invention include, but are not limited to, those suitable for injectable or infusion compositions. Supplementary active compounds can also be incorporated into the compositions. The use of such media and agents for pharmaceutically active substances is well known in the art. Pharmaceutical compositions are described in a number of sources that are well known and readily available to those skilled in the art, for example, Remington's Pharmaceutical Sciences (Martin E. W., Easton Pa., Mack Publishing Company, 19th ed., 1995).
The amount of pharmaceutically acceptable carrier will depend upon the level of the compound and any other optional ingredients that a person skilled in the art would classify as distinct from the carrier (e.g., other active agents). The formulations of the present invention may comprise, for example, from about 5% to 99.99%, or 25% to about 99.9 % or from 30% to 90% by weight of the composition, of a pharmaceutically
acceptable carrier. The pharmaceutically acceptable carrier can, in the absence of other adjuncts, form the balance of the composition.
Optionally, the pharmaceutical composition of the present disclosure further comprises other additional components, for example therapeutic and/or prophylactic ingredients. The invention thus relates in a further aspect to pharmaceutical composition comprising the compound of the present invention, one or more pharmaceutically acceptable carriers together with one or more other active agents. Generally, the amount of other active agent present in the pharmaceutical composition is sufficient to provide an additional benefit either alone or in combination with the other ingredients in the composition.
It will be understood by the person skilled in the art that these optional components may be categorized by their therapeutic or aesthetic benefit or their postulated mode of action. However, it is also understood that these optional components may, in some instances, provide more than one therapeutic or aesthetic benefit or operate via more than one mode of action. Therefore, classifications herein are made for the sake of convenience and are not intended to limit the component to that particular application or applications listed. Also, when applicable, the pharmaceutically-acceptable salts of the components are useful herein.
When other active agents are present in the pharmaceutical formulation of the present invention, the dose of the compound may either be the same as or differ from that employed when the other additional components are not present. Appropriate doses will be readily appreciated by those skilled in the art.
A pharmaceutical composition is formulated to be compatible with its intended route of administration, e.g., local or systemic. Examples of routes of administration include parenteral, e.g., intravenous, intradermal, subcutaneous, oral, nasal, topical, transdermal, transmucosal, and rectal administration. Oral and nasal administration include administration via inhalation. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application can include the following components: a sterile diluent, such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic.
Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions (where water soluble) or dispersions, non-aqueous solutions and sterile powders for the extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological saline, bacteriostatic water, Cremophor EL (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy syringability exists. It should be stable under the conditions of manufacture and storage and be preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. Isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride can also be included in the composition. Prolonged absorption of the injectable compositions can be brought about by including in the composition an agent that delays absorption, such as aluminum monostearate or gelatin.
Sterile injectable solutions can be prepared by incorporating the active compound in the required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the polynucleotide into a sterile vehicle, which contains a basic dispersion medium and the required other ingredients from those enumerated above. In the case of sterile powders for the preparation of
sterile injectable solutions, suitable methods of preparation include vacuum drying and freeze-drying which yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.
Oral compositions generally include an inert diluent or an edible carrier. For the purpose of oral therapeutic administration, the active compound can be incorporated with excipients and used in the form of tablets, troches, or capsules, e.g., gelatin capsules. Oral compositions can also be prepared using a fluid carrier for use as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can contain any of the following ingredients, or compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, PRIMOGEL, or corn starch; a lubricant such as magnesium stearate or sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavouring agent such as peppermint, methyl salicylate, or orange flavouring.
Formulations suitable for administration by nasal inhalation include where the carrier is a solid, include a coarse powder having a particle size, for example, in the range of about 20 to about 500 microns, which is administered in the manner in which snuff is taken, i.e., by rapid inhalation through the nasal passage from a container of the powder held close up to the nose. Suitable formulations wherein the carrier is a liquid for administration by nebulizer, include aqueous or oily solutions of the agent. For administration by inhalation, the agent(s) can also be delivered in the form of drops or an aerosol spray from a pressured container or dispenser that contains a suitable propellant, e.g., a gas such as carbon dioxide, or a nebulizer. Such methods include those described in U.S. 6,468,798.
Systemic administration can also be by transmucosal or transdermal means. For transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art, and include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal administration can be accomplished through the use of nasal sprays, drops, or suppositories. For transdermal administration, the active
compound (e.g., polynucleotides of the invention) are formulated into ointments, salves, gels, or creams, as generally known in the art.
The pharmaceutical compositions may be prepared by any of the method well known to a person skilled in pharmaceutical formulation. Generally, the compositions are prepared by contacting the insulin analog, molecule or compound uniformly and intimately with liquid carriers or finely divided solid carriers or both. Then, if necessary, the product is shaped into the desired formulation. For parenteral administration, in one embodiment the insulin analogs, compounds or molecules of the invention can be formulated by mixing it at the desired degree of purity, in a unit dosage injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable carrier.
In some embodiments, the pharmaceutical composition comprises a therapeutically effective amount of an insulin analog, peptide, compound or molecule according to the invention. The content of the insulin analog, peptide, compound or molecule of the invention in a pharmaceutical composition of the invention is e.g. from about 0.1 to about 100% w/w of the pharmaceutical composition.
Kits
The materials described above as well as other materials can be packaged together in any suitable combination as a kit useful for performing, or aiding in the performance of, the disclosed method. It is useful if the kit components in a given kit are designed and adapted for use together in the disclosed method. Disclosed are kits comprising one or more of the disclosed insulin analogs. For example disclosed are kits comprising one or more of the disclosed insulin analogs, peptides, compounds or pharmaceutical compositions.
The present invention will now be described further with reference to the following examples, which are illustrative only and non-limiting. The examples refer to the figures.
EXAMPLES
Example 1 - Peptide Synthesis of Con-Ins Gl
Solid phase peptide synthesis of Con-Ins Gl chain A and chain B.
Both A and B chains of Con-Ins Gl were synthesized with Fmoc (9- fluorenmethyloxycarbonyl) chemistry on a CEM Liberty 1 automated microwave peptide synthesizer (CEM Corporation, Matthews, NC). For the synthesis of A chain pre-loaded Fmoc-Cys(Trt)-Rink Amide MBHA resin (0.21 mmol/g) (Peptides International, Louisville, KY) and for the synthesis of B chain pre-loaded Fmoc- Arg(Pbf)-Wang resin (0.4 mmol/g) (AnaSpec, EGT., Freemont, CA) were used. Fmoc- N -protected amino acids with side chain protection were from commercial sources: Bachem Inc. (Torrance, CA), Chem-Impex International (Wood Dale, IL), Genzyme (Cambridge, MA), Novabiochem (San Diego, CA), P3 Biosystem (Louisville, KY) and Reanal (Budapest, Hungary). Fmoc-y-carboxy-L-glutamic acid γ,γ-di-t-butyl ester (Fmoc-Gla(OtBu)2-OH) was synthesized in house (Rivier et al. 1987). Side-chain protection for the amino acids was as follows: Lys, teri-butyloxycarbonyl (Boc); Hyp, Ser, Thr and Tyr, teri-butyl ether (tBu); Asn, Cys, and His trityl (Trt); Arg 2,2,4,6,7- pentamethyl-dihydroxybenzofuran-5-sulfonyl (Pbf); Glu, Gla, and Asp teri-butyl ester (OtBu); Cys, acetamidomethyl (Acm); Cys, 4-methoxytrityl (Mmt); Cys, S-tert- butylthionyl (S-t-Bu).
To be able make the correct intra- and intermolecular disulfide bonds, the side chain protecting group of CysA7 and CysB7 was Acm, the side chain protecting group of CysA20 and CysB 19 was Trt, the side chain of CysAl l was Mmt protected, and the side chain of CysA6 was S-i-Bu protected. Both chains were synthesized on a 0.1 mmol scale. Coupling reactions were performed on the resin in the presence of 5-fold molar excess of Fmoc-protected amino acids dissolved in DMF (except His in NMP) with activation by HATU [2-(lH-9-(Azabenzotriazol-l-yl)-l,l,3,3-tetramethyl- aminium hexafluorophosphate] : DIEA [N,N-diisopropylethylamine] : AA [protected amino acids] (0.9 : 2 : 1) at 0 W for 2 min then at 35 W with a maximum temperature of 60 C for 10 min. Arg was always double-coupled at room temperature for 25 min then at 15 W with a maximum temperature of 50 C for 12 min. Cys, His, and Gla were coupled at 40 W with a maximum temperature of 50 C for 6 min. Deprotection of the
Fmoc group was performed with 20% piperidine containing 0.1 M HOBt in DMF in two stages (using a fresh reagent each time): with an initial deprotection of 2 min at 35 W followed by 5 min deprotection at 35 W with a maximum temperature of 60 C. Con-Ins Gl chain A: intramolecular disulfide bond formation, cleavage and purification.
The intramolecular disulfide bridge between CysA6 and CysAl l was formed on the resin using a non-oxidative method (Galande et al. 2005). In the first step, S-i-Bu of CysA6 was removed by reduction to liberate free thiol by treating the resin (760 mg) with 20% mercaptoethanol (ME) (Fluka) and 1% N-Methylmorpholine (ΝΜΜ) in dimethylformamide (DMF) 8 mL overnight at room temperature. The resin was washed with DMF and dried. The resin was then reacted with a 10-fold excess of 2,2'- dithiobis(5-nitropyridine) (DTNB) ~ 1 mmol (Sigma-Aldrich, St Louis, MO) in dichloromethane (DCM) 8 mL for 1 h to form the S-5-nitropyridin-sufenyl (5-Npys) protected CysA6. After washing out the excess of the reagents with DCM, the resin was treated with 1 % trifluoroacetic acid (TFA) in dichloromethane (DCM) 8 mL in the presence of 2 μΐ^ triisopropylsilane (TIS) as a scavenger for 20 min to deprotect CysAl l (Mmt) and to form the disulfide bridge between CysA6 and CysAl l at the same time.
Cleavage from the resin (720 mg) and simultaneous deprotection of chain A were performed by stirring the resin with 10 mL of a reagent containing (TFA/water/TIS : 95/2.5/2.5) for 2 h. This was followed by precipitation of the peptide using ice-cold anhydrous ethyl ether, then extraction with 0.1% TF A/40% water/60% acetonitrile and lyophilization. The peptide was purified by preparative Waters HPLC (Milford, MA) on Waters PrepPak cartridge (2.5 x 10 cm) packed with Bondapak Qs (15-20 μπι particle size, 300 A) in solvent system A: 0.1% TFA/water, B: 0.1% TF A/40% water/60% ACN with a linear gradient ranging from 5% to 65% solvent B in 60 min at a flow rate 20 mL/min. 18.7 mg (7.7 μπιοΐ) of chain A was obtained. The mass of the peptide was confirmed by electrospray ionization (ESI)-MS measured on a ThermoScientific LTQ Orbitrap XL (Waltham, MA) instrument (calculated monoisotopic MH+1: 2422.03 Da; determined monoisotopic MH+1value 2422.01 Da).
Con-Ins Gl chain B: cleavage and purification.
Cleavage from the resin and simultaneous deprotection of chain B were performed by stirring 500 mg resin with 10 mL reagent containing (TFA/thioanisol/3,6- Dioxa-l,8-octanedithol (DODT, TCI America, Portland, OR)/water: 87.5/5/2.5/5) for 2 h, followed by precipitation of the peptide using ice-cold anhydrous ethyl ether, then extraction with 0.1% TF A/40% water/60% ACN and lyophilization. The peptide was purified by preparative HPLC (as described for purification of chain A, except that the gradient ranged from 20 to 80% solvent B in 60 min) 25.9 mg (9 μπιοΐ) of chain B was obtained. The mass of the peptide was confirmed by ESI-MS (calculated monoisotopic MH+1value 2868.24 Da; determined monoisotopic MH+1value 2868.22 Da).
DMSO-assisted chain A and chain B ligation to form partially folded Con-Ins Gl (containing one disulfide bond).
Chain A and chain B (7 μπιοΐ each) were dissolved together in 0.1 % TF A/water solution (7.1 mL) and added to a mixture of 14.5 mL DMSO, 14.25 mL water, 35.6 mL 0.2 M Tris containing 2 mM EDTA, pH 7.5. The oxidation was monitored by analytical HPLC. After 25 h at room temperature, the reaction was quenched with 8% formic acid (1 mL), diluted with 0.1%TFA to a total volume of 225 mL and purified by preparative HPLC with a gradient ranged from 15 to 75%B in 60 min. 4.5 mg (0.85 μπιοΐ, 12.1% yield based on the starting amount of 7 μπιοΐ) of heterodimer was obtained. The identity of the peptide was confirmed by ESI-MS (calculated monoisotopic MH+1: 5287.25 Da; determined monoisotopic MH+1: 5287.19 Da). ^-assisted oxidation to form fully-oxidized Con-Ins Gl.
4.5 mg (0.85 μπιοΐ) of Con-Ins Gl (partially folded) was dissolved in 6.2 mL of 2.5% TF A/water solution and 55 μL· of ½ solution (50 mg ½ in 5 mL MeOH) was added and stirred for 60 min. It was quenched by adding 1M ascorbic acid solution until the yellow colour of the solution became clear. The reaction was diluted with 60 mL water and loaded on preparative RP-HPLC column. 1.5 mg (0.29 μπιοΐ) of fully- oxidized Con-Ins Gl was obtained (yield 35 % based on the starting amount of the
partially folded product containing one interchain disulfide bond and 4 % based on the starting amount of purified chain A). The identity of the peptide was confirmed by ESI- MS on a ThermoScientific LTQ Orbitrap XL (Waltham, MA) mass spectrometer (calculated monoisotopic MH+1: 5143.16 Da; determined monoisotopic MH+1: 5143.16 Da).
Purity of the peptide was assessed by RP-HPLC and capillary electrophoresis. Quantitative RP-HPLC was performed using a GE Healthcare AKTApurifier 10 (Pittsburgh, PA) and a Phenomenex (Torrance, CA) Kinetex XB-C18 column (4.6 x 100 mm, 5.0 μπι particle size, 100 A pore size). The solvent system comprised solvent A = 0.1% TFA in water and solvent B = 60% ACN, 40% A. A gradient was performed from 20 to 80%B in 30 min at a flow rate of 1.0 mL/min. Detection was at 214 and 280 nm. The purity of the peptide was determined to be 89%. Capillary electrophoresis (CE) was performed using a Groton Biosystems GPA 100 instrument. (Boxborough, MA) The electrophoresis buffer was 0.1 M sodium phosphate (15% acetonitrile), pH 2.5. Separation was accomplished by application of 20 kV to the capillary (0.75 μπι x 100 cm). Detection was at 214 nm. The assessed purity of the peptide was 80%.
Synthesis and purification of sCon-Ins Gl.
Con-Ins Gl containing CysA6, Al l to SecA6, Al l modifications in the A chain (referred to as sCon-Ins Gl ; Sec=selenocysteine) was chemically synthesized, purified and oxidized as described by Safavi-Hemami et al. (2015), with the exception that corrected extinction coefficients were used for quantification of the B chain (2,980 M-l»cm-l) and fully oxidized sCon-Ins Gl (4,470 M-l»cm-l). Synthesis of sCon-Ins Gl [GluA4, ProB3, GluBlO] was performed as described for sCon-Ins Gl (Safavi- Hemami et al. (2015)). The stepwise formation of disulfide bonds of sCon-Ins Gl [GluA4, ProB3, GluBlO] is described in detail below. sCon-Ins Gl[ GluA4 ] chain A: cleavage, DTT reduction and purification.
The peptide was cleaved from 125 mg of resin for 1.5 h using 1 mL of enriched Reagent K (TFA/water/phenol/thioanisole/l,2-ethanedithiol, 82.5/5.0/5.0/5.0/2.5 by volume), which was prepared using 2 mL TFA (Fisher Scientific, Fair Lawn, NJ), 66
μL· H2O, 12 mg 2,2-dithiobis(5-nitropyridine) (DTNP; Aldrich; Saint Louis, MO), and 150 mg phenol, followed by addition of 25 μL· thioanisole. The cleavage mixture was filtered and precipitated with 10 mL of cold methyl-teri-butyl ether (MTBE; Fisher Scientific, Fair Lawn, NJ). The crude peptide was precipitated by centrifugation at 7,000 x g for 6 min and washed once with 10 mL cold MTBE. To induce intramolecular diselenide bond formation (SecA6 to SecAlO), the washed peptide pellet was dissolved in 50% ACN (Fisher Scientific; Fair Lawn, NJ) (vol/vol) in water and 2 mL of 100 mM dithiotreitol (DTT, EMD Chemicals, Gibbstown, NJ) in 1 mL 0.2 M Tris-HCl (Sigma, St Louis, MO) containing 2 mM EDTA (Mallinckrodt, St. Louis, MO), pH 7.5, 1 mL of water was added and vortexed gently, and the reaction was allowed to proceed for 2 h. It was then quenched with 8% formic acid (vol/vol) (Fisher Scientific, Fair Lawn, NJ), diluted with 0.1% TFA (vol/vol) in water, and purified by re versed-phase (RP) HPLC using a semi-preparative C18 Vydac column (218TP510, 250 x 10 mm, 5-μπι particle size; Grace, Columbia, MD) eluted with a linear gradient ranging from 10 to 40% solvent B in 30 min at a flow rate 4 mL/min. The HPLC solvents were 0.1% (vol/vol) TFA in water (solvent A) and 0.1% TFA (vol/vol) in 90% aqueous ACN (vol/vol) (solvent B). UV absorbance was measured at 220 and 280 nm to monitor the eluent. Purity of the peptide was assessed by analytical RP-HPLC on a CI 8 Vydac column (218TP54, 250 x 4.6 mm, 5 μπι particle size, Grace, Columbia, MD) using a linear gradient ranging from 10 to 40% of solvent B in 30 min with a flow rate 1 mL/min. The peptide was quantified by UV absorbance at 280 nm using an extinction coefficient (ε) of 1,490 M_1-cm_1. From 135 mg of the resin, 3.8 mg of chain A was obtained. The mass of the peptide was confirmed by electrospray ionization (ESI)-MS (calculated monoisotopic MH+1: 2,473.674, determined monoisotopic: MH+1 2,472.924). Molecular masses were calculated using ProteinProspector (version 5.12.1). sCon-Ins Gl[ProB3, GluBlO] chain B: cleavage and purification.
The peptide was cleaved from 94 mg resin by a 3 h treatment with 1 mL of Reagent K and subsequently filtered, precipitated, and washed as described above. The washed peptide pellet was purified as described above with the exception that the
gradient ranged from 15 to 45% solvent B. The same gradient was used to assess the purity of the linear peptide as described above, and peptide quantitation was carried out using ε value of 2,980 M_1-cm_1. From 94 mg of the cleaved resin, 2.37 mg of chain B was obtained. The mass of the peptide was confirmed by ESI-MS (calculated monoisotopic MH+1: 2,808.24, determined monoisotopic MH+1: 2,808.25).
Copper-assisted chain A and chain B ligation to form sCon-Ins Gl[ GluA4, ProB3, GluBlOJ.
A total of 100 nmol of each chain was combined and dried using a SpeedVac. The peptide mixture was dissolved in 100
of 0.1% TFA (vol/vol) and added to a mixture of 800 CuCl2 H20 (J.T. Baker, Phillipsburg, NJ) 100 1M Tris-HCl containing 10 nM EDTA, pH 7.5. The final peptide concentration was 100 μΜ. The reaction was left for 24 h at room temperature and then quenched with 8% formic acid (vol/vol), diluted with 0.1% TFA and purified by RP-HPLC using a preparative CI 8 Vydac column eluted with a linear gradient ranging from 15 to 45% of solvent B in 30 min at a flow rate 4 mL/min. The purity of sCon-Ins Gl was assessed by analytical RP- HPLC using the same gradient as for the semi-preparative purification, at a flow rate 1 mL/min. sCon-Ins Gl[GluA4, ProB3, GluBlO] was quantified at 280 nm using an ε value of 4,470 M_1-cm_1. The yield of the reaction was 28%. From 900 nmol of the 1 : 1 mixture of chain A and B, 1.36 mg of the desired product was obtained. The identity of the peptide was confirmed by ESI-MS (calculated monoisotopic MH+1: 5,278.15; determined monoisotopic MH+1: 5278.15).
Iodine (hj-assisted formation of fully folded sCon-Ins Gl[GluA4, ProB3, GluBlO].
A solution of I2 (Acros Organics, Geel, Belgium) was prepared as follows: 10 mg of ½ was added to 5 mL of ACN. After 20 min of stirring, the I2 was completely dissolved, and 15 mL of water and 600 uL of TFA were added. A total of 300
of the I2 mixture was added to 149 nmol (90% purity) and 106 nmol (72% purity) of partially folded sCon-Ins Gl[GluA4, Cys(Acm)7, ProB3, C(Acm)B7, GluB lO] dissolved in 300 μΕ of 0.1% TFA each. Reactions were incubated for 5 min, quenched with 10
of 1 M L-ascorbic acid (Sigma, St. Louis, MO), diluted with 0.1% TFA in water to a total
volume of 4.5 mL and purified as described for partially folded sCon-Ins Gl [GluA4, ProB3, GluBlO]. The purity of the final product (fully-folded sCon-Ins Gl [GluA4, ProB3, GluB lO]) was assessed by analytical RP-HPLC on C18 Vydac column (218TP54, 250 x 4.6 mm, 5 μπι particle size) using the same gradient as for the semi- preparative purification, at a flow rate 1 mL/min, and was determined to be 97%. sCon- Ins Gl [GluA4, ProB3, GluB lO] was quantified as described for the partially folded product. The yield of the reaction was 14%, with 0.18 mg of the desired product being obtained. The identity of the peptide was confirmed by ESI-MS (calculated monoisotopic MH+1: 5,134.84; determined monoisotopic MH+1: 5,134.07).
Synthesis and Purification ofhIns[DOI] and Mns[TyrB15, TyrB20, DOI]
hlns [DOI] is a monomeric analogue lacking residues 23 to 30 of the B chain of human insulin. hIns[DOI] and Mns[TyrB15, TyrB20, DOI] were chemically synthesized, purified and oxidized following standard procedures.
Example 2 - Con-Ins Gl Human Insulin Receptor Binding and Signalling Activation Insulin receptor binding .
The ability of Con-Ins Gl to bind to the human insulin receptor (hIR) was measured by a binding competition assay. Competition binding assays were performed using solubilised immuno-captured human IR (isoform B) with europium-labelled human insulin and increasing concentrations of venom insulin as previously described (Denley et al. 2004). Time -resolved fluorescence was measured using 340-nm excitation and 612-nm emission filters with a Polarstar Fluorimeter (BMGLab Technologies, Mornington, Australia). IC50 values were calculated, using Prism 6, by curve-fitting with a non-linear regression (one-site) analysis. At least three assays were performed with three replicates per data point.
The C. geographus venom insulin sCon-Ins Gl was found to be only thirty-fold less active against the human IR-B receptor than hlns (sCon-Ins Gl : log [IC50 (nM)] = 1.24 ± 0.06; hlns: log [IC50 (nM)] = -0.26 ± 0.02) despite lacking any equivalent to ArgB23 through to ThrB30 of hlns (Figures 1 and 2).
Insulin signalling activation assay.
The ability of Con-Ins Gl to induce insulin signalling was assessed by Akt phosphorylation analysis. Briefly, pAkt Ser473 levels were measured in a mouse fibroblast cell line, NIH 3T3, overexpressing human IR-B. The cell line was cultured in DMEM with 10% fetal bovine serum (FBS), 100 U/mL penicillin- streptomycin and 2 μg/mL puromycin. For the assay, 40,000 cells per well were plated in a 96-well plates with culture media containing 1 % FBS. 24 h later, 50 μL· of insulin solution was pipetted into each well after the removal of the original media. After a 30-min treatment, the insulin solution was removed and the HTRF pAkt Ser473 kit (Cisbio, Massachusetts, USA) was used to measure the intracellular level of pAkt Ser473. Briefly, the cells were first treated with cell lysis buffer (50 μL· per well) for 1 h under mild shaking. 16 μL· of cell lysate was then added to 4 μL· of detecting reagent in a white 384-well plate. After 4-h incubation, the plate was read in a Synergy Neo plate reader (BioTek, Vermont, USA) and the data processed according to the manufacturer's protocol. The assays were repeated for a total of four times. EC50 values were calculated (using Prism 6) by curve-fitting with a non-linear regression (one-site) analysis.
It was found that Con-Ins Gl is only ca ten-fold less active than hlns in an Akt phosphorylation assay (sCon-Ins Gl : log [EC50 (nM)] = 0.90 ± 0.05; Con-Ins Gl : log [EC50 (nM)] = 0.78 ± 0.15; hlns: log [EC50 (nM)] = -0.20 ± 0.20; Figure 3). Our results highlight the existence within Con-Ins Gl of structural motifs that enable potent activity despite the venom protein's lack of an equivalent to either the canonical aromatic triplet or the B -chain C-terminal segment as a whole. Example 3 - Crystal Structure of Con-Ins Gl
Crystallisation and data collection.
Con-Ins Gl was synthesised as described above. Con-Ins Gl was prepared for crystallization in 10 mM HC1 at a concentration of 4 mg/mL.
Initial crystallization trials employed a robotic 192-condition sparse-matrix hanging-drop screen conducted at the CSIRO Collaborative Crystallisation Centre (Parkville, Australia). A single crystal was extracted from a condition comprising
2.0 M ammonium sulphate plus 10% DL-malate-MES-Tris (pH 9.0) and then mounted directly (without cryo-protective agent) in a cryo-loop for diffraction data collection at 100K on the MX2 beamline at the Australian Synchrotron (A= 0.9537 A).. Data were processed to a resolution of 1.95 A using XDS (Kabsch (2010)). The space group is P432 with unit cell dimensions a=b=c=74.91A. From the apparent molecular mass of 5143 Da per monomer and 1 molecule per asymmetric unit, the solvent content is estimated as 64%.
Structure solution and refinement
The structure was solved by molecular replacement using as starting model an insulin monomer from PDB entry 3I3Z and using the PHASER software (McCoy et al, 2007). Crystallographic refinement employed PHENIX (Adams et al. 2010) iterated with model building within COOT (Emsley and Cowtan, 2004). The single sulphate ion observed close to the four-fold axis was modelled without restraint upon its orientation or position, effected in PHENIX by setting its occupancy to unity rather than 0.25. Data processing and refinement statistics are presented in Table 1. The final model had a Rwork / Rfree of 0.208 / 0.217. All residues in the final model lay in the favoured region of the Ramachandran plot. Table 1: X-ray data processing and refinement statistics
X-ray data processing
Space group P432
Cell dimensions
a, b, c (A) 74.91 , 74.91 , 74.91
α, β, Ύ 90, 90, 90
Resolution (A) 33.5 - 1.95 (2.02 - 1.95)a
-Emerge 0.368 (2.67)
<Ι/σ(Γ» 6.46 (0.90)
Completeness (%) 99.5 (95.8)
Redundancy 8.9 (8.2)
X-ray refinement
Resolution 33.5 - 1.95 (2.04
No. reflections 5645 (635)
Rwork / Rfree 0.211 / 0.233
No. atoms
protein 349
sulfate ion 1
water 35
B factors (A2)
protein 35
sulfate ion 25
water 38
r.m.s. deviations
Bond lengths (A) 0.013
Bond angles (°) 1.39
Ramachandran plot (%) 100.0 / 0.0 / 0.0
a Numbers in parentheses refer to the outer resolution shell.
b Data were included to the maximum resolution at which the CCm correlation statistic remained significant at the /?=0.001 level of significance. Example 4 - Con-Ins Gl Structure
The structure reveals that the overall Con-Ins Gl secondary structure is similar to that of hlns, with the N-terminal residues of the B-chain following an extended path similar to that of the classical T-state hlns (Figure 4a) (the T-state is characterized by residues B1-B8 being in an extended conformation, folded back against the A-chain helical assembly). As anticipated from the absence of residues equivalent to hlns B22- B30, there is no interface within the crystallographic unit cell resembling the hlns dimer interface. All monomer-monomer interfaces within the crystal are sparse, bar those formed between Con-Ins Gl monomers packed around the four-fold axis, each of which buries -440 A of molecular surface. The four monomers coordinate an apparent sulphate ion lying close to the four-fold axis, which forms part of a charged-
compensated cluster with the amides of GlyAl and a single side-chain carboxylate group of each GlaA4 (Figure 5). Based on sedimentation equilibrium data below the inventors conclude that this association is an artefact of crystallization.
The hydrophobic core of Con-Ins Gl involves the side chains of residues ValA2, CysA6, CysAl l, PheA16, TyrA19, ArgB6, IleB l l, TyrB15 and LeuB 18 (Figure 4b). Of these, three are identical in human insulin (CysA6, CysAl l and TyrA19), three differ conservatively (ValA2→Ile, IleB l l→Leu and LeuB 18→Val) and three are markedly different (ArgB6— >Leu, PheA16— >Leu and TyrB 15— >Leu). In hins, the LeuB15 equivalent of TyrB 15 in Con-Ins Gl packs in part against the core and in part against the side chain of hins PheB24; substitution by Tyr reduces somewhat the exposed hydrophobic surface of the Con-Ins Gl monomer in the absence of an equivalent to hins B24. The bulkier side chain of Cons-Ins Gl PheA16 (compared to that of hins LeuA16) appears to be associated with the change at TyrB15: the side chain of TyrB 15 is further away from the core of the protein compared to its hins counterpart, with the (larger) PheA16 aromatic ring compensating for this in terms of packing (Figure 4b).
Example 5 - Homology Modelling and Molecular Dynamics
To gain insight into the structural principles that enable Con-Ins Gl activity against the vertebrate insulin receptor in the absence of an equivalent of the key receptor-engaging residue hins PheB24, the inventors created a model of Con-Ins Gl bound to the elements of the human insulin receptor (hIR) that form the primary binding site for the hormone. Models of Con-Ins Gl in complex with the IR Ll-CR module (residues Gly5 to Lys310) and the IR aCT segment (residues Phe705 to Ser719 of the IR-A isoform) were created using MODELLER (v9.15) (Webb and Sali, (2014)) with the templates being the above crystal structure of Con-Ins Gl, the crystal structure of the IR site 1 components in complex with hins (PDB entry 40GA; Menting et al. 2014), and the NMR structure of the A-chain of insulin (PDB entry 2HIU; Hua et al. 1995). All models included the post-translation modifications of Con-Ins Gl and a single N-linked N-acetyl-D-glucosamine residue at each of the IR residues Asnl6, Asn25, Asnl l l, Asn215 and Asn255 (Sparrow et al. 2008).
Molecular dynamics (MD) simulations employed GROMACS (v5.0.4) (Pronk et al. 2013) with the CHARMM36 force field (Guvench et al. 2011 ; Best et al. 2012) and were initiated with the model of the Con-Ins Gl / IR complex that had the lowest modeller objective function. Ionizable residues, including the carboxy-glutamic acids, were assumed to be in their charged state. Each system was solvated using the TIP3P water model in a cubic box extending 10 A beyond all atoms. Sodium and chloride ions were added to neutralize the system and provide a final ionic strength of 0.1 M. The protein and solvent (including ions) were coupled separately with velocity rescaling to a thermal bath at 300 K applied with a coupling time of 0.1 ps. All simulations were performed with a single non-bonded cut-off of 10 A and applying the Verlet neighbour searching cut-off scheme with a neighbour-list update frequency of 25 steps (50 fs); the time step used in all the simulations was 2 fs.
Periodic boundary conditions were used with the particle-mesh Ewald method used to account for long-range electrostatics, applying a grid width of 1.2 A and a sixth-order spline interpolation. All bond lengths were constrained using the P-LINCS algorithm. Simulations consisted of an initial minimization, followed by 50 ps of MD with all protein atoms restrained. Following positionally-restrained MD, MD simulations were continued for a further 10 ns applying positional restraints on the Ca atoms of the IR excluding the C-terminal residues of aCT (residues Val715 to Ser719). Following the Ca atom-restrained MD, the simulations were continued without restraints for a further 50 ns. The coordinates of the final model are provided in Appendix II.
TyrB15
A salient feature that emerges from the model is that the side chain of Con-Ins
Gl TyrB15 is rotated with respect to its conformation in our crystal structure in order to avoid steric clash with the hIR aCT residue Phe714. The rotation directs the side chain of Con- Ins Gl TyrB 15 into the pocket occupied by hins PheB24 in the receptor complex, suggesting that Con-Ins Gl TyrB 15 is thus a surrogate for hins PheB24 in terms of receptor engagement (Figure 6). Such rotation of the TyrB 15 side chain also permits the key hIR aCT residue Phe714 to engage the venom protein core (Figure 6).
By contrast, vertebrate insulins have leucine at position B 15, which is strictly conserved.
TyrB20
The side chain of Con-Ins Gl TyrB20 is adjacent to that of Con-Ins Gl TyrB 15 and may also be involved in compensating for the lack of an equivalent to hInsPheB24. We note that the crystallographic difference electron density associated with the TyrB15 side chain is somewhat poorly defined, compatible with such mobility (Figure 7). By contrast, vertebrate insulins have a glycine at position B20, which is strictly conserved.
The PTM
Con-Ins Gl contains the following four post-translational modifications (PTMs): residues A4 and BIO are γ-carboxyglutamates (Gla) as opposed to Glu and His (respectively) in hlns, residue B3 is hydroxyproline (Hyp) as opposed to Asn in hlns, and the A-chain C-terminal residue CysA20 is amidated (Figure 1); note that the Con- Ins Gl B -chain numbering begins at -1 to allow comparison with hlns). Such modifications are commonly observed in conotoxins but have not been detected previously in insulins (Safavi-Hemami et al. 2015). A synthetic analogue of Con-Ins Gl containing PTMs was four times more active against the human IR-B than a PTM- free analogue (Figure 2) and induced Akt phosphorylation at eight-fold greater efficiency than the PTM-free analogue (Figure 3).
Examination of the four PTMs within the Con-Ins structure reveals that all— with the exception of amidation of CysA20— are likely to play a role in stabilizing the structure of Con-Ins Gl. Both side-chain carboxylates of GlaA4 are in polar interactions with the N-terminal amino group of GlyAl, as is the single side -chain carboxylate of hlns GluA4 (Figure 4c). These additional interactions in Con-Ins Gl may assist in stabilizing the short A-chain N-terminal helix. One side-chain carboxylate group of GlaB lO forms a hydrogen bond to the backbone amide of CysB7 and may play a role in stabilizing the B-chain N-terminal region; there is no equivalent interaction within hlns, hlns HisB lO being involved in hexamer formation (Figure 4d).
The side-chain hydroxyl group of HypB3 is equivalently located to the side-chain amide oxygen of hlns AsnB3 and may be able to form a (long) H-bond to the backbone amide of SerA12 (Figure 4e). The C-terminal amide of CysA20 makes no interaction with the remainder of the venom protein.
Within the model presented at example 5, the only PTM residue interacting with the receptor is GlaB4, its interaction being equivalent to that between hlns GluB4 and aCT Asn711 (Figure 8).
Example 6 - hlns Gl DPI L15Y.G20Y Signalling Activation
The ability of hIns[DOI] and hIns[TyrB 15, TyrB20, DOI] to induce insulin signalling was assessed by Akt phosphorylation analysis as described above. Briefly, pAkt Ser473 levels were measured in a mouse fibroblast cell line, NIH 3T3, overexpressing human IR-B. The cell line was cultured in DMEM with 10% fetal bovine serum (FBS), 100 U/mL penicillin-streptomycin and 2 μg/mL puromycin. For the assay, 40,000 cells per well were plated in a 96-well plates with culture media containing 1% FBS. 24 h later, 50 μL· of insulin solution was pipetted into each well after the removal of the original media. After a 30-min treatment, the insulin solution was removed and the HTRF pAkt Ser473 kit (Cisbio, Massachusetts, USA) was used to measure the intracellular level of pAkt Ser473. Briefly, the cells were first treated with cell lysis buffer (50 μL· per well) for 1 h under mild shaking. 16 μΕ of cell lysate was then added to 4 μL· of detecting reagent in a white 384-well plate. After 4-h incubation, the plate was read in a Synergy Neo plate reader (BioTek, Vermont, USA) and the data processed according to the manufacturer's protocol. The assays were repeated for a total of four times. EC50 values were calculated (using Prism 6) by curve-fitting with a non-linear regression (one-site) analysis.
It was found that hIns[TyrB15, TyrB20, DOI] was ca five-fold more active than hIns[DOI] in an Akt phosphorylation assay (Figure 9). Our results highlight the ability of mutations at position 15 and 20 of the human insulin B chain to at least partially compensate for the lack 8 C-terminal residues.
Example 7 - Solution Properties of Con-Ins Gl
The self-association state of Con-Ins Gl in solution at 100 μg/mL was analysed using sedimentation equilibrium analysis. Briefly, analytical ultracentrifugation was conducted at 20°C using a Beckman XLI analytical centrifuge in 12 mm path-length cells. Con-Ins Gl was diluted from a 10 mg/mL stock in 10 mM HC1 into 10 mM Tris, 50 mM NaCl, pH 7.4 to a final concentration of 100 μg/mL. An equal volume of 10 mM NaOH was added to neutralize any pH change. A total sample volume of 100 μL· was used. Identical samples were prepared also containing 0.2 mM ZnCl2, 2 mM CaCl2, 1 mM sodium phosphate (pH 7.4) or 0.1 M ammonium sulfate. Radial concentration distributions were measured by absorbance at 220 nm. Sedimentation equilibrium was established at 30,000 and 45,000 rpm, as assessed by sequential absorbance scans 1 h apart. Data at both speeds were jointly fitted to a single ideal sedimenting species in SEDPHAT (Houtman et al. 2007) using values of solution density and solvent partial specific volume estimated from composition using SEDNTERP (Laue et al. 1992). With the exception of the disulfides, all post- translational modifications were neglected in the estimation of Con-Ins Gl partial specific volume. Reported errors describe the precision of the fit at 0.68 confidence level, estimated from Monte Carlo simulations as implemented in SEDPHAT.
The data (obtained at 30,000 and 45,000 rpm) are well described by Con-Ins Gl being a single sedimenting species of apparent MW 5380 ± 55 g/mol (Figure 10). Although the fit is excellent (reduced χ = 0.95), the best-fit mass is slightly higher than expected. It is likely that this reflects an inaccurate estimate of the protein partial specific volume, which the inventors have determined from amino acid composition, neglecting the post-translational modifications present. It is possible however that this increase in predicted mass is the result of a small amount of higher molecular weight species present. At most this would equate to the presence of 5% dimeric Con-Ins Gl. Based on a calculated theoretical mass of 5143, the inventors conclude that Con-Ins Gl is predominantly monomeric in solution. The monomeric nature of Con-Ins Gl is therefore in agreement with its lack of an equivalent to C-terminal part of human insulin chain B (amino acids B22-B30), which have been shown to be critical for the oligomerisation of insulin.
2 + 9- 3-
The inventors also tested whether Zn , Ca , SO4 " or PO4 " altered the aggregation state of Con-Ins Gl ; in particular, in the case of Zn2+ to test whether the ion might mediate Con-Ins Glmultimerization as it does for hins, and in the case of
2_
SO4 " to test whether the ion might be involved in mediating the tetrameric arrangement observed in the crystal. In the presence of each of these respective ions similar sedimentation equilibrium profiles were observed, equally well described by single sedimenting species and with no significant change in apparent MW (data not shown). Accordingly, the inventors conclude that Con-Ins Gl remains predominantly monomeric in the presence of each of these ions, at least at concentrations up to 100 μg/mL.
Example 8 - Crystal Structure of Con-Ins Gl in complex with human insulin receptor fragments that reconstitute the primary hormone binding site of the receptor
Crystallisation and data collection.
The primary insulin binding site ("Site 1 ") of the human insulin receptor (hIR) can be re-created in a domain minimized and suitable for crystallographic analysis of the interaction of insulin or insulin analogues with Site 1 (Menting et al. 2013) (Lawrence et al. 2016). Integral to this process is the further attachment of fragments of the monoclonal antibody 83-7 (Soos et al. 1986) to the CR domain of the receptor fragment to assist crystallization. This technique was used to generate crystals of Con- Ins Gl in co-complex with the elements that re-create hIR Site 1.
Con-Ins Gl was synthesised as described in Example 1. Con-Ins Gl was resuspended in 10 mM HC1.
MR construct IR310.T {i.e., residues 1-310 of MR followed by the N-terminal remnant Leu-Val-Pro-Arg of a thrombin cleavage site and inclusive of the population variant Tyrl44His) was produced and purified as described in Menting et al. 2013. Fv83-7, the variable domain module of the monoclonal antibody 83-7 (Soos et al. 1986), was produced and purified as described in (Lawrence et al. 2016). IR310.T was then complexed with Fv83-7 and the complex purified as described in Lawrence et al. 2016. The Fv83-7.IR310.T complex was then subject to endoglycosidase H treatment as described in Lawrence et al. 2016.
Peptide IR-A " of the A isoform of hIR was synthesized under contract by Genscript (USA).
Con-Ins Gl in complex with human insulin receptor fragments that reconstitute the primary hormone binding site of the receptor was prepared by combining EndoH- treated Fv83.7.IR310.T, IR-A704"719 and Con-Ins Gl as shown in Table 2.
Table 2: Preparation of Con-Ins Gl in complex with human insulin receptor fragments
Component Buffer and concentration Mol equivalent (relative to EndoH-treated Fv83.7.IR310.T)
EndoH-treated 3 mg/ml in 10 mM 1
Fv83.7.IR310.T HEPES-NaOH buffer pH
7.5 plus 0.02% sodium
azide
IR-A704"719 10 mM HC1 3
Con-Ins Gl 10 mM HC1 3
Initial crystallization trials employed a robotic 576-condition sparse-matrix sitting-drop screen conducted at the CSIRO Collaborative Crystallisation Centre (Parkville, Australia). Each drop containined a 1 : 1 well:complex volume ratio. Crystal growth was observed using 1.8 to 2.0 M ammonium sulfate or 1.8 to 2.0 M ammonium sulfate with any one of 0.1 M Tris-HCl, pH 7.5; MOPS-NaOH pH 7.0 and MES- NaOH, pH 6.5.
Crystallisation conditions were optimised and single crystals of the same protein:peptide:Con-Ins Gl mixture were grown using hanging drop format in Linbro 24 plates and a reservoir buffer consisting of 1.8 to 2.0 M ammonium sulfate or 1.8 to 2.0 M ammonium sulfate, 0.1 M Tris-HCl, pH 7.5. Other buffers (e.g. MOPS-NaOH pH 7.0 and MES-NaOH, pH 6.5) were also trialled and produced similarly diffracting crystals. A single crystal grown in a solution of 1.7 M ammonium sulphate, 50 mM MOPS-NaOH pH 7.0 gave the best diffraction data,
The single crystal of Con-Ins Gl in co-complex with IR-A704"719 and the EndoH- treated Fv83.7.IR310.T complex was cryo-protected by transfer to a solution consisting of reservoir buffer plus 30% glycerol and extraction into a cryo-loop. The loop was then plunged directly into a bath of liquid nitrogen. Diffraction data to 3.25 A resolution were collected at beamline MX2 at the Australian Synchrotron (Melbourne, Australia). Data were integrated and merged using the XDS package (Kabsch, 2010); Data processing statistics are presented in Table 3. The space group is 7222 with unit cell dimensions a=106.16, b=22T.12 and c=228.70 A. From the apparent molecular mass of 63440 Da per complex and 2 complexes per asymmetric unit, the solvent content is estimated as 77%.
Structure solution and refinement
The structure was solved by molecular replacement using PHASER (McCoy et al. 2007), employing as search model the a single Fv83-7.IR310.T component of PDB entry 40GA (Menting et al. 2014). Two copies were located in the asymmetric unit. Electron density corresponding to two copies of Con-InsGl (bound separately to the two respective Ll+IR-A704"709 fragments of IR) were visible in the difference electron density map. Residues of Con-Ins Gl were then built objectively into the electron density. X-ray crystallographic refinement employed PHENIX (Adams et al. 2010). Final stages of refinement included TLS refinement, restrained individual B -factor refinement and torsional NCS restraints. Final refinement statistics are presented in Table 3. The final model included the residues detailed in Table 4.
Table 3: X-ray data processing and refinement statistics
X-ray data processing
Space group 1222
a, b, c (A) 106.16, 227.12, 228.70
Resolution (A) 50-3.25 (3.35-3.25)
Emerge 0.259 (2.748)
7/σ( ) 6.44 (0.59)
CCi/2 0.992 (0.163)
Completeness (%) 99.8 (99.9)
Redundancy 5.8 (6.0)
X-ray refinement
Resolution (A) 50-3.25
No. reflections 43845
Rwork / Rfree 0.2285 / 0.2786
No. atoms:
Protein 8969
Carbohydrate 282
Solvent 20
<B> (A2):
Protein 124.9
Carbohydrate 153.9
Solvent 127.4
Root-mean-square deviations:
Bond lengths (A) 0.002
Bond angles (°) 0.5
Ramachandran plot:
Allowed (%) 92.0
Favoured (%) 7.6
Outlier (%) 0.4
a Numbers in parentheses refer to the outer resolution shell.
b Data were included to the maximum resolution at which the CC correlation statistic remained significant at the /?=0.001 level of significance. Table 4: Residues included in the final model of Con-Ins Gl in complex with Fv83- 7.IR310.T and IR-A704"719 The final model contain two copies (Complex 1 and complex 2) of the complex in the asymmetric unit.
Complex 1 Complex 2
Con-Ins Gl A chain 1-20 1-20
Con-Ins Gl B chain 4- 19 4- 19
IR310.T 5- 159, 168-265, 276-309 5- 159, 168-265, 276-309
IR_A704-7!9 705-719 705-715
Fv83-7 Heavy chain 1-118 1-117
Fv83-7 Light chain 1-114 1-111
No. of N-linked g] 11 11
residues
No. of ions 4 sulfate ions
Example 9 - Con-Ins Gl in complex with human insulin receptor fragments
The two copies of the complex within the crystallographic asymmetric unit showed only limited differences in overall structure (Figure 11) and description here will thus be restricted to the Complex 1.
An overlay of crystal structure of the Con-InsGl complex with the crystal structure of hins in co- complex with Fab83-7.IR310.T and IR-A704-719 (PDB entry 40GA; Menting et al. 2014) is provided in Figure 12. The overall structure of Con-Ins Gl in complex with human insulin receptor fragments is similar to that of equivalent sub-structure within the structure of hins complexed with IR310.T, IR-A704"719 and Fab 83-7. A salient difference is that for the Con-InsGl complex interpretable electron density was apparent for three residues N-terminal to Con-Ins Gl B7. The final model thus includes Con-Ins Gl residues B4,B5 and B6. In contrast, in the human insulin co- complex structure, the B chain of hins could be modelled only from residue B7 onwards in the C-terminal direction.
Figure 13 shows an overlay of the structure of the Con-Ins Gl complex determined here with that of human insulin complexed with IR310.T, IR-A704"719 and Fab 83-7, focussing on residue TyrB15 of Con-Ins Gl. As is evident, the side-chain of TyrB15 is rotated from its receptor-free position to be positioned in the same location that is occupied by that of hins PheB24 in the human insulin complex. The complex structure supports the conclusion from Example 5 that TyrB 15 helps compensate for the lack of PheB24. No interpretable electron density is present for Con-Ins Gl TyrB20.
Example 10 - Molecular modelling of hlnsrPOH, hIns|TyrB15, DOI1 and hIns|TyrB20, DOIl with the components that comprise the primary binding site (site 1) of the human insulin receptor
Models of des-octa-insulin (hIns[DOI]: a human insulin that lacks the eight C- terminal residues of the B chain) in complex with the IR LI domain (residues Gly5 to Cysl55) and IR-A705 714were created with MODELLER (v9.16) (Webb & Sali, 2014) using the crystal structure of the IR site 1 components in complex with human insulin (hlns) (PDB entry 40GA; Menting et al. 2014) and the NMR structure of the A chain of insulin (PDB entry 2KJJ). All models included the post-translational modifications of a single respective N-linked N-acetyl-D-glucosamine residue at each of the IR residues Asnl6, Asn25 and Asnl l l.
Molecular dynamics (MD) simulations were conducted using the GROMACS (v5.1.2) (Abraham et al. 2015) suite of programs and a modified version of the CHARMM36 (Guvench et al. 2011 ; Best et al. 2012) force field initiated with the model of the hIns[DOI]-IR complex that had the lowest MODELLER objective function. Each system was placed in a TIP3P single point water model solvated cubic box extending 10 A beyond all atoms with periodic boundary conditions used along all axes. Ionizable residues were assumed to be in their charged state and a final ionic strength of 0.1 M was obtained by neutralizing the system and adding sufficient sodium and chloride ions. Temperature coupling was conducted in 2 groups with the protein and solvent coupled independently to a velocity rescaling (Bussi et al. 2007) , thermostat at 300 K, both groups utilising a time constant of 0.1 ps. Isotropic pressure coupling was implemented with the Berendsen (Berendsen et al. 1984) technique using a reference pressure of 1 bar and a time constant of 0.5 ps. All simulations were performed with a universal 12 A non-bonded interactions cut-off, with long-range electrostatics accounted for using the particle-mesh Ewald method (Essmann et al. 1995) with a grid width of 1.0 A and a sixth-order spline interpolation. The Verlet neighbour searching cut-off scheme was applied with a neighbour-list update frequency of 25 steps (50 fs); the time step used in all the simulations was 2 fs. All bond lengths were constrained with the P-LINCS algorithm (Hess, 2008). Simulations underwent an
initial steepest decent minimization followed by 50 ps of MD with all protein atoms restrained. Following positionally restrained MD, MD simulations were continued for a further 100 ns
Analysis of the insulin analogue interaction with the receptor was made using the FoldX suite of programs (Schymkowitz, et al. 2005) after 100 ns MD. The FoldX RepairPDB utility was used to ensure the structures had no unreasonable torsion angles or van der Waals' clashes before the position scan was conducted, indicating the resultant mutational AAG contribution at each site of the hIns[DOI]. Des-octa-insulin (hIns[DOI])
The interactions made by hIns[DOI] conserves similar interactions at the interface between insulin and the receptor observed in the X-ray crystal structure of native hins with the receptor (PDB entry 40GA), particularly interactions made within the hydrophobic pocket generated by B domain residues ValB 12, LeuB15 and receptor residues Asnl5, Leu37, Phe39, and Phe714. The absence of B-chain C-terminal residues results in the B-chain helix no longer unwinding, resulting in a transient salt bridge between ArgB22 and GluA17. This also allows the B chain helix to shift closer to IR-L1, resulting in π-π parallel displaced stacking between TyrB 16 and IR-L1 Tyr67. The final model is shown in Figure 14.
Des-octa-insulin-TyrBl 5 (Mns[TyrB15, DOI])
Over a 100ns time period the interactions observed by hIns[TyrB 15, DOI] are similar to those made by hins [DOI]. The B-chain helix mirrors the shift closer to the IR LI domain with similar π-π stacking between TyrB 16 and IR-L1 Tyr67. The flexibility of C-terminal B-chain residues similarly allows the transient salt bridge between ArgB22 and GluA17. The presence of a Tyr at the B 15 position projects into the hydrophobic core of the DOI-(IR-A704 719)-L1 interface occupying space otherwise occupied by hins LeuB 15. The final model is shown in Figure 15.
Des-octa-insulin-TyrB20 (Mns[TyrB20, DOI])
The initial comparative model included a restraint to ensure TyrB20 occupied the hInsB24 binding site. Following 100 ns of MD TyrB20 remained in the hlns B24 binding site, with all other interactions with the receptor appearing native-like. Unlike hIns[DOI], the native π-π parallel displaced stacking between TyrB 16 and IR LI Phe39 was maintained, however, the salt bridge caused by the lack of B-chain C-terminal residues between ArgB22 and GluA17 was the same as that observed with hIns[DOI]. The final model is shown in Figure 16. FoldX mutational position scan analysis
The FoldX position scan utility provides a qualitative interpretation as to the positions within hIns[DOI] that can accommodate mutations, and those that cannot (Figures 17, 18 and 19). Within hIns[DOI] and hIns[TyrB20, DOI] the solvent- exposed residues within the A chain, particularly residues GluA4, GlnA5 and ThrA8 indicate a little net positive or negative effect to most mutations. This is dissimilar to that of the hIns[TyrB15, DOI] analogue which instead proposed a positive impact on mutation of these residues. Mutations occurring within the hydrophobic core at the interaction face not- surprisingly appeared to suggest a greater effect with all models, and all models suggested disulfide bonded cysteines could undergo significant AAG advantages however this data was removed due to the importance the disulfide interactions have on the overall structure of the analogues. Mutations at residues TyrA14, AsnA18, HisB5 and GluB21 indicated that in the DOI-TyrB20 analogue mutations at these positions were advantageous, where they were not in the DOI analogue; indicating a more favourable interaction. The DOI-TyrB 15 analogue however provided results analogous to an amalgam of the results of DOI and DOI- TyrB20, suggesting only minor differences at all sites, with the exception of HisB5. Mutations at positions AsnB3 and GlnB4 indicated the inverse, however this is presumably due to the orientation of the B chain in proximity to the B chain helix at this frame, and is therefore an artefact of the sampling technique.
Conclusions
The MD simulations of the hlns analogues, hIns[DOI], Mns[TyrB 15, DOI] and hIns[TyrB20, DOI], in complex with the IR LI domain and IR-A 705-"71 and subsequent mutational analysis provide insight as to the method hlns [DOI] binds despite the lack of key receptor engagement residues, particularly PheB24. The simulations indicate that the tyrosine substitution at position B20 can act as a substitute for PheB24 in hlns, occupying approximately the same location, and is stable over the simulated time. The simulation of hIns[DOI] (which lacks PheB24 and any attempt to replace it by mutation at B 15 or B20) in interaction with the insulin receptor fragments shows that its engagement is holistically similar to that of hlns with IR LI and IR-A 705-"719 but with subtle differences at sites between the IR LI domain and the analogue B chain. The shift of the B chain towards the IR LI domain is not observed in the hIns[TyrB20, DOI] complex due to the steric clash that would occur, however it is observed in Mns[TyrB15, DOI], suggesting the impact of substitution at B20 is different to that at B 15. The mutational FoldX calculations suggest that similarly favourable mutations exist at sites unaffected by the resultant change in binding pose upon B20 mutation, with these differences localized distal to the hydrophobic core. Example 11 - Development of monomeric human insulin analogs
Research Strategy for New Insulin Analogs
As described herein, the recent discovery of a monomeric insulin variant (Con- Ins-Gl) in the venom of a predatory snail has helped propel the research behind the disclosed peptides. Methods have been developed and data has been obtained that explain how Con-Ins-Gl both avoids dimerization and maintains receptor binding and insulin signaling, and thereby acts very quickly. Furthermore, insights from fundamental discoveries have been used to develop a protein that only differs from the sequence of human insulin at four amino acid positions yet is monomeric, fast acting, and displays potency comparable to that of authentic human insulin.
The insights gained from study of Con-Ins-Gl were used to develop a monomeric human insulin that displays only four amino acid substitutions from the human "shortened" protein.
To circumvent the constraints of human insulin's structure, solutions were taken from nature: fish-hunting cone snails, Conus geographus, have evolved the use of specialized insulin from their venom that induces paralyzing hypoglycemic shock in fish within seconds. The sequence of venomous insulin, Con-Ins- Gl, was elucidated using a combination of genome sequencing and mass spectrometry (Figure 1). Notably, four post-translational modifications were observed: A4 Glu and B IO Glu to gammacarboxyglutamic acid, Gla; B3 Pro to 2-hydroxylproline, and a C-terminal amide on the A-chain. Due to the low abundance of this venomous insulin, it cannot be isolated from the animal. Instead, a synthetic analogue (sCon-Ins-Gl) was obtained in which, for ease of synthesis, a diselenium-bond replaced the intra-molecular disulfide bond in the A chain. sCon-Ins-Gl induces hypoglycemic shock when it is injected into fish, and it slows fish motility when it is present in the water. Other than its effects on fish, the most special feature of Con-Ins-Gl is that it is the shortest insulin molecule reported to date with a "shortened" B chain. Because a shortened human insulin (des- octapeptide insulin, DOI) is monomeric, it indicated that Con-Ins-Gl is monomeric and can be used as an UFI (ultra-fast acting insulin). Con-Ins-Gl lacks two segments that in human insulin are involved in binding to with the human insulin receptor (MR): First, A21 Asn of human insulin contacts hIR binding site 1 and its removal causes a 100- fold reduction in binding affinity. Second, the aromatic triplet (B24-B26) is one element for human insulin to bind hIR binding through contacts at hIR binding site 1. Removal of these residues leads to a 1 ,000-fold reduction in affinity.
Despite these concerns, Con-Ins-Gl (instead of the selenium analogue) was chemically synthesized and it was found that it binds to hIR with only 30-fold less affinity than human insulin. This surprising result raised a key question: how does Con- Ins-Gl bind to hIR without the key aromatic residues used by human insulin? The structure of Con-Ins-Gl was found to display a nearly identical backbone as human insulin. By fitting the Con-Ins-Gl structure into a published human insulin-hIR co- structure, it was inferred that Con-Ins-Gl B15 Tyr and B20 Tyr (Leu and Gly in human
insulin) interact with human IR to substitute for the role played by human B24 Phe. These strong results provide a rational basis to develop a human monomeric UFI based on the snail insulin structure. Develop human monomeric insulin analogs as therapeutic leads.
The development of ultra-fast acting insulin (UFI) represents the next major advance in insulin analogue development. The fundamental challenge in redesigning human insulin is that the same residues involved in receptor binding also mediate dimer formation. Thus, the discovery of the venomous insulin Con-Ins-Gl represents an important step forward in the creation of a monomeric, ultrafast-acting insulin because it lacks these residues (and thus does not dimerize) but retains the ability to bind and activate the insulin receptor. There is concern, however, that the low sequence identity between Con-Ins-Gl and human insulin could give rise to an immune response, especially given that diabetes is a chronic disease that requires daily insulin injections. Therefore, instead of developing an UFI based on the venomous insulin, one can start with the scaffold of human DOI (Des-octapeptide (B23-30) human insulin) because it is monomeric and because close analogs of this truncated human insulin are likely to be tolerated by the human immune system, as indicated by the current clinical use of insulin analogues displaying two or three mutations. The challenge, however, is that DOI is nearly inactive (1,000-fold weaker than human insulin). Data indicate that Con- Ins-Gl uses the B15 Tyr and/or B20 Tyr to compensate for the loss of B24 Phe, and further indicate additional modifications that enhance the affinity of Con-Ins-Gl. Leveraging these insights, DOI can be developed into an active UFI analogue as a therapeutic lead for diabetes treatment.
Develop human DOI into a bioactive monomeric insulin
Traditionally, DOI was synthesized enzymatically by trypsin cleavage of human insulin, which is not suitable for analogue synthesis. Therefore, a modular synthetic route to access DOI has been developed. The primary challenge for the synthesis of human insuin is the hydrophobic character of the A chain. By using an isoacyl peptide pair on the A8-A9 Thr-Ser, an extra charged residue (amine) was introduced to the A
chain to increase its solubility (Figure 21). After disulfide bond formation, the isoacyl peptide underwent an O-to-N acyl shift at pH 8 to yield the DOI sequence. This synthetic DOI has the same molecular weight (from MALDI) and hIR activation activity as the enzymatically synthetic DOI, which proves the reliability of the developed method.
It has been demonstrated that the two Tyr on B 15 and B20 of Con-Ins-Gl are important for hIR activation. To test the hypothesis that mutations on these two sites will increase the potency of hIR activation, three DOI analogues with B15 Leu and/or B20 Gly mutated to Tyr were synthesized. As shown in Figure 22, the two analogues with B20 Tyr have 5-fold increased potency in hIR activation while the B 15 Tyr DOI analogue is similar to DOI. This demonstrates that B20 Tyr alone can increase the potency of DOI, likely due to compensation for loss of B24 Phe. To further increase potency, a DOI analogue that additionally displays B IO Glu was synthesized, which is the B IO substitution that gives the strongest hIR binding. This provided another 5-fold increase in potency compared to B20 Tyr alone, and has a similar potency as Con-Ins- Gl (Figure 23). This demonstrates that mutations from the venomous insulin can be grafted onto human DOI to develop bioactive analogues.
The crystal structures of Con-Ins-Gl and of Conlns Gl in complex with the site 1 fragments of the human insulin receptor lack clear electron density for the B20 residue, which indicates that it may be flexible. It is also possible that the side chain of a tyrosine substituent at this position may not optimally engage the insulin receptor. Therefore, the hypothesis that substitutions other than Tyr can further increase potency were tested by synthesizing a series of B 10E, B20X DOI analogues with X being aromatic amino acids (Figure 24A). Interestingly, large substituents such as indole (Trp) and biphenyl group lead to higher potency in hIR activation (Figure 24B). The biphenyl analogue is 10% of the potency of human insulin (3 -fold higher than N10E, B20Y DOI). This demonstrates the power of the interdisciplinary approach using both protein engineering and structural biology. The potency of DOI has been increased by 100-fold by mutating two positions. Halogen-substituted naphthyl and biphenyl groups on B20 can be used to further optimize DOI analogue potency.
Because the A8 position is important for interacting with hIR binding site 2, the A8 His mutation can be introduced into the current lead analogue and assay for hIR activation. Both A8 His and A9 Arg (original residues on Con-Ins-Gl) were introduced to the DOI analogue with B IO Glu and B20 Tyr, the lead analogue (Figure 23). This quadruple DOI mutant has potency for hIR activation that is comparable to that of human insulin (Figure 25). The mutations on Con-Ins-Gl promote binding to IR site 2. X-ray crystallography can be used to study the interaction between insulin and binding site 2. Protein engineering efforts can be expanded to the A8-A10 triplet to further optimize interaction with hIR binding site 2 by using a medicinal chemistry approach similar to the work on B20. Currently, the best DOI analog varies from the parent human insulin sequence at only 4 residues, so it is likely that the immunogenicity of the monomeric DOI analogues will be similar to that of the FDA-approved insulin analogues that are in clinical use.
It was demonstrated that each mutation on A8, A9, BIO and B20 has individual effects in affect hIR activation (Figure 26).
Evaluate monomeric insulin leads in STZ treated diabetic mouse models.
After potent monomeric insulin analogues are identified, the in vivo properties can be evaluated. An insulin tolerance test can be performed in STZ-treated mice to confirm the in vivo glucose-lowering ability. The two key features for an UFI analogue are fast onset and short duration of action. UFI analogue serum levels will be measured using HPLC coupled with mass spectrometry (LC/MS/MS) in diabetic mice after subcutaneous injections to measure its absorption rate (using insulin lispro as a control). For monomeric insulins, a faster absorption rate can be seen compared to the dimeric insulin lispro. Furthermore, glycemic clamp experiments can be used to quantify the onset and duration of UFI analogues in vivo by determining the amount of glucose infusion required to maintain a targeted glucose level. The glucose clamp study can show that UFI analogues have a shorter onset and duration of action due to their reduced depot effects in subcutaneous tissue. The combination of these properties can greatly reduce the risk of hypoglycemia.
It will be appreciated by persons skilled in the art that numerous variations and/or modifications may be made to the invention as shown in the specific embodiments without departing from the spirit or scope of the invention as broadly described. The present embodiments are, therefore, to be considered in all respects as illustrative and not restrictive. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the method and compositions described herein. Such equivalents are intended to be encompassed by the following claims.
All publications discussed and/or referenced herein are incorporated herein in their entirety.
Any discussion of documents, acts, materials, devices, articles or the like which has been included in the present specification is solely for the purpose of providing a context for the present invention. It is not to be taken as an admission that any or all of these matters form part of the prior art base or were common general knowledge in the field relevant to the present invention as it existed before the priority date of each claim of this application.
REFERENCES
Abraham et al. (2015) SoftwareX 1-2, 19-25.
Adams et al. (1969) Nature 224, 491-495.
Adams, et al. (2010) D. Biol. Crystallogr. 66, 213-221.
Bao et al. (1997) Proc. Natl. Acad. Sci. USA 94, 2975-2980.
Bentley (1997) Meth. Enzym., 276, 611-619.
Berendsen et al. (1984) J. Chem. Phys. 81, 3684-3690.
Best et al. (2012) J. Chem. Theory Comput. 8, 3257-3273.
Bricogne et al. (2011) BUSTER version 2.10, Cambridge, United Kingdom: Global Phasing Ltd.
Brunger (1996). X-PLOR reference manual 3.851 (Yale Univ., New Haven, CT).
Brunger (1997) Meth. Enzym., 276, 558-580,
Brunger et al. (1998) D. Biol. Crystallogr., 54, 905-921.
Bussi et al. (2007). J. Chem. Phys. 126: 014101.
Chen et al. (1998) J. Biol. Chem. 273, 16248-16258.
Cnudde et al. (2011) J. Biol. Inorg. Chem. 16, 257-266.
Dai et al. (2011) J. Inorg. Biochem. 105, 52-57.
Denley et al. (2004) Mol. Endocrinol. 18, 2502-2512.
Dodson and Steiner (1998) Curr. Opin. Struct. Biol. 8, 189-194.
Emsley and Cowtan (2004) D. Biol. Crystallogr. 60, 2126-2132.
Essmann et al. (1995) J Chem Phys 103, 8577-8593.
Galande et al. (2005) J. Comb. Chem. 7, 174-177.
Glendorf et al. (2011) PLoS One. 6, e20288.
Guvench et al. (2011) J. Chem. Theory Comput. 7, 3162-3180.
Heni et al. (2015) Nat. Rev. Endocrinol. 11 , 701-711.
Hess (2008) J. Chem. Theory Comput. 4, 116-122.
Holm and Rosenstrom (2010) Nucl. Acids Res. 38, W545-549.
Houtman et al. (2007) Protein Sci. 16, 30-42.
Hua et al. (1995) Nat. Struct. Biol. 2, 129- 138.
Jones et al. (1991) Acta Crystallogr., A 47, 110-119).
Kabsch (2010) Biol. Crystallogr. 66, 133-144.
Kleywegt and Jones (1994). CCP4/ESF-EACBM Newsletter on Protein
Crystallography, 31 November 1994, 9-14. [http://xray.bmc.uu.se/usf/factory_4.html] Krissinel and Henrick (2004) Acta Cryst. D60, 2256-2268.
King (2011) Expert Opin. Biol. Ther. 11, 1469-1484.
Lattman (1985) Meth. Enzymol., 115, 55-77.
Laue et al. (1992) in Analytical Ultracentrifugation in Biochemistry and Polymer Science 90-125.
Lawrence et al. (2016). J. Biol. Chem. 291, 15473-15481.
McCoy et al. (2007) J. Appl. Crystallogr. 40, 658-674.
Marsh et al. (1995) J. Cell Biol., 130, 1081-1091.
Menting et al. (2013) Nature 493, 241-245.
Menting et al. (2014) Proc. Natl. Acad. Sci. USA 111, E3395-E3404.
Moody et al. (1974) Horm. Metab. Res. 6(1), 12-6,
Morton and Myszka (1998) Methods Enzymol, 295, 268-294.
Murshudov et al. (1997) Acta Crystallogr. D. Biol. Crystallogr. 53, 240-255.
Muttenthaler et al. (2010) Biopolymers 94, 423-432.
Navaza and Saludjian (1997) Meth. Enzym. 276, 581-594.
Nice and Catimel (1999) Bioessays, 21, 339-352,
Olefsky (1978) Biochem. J., 172, 137-145.
Owens (2002) Nat. Rev. Drug Discov. 1, 529-540.
Pettersen et al. (2004) J Comput Chem. 25, 1605-12.
Phillips et al. (2005) J Comput Chem. 26, 1781-1802.
Pronk et al. (2013) Bioinformatics 29, 845-854.
Rivier et al. (1987) Biochemistry 26, 8508-8512.
Robinson and James (1992) Am. J. Physiol., 263, E383-E393.
Rossmann (1972) Int. Sci. Rev. Ser., No. 13, Gordon & Breach, New York.
Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual, 3rd ed. Cold Spring
Harbor Laboratory Press.
Safavi-Hemami et al. (2015) Proc. Natl. Acad. Sci. USA 112, 1743-1748.
Schymkowitz et al. (2005) Nucleic Acids Res. 33, W382-8.
Smith et al. (2003) D. Biol. Crystallogr. 59, 474-482.
Soos et al. (1986). Biochem. J. 235, 199-208.
Sparrow et al. (2008) Struct. Funct. Bioinform. 71, 426-439.
Tong and Rossmann (1997) Meth. Enzym. 276:594-611.
Walewska et al. (2009) Angew. Chem. Int. Ed. Engl. 48, 2221-2224.
Webb and Sali (2014) Curr. Protoc. Bioinformatics 47, 5.6.1-5.6.32.
Weiss (2009) Vitam. Horm. 80, 33-49.
Zambelli et al. (2016) Pharmacol. Res., epub ahead of print.
APPENDIX I: ATOMIC COORDINATES FOR CON-INS Gl INSULIN
(CHAIN A AND CHAIN B)
Format comprises the following colon-separated fields: (1) atom name, (2) residue type, (3) 5 chain name,(4) residue number,(5-7) xyz coordinates, (8) thermal B-factor.
N:GLY:A:1:15.439:38.785:34.309:26.76 OE21:CGU:A:4:18.448:39.819:35.754:54.62
CA:GLY:A:1:14.11:39.14:33.733:22.21 OE22:CGU:A:4:18.871:41.181:37.42:85.42
C:GLY:A:1:14.251:39.976:32.492:21.3 C:CGU:A:4:18.255:43.089:31.618:20.98
O:GLY:A:l:15.353:40.312:32.093:22.73 O:CGU:A:4:19.303:43.691:31.269:21.08
H1:GLY:A:1:15.395:38.813:35.197:32.11 HG:CGU:A:4:18.338:43.105:35.863:33.4
H2:GLY: A: 1 : 15.666:37.965:34.049:32.11 HB3:CGU:A:4:17.661:41.115:33.568:26.58
H3:GLY: A: 1 : 16.051 :39.366:34.024:32.11 HB2:CGU:A:4:19.22:41.867:33.754:26.58
HA2:GLY:A:1:13.625:38.331:33.508:26.65 HA:CGU:A:4:18.079:44.185:33.551:25.82
HA3:GLY:A:1:13.595:39.639:34.386:26.65 N:HIS:A:5:17.548:42.333:30.786:21.02
N:VAL:A:2:13.113:40.325:31.895:21.05 CA:HIS:A:5:18.007:41.966:29.443:20.65
CA:VAL:A:2: 13.122:41.169:30.72:22.11 C:HIS:A:5:17.659:43.013:28.373:19.86
C:VAL:A:2:13.696:42.561:31.01:20.41 O:HIS:A:5:18.285:43.07:27.334:23.44
O:VAL:A:2:14.274:43.187:30.116:20.06 CB:HIS:A:5:17.391:40.584:29.071:20.16
CB:VAL:A:2:11.698:41.225:30.132:25.46 CG:HIS:A:5:18.115:39.864:27.981:47.58
CG1:VAL:A:2:10.72:41.927:31.074:21.3 ND1:HIS:A:5:19.277:39.15:28.201:43.05
CG2:VAL:A:2:11.725:41.902:28.839:27.88 CD2:HIS:A:5:17.832:39.734:26.661:45.41
H:VAL:A:2: 12.33:40.084:32.154:25.26 CE1:HIS:A:5:19.678:38.617:27.058:50.46
HA:VAL:A:2:13.693:40.756:30.053:26.54 NE2:HIS:A:5:18.82:38.957:26.111:52.67
HB:VAL:A:2:11.38:40.32:29.989:30.55 HA:HIS:A:5: 18.972:41.867:29.458:24.79
HG11:VAL:A:2:9.841:41.939:30.665:25.56 HB2:HIS:A:5: 17.402:40.017:29.858:24.19
HG12:VAL:A:2:10.688:41.442:31.914:25.56 HB3:HIS:A:5: 16.476:40.72:28.778:24.19
HG13:VAL:A:2:11.026:42.835:31.227:25.56 HD2:HIS:A:5:17.106:40.104:26.213:54.49
HG21:VAL:A:2:10.824:41.931:28.48:33.45 HE1:HIS:A:5:20.437:38.093:26.94:60.55
HG22:VAL:A:2: 12.061:42.804:28.963:33.45 HE2:HIS:A:5:18.873:38.729:25.284:63.2
HG23:VAL:A:2:12.306:41.412:28.238:33.45 H:HIS:A:5:16.775:42.01:30.98:25.23
N:VAL:A:3:13.604:43.056:32.249:26.31 N:CYS:A:6:16.641:43.84:28.616:19.88
CA:VAL:A:3:14.092:44.414:32.525:22.38 CA:CYS:A:6:16.113:44.731:27.591:19.76
C:VAL:A:3:15.618:44.365:32.529:26.64 C:CYS:A:6:16.063:46.184:28.028:21.58
0:VAL:A:3:16.283:45.246:31.982:21.94 O:CYS:A:6:15.909:47.06:27.165:20.58
CB:VAL:A:3:13.517:44.989:33.842:28.85 CB:CYS:A:6:14.695:44.308:27.194:21.07
CG1:VAL:A:3:14.192:46.325:34.21:23.35 SG:CYS:A:6:14.678:42.722:26.317:26
CG2:VAL:A:3:11.967:45.154:33.731:22.25 H:CYS:A:6:16.238:43.902:29.373:23.85
H:VAL:A:3:13.276:42.642:32.928:31.58 HA:CYS:A:6:16.676:44.674:26.803:23.71
HA:VAL:A:3:13.816:44.998:31.802:26.85 HB2:CYS:A:6:14.154:44.217:27.994:25.29
HB:VAL:A:3:13.695:44.361:34.559:34.62 HB3:CYS:A:6:14.315:44.981:26.608:25.29
HG11:VAL:A:3:13.808:46.654:35.037:28.02 N:CYS:A:7:16.128:46.463:29.323:21.05
HG12:VAL:A:3:15.144:46.176:34.323:28.02 CA:CYS:A:7:16.186:47.837:29.839:21.74
HG13:VAL:A:3:14.039:46.963:33.496:28.02 C:CYS:A:7:17.553:48.188:30.41:23.78
HG21:VAL:A:3:11.628:45.515:34.565:26.7 O:CYS:A:7:18.103:49.215:30.042:30.88
HG22:VAL: A:3: 11.767:45.762:33.001 :26.7 CB:CYS:A:7:15.098:48.081:30.895:21.43
HG23:VAL:A:3:11.569:44.287:33.558:26.7 SG:CYS:A:7:15.221:49.718:31.732:27.59
N:CGU:A:4:16.193:43.318:33.099:21.34 H:CYS:A:7:16.14:45.864:29.941:25.26
CA:CGU:A:4:17.68:43.233:33.107:21.52 HA:CYS:A:7: 16.016:48.446:29.103:26.09
CB:CGU:A:4:18.136:42.041:33.939:22.15 HB2:CYS:A:7:14.229:48.033:30.465:25.71
CG:CGU:A:4: 17.867:42.164:35.47:27.83 HB3:CYS:A:7:15.165:47.394:31.576:25.71
CD1:CGU:A:4:16.419:42.187:35.88:25.08 N:HIS:A:8:18.157:47.348:31.264:28.49
OE12:CGU:A:4:16.135:42.812:36.937:28.6 CA:HIS:A:8:19.509:47.657:31.747:25.02
OEll:CGU:A:4:15.531:41.542:35.211:22.71 C:HIS:A:8:20.564:47.453:30.657:30.05
CD2:CGU:A:4:18.45:40.98:36.25:48.38 O:HIS:A:8:21.558:48.179:30.604:34.16
CB:HIS:A:8:19.847:46.811:32.972:27.96 SG:CYS:A:11:15.186:43.358:24.401:23.25
CG:HIS:A:8:18.916:47.03:34.125:33.4 H:CYS:A:11:17.296:45.881:23.98:24.33
ND1:HIS:A:8:18.619:46.043:35.044:34.99 HA:CYS:A:11:18.598:43.683:23.076:24.13
CD2:HIS:A:8: 18.234:48.129:34.521:27.24 HB2:CYS:A:11:17.048:42.175:23.763:24.7
CE1:HIS:A:8:17.767:46.516:35.936:33.7 HB3:CYS:A:11:17.368:43.045:25.045:24.7
NE2:HIS:A:8:17.527:47.782:35.648:38.53 N:SER:A:12:17.349:43.443:21.002:19.95
H:HIS:A:8:17.818:46.619:31.57:34.19 CA:SER:A:12:16.64:43.374:19.734:20.24
HA:HIS:A:8:19.538:48.589:32.014:30.03 C:SER:A:12:15.198:42.876:19.921:19.9
HB2:HIS:A:8:19.802:45.874:32.728:33.55 O:SER:A:12:14.82:42.314:20.95:19.48
HB3:HIS:A:8:20.744:47.033:33.268:33.55 CB:SER:A:12:17.365:42.442:18.794:20.66
HD2:HIS:A:8:18.233:48.959:34.103:32.68 OG:SER:A:12:17.226:41.115:19.254:20.36
HE1:HIS:A:8:17.417:46.045:36.658:40.44 H:SER:A:12:17.993:42.877:21.068:23.94
HE2:HIS:A:8:17.009:48.305:36.092:46.23 HA:SER:A:12:16.611:44.256:19.332:24.29
N:ARG:A:9:20.377:46.477:29.813:32.17 HB2:SER:A:12:16.978:42.517:17.908:24.79
CA:ARG:A:9:21.026:46.221:28.542:24.61 HB3:SER:A:12:18.306:42.677:18.772:24.79
C:ARG:A:9:19.985:46.413:27.464:24.55 HG:SER:A:12:17.626:40.589:18.736:24.43
O:ARG:A:9:18.838:46.063:27.712:22.58 N:ASN:A:13:14.386:43.089:18.896:20.29
CB:ARG:A:9:21.524:44.776:28.434:42.6 CA:ASN:A:13:13.05:42.486:18.857:20.21
CG:ARG:A:9:22.966:44.545:28.786:89.14 C:ASN:A:13:13.113:40.969:18.999:19.99
CD:ARG:A:9:23.608:43.537:27.831:118.62 O:ASN:A:13:12.303:40.375:19.72:19.76
NE:ARG:A:9:25.059:43.702:27.796:137.92 CB:ASN:A:13:12.338:42.827:17.571:20.89
CZ:ARG:A:9:25.928:42.745:27.485:154.54 CG:ASN:A:13:11.993:44.254:17.455:21.34
NH1:ARG:A:9:25.517:41.519:27.184:170.53 OD1:ASN:A:13:12.063:45.022:18.425:21.1
NH2:ARG:A:9:27.224:43.02:27.49:152.73 ND2:ASN:A:13:11.561:44.637:16.254:22.2
H:ARG:A:9:19.797:45.864:29.977:38.61 H:ASN:A:13:14.577:43.575:18.213:24.35
HA:ARG:A:9:21.763:46.836:28.4:29.53 HA:ASN:A:13:12.523:42.835:19.593:24.25
HB2:ARG:A:9:20.99:44.224:29.027:51.12 HB2:ASN:A:13:12.913:42.598:16.823:25.07
HB3:ARG:A:9:21.399:44.478:27.519:51.12 HB3:ASN:A:13:11.515:42.316:17.522:25.07
HG2:ARG:A:9:23.451:45.382:28.719: 106.97 HD21:ASN:A:13:11.343:45.457:16.116:26.64
HG3:ARG:A:9:23.024:44.192:29.688: 106.97 HD22:ASN:A:13:11.502:44.063:15.616:26.64
HD2:ARG:A:9:23.41:42.637:28.132:142.34 N:ALA:A:14:14.073:40.323:18.327:20.53
HD3:ARG:A:9:23.263:43.677:26.935:142.34 CA:ALA:A:14:14.136:38.865:18.34:20.59
HE:ARG:A:9:25.375:44.477:27.991:165.51 C:ALA:A:14:14.514:38.344:19.713:20.2
HH11:ARG:A:9:24.676:41.336:27.181:204.63 0:ALA: A: 14: 13.966:37.34:20.17:20.42
HH12:ARG:A:9:26.09:40.909:26.987:204.63 CB:ALA:A:14:15.122:38.388:17.293:21.21
HH21:ARG:A:9:27.495:43.812:27.686:183.27 H:ALA:A:14:14.689:40.704:17.863:24.64
HH22:ARG:A:9:27.794:42.407:27.293:183.27 HA:ALA:A:14:13.262:38.509:18.115:24.71
N:PRO:A:10:20.315:46.922:26.273:21.04 HB1:ALA:A:14:15.156:37.419:17.311:25.45
CA:PRO:A:10:19.325:46.947:25.186:22.55 HB2:ALA:A:14:14.827:38.694:16.421:25.45
C:PRO:A:10:19.153:45.573:24.54:20.8 HB3:ALA:A:14:15.997:38.755:17.494:25.45
O:PRO:A:10:20.049:44.721:24.565:22.83 N:GLU:A:15:15.393:39.048:20.418:19.99
CB :PRO: A: 10: 19.914:47.951 :24.205:21.44 CA:GLU:A:15:15.674:38.714:21.807:20.64
CG:PRO:A:10:21.394:47.783:24.393:23.92 C:GLU:A:15:14.459:38.979:22.702:20.25
CD:PRO:A:10:21.581:47.551:25.858:26.07 0:GLU:A:15:14.126:38.165:23.56:19.51
HA:PRO:A:10:18.468:47.268:25.508:27.06 CB:GLU:A:15:16.886:39.507:22.306:24.27
HB2:PRO:A:10:19.65:47.726:23.299:25.72 CG:GLU:A:15:18.221:38.954:21.864:52.08
HB3:PRO:A:10:19.633:48.849:24.439:25.72 CD:GLU:A:15:18.481:37.555:22.399:68.3
HG2:PRO:A:10:21.703:47.018:23.883:28.7 OEl:GLU:A:15:18.414:37.354:23.638:37.34
HG3:PRO:A:10:21.853:48.59:24.112:28.7 OE2:GLU:A:15:18.75:36.655:21.573:92.14
HD2:PRO:A:10:22.324:46.948:26.013:31.28 H:GLU:A:15:15.838:39.719:20.116:23.99
HD3:PRO:A:10:21.706:48.395:26.32:31.28 HA:GLU:A:15:15.889:37.77:21.867:24.77
N:CYS:A: 11: 17.965:45.34:23.974:20.27 HB2:GLU:A:15:16.82:40.416:21.974:29.12
CA:CYS: A: 11 : 17.735:44.077:23.278:20.11 HB3:GLU:A:15:16.876:39.513:23.276:29.12
C:CYS:A:11:17.021:44.299:21.95:20.29 HG2:GLU:A:15:18.242:38.914:20.895:62.5
O:CYS:A:ll:16.249:45.24:21.766:20.43 HG3:GLU:A:15:18.927:39.535:22.188:62.5
CB :CYS: A: 11 : 16.959:43.053:24.166:20.58 N:PHE:A:16:13.813:40.136:22.56:19.08
CA:PHE:A:16:12.627:40.394:23.379:19.01 HD2:LYS:A:18:16.284:35.711:23.875:37
C:PHE:A:16:11.58:39.275:23.258:19.21 HD3:LYS:A:18:16.163:34.218:24.424:37
O:PHE:A:16:10.937:38.891:24.259:19.32 HE2:LYS:A:18:16.468:33.491:22.125:45.86
CB:PHE:A:16:12:41.738:23.008:19.12 HE3:LYS:A:18:16.902:34.987:21.8:45.86
CG:PHE:A:16:10.869:42.118:23.896:19.26 HZ1:LYS:A:18:18.739:33.696:22.207:74.4
CD2:PHE:A:16:11.086:42.876:25.025:19.26 HZ2:LYS:A:18:18.658:34.713:23.236:74.4
CD1:PHE:A:16:9.575:41.712:23.603:19.56 HZ3:LYS:A:18:18.263:33.351:23.531:74.4
CE2:PHE:A:16:10.018:43.216:25.879:19.59 N:TYR:A:19:10.88:36.384:25.687:25.2
CE1:PHE:A:16:8.496:42.075:24.429:19.92 CA:TYR:A:19:10.118:36.604:26.923:20.68
CZ:PHE:A:16:8.717:42.809:25.564:19.95 C:TYR:A:19:8.575:36.613:26.737:31.44
H:PHE:A:16:14.029:40.767:22.018:22.9 0:TYR:A:19:7.832:36.875:27.687:29.68
HA:PHE:A: 16: 12.898:40.443:24.309:22.81 CB:TYR:A:19:10.538:37.943:27.555:20.37
HB2:PHE:A: 16: 12.677:42.43:23.073:22.95 CG:TYR: A: 19:11.901 :37.935:28.109:23.44
HB3:PHE:A:16:11.663:41.689:22.1:22.95 CD1:TYR:A:19:13:38.247:27.323:19.93
HD2:PHE: A: 16: 11.95:43.143:25.241 :23.11 CD2:TYR:A:19:12.117:37.579:29.428:23.18
HD1:PHE:A:16:9.416:41.209:22.837:23.48 CE1:TYR:A:19:14.278:38.224:27.859:26.81
HE2:PHE:A:16:10.173:43.731:26.637:23.51 CE2:TYR: A: 19: 13.379:37.562:29.96:25.73
HE1:PHE:A:16:7.635:41.79:24.221:23.9 CZ:TYR:A:19:14.444:37.864:29.184:22.56
HZ:PHE:A: 16:8.006:43.046:26.114:23.93 OH:TYR:A:19:15.682:37.826:29.75:22.36
N:LYS:A:17:11.394:38.736:22.051:19.63 H:TYR:A: 19: 11.025:37.107:25.245:30.24
CA:LYS:A:17:10.38:37.7:21.84:19.97 HA:TYR:A:19:10.335:35.898:27.552:24.82
C:LYS:A:17:10.614:36.46:22.684:20.12 HB2:TYR:A:19:10.498:38.635:26.876:24.45
O:LYS:A:17:9.672:35.685:22.893:20.5 HB3:TYR:A:19:9.926:38.154:28.276:24.45
CB:LYS:A:17:10.329:37.289:20.362:20.31 HD2:TYR:A:19:11.39:37.364:29.968:27.82
CG:LYS:A:17:9.504:38.28:19.498:20.56 HD1:TYR:A:19:12.877:38.483:26.431:23.92
CD:LYS:A:17:9.689:38.041:18.044:21.03 HE2:TYR:A:19:13.505:37.32:30.849:30.88
CE:LYS:A:17:9.094:39.21:17.228:23.58 HE1:TYR:A:19:15.015:38.436:27.332:32.17
NZ:LYS:A: 17:9.222:39.104: 15.74:26.95 HH:TYR:A:19:16.265:38.03:29.18:26.83
H:LYS:A:17:11.837:38.95:21.346:23.56 O:CY3:A:20:6.334:34.016:25.918:46.46
HA:LYS:A:17:9.512:38.061:22.078:23.96 C:CY3:A:20:5.968:35.121:26.067:39.6
HB2:LYS: A: 17: 11.232:37.262:20.009:24.37 N1:CY3:A:20:4.848:35.386:26.929:43.25
HB3:LYS:A:17:9.917:36.413:20.291:24.37 CA:CY3:A:20:6.666:36.307:25.294:21.87
HG2:LYS:A:17:8.562:38.173:19.703:24.67 N:CY3:A:20:8.084:36.362:25.53:22.5
HG3:LYS:A:17:9.788:39.186:19.694:24.67 CB:CY3:A:20:6.382:35.984:23.834:33.52
HD2:LYS:A: 17: 10.636:37.977: 17.845:25.23 SG:CY3:A:20:6.832:37.458:22.767:39.89
HD3:LYS:A:17:9.232:37.224:17.789:25.23 H:CY3:A:20:8.534:36.217:24.812:27
HE2:LYS:A:17:8.148:39.275:17.433:28.3 HA:CY3:A:20:6.252:37.151:25.536:26.25
HE3:LYS:A: 17:9.539:40.028: 17.499:28.3 HB2:CY3:A:20:5.439:35.782:23.725:40.22
HZ1:LYS:A:17:8.855:39.816:15.352:32.34 HB3:CY3:A:20:6.908:35.214:23.567:40.22
HZ2:LYS:A:17:10.081:39.061:15.511:32.34 HN11:CY3:A:20:4.569:36.191:27.039:51.9
HZ3:LYS:A:17:8.808:38.371:15.451:32.34 HN12:CY3:A:20:4.467:34.738:27.347:51.9
N:LYS:A:18:11.856:36.224:23.113:20.04 O:THR:B:-l:16.056:46.908:9.868:60.34
CA:LYS:A:18:12.144:35.116:24.033:21.82 N:THR:B:-1:16.406:48.652:7.605:78.37
C:LYS:A:18:11.301:35.19:25.294:20.55 CA:THR:B:-1:17.132:48.822:8.897:76.87
O:LYS:A:18:11.071:34.163:25.936:21.12 C:THR:B:-1:17.109:47.519:9.689:64.38
CB :LYS:A: 18: 13.624:35.111 :24.441 :20.42 CB:THR:B:-1:16.526:49.975:9.74:82.36
CG:LYS:A:18:14.518:34.782:23.279:20.61 OG1:THR:B:-1:17.021:51.231:9.25:103.8
CD:LYS:A:18:16.015:34.803:23.664:30.84 CG2:THR:B:-1:16.88:49.85:11.228:68.98
CE:LYS:A:18:16.866:34.3:22.484:38.22 H1:THR:B:-1:16.283:49.449:7.227:94.05
NZ:LYS:A:18:18.269:33.984:22.906:62 H2:THR:B:-1:16.886:48.137:7.061:94.05
H:LYS:A:18:12.546:36.685:22.889:24.05 H3:THR:B:-1:15.617:48.27:7.756:94.05
HA:LYS:A:18:11.951:34.276:23.589:26.19 HA:THR:B:-1:18.058:49.042:8.712:92.25
HB2:LYS:A:18:13.867:35.989:24.772:24.51 HB:THR:B:-1:15.56:49.958:9.656:98.83
HB3:LYS:A:18:13.763:34.442:25.13:24.51 HG1:THR:B:-1:16.698:51.862:9.7:124.55
HG2:LYS:A:18:14.303:33.894:22.954:24.73 HG21:THR:B:-1:16.486:50.584:11.725:82.78
HG3:LYS:A:18:14.379:35.436:22.576:24.73 HG22:THR:B:-1:16.538:49.013:11.58:82.78
HG23:THR:B:-1:17.843:49.871:11.343:82.78 HD22:HYP:B:3:21.39:46.89:18.507:36.81
O:PHE:B:0:17.693:47.018:12.852:32.11 HG:HYP:B:3:22.156:45.447:20.171:41.58
N:PHE:B:0:18.285:47.117:10.17:47.49 HD1:HYP:B:3:21.078:43.754:19.188:44.43
CA:PHE:B:0:18.434:45.859:10.88:33.82 HB3:HYP:B:3:20.807:47.074:21.115:55.65
C:PHE:B:0:17.777:45.945:12.258:30.41 HB2:HYP:B:3:20.201:45.739:21.749:55.65
CB:PHE:B:0:19.916:45.528:11.007:28.4 HA:HYP:B:3:18.572:45.748:20.503:36.08
CG:PHE:B:0:20.185:44.138:11.419:27.64 0:LYS:B:4:16.271:47.848:23.837:25.52
CD2:PHE:B:0:20.443:43.17:10.471:29.44 N:LYS:B:4:17.521:47.837:21.368:29.31
CD1:PHE:B:0:20.187:43.786:12.753:26.31 CA:LYS:B:4:16.899:49.058:21.865:26.28
CE2:PHE:B:0:20.706:41.883:10.832:27.99 C:LYS:B:4:16.634:48.93:23.365:21.83
CE1:PHE:B:0:20.454:42.477:13.135:27.97 CB:LYS:B:4:15.558:49.322:21.159:37.68
CZ:PHE:B:0:20.702:41.517:12.166:27.51 CG:LYS:B:4:15.666:49.595:19.675:65.23
H:PHE:B:0:19.016:47.564:10.094:56.99 CD:LYS:B:4:14.269:49.657:19.051:94.75
HA:PHE:B:0:18.003:45.151:10.377:40.59 CE:LYS:B:4:14.289:50.089:17.586:109.16
HB2:PHE:B:0:20.342:45.669:10.147:34.08 NZ:LYS:B:4:12.904:50.217:17.038:105.24
HB3:PHE:B:0:20.311:46.115:11.67:34.08 H:LYS:B:4:17.109:47.124:21.616:35.17
HD1:PHE:B:0:20.017:44.431:13.402:31.58 HA:LYS:B:4:17.488:49.813:21.713:31.53
HD2:PHE:B:0:20.445:43.402:9.571:35.33 HB2:LYS:B:4:14.99:48.544:21.273:45.21
HE1 :PHE:B :0:20.45:42.243: 14.035:33.57 HB3:LYS:B:4:15.139:50.094:21.569:45.21
HE2:PHE:B:0:20.874:41.244:10.177:33.58 HG2:LYS:B:4:16.106:50.447:19.533:78.28
HZ:PHE:B:0:20.881:40.638:12.411:33.01 HG3:LYS:B:4:16.163:48.879:19.248:78.28
0:ASP:B:1:18.276:43.923:15.696:23.3 HD2:LYS:B:4:13.864:48.777:19.098:113.71
N: ASP:B: 1 : 17.325:44.791 : 12.774:25.54 HD3:LYS:B:4:13.732:50.296:19.544:113.71
CA:ASP:B: 1:16.642:44.721: 14.072:24.29 HE2:LYS:B:4:14.727:50.951:17.511:130.99
C:ASP:B:1:17.702:44.882: 15.159:25.42 HE3:LYS:B:4:14.763:49.424:17.062:130.99
CB :ASP:B : 1 : 15.863:43.408: 14.192:30.99 HZ1:LYS:B:4:12.937:50.469:16.185:126.28
CG:ASP:B:1:15.425:43.112:15.613:33.34 HZ2:LYS:B:4:12.482:49.435:17.093:126.28
ODl:ASP:B:l:15.172:44.1:16.35:26.15 HZ3:LYS:B:4:12.449:50.825:17.502:126.28
OD2:ASP:B:1:15.36:41.913:16.004:22.1 O:HIS:B:5:14.227:50.871:24.963:32.55
H: ASP:B: 1 : 17.404:44.028: 12.384:30.65 N:HIS:B:5:16.734:50.025:24.109:21.79
HA:ASP:B:1:16.014:45.456:14.145:29.15 CA:HIS:B:5:16.363:50.01:25.519:21.56
HB2:ASP:B:1:15.068:43.46:13.639:37.19 C:HIS:B:5:14.857:50.045:25.607:22.3
HB3:ASP:B:1:16.427:42.677:13.894:37.19 CB:HIS:B:5:16.937:51.202:26.275:22.18
O:THR:B:2:17.54:46.872:18.038:27.96 CG:HIS:B:5:18.423:51.204:26.337:22.64
N:THR:B:2:18.026:46.134:15.452:24.07 ND1:HIS:B:5:19.131:50.545:27.328:28.27
CA:THR:B:2:19.15:46.377:16.357:23.86 CD2:HIS:B:5:19.345:51.765:25.516:25.48
C:THR:B:2:18.686:46.539:17.782:24.18 CE1 :HIS:B :5:20.428:50.699:27.11 :22.51
CB:THR:B:2:19.911:47.632:15.99:31.47 NE2:HIS:B:5:20.585:51.441:26.022:28.72
OGl:THR:B:2:18.958:48.664:15.749:33.31 H:HIS:B:5:17.013:50.786:23.824:26.15
CG2:THR:B:2:20.815:47.367:14.758:33.17 HA:HIS:B:5:16.682:49.193:25.934:25.87
H:THR:B:2:17.631:46.838:15.155:28.88 HB2:HIS:B:5:16.657:52.018:25.832:26.62
HA:THR:B:2:19.762:45.626:16.318:28.64 HB3:HIS:B:5:16.601:51.188:27.185:26.62
HB:THR:B:2:20.479:47.889:16.733:37.77 HD1:HIS:B:5:18.784:50.097:27.975:33.93
HG1:THR:B:2:19.352:49.376:15.542:39.98 HD2:HIS:B:5:19.173:52.273:24.756:30.58
HG21:THR:B:2:21.302:48.173:14.525:39.81 HE1:HIS:B:5:21.112:50.36:27.64:27.01
HG22:THR:B:2:21.45:46.662:14.959:39.81 N:ARG:B:6:14.278:49.126:26.37:20.97
HG23:THR:B:2:20.272:47.097:14.001:39.81 CA:ARG:B:6:12.838:49.07:26.561:24.52
C:HYP:B:3:18.617:47.825:20.609:37.46 C:ARG:B:6:12.608:49.07:28.058:21.26
O:HYP:B:3:19.172:48.867:20.197:33.28 O:ARG:B:6:12.884:48.066:28.727:23.78
CA:HYP:B:3:19.208:46.443:20.27:30.06 CB:ARG:B:6:12.238:47.832:25.893:20.69
CB:HYP:B:3:20.366:46.23:20.929:46.37 CG:ARG:B:6:12.796:47.582:24.53:26.83
CG:HYP:B:3:21.21:45.388:19.961:34.65 CD:ARG:B:6:12.576:46.115:24.096:33.07
OD1 :HYP:B :3:20.787:44.116: 19.901 :37.02 NE:ARG:B:6: 13.308:45.832:22.86:31.36
CD:HYP:B:3:20.889:46.065:18.606:30.67 CZ:ARG:B:6:12.807:45.919:21.638:20.49
N:HYP:B:3:19.59:46.315:18.708:25.2 NH1:ARG:B:6:11.527:46.276:21.431:28.98
HD23:HYP:B:3:21.065:45.458:17.869:36.81 NH2:ARG:B:6:13.594:45.629:20.612:25.07
H:ARG:B:6:14.707:48.515:26.795:25.17 HA:CGU:B:10:7.113:50.935:25.873:33.66
HA: ARG:B :6: 12.425:49.861 :26.181 :29.42 N:ILE:B : 11 :8.048:48.421 :27.803:25.25
HB2: ARG:B :6: 12.426:47.054:26.441 :24.83 CA:ILE:B:11:8.061:46.962:27.816:21.79
HB3:ARG:B:6:11.279:47.953:25.806:24.83 C:ILE:B:11:6.652:46.486:27.505:22.28
HG2:ARG:B:6:12.352:48.159:23.89:32.19 0:ILE:B : 11 :6.473:45.595:26.699:22.22
HG3:ARG:B:6:13.75:47.758:24.537:32.19 CB:ILE:B:11:8.584:46.402:29.14:29.42
HD2:ARG:B:6:12.903:45.519:24.789:39.68 CGI :ILE:B: 11 : 10.089:46.697:29.267:30.34
HD3:ARG:B:6:11.632:45.963:23.935:39.68 CG2:ILE:B: 11:8.328:44.893:29.247:23.17
HE:ARG:B:6:14.13:45.591:22.934:37.64 CD1:ILE:B:11:10.655:46.443:30.653:34.61
HH11:ARG:B:6:11.019:46.461:22.1:34.78 H:ILE:B:11:8.15:48.789:28.574:30.3
HH12:ARG:B:6:11.218:46.324:20.63:34.78 HA:ILE:B : 11 :8.645:46.647:27.108:26.14
HH21 :ARG:B:6: 14.411 :45.398:20.748:30.08 HB:ILE:B:11:8.12:46.846:29.867:35.31
HH22:ARG:B:6:13.289:45.676:19.809:30.08 HG12:ILE:B:11:10.571:46.133:28.643:36.41
N:CYS:B:7:12.125:50.193:28.588:22.09 HG13:ILE:B:11:10.243:47.63:29.052:36.41
CA:CYS:B:7:12.018:50.387:30.028:24.41 HG21 :ILE:B: 11 :8.671 :44.574:30.097:27.8
C:CYS:B:7:10.61:50.801:30.421:28.77 HG22:ILE:B: 11:7.373:44.731:29.195:27.8
0:CYS:B:7:9.878:51.453:29.662:23.73 HG23:ILE:B:11:8.781:44.443:28.517:27.8
CB:CYS:B:7:12.995:51.452:30.534:25.61 HD11:ILE:B:11:11.603:46.651:30.649:41.53
SG:CYS:B:7:14.735:51.095:30.241:32.01 HD12:ILE:B:11:10.194:47.009:31.291:41.53
H:CYS:B:7:11.851:50.864:28.126:26.51 HD13:ILE:B:11:10.522:45.509:30.881:41.53
HA:CYS:B:7:12.223:49.551:30.477:29.3 N:THR:B:12:5.642:47.089:28.133:23.69
HB2:CYS:B:7: 12.791:52.292:30.093:30.73 CA:THR:B:12:4.264:46.785:27.759:27.8
HB3:CYS:B:7:12.874:51.551:31.492:30.73 C:THR:B:12:4.051:46.987:26.269:32.55
N:GLY:B:8:10.246:50.438:31.636:23.3 O:THR:B:12:3.511:46.106:25.58:24.04
CA:GLY:B:8:8.941:50.832:32.13:24.26 CB:THR:B:12:3.292:47.642:28.58:33.52
C:GLY:B:8:7.848:50.372:31.198:27.94 OG1:THR:B:12:3.502:47.358:29.965:32.74
O:GLY:B:8:7.783:49.176:30.882:23.54 CG2:THR:B: 12: 1.803:47.355:28.223:26.3
H:GLY:B:8:10.721:49.975:32.183:27.96 H:THR:B:12:5.726:47.666:28.765:28.42
HA2:GLY:B:8:8.79:50.442:33.005:29.11 HA:THR:B:12:4.082:45.854:27.961:33.36
HA3:GLY:B:8:8.898:51.798:32.208:29.11 HB:THR:B:12:3.471:48.581:28.413:40.23
N:SER:B:9:7.045:51.325:30.7:25.43 HG1:THR:B:12:2.979:47.818:30.435:39.29
CA:SER:B:9:5.825:51.05:29.947:28.83 HG21:THR:B:12:1.22:47.914:28.761:31.56
C:SER:B:9:6.127:50.664:28.495:33.52 HG22:THR:B:12:1.644:47.545:27.285:31.56
0:SER:B:9:5.335:49.988:27.841:31.13 HG23:THR:B:12:1.594:46.424:28.396:31.56
CB:SER:B:9:4.914:52.281:29.986:36.62 N:ASN:B:13:4.453:48.15:25.751:24.48
OG:SER:B:9:5.573:53.396:29.4:36.38 CA: ASN:B : 13:4.301 :48.421 :24.321 :25.34
H:SER:B:9:7.199:52.166:30.793:30.51 C:ASN:B:13:4.974:47.353:23.491:24.13
HA:SER:B:9:5.353:50.312:30.364:34.6 O:ASN:B:13:4.462:46.938:22.439:24.08
HB2:SER:B:9:4.104:52.092:29.487:43.94 CB:ASN:B:13:4.909:49.776:23.943:28.08
HB3:SER:B:9:4.698:52.488:30.908:43.94 CG:ASN:B:13:4.169:50.958:24.563:43.36
HG:SER:B:9:6.279:53.567:29.822:43.65 OD1:ASN:B:13:3.038:50.826:25.036:39.56
N:CGU:B:10:7.28:51.095:27.998:28.76 ND2:ASN:B:13:4.82:52.121:24.575:47.85
CA:CGU:B:10:7.81:50.596:26.686:28.05 H:ASN:B:13:4.812:48.79:26.2:29.38
CB:CGU:B:10:9.239:51.112:26.411:26.61 HA:ASN:B:13:3.358:48.435:24.096:30.41
CG:CGU:B:10:9.333:52.622:26.497:48.2 HB2:ASN:B:13:5.829:49.805:24.249:33.69
CD1:CGU:B:10:10.796:53.099:26.309:43.27 HB3:ASN:B:13:4.877:49.876:22.978:33.69
OE12:CGU:B:10:10.997:54.252:25.818:32.43 HD21:ASN:B:13:4.449:52.819:24.913:57.42
OE11:CGU:B:10:11.775:52.332:26.659:28.59 HD22:ASN:B: 13:5.612:52.174:24.244:57.42
CD2:CGU:B:10:8.34:53.322:25.507:61.07 N:SER:B:14:6.155:46.925:23.918:22.66
OE21:CGU:B:10:8.191:52.877:24.334:60.62 CA:SER:B:14:6.85:45.888:23.173:21.94
OE22:CGU:B:10:7.665:54.311:25.914:71.45 C:SER:B:14:6.098:44.563:23.22:21.87
C:CGU:B:10:7.888:49.082:26.653:29.41 O:SER:B:14:5.951:43.898:22.193:22.01
O:CGU:B:10:7.883:48.518:25.527:30.26 CB:SER:B:14:8.269:45.758:23.696:25.08
HG:CGU:B:10:9.037:52.937:27.534:57.84 OG:SER:B: 14:9.033:46.775:23.091:24.01
HB3:CGU:B:10:9.957:50.658:27.117:31.93 H:SER:B:14:6.565:47.209:24.618:27.19
HB2:CGU:B:10:9.575:50.754:25.412:31.93 HA:SER:B:14:6.904:46.16:22.243:26.33
HB2:SER:B: 14:8.274:45.874:24.659:30.1 O:LEU:B:18:5.69:39.304:19.308:24.4
HB3:SER:B:14:8.63:44.891:23.453:30.1 CB:LEU:B:18:7.286:41.806:20.002:22.61
HG:SER:B:14:9.828:46.735:23.358:28.81 CG:LEU:B:18:7.948:43.093:19.512:21.46
N:TYR:B:15:5.629:44.144:24.4:21.89 CD1:LEU:B:18:9.302:43.234:20.155:24.5
CA:TYR:B: 15:4.768:42.964:24.454:22.13 CD2:LEU:B:18:8.07:43.128:17.966:28.1
C:TYR:B:15:3.609:43.089:23.478:22.94 H:LEU:B:18:5.088:43.006:20.671:27.19
0:TYR:B:15:3.283:42.128:22.771:23.01 HA:LEU:B:18:5.935:41.652:18.476:26.53
CB:TYR:B:15:4.193:42.718:25.852:22.53 HB2:LEU:B:18:7.249:41.832:20.971:27.14
CG:TYR:B:15:5.115:42.148:26.893:22.01 HB3:LEU:B:18:7.823:41.055:19.704:27.14
CD1:TYR:B:15:5.679:40.889:26.759:37.65 HG:LEU:B:18:7.407:43.85:19.787:25.75
CD2:TYR:B:15:5.351:42.85:28.079:48.14 HD11:LEU:B:18:9.717:44.053:19.841:29.4
CE1:TYR:B:15:6.507:40.375:27.747:62.89 HD12:LEU:B:18:9.192:43.267:21.118:29.4
CE2:TYR:B:15:6.165:42.337:29.078:52.81 HD13:LEU:B:18:9.848:42.471:19.91:29.4
CZ:TYR:B:15:6.748:41.113:28.908:60.82 HD21:LEU:B:18:8.494:43.959:17.702:33.72
OH:TYR:B:15:7.558:40.637:29.929:54.81 HD22:LEU:B:18:8.607:42.375:17.675:33.72
H:TYR:B:15:5.789:44.514:25.16:26.27 HD23:LEU:B:18:7.183:43.071:17.578:33.72
HA:TYR:B:15:5.287:42.185:24.203:26.56 N:CYS:B:19:4.481:40.285:20.912:26.31
HB2:TYR:B:15:3.868:43.565:26.195:27.03 CA:CYS:B:19:4.018:39.001:21.403:23.2
HB3:TYR:B:15:3.448:42.104:25.765:27.03 C:CYS:B:19:2.573:38.743:21.045:29.55
HD1:TYR:B:15:5.53:40.399:25.982:45.19 O:CYS:B:19:2.204:37.602:20.812:27.61
HD2:TYR:B:15:4.966:43.688:28.195:57.77 CB:CYS:B:19:4.199:38.915:22.923:25.28
HE1:TYR:B:15:6.897:39.537:27.638:75.46 SG:CYS:B:19:5.942:39.114:23.454:33.59
HE2:TYR:B:15:6.328:42.83:29.85:63.37 H:CYS:B:19:4.174:40.968:21.334:31.58
HH:TYR:B:15:7.597:41.201:30.551:65.77 HA:CYS:B:19:4.553:38.3:21:27.84
N:MET:B:16:2.967:44.255:23.427:23.8 HB2:CYS:B:19:3.677:39.617:23.342:30.34
CA:MET:B:16:1.803:44.423:22.544:28.09 HB3:CYS:B:19:3.891:38.047:23.227:30.34
C:MET:B:16:2.197:44.329:21.083:26.65 O:TYR:B:20:0.453:40.693:18.643:58.94
O:MET:B:16:1.471:43.748:20.275:36.43 N:TYR:B:20:1.763:39.78:20.969:30.88
CB:MET:B:16:1.118:45.775:22.768:30.99 CA:TYR:B:20:0.33:39.624:20.788:36.15
CG:MET:B:16:0.499:45.977:24.148:42.33 C:TYR:B:20:-0.046:39.793:19.324:48.21
SD:MET:B:16:-0.954:44.949:24.404:68.25 CB:TYR:B:20:-0.44:40.615:21.651:29.56
CE:MET:B:16:-0.206:43.67:25.426:42.42 CG:TYR:B:20:-0.651:40.122:23.065:55.92
H:MET:B:16:3.177:44.953:23.883:28.56 CD1:TYR:B:20:-1.597:39.145:23.34:71.32
HA:MET:B:16:1.157:43.726:22.74:33.71 CD2:TYR:B:20:0.094:40.621 :24.118:59.77
HB2:MET:B:16:1.775:46.477:22.639:37.19 CE1:TYR:B:20:-1.8:38.679:24.621:72.77
HB3:MET:B:16:0.409:45.872:22.114:37.19 CE2:TYR:B:20:-0.098:40.161:25.411:73.5
HG2:MET:B: 16: 1.153:45.748:24.826:50.79 CZ:TYR:B:20:-1.058:39.185:25.659:74.33
HG3:MET:B:16:0.232:46.905:24.243:50.79 OH:TYR:B:20:-1.29:38.681:26.931:61.61
HE1:MET:B:16:-0.883:43.016:25.657:50.9 H:TYR:B:20:2.02:40.599:21.019:37.06
HE2:MET:B:16:0.508:43.245:24.926:50.9 HA:TYR:B:20:0.074:38.728:21.06:43.38
HE3:MET:B:16:0.15:44.078:26.231:50.9 HB2:TYR:B:20:0.055:41.447:21.695:35.48
N:ASP:B:17:3.344:44.897:20.719:29.89 HB3:TYR:B:20:-1.312:40.768:21.254:35.48
CA: ASP:B : 17:3.758:44.849: 19.322:34.2 HDl :TYR:B :20:-2.106:38.797:22.644:85.58
C:ASP:B:17:4.362:43.503:18.931:30.46 HD2:TYR:B:20:0.735:41.274:23.956:71.73
O:ASP:B:17:4.224:43.095:17.781:28.87 HEl:TYR:B:20:-2.44:38.024:24.782:87.33
CB:ASP:B:17:4.769:45.957:19.002:41.66 HE2:TYR:B :20:0.409:40.506:26.11:88.2
CG: ASP:B : 17:4.239:47.347: 19.307:78.78 HH:TYR:B:20:-0.786:39.058:27.486:73.93
ODl:ASP:B:17:3:47.511:19.374:89.35 0:ARG:B:21:-2.375:41.115:18.001:73.56
OD2:ASP:B:17:5.064:48.277:19.478:72.64 N:ARG:B:21:-0.904:38.888:18.851:48.09
H:ASP:B:17:3.887:45.306:21.246:35.87 CA:ARG:B:21:-1.505:38.926:17.518:54.86
HA:ASP:B:17:2.978:44.994:18.764:41.04 C: ARG:B :21 :-2.02:40.306: 17.132:67.04
HB2:ASP:B:17:5.569:45.818:19.534:49.99 CB:ARG:B:21:-2.675:37.949:17.448:56.88
HB3:ASP:B:17:4.989:45.92:18.058:49.99 CG: ARG:B :21 :-2.44:36.661 : 16.704:66.58
N:LEU:B:18:5.062:42.822:19.831:22.66 CD:ARG:B:21:-3.714:36.364:15.961:87.2
CA:LEU:B:18:5.868:41.669:19.444:22.1 NE:ARG:B:21:-3.936:34.955:15.663:104.85
C:LEU:B:18:5.314:40.329:19.885:22.41 CZ:ARG:B:21:-5.147:34.397:15.619:123.5
NH1:ARG:B:21:-6.239:35.129:15.88:136.71 HE:ARG:B:21:-3.252:34.434:15.65:125.82
NH2:ARG:B:21:-5.27:33.101:15.333:115.82 HH11:ARG:B:21:-
OXT:ARG:B:21:-2.121:40.62:15.939:68.47 6.162:35.965:16.065:164.05
H:ARG:B:21:-1.164:38.209:19.309:57.7 HH12:ARG:B:21:-
HA:ARG:B:21:-0.844:38.656:16.862:65.83 7.017:34.764:15.855:164.05
HB2:ARG:B:21:-2.926:37.714:18.355:68.25 HH21:ARG:B:21:-4.571:32.628:15.17:138.98
HB3:ARG:B:21:-3.418:38.398:17.016:68.25 HH22:ARG:B:21:-
HG2:ARG:B:21:-1.716:36.768:16.067:79.9 6.049:32.738:15.311:138.98
HG3:ARG:B:21:-2.258:35.94:17.327:79.9
HD2:ARG:B:21:-4.462:36.672:16.495:104.64
HD3:ARG:B:21 :-3.698:36.842: 15.117: 104.64
APPENDIX II: ATOMIC COORDINATES FOR MODEL OF
CON-INS Gl INSULIN (CHAINS A AND B) BOUND TO INSULIN RECEPTOR
(CHAINS E. F AND X)
Format comprises the following colon-separated fields: (1) atom name, (2) residue type, (3) chain name, (4) residue number, (5-7) xyz coordinates.
N:GLY: A: 1 :50.25:62.37:57.42 CB:HSP:A:5:52.31:67.31 :55.81
CA:GLY: A: 1 :50.21 :62.09:55.93 CD2:HSP:A:5:54.53:67.05:57.13
C:GLY:A:1 :50.37:63.37:55.13 CG:HSP:A:5:53.68:67.72:56.33
0:GLY: A: 1 :50.44:64.47:55.71 NE2:HSP:A:5:55.52:67.95:57.54
N:VAL:A:2:50.36:63.33:53.84 ND1 :HSP:A:5:54.12:69.02:56.28
CA:VAL:A:2:50.39:64.45:52.97 CE1 :HSP:A:5:55.24:69.11:57.01
CB:VAL:A:2:50.68:64.02:51.52 C:HSP:A:5:50.95:69.36:55.26
CG1 :VAL:A:2:49.47:63.39:50.84 O:HSP:A:5:51.53:70.39:55.44
HG11 :VAL:A:2:49.15:62.42:51.28 N:CYS:A:6:50.13:69.07:54.23
HG12:VAL:A:2:48.57:64.02:50.95 CA:CYS:A:6:50.12:70.02:53.1
HG13:VAL:A:2:49.67:63.37:49.74 CB:CYS:A:6:50.43:69.37:51.79
CG2:VAL:A:2:51.09:65.27:50.71 SG:CYS:A:6:52.15:68.87:51.77
HG21 :VAL:A:2:51.18:64.92:49.66 C:CYS:A:6:48.76:70.68:52.84
HG22:VAL:A:2:50.29:66.03:50.8 0:CYS:A:6:48.67:71.65:52.16
HG23:VAL:A:2:52.02:65.72:51.11 N:CYS:A:7:47.63:70.09:53.34
C:VAL:A:2:49.28:65.39:53.13 CA:CYS:A:7:46.24:70.54:53.04
0:VAL:A:2:49.47:66.66:53.13 CB:CYS:A:7:45.1:69.69:53.59
N:VAL:A:3:48.03:64.83:53.43 SG:CYS:A:7:43.48:69.8:52.7
CA:VAL:A:3:46.92:65.71:53.69 C:CYS:A:7:45.9:71.97:53.55
CB:VAL:A:3:45.62:64.98:53.83 0:CYS:A:7:45.19:72.7:52.82
CGI : VAL: A:3:44.49:66.01 :54.29 N:HIS:A:8:46.4:72.37:54.71
HG11 :VAL:A:3:43.5:65.55:54.43 CA:HIS:A:8:45.79:73.56:55.37
HG12:VAL:A:3:44.78:66.38:55.31 CB:HIS:A:8:45.46:73.16:56.86
HG13:VAL:A:3:44.46:66.82:53.53 ND1 :HIS:A:8:43.21:72.06:57
CG2:VAL:A:3:45.4:64.44:52.42 CG:HIS:A:8:44.57:71.94:56.89
HG21 :VAL:A:3:45.44:65.23:51.65 CE1 :HIS:A:8:42.72:70.85:57.06
HG22:VAL:A:3:46.13:63.62:52.26 NE2:HIS:A:8:43.69:69.92:56.99
HG23:VAL:A:3:44.4:63.97:52.44 CD2:HIS:A:8:44.85:70.62:56.98
C:VAL:A:3:47.13:66.53:54.97 C:HIS:A:8:46.71 :74.82:55.23
O:VAL:A:3:47.09:67.77:54.94 0:HIS:A:8:46.22:75.95:55.27
N:CGU:A:4:47.56:65.91 :56.11 N:ARG:A:9:48:74.57:55.05
CA:CGU:A:4:47.83:66.64:57.38 CA:ARG:A:9:49.01:75.52:54.99
CB:CGU:A:4:48.2:65.74:58.59 CB:ARG:A:9:49.72:75.72:56.41
CG:CGU:A:4:47.15:64.64:59.01 CG:ARG:A:9:48.86:76.27:57.56
CD1 :CGU:A:4:46.83:63.56:58.09 CD:ARG:A:9:48.32:77.67:57.25
OE11 :CGU:A:4:45.67:63.06:58.15 NE:ARG:A:9:49.56:78.48:56.93
OE12:CGU:A:4:47.69:63.15:57.27 CZ:ARG:A:9:49.65:79.62:56.2
CD2:CGU:A:4:47.66:63.83:60.16 NH1 :ARG:A:9:48.56:80.19:55.69
OE21 :CGU: A:4:48.79:63.44:60.04 NH2:ARG:A:9:50.8:80.13:55.88
OE22:CGU:A:4:47.02:63.74:61.23 C:ARG:A:9:50.08:74.83:54.07
C:CGU:A:4:48.94:67.66:57.17 O:ARG:A:9:50.03:73.62:54.02
0:CGU:A:4:48.88:68.76:57.7 N:PRO:A: 10:50.96:75.61 :53.49
N:HSP:A:5:49.99:67.35:56.35 CD:PRO:A: 10:50.86:77.07:53.54
CA:HSP:A:5:51.16:68.26:56.32 CA:PRO:A: 10:51.91 :75.19:52.5
CB:PRO:A:10:52.56:76.52:52.07 CG:LYS:A:17:62.75:64.81:48.61
CG:FRO:A:10:51.54:77.6:52.31 CD:LYS:A:17:62.61:64.47:47.08
C:PRO:A:10:52.88:74.29:53.1 CE:LYS:A:17:63.89:63.73:46.59
O:PRO:A:10:53.1:74.37:54.31 NZ:LYS:A:17:63.71:63.37:45.1
N:CYS:A:11:53.41:73.39:52.3 C:LYS:A:17:60.1:63.81:49.85
CA:CYS:A:11:54.53:72.61:52.57 0:LYS: A: 17:60.47:62.72:49.41
CB:CYS:A:11:54.09:71.06:52.69 N:LYS:A:18:59.51:63.87:51.09
SG:CYS:A:11:53.26:70.5:51.18 CA:LYS:A:18:59.36:62.7:51.98
C:CYS:A:11:55.53:72.94:51.45 CB:LYS:A:18:58.96:63.08:53.42
O:CYS:A:ll:55.12:73.21:50.35 CG:LYS:A:18:60.02:64.01:54.14
N:SER:A:12:56.8:72.88:51.78 CD:LYS:A:18:61.31:63.29:54.45
CA:SER:A:12:57.88:73.04:50.78 CE:LYS:A:18:62.48:64.29:54.85
CB:SER:A:12:59.21:73.42:51.47 NZ:LYS:A:18:63.76:63.61:54.99
OG:SER:A: 12:59.57:72.46:52.46 C:LYS:A:18:58.23:61.74:51.51
C:SER:A:12:58.11:71.8:49.87 O:LYS:A:18:58.08:60.61:51.89
O:SER:A:12:57.59:70.71:50.1 N:TYR:A:19:57.3:62.26:50.64
N:ASN:A:13:58.93:71.94:48.84 CA:TYR:A:19:56.22:61.46:50.05
CA:ASN:A:13:59.38:70.76:48.15 CB:TYR:A:19:54.96:62.33:49.8
CB:ASN:A:13:60.35:71.19:46.96 CG:TYR:A:19:54.68:63.01:51.05
CG:ASN:A:13:59.8:71.95:45.78 CD1:TYR:A:19:54.62:64.38:51.09
OD1:ASN:A:13:60.32:72.95:45.28 CE1:TYR:A:19:54.43:65.14:52.32
ND2:ASN:A:13:58.63:71.47:45.27 CZ:TYR:A:19:54.13:64.37:53.45
C:ASN:A:13:60.16:69.73:49.01 OH:TYR:A:19:53.83:65.03:54.66
O:ASN:A:13:60.08:68.53:48.85 CD2:TYR:A:19:54.35:62.29:52.2
N:ALA:A:14:61.02:70.12:49.92 CE2:TYR: A: 19:54.07:62.94:53.42
CA:ALA:A:14:61.68:69.29:50.94 C:TYR:A:19:56.57:60.84:48.73
CB:ALA:A:14:62.6:70.18:51.71 O:TYR:A:19:55.82:60.08:48.13
C:ALA:A:14:60.74:68.46:51.87 N:CYS:A:20:57.76:61.14:48.23
O:ALA:A:14:61.02:67.28:52.14 CA:CYS:A:20:58.16:60.66:46.93
N:GLU:A: 15:59.62:69.04:52.35 CB:CYS:A:20:59.26:61.58:46.16
CA:GLU:A:15:58.47:68.37:52.89 SG:CYS:A:20:58.71:63.23:45.66
CB:GLU:A:15:57.49:69.39:53.57 C:CYS:A:20:58.91:59.31:47.13
CG:GLU:A:15:58.09:70.2:54.8 NT:CYS:A:20:58.67:58.33:46.22
CD:GLU:A:15:57.33:71.55:54.96 O:CYS:A:20:59.61:59.11:48.11
OEl:GLU:A:15:57.68:72.55:54.29 N:THR:B:-1:64.69:74.65:39.71
OE2:GLU:A:15:56.54:71.67:55.93 CA:THR:B :-l :65.2:73.71 :40.76
C:GLU:A:15:57.71:67.43:51.92 CB:THR:B:-1:66.52:74.1:41.37
0:GLU:A: 15:57.42:66.29:52.27 OG1:THR:B:-1:67.48:74.53:40.42
N:PHE:A:16:57.55:67.8:50.61 CG2:THR:B:-1:67.16:72.85:42.06
CA:PHE:A: 16:56.93:66.92:49.6 HG21:THR:B:-1:67.99:73.25:42.68
CB:PHE:A:16:56.65:67.71:48.26 HG22:THR:B:-1:66.33:72.49:42.7
CG:PHE:A:16:55.52:67.12:47.54 HG23:THR:B:-1:67.4:72.02:41.37
CD1:PHE:A:16:54.3:67.71:47.66 C:THR:B:-1:64.12:73.54:41.84
CE1:PHE:A:16:53.2:67.06:47.12 0:THR:B:-1:63.46:72.51:41.95
CZ:PHE:A:16:53.35:65.96:46.22 N:PHE:B:0:63.74:74.7:42.52
CD2:PHE:A:16:55.72:66.01:46.68 CA:PHE:B:0:62.74:74.78:43.62
CE2:PHE:A:16:54.6:65.4:46.07 CB:PHE:B:0:63.39:75.47:44.88
C:PHE:A: 16:57.92:65.77:49.39 CG:PHE:B:0:64.56:74.7:45.18
0:PHE:A:16:57.51:64.65:48.97 CD1:PHE:B:0:65.84:75.3:45.05
N:LYS:A:17:59.2:66.01:49.58 CE1:PHE:B:0:67:74.52:45.26
CA:LYS:A:17:60.17:65.1:49.09 CZ:PHE:B:0:66.87:73.25:45.9
CB:LYS:A:17:61.57:65.7:49.13 CD2:PHE:B:0:64.44:73.34:45.64
CE2:PHE:B:0:65.63:72.67:46.03 CD:ARG:B:6:51.26:71.53:46.5
C:PHE:B:0:61.59:75.64:43.29 NE:ARG:B:6:52.72:71.85:46.15
O:PHE:B:0:61.64:76.7:42.61 CZ:ARG:B:6:53.08:72.5:45.03
N: ASP:B: 1 :60.4:75.21 :43.77 NH1 :ARG:B:6:52.31 :72.87:44
CA:ASP:B: 1:59.15:75.98:43.64 NH2:ARG:B:6:54.36:72.88:44.97
CB:ASP:B: 1:58.08:74.84:43.54 C:ARG:B:6:47.65:69.78:48.71
CG:ASP:B: 1:58.26:73.88:42.4 O:ARG:B:6:47.82:69.06:49.68
ODl :ASP:B:l :58.25:74.2:41.23 N:CYS:B:7:46.44:69.85:48.2
OD2:ASP:B:l :58.29:72.65:42.73 CA:CYS:B:7:45.34:69.09:48.69
C:ASP:B: 1 :59:76.84:44.9 CB:CYS:B:7:44.48:69.99:49.69
0:ASP:B: 1 :59.69:76.66:45.91 SG:CYS:B:7:43.61:68.9:50.86
N:THR:B:2:57.96:77.74:44.9 C:CYS:B:7:44.51 :68.71 :47.4
CA:THR:B:2:57.54:78.49:46.12 0:CYS:B:7:44.7:69.41:46.41
CB:THR:B:2:56.98:79.95:45.82 N:GLY:B:8:43.6:67.71 :47.42
OG1 :THR:B:2:55.81 :80.02:44.98 CA:GLY:B:8:42.92:67.32:46.24
CG2:THR:B:2:58.1:80.75:45.16 C:GLY:B:8:43.79:66.93:45.05
HG21 :THR:B:2:59:80.68:45.8 0:GLY:B:8:44.82:66.24:45.2
HG22:THR:B:2:58.33:80.32:44.16 N:SER:B:9:43.35:67.38:43.85
HG23:THR:B:2:57.63:81.75:45.05 CA:SER:B:9:44.03:67.09:42.55
C:THR:B:2:56.44:77.62:46.85 CB:SER:B:9:43.2:67.82:41.43
O:THR:B:2:55.58:77.05:46.16 OG:SER:B:9:43.07:69.22:41.72
N:HYP:B:3:56.41 :77.52:48.18 C:SER:B:9:45.53:67.5:42.52
CA:HYP:B:3:55.62:76.6:48.99 0:SER:B:9:46.42:66.82:41.94
CB:HYP:B:3:56:76.88:50.44 N:CGU:B: 10:45.83:68.72:43.12
CG:HYP:B:3:57.39:77.4:50.38 CA:CGU:B: 10:47.09:69.37:43.2
OD1 :HYP:B:3:58.2:76.26:50.04 CB:CGU:B: 10:46.95:70.75:43.85
CD:HYP:B:3:57.32:78.25:49.04 CG:CGU:B: 10:46.26:71.75:42.93
C:HYP:B:3:54.14:76.75:48.68 CD1 :CGU:B:10:46:73.11 :43.47
0:HYP:B:3:53.65:77.85:48.47 OE11 :CGU:B:10:46.6:73.47:44.54
N:LYS:B:4:53.39:75.63:48.59 OE12:CGU:B:10:45.19:73.92:42.89
CA:LYS:B:4:51.96:75.7:48.3 CD2:CGU:B:10:47.12:72.03:41.77
CB:LYS:B:4:51.68:75.31 :46.8 OE21 :CGU:B:10:48.28:72.52:41.89
CG:LYS:B:4:52.23:76.3:45.71 OE22:CGU:B:10:46.68:71.76:40.62
CD:LYS:B:4:51.28:77.47:45.48 C:CGU:B:10:48.15:68.65:43.91
CE:LYS:B:4:51.75:78.6:44.56 O:CGU:B: 10:49.32:68.74:43.58
NZ:LYS:B:4:51.76:78.34:43.11 N:ILE:B : 11 :47.8:67.7:44.92
C:LYS:B:4:51.33:74.75:49.05 CA:ILE:B:11 :48.8:66.72:45.33
0:LYS:B:4:51.97:73.79:49.52 CB :ILE:B: 11 :48.41 :65.79:46.46
N:HIS:B:5:50.05:74.91:49.33 CG2:ILE:B: 11 :49.6:65.08:47
CA:HIS:B:5:49.21:73.89:49.97 HG21 :ILE:B: 11 :50.09:64.42:46.24
CB:HIS:B:5:48.13:74.62:50.83 HG22:ILE:B: 11 :50.34:65.69:47.54
ND1 :HIS:B:5:47.21 :76.49:49.4 HG23:ILE:B: 11 :49.29:64.33:47.77
CG:HIS:B:5:47.02:75.21 :49.99 CGI :ILE:B: 11 :47.56:66.48:47.58
CE1:HIS:B:5:46.03:76.76:48.79 HG11 :ILE:B: 11 :48.27:67.03:48.23
NE2:HIS:B:5:45.14:75.76:48.88 HG12:ILE:B: 11 :46.86:67.21:47.12
CD2:HIS:B:5:45.71 :74.81:49.71 CD:ILE:B:11 :46.86:65.41 :48.35
C:HIS:B:5:48.62:72.89:49.05 C:ILE:B: 11:49.34:65.88:44.19
0:HIS:B:5:47.89:73.33:48.15 0:ILE:B : 11 :50.56:65.65:44.14
N:ARG:B:6:49.14:71.68:49.06 N:THR:B: 12:48.53:65.33:43.22
CA:ARG:B:6:48.7:70.67:48.09 CA:THR:B: 12:48.88:64.49:42.06
CB:ARG:B:6:49.87:69.72:47.69 CB:THR:B: 12:47.73:63.83:41.3
CG:ARG:B:6:51.23:70.54:47.59 OG1 :THR:B:12:47.03:62.84:42.01
CG2:THR:B:12:48.36:63.09:40.05 CD1:LEU:B:18:59.8:66.71:45.38
HG21:THR:B:12:48.85:63.73:39.29 CD2:LEU:B:18:60.5:68.79:44.05
HG22:THR:B: 12:49.04:62.32:40.48 C:LEU:B:18:59.26:65.87:42.26
HG23:THR:B: 12:47.63:62.42:39.55 O:LEU:B:18:60.49:65.71:42.29
C:THR:B:12:49.79:65.29:41.11 N:CYS:B:19:58.41:64.8:42.21
O:THR:B:12:50.84:64.87:40.6 CA:CYS:B:19:58.9:63.42:42.29
N:ASN:B:13:49.45:66.58:40.89 CB:CYS:B:19:57.92:62.32:42.81
CA:ASN:B:13:50.11:67.48:39.98 SG:CYS:B:19:57.19:62.86:44.35
CB:ASN:B:13:49.2:68.71:39.88 C:CYS:B:19:59.72:62.92:41.1
CG:ASN:B:13:47.88:68.42:39.08 O:CYS:B:19:60.62:62.13:41.22
ODl:ASN:B:13:47.81:67.41:38.44 N:TYR:B:20:59.48:63.5:39.89
ND2:ASN:B:13:46.85:69.37:39.23 CA:TYR:B:20:60.21:63.21:38.64
C:ASN:B:13:51.5:67.78:40.54 CB:TYR:B:20:59.23:63.06:37.55
0:ASN:B:13:52.46:67.76:39.78 CG:TYR:B:20:58.26:61.89:37.68
N:SER:B:14:51.66:68:41.88 CD1:TYR:B:20:58.46:60.68:36.92
CA:SER:B:14:53.01:68.18:42.46 CE1:TYR:B:20:57.53:59.62:37.09
CB:SER:B:14:52.89:68.82:43.85 CZ:TYR:B:20:56.54:59.69:38.03
OG:SER:B : 14:54.2:69.12:44.45 OH:TYR:B:20:55.4:58.82:37.99
C:SER:B: 14:53.68:66.9:42.61 CD2:TYR:B:20:57.1:61.98:38.49
0:SER:B:14:54.89:66.86:42.55 CE2:TYR:B:20:56.33:60.85:38.81
N:TYR:B:15:52.99:65.72:42.68 C:TYR:B:20:61.19:64.32:38.26
CA:TYR:B:15:53.58:64.41:42.53 O:TYR:B:20:61.74:64.35:37.15
CB:TYR:B:15:52.71:63.21:43.15 N:ARG:B:21:61.42:65.28:39.1
CG:TYR:B:15:53.26:61.84:42.94 CA:ARG:B:21:62.44:66.31:39.01
CD1:TYR:B:15:53.92:61.2:44.03 CB:ARG:B:21:62:67.59:39.64
CE1:TYR:B:15:54.48:59.89:43.86 CG:ARG:B:21:62.87:68.85:39.35
CZ:TYR:B:15:54.47:59.29:42.59 CD:ARG:B:21:62.45:70.08:40.08
OH:TYR:B:15:54.87:57.98:42.46 NE:ARG:B:21:60.98:70.45:39.88
CD2:TYR:B:15:53.16:61.18:41.71 CZ:ARG:B:21:60.21:71.04:40.75
CE2:TYR:B:15:53.74:59.87:41.56 NH1:ARG:B:21:60.62:71.6:41.81
C:TYR:B:15:54.27:64.21:41.11 NH2:ARG:B:21:58.96:71.22:40.38
0:TYR:B:15:55.36:63.71:41 C:ARG:B:21:63.71:65.92:39.82
N:MET:B: 16:53.69:64.59:39.93 OTl:ARG:B:21:64.84:66.14:39.3
CA:MET:B:16:54.25:64.42:38.62 OT2:ARG:B:21:63.54:65.45:40.89
CB:MET:B:16:53.21:64.31:37.49 N:GLY:E:5:44.17:26.88:38.71
CG:MET:B:16:52.44:62.94:37.6 CA:GLY:E:5:44.62:28.14:37.98
SD:MET:B:16:51.33:62.6:36.24 C:GLY:E:5:45.67:29.03:38.58
CE:MET:B:16:50.3:64.09:36.2 0:GLY:E:5:45.77:29.25:39.8
C:MET:B:16:55.3:65.43:38.3 N:GLU:E:6:46.45:29.68:37.73
0:MET:B:16:56.12:65.26:37.42 CA:GLU:E:6:47.55:30.45:38.02
N:ASP:B:17:55.4:66.49:39.16 CB:GLU:E:6:48.31:30.93:36.77
CA:ASP:B:17:56.33:67.56:39.06 CG:GLU:E:6:49.65:31.81:36.96
CB:ASP:B:17:55.67:68.84:39.59 CD:GLU:E:6:50.72:31.2:37.82
CG:ASP:B:17:56.3:70.13:39.08 0E1:GLU:E:6:51.83:30.94:37.18
OD1:ASP:B:17:57.42:70.07:38.45 OE2:GLU:E:6:50.53:31.06:39.04
OD2:ASP:B:17:55.79:71.18:39.46 C:GLU:E:6:47.17:31.63:38.85
C:ASP:B:17:57.53:67.26:39.91 O:GLU:E:6:46.05:32.15:38.64
0:ASP:B:17:58.57:66.88:39.4 N:VAL:E:7:48.05:32.06:39.79
N:LEU:B:18:57.42:67.34:41.25 CA: VAL:E:7:47.92:33.11 :40.79
CA:LEU:B:18:58.6:67.29:42.17 CB:VAL:E:7:47.8:32.46:42.09
CB:LEU:B:18:58.13:67.79:43.54 CG1:VAL:E:7:47.47:33.5:43.22
CG:LEU:B:18:59.29:67.99:44.61 HG11 :VAL:E:7:48.36:34.16:43.27
HG12:VAL:E:7:46.54:34.06:43.01 C:ILE:E: 13:49.19:51.48:38.98
HG13:VAL:E:7:47.38:32.94:44.19 O:ILE:E:13:48.05:51.89:39.06
CG2:VAL:E:7:46.75:31.33:42.07 N:ARG:E: 14:50.24:52.28:38.72
HG21 :VAL:E:7:45.89:31.45:41.38 CA:ARG:E: 14:50.15:53.74:38.76
HG22:VAL:E:7:47.19:30.36:41.76 CB:ARG:E: 14:50.85:54.26:40.04
HG23:VAL:E:7:46.37:31.24:43.11 CG:ARG:E: 14:50.26:53.93:41.42
C:VAL:E:7:48.91 =34.21 :40.76 CD:ARG:E: 14:50.99:54.45:42.68
O:VAL:E:7:50.05:33.9:40.88 NE:ARG:E: 14:50.54:53.62:43.82
N:CYS:E:8:48.39:35.44:40.73 CZ:ARG:E:14:50.7:53.9:45.12
CA:CYS:E:8:49.15:36.68:40.85 NH1 :ARG:E: 14:51.52:54.77:45.58
CB:CYS:E:8:48.55:37.79:39.98 NH2: ARG:E: 14:50.03:53.14:45.96
SG:CYS:E:8:49.07:37.75:38.31 C:ARG:E: 14:50.92:54.39:37.62
C:CYS:E:8:48.97:37.14:42.3 0:ARG:E: 14:52.07:54.1 :37.32
0:CYS:E:8:47.85:37.24:42.79 N:ASN:E: 15:50.33:55.33:36.91
N:PRO:E:9:50.03:37.49:43.04 CA:ASN:E:15:50.95:56.14:35.79
CD:PRO:E:9:51.35:36.96:42.94 CB:ASN:E: 15:52.03:57.17:36.14
CA:PRO:E:9:49.99:38.5:44.1 CG:ASN:E:15:51.71 :58.12:37.33
CB:PRO:E:9:51.47:38.6:44.53 OD1 :ASN:E: 15:52.48:58.06:38.3
CG:PRO:E:9:51.97:37.17:44.28 ND2:ASN:E: 15:50.61:58.92:37.34
C:PRO:E:9:49.37:39.91 :43.73 C:ASN:E:15:51.35:55.45:34.52
O:PRO:E:9:49.23:40.18:42.53 O:ASN:E: 15:51.35:56.07:33.48
N:GLY:E: 10:49.09:40.77:44.74 N:NLG:E: 16:51.7:54.11 :34.56
CA:GLY:E: 10:48.48:42.11:44.71 CA:NLG:E: 16:52.14:53.31 :33.41
C:GLY:E: 10:49.18:43.08:43.72 CB:NLG:E:16:53.74:53.04:33.52
0:GLY:E: 10:50.4:43.08:43.54 CG:NLG:E: 16:54.55:54.25:33.28
N:MET:E: 11:48.38:43.94:43.11 ODl :NLG:E: 16:55.23:54.79:34.16
CA:MET:E:11 :48.8:44.79:42.01 ND2:NLG:E: 16:54.62:54.59:32.04
CB :MET:E: 11 :48.14:44.37:40.71 C:NLG:E: 16:51.47:52.01 :33.48
CG:MET:E:11 :48.83:43.2:40.06 0:NLG:E: 16:51.17:51.47:34.52
SD:MET:E: 11 :48.37:43.08:38.35 N:LEU:E: 17:51.28:51.34:32.29
CE:MET:E: 11 :49.73:42:37.84 CA:LEU:E:17:50.62:50.03:32.18
C:MET:E:11 :48.45:46.28:42.19 CB:LEU:E: 17:49.62:49.98:30.99
0:MET:E: 11:47.32:46.68:41.95 CG:LEU:E:17:48.56:51.12:30.98
N:ASP:E: 12:49.46:47.1 :42.5 CD1 :LEU:E:17:47.88:51.34:29.61
CA:ASP:E: 12:49.39:48.55:42.73 CD2:LEU:E:17:47.53:50.89:32.13
CB:ASP:E: 12:50.23:48.78:44.03 C:LEU:E: 17:51.6:48.76:32.2
CG:ASP:E: 12:50.28:50.19:44.51 0:LEU:E: 17:51.17:47.65:32.24
OD1 :ASP:E:12:49.47:51.04:44.03 N:THR:E: 18:52.92:49.03:32.35
OD2:ASP:E:12:51.11 :50.52:45.4 CA:THR:E: 18:54.03:48.06:32.4
C:ASP:E: 12:49.95:49.27:41.51 CB:THR:E:18:55.22:48.91:32.71
0:ASP:E: 12:51.17:49.44:41.38 OG1 :THR:E: 18:55.32:50.07:31.91
N:ILE:E:13:49.09:49.66:40.55 CG2:THR:E: 18:56.54:48.09:32.6
CA:ILE:E: 13:49.46:49.96:39.22 HG21 :THR:E: 18:56.48:47.27:33.33
CB:ILE:E: 13:48.86:49.13:38.07 HG22:THR:E: 18:56.75:47.76:31.56
CG2:ILE:E: 13:49.61 =49.46:36.79 HG23:THR:E: 18:57.44:48.66:32.89
HG21 :ILE:E: 13:50.7:49.22:36.83 C:THR:E: 18:53.88:46.94:33.43
HG22:ILE:E: 13:49.12:48.94:35.95 0:THR:E: 18:54.32:45.77:33.15
HG23:ILE:E: 13:49.67:50.56:36.61 N:ARG:E: 19:53.38:47.24:34.65
CG1 :ILE:E: 13:48.84:47.63:38.4 CA: ARG:E: 19:53.32:46.27:35.73
HG11 :ILE:E: 13:48.24:47.55:39.33 CB:ARG:E: 19:53.84:46.86:37.09
HG12:ILE:E: 13:48.37:47.12:37.54 CG:ARG:E: 19:55.34:47.16:37.29
CD:ILE:E: 13:50.14:46.87:38.62 CD:ARG:E: 19:56.12:45.89:36.99
NE:ARG:E:19:57.58:45.99:37.47 CB:NLG:E:25:49.46:33.4:32.36
CZ:ARG:E: 19:58.34:44.97:37.76 CG:NLG:E:25:50.26:33.64:31.05
NH1 :ARG:E: 19:57.93:43.71 :37.62 OD1 :NLG:E:25:51.31 :34.22:31.07
NH2:ARG:E: 19:59.58:45.08:38.23 ND2:NLG:E:25:49.73:33.18:29.93
C:ARG:E:19:51.97:45.64:35.83 C:NLG:E:25:47.58:33.79:33.93
0:ARG:E: 19:51.71:44.7:36.6 0:NLG:E:25:47.29:32.64:34.1
N:LEU:E:20:51.01 :45.99:34.97 N:CYS:E:26:47.42:34.76:34.84
CA:LEU:E:20:49.73:45.38:34.78 CA:CYS:E:26:46.75:34.59:36.12
CB:LEU:E:20:48.58:46.44:34.43 CB:CYS:E:26:47.08:35.77:37.12
CG:LEU:E:20:47.18:45.93:34.02 SG:CYS:E:26:48.82:35.84:37.54
CD1 :LEU:E:20:46.63:44.91:35.12 C:CYS:E:26:45.25:34.49:36.05
CD2:LEU:E:20:46.24:47.12:33.76 0:CYS:E:26:44.55:35.41:35.59
C:LEU:E:20:49.83:44.33:33.66 N:SER:E:27:44.62:33.38:36.47
O:LEU:E:20:49.04:43.37:33.64 CA:SER:E:27:43.17:33.16:36.63
N:HIS:E:21 :50.87:44.44:32.8 CB:SER:E:27:42.77:31.66:36.55
CA:HIS:E:21 :51.26:43.45:31.72 OG:SER:E:27:43.19:31.06:35.33
CB:HIS:E:21 :52.32:44.01 :30.81 C:SER:E:27:42.63:33.72:37.92
ND1 :HIS:E:21 :50.67:45.42:29.41 O:SER:E:27:41.51 :34.19:38.03
CG:HIS:E:21 :51.96:45.17:29.95 N:VAL:E:28:43.4:33.87:39
CE1:HIS:E:21 :50.89:46.48:28.7 CA:VAL:E:28:42.99:34.56:40.27
NE2:HIS:E:21 :52.22:46.95:28.74 CB:VAL:E:28:42.59:33.58:41.43
CD2:HIS:E:21:52.93:46.12:29.56 CGI :VAL:E:28:42.19:34.29:42.71
C:HIS:E:21 :51.65:42.03:32.3 HG11 :VAL:E:28:41.64:33.62:43.4
O:HIS:E:21 :51.79:41.06:31.55 HG12:VAL:E:28:43.09:34.63:43.24
N:GLU:E:22:51.73:41.85:33.65 HG13:VAL:E:28:41.45:35.1:42.51
CA:GLU:E:22:52:40.72:34.5 CG2:VAL:E:28:41.42:32.65:41.01
CB:GLU:E:22:52.77:41.19:35.78 HG21 : VAL:E:28:41.81 :32.06:40.14
CG:GLU:E:22:54.11 :41.96:35.36 HG22:VAL:E:28:41.01 :31.93:41.75
CD:GLU:E:22:54.97:42.45:36.46 HG23:VAL:E:28:40.55:33.24:40.67
OE1 :GLU:E:22:56.1 :42.99:36.1 C:VAL:E:28:44.17:35.51 :40.72
OE2:GLU:E:22:54.76:42.22:37.65 O:VAL:E:28:45.31:35.18:40.45
C:GLU:E:22:50.75:39.98:34.88 N:ILE:E:29:43.92:36.61 :41.42
O:GLU:E:22:50.92:39.02:35.67 CA:ILE:E:29:44.82:37.49:42.11
N:LEU:E:23:49.6:40.37:34.4 CB:ILE:E:29:44.91 :38.98:41.59
CA:LEU:E:23:48.38:39.69:34.78 CG2:ILE:E:29:43.58:39.47:41.07
CB:LEU:E:23:47.2:40.71 :34.94 HG21 :ILE:E:29:43.68:40.56:40.89
CG:LEU:E:23:45.8:40.32:35.37 HG22:ILE:E:29:43.11 :38.88:40.24
CD1 :LEU:E:23:45.79:39.81:36.85 HG23:ILE:E:29:42.94:39.48:41.97
CD2:LEU:E:23:44.97:41.61:35.39 CGI :ILE:E:29:45.52:39.99:42.5
C:LEU:E:23:47.84:38.71:33.73 HG11 :ILE:E:29:44.8:40.41 :43.23
O:LEU:E:23:46.83:38.01:33.9 HG12:ILE:E:29:46.34:39.49:43.07
N:GLU:E:24:48.48:38.62:32.56 CD:ILE:E:29:46.25:41.12:41.69
CA:GLU:E:24:47.98:37.85:31.44 C:ILE:E:29:44.59:37.51 :43.56
CB:GLU:E:24:48.88:38.01 :30.19 0:ILE:E:29:43.43:37.67:43.99
CG:GLU:E:24:48.81 :39.38:29.55 N:GLU:E:30:45.57:37.26:44.46
CD:GLU:E:24:49.11 :39.42:28.07 CA:GLU:E:30:45.35:37.01 :45.93
OEl :GLU:E:24:48.17:39.55:27.25 CB:GLU:E:30:46.19:35.77:46.34
OE2:GLU:E:24:50.26:39.36:27.66 CG:GLU:E:30:45.46:34.52:45.94
C:GLU:E:24:47.6:36.38:31.71 CD:GLU:E:30:46.09:33.23:46.57
0:GLU:E:24:46.56:35.88:31.32 OE1 :GLU:E:30:45.52:32.14:46.28
N:NLG:E:25:48.43:35.55:32.37 OE2:GLU:E:30:47.19:33.39:47.15
CA:NLG:E:25:48.16:34.14:32.55 C:GLU:E:30:45.52:38.26:46.78
O:GLU:E:30:45.52:38.2:48.04 0:LEU:E:36:47.33:55.34:37.98
N:GLY:E:31:45.67:39.42:46.25 N:LEU:E:37:45.75:56.85:38.52
CA:GLY:E:31 :45.63:40.68:46.9 CA:LEU:E:37:46.35:58.08:38.12
C:GLY:E:31 :44.64:41.64:46.22 CB:LEU:E:37:47.39:58.58:39.17
0:GLY:E:31:43.65:41.22:45.63 CG:LEU:E:37:47.05:58.73:40.71
N:HIS:E:32:44.93:42.94:46.34 CD1 :LEU:E:37:48.2:59.09:41.69
CA:HIS:E:32:44.09:43.91 :45.6 CD2:LEU:E:37:45.9:59.68:40.71
CB:HIS:E:32:44.29:45.27:46.26 C:LEU:E:37:46.89:58.12:36.68
ND1 :HIS:E:32:46.29:45.37:47.77 0:LEU:E:37:47.99:58.52:36.49
CG:HIS:E:32:45.73:45.58:46.55 N:MET:E:38:46.21:57.62:35.72
CE1:HIS:E:32:47.61 :45.65:47.7 CA:MET:E:38:46.78:57.54:34.36
NE2:HIS:E:32:47.86:46.15:46.48 CB:MET:E:38:46.36:56.19:33.9
CD2:HIS:E:32:46.7:46.09:45.75 CG:MET:E:38:46.72:54.88:34.71
C:HIS:E:32:44.48:44.03:44.18 SD:MET:E:38:48.25:54.02:34.16
0:HIS:E:32:45.63:43.77:43.9 CE:MET:E:38:48.04:52.91 :35.6
N:LEU:E:33:43.58:44.46:43.27 C:MET:E:38:46.31:58.64:33.46
CA:LEU:E:33:43.92:44.98:42 O:MET:E:38:45.15:58.79:33.04
CB:LEU:E:33:43.35:44.1 :40.84 N:PHE:E:39:47.22:59.59:33.19
CG:LEU:E:33:43.46:44.63:39.39 CA:PHE:E:39:46.98:60.86:32.61
CD1 :LEU:E:33:44.89:44.98:39.04 CB:PHE:E:39:47.82:61.96:33.33
CD2:LEU:E:33:42.84:43.75:38.34 CG:PHE:E:39:47.17:62.38:34.59
C:LEU:E:33:43.52:46.46:41.99 CD1 :PHE:E:39:47.56:61.78:35.78
0:LEU:E:33:42.4:46.94:41.9 CE1 :PHE:E:39:46.78:61.94:36.94
N:GLN:E:34:44.52:47.33:42.05 CZ:PHE:E:39:45.65:62.74:36.95
CA:GLN:E:34:44.28:48.79:41.96 CD2:PHE:E:39:46.1 :63.2:34.6
CB:GLN:E:34:44.56:49.49:43.32 CE2:PHE:E:39:45.37:63.49:35.81
CG:GLN:E:34:45.98:49.64:43.91 C:PHE:E:39:47.23:60.86:31.04
CD:GLN:E:34:46.05:50.31:45.35 O:PHE:E:39:46.59:61.52:30.23
OE1 :GLN:E:34:45.12:50.12:46.13 N:LYS:E:40:48.32:60.12:30.69
NE2:GLN:E:34:47.2:50.89:45.72 CA:LYS:E:40:48.87:60.12:29.32
C:GLN:E:34:45.01 :49.34:40.8 CB:LYS:E:40:50.4:60.44:29.23
O:GLN:E:34:46.18:49.08:40.59 CG:LYS:E:40:50.82:61.72:29.99
N:ILE:E:35:44.3:50.27:40.09 CD:LYS:E:40:50.48:62.98:29.2
CA:ILE:E:35:44.82:50.86:38.85 CE:LYS:E:40:51.31 :63.06:27.97
CB :ILE:E:35:44.1 :50.26:37.55 NZ:LYS:E:40:50.8:64.2:27.2
CG2:ILE:E:35:44.72:51.01 :36.31 C:LYS:E:40:48.46:58.86:28.47
HG21 :ILE:E:35:44.6:52.11 :36.29 O:LYS:E:40:48.81 :58.71:27.28
HG22:ILE:E:35:45.79:50.73:36.17 N:THR:E:41 :47.7:57.89:29.06
HG23:ILE:E:35:44.18:50.51 :35.46 CA:THR:E:41:47.25:56.64:28.41
CG1 :ILE:E:35:44.27:48.75:37.39 CB:THR:E:41 :46.37:55.75:29.37
HG11 :ILE:E:35:45.23:48.59:36.86 OG1 :THR:E:41 :45.42:56.51 :30.07
HG12:ILE:E:35:44.42:48.35:38.41 CG2:THR:E:41 :47.34:55.04:30.31
CD:ILE:E:35:43.21 :47.93:36.75 HG21 :THR:E:41 :46.68:54.58:31.08
C:ILE:E:35:44.57:52.4:39.03 HG22:THR:E:41 :48.08:54.45:29.72
O:ILE:E:35:43.41 :52.83:39.05 HG23:THR:E:41 :47.95:55.79:30.85
N:LEU:E:36:45.66:53.2:39.12 C:THR:E:41:46.44:56.94:27.15
CA:LEU:E:36:45.62:54.56:39.52 0:THR:E:41 :45.72:57.92:27.1
CB:LEU:E:36:46.28:54.72:40.92 N:ARG:E:42:46.58:56.23:26.03
CG:LEU:E:36:46.11 :53.54:41.94 CA:ARG:E:42:45.68:56.36:24.89
CD1 :LEU:E:36:46.86:53.81:43.25 CB:ARG:E:42:46.31 :55.86:23.54
CD2:LEU:E:36:44.71 :53.24:42.31 CG:ARG:E:42:47.27:56.83:22.91
C:LEU:E:36:46.31 :55.62:38.62 CD:ARG:E:42:48.62:57.03:23.61
NE: ARG:E:42:49.12:55.64:23.9 N:ASP:E:48:43.12:47.69:22.82
CZ:ARG:E:42:50.18:55.04:23.35 CA:ASP:E:48:43.87:46.47:22.59
NHl :ARG:E:42:50.91 :55.57:22.42 CB:ASP:E:48:45.29:46.7:22
NH2:ARG:E:42:50.62:53.9:23.84 CG:ASP:E:48:45.3:47.4:20.66
C:ARG:E:42:44.44:55.53:25.2 ODl :ASP:E:48:44.31 :47.58: 19.95
O:ARG:E:42:44.55:54.67:26.08 OD2:ASP:E:48:46.45:47.75:20.21
N:PRO:E:43:43.22:55.73:24.64 C:ASP:E:48:43.85:45.54:23.8
CD:PRO:E:43:42.86:56.97:23.97 0:ASP:E:48:43.77:44.33:23.54
CA:PRO:E:43:42.08:54.82:24.8 N:LEU:E:49:43.86:45.99:25.12
CB:PRO:E:43:40.88:55.68:24.31 CA:LEU:E:49:43.92:45.05:26.31
CG:PRO:E:43:41.42:56.78:23.35 CB:LEU:E:49:44.12:45.74:27.59
C:PRO:E:43:42.3:53.68:23.9 CG:LEU:E:49:45.45:46.44:27.69
0:PRO:E:43:41.76:52.62:24.17 CD1 :LEU:E:49:45.49:47.5:28.72
N:GLU:E:44:43.05:53.82:22.78 CD2:LEU:E:49:46.64:45.5:28.04
CA:GLU:E:44:43.63:52.75:22.01 C:LEU:E:49:42.76:44.09:26.53
CB:GLU:E:44:44.58:53.36:20.94 0:LEU:E:49:41.61 :44.54:26.45
CG:GLU:E:44:43.81 :54.33:20.01 N:SER:E:50:43.06:42.9:26.96
CD:GLU:E:44:42.77:53.64: 19.09 CA:SER:E:50:42.06:41.9:27.34
OE1 :GLU:E:44:42.97:52.58: 18.47 CB:SER:E:50:41.35:40.95:26.23
OE2:GLU:E:44:41.71 :54.22: 18.94 OG:SER:E:50:40.23:40.19:26.79
C:GLU:E:44:44.35:51.73:22.81 C:SER:E:50:42.73:41.01:28.28
O:GLU:E:44:44.31 :50.56:22.53 O:SER:E:50:43.85:40.57:28
N:ASP:E:45:45.01 :52.13:23.96 N:PHE:E:51:42.15:40.71:29.47
CA:ASP:E:45:45.76:51.19:24.83 CA:PHE:E:51 :42.64:39.68:30.42
CB:ASP:E:45:46.57:51.97:25.87 CB :PHE:E:51 :42.95:40.28 :31.87
CG:ASP:E:45:47.68:52.8:25.2 CG:PHE:E:51 :43.94:41.46:31.66
ODl :ASP:E:45:47.94:53.94:25.66 CD1 :PHE:E:51 :45.28:41.17:31.53
OD2:ASP:E:45:48.36:52.37:24.25 CE1 :PHE:E:51 :46.29:42.13:31.31
C:ASP:E:45:44.85:50.17:25.4 CZ:PHE:E:51 :45.82:43.48:31.21
O:ASP:E:45:45.05:48.97:25.29 CD2:PHE:E:51 :43.51 :42.84:31.61
N:PHE:E:46:43.69:50.61:25.95 CE2:PHE:E:51 :44.46:43.83:31.4
CA:PHE:E:46:42.72:49.78:26.57 C:PHE:E:51 :41.61 :38.55:30.66
CB:PHE:E:46:41.96:50.49:27.68 O:PHE:E:51:40.94:38.53:31.72
CG:PHE:E:46:42.82:50.9:28.82 N:PRO:E:52:41.44:37.59:29.72
CD1 :PHE:E:46:43.19:50.01 :29.84 CD:PRO:E:52:42.29:37.43:28.49
CE1:PHE:E:46:43.89:50.42:30.98 CA:PRO:E:52:40.35:36.64:29.81
CZ:PHE:E:46:44.14:51.76:31.13 CB:PRO:E:52:40.17:36.16:28.29
CD2:PHE:E:46:43.24:52.23:28.88 CG:PRO:E:52:41.62:36.25:27.76
CE2:PHE:E:46:43.8:52.67:30.13 C:PRO:E:52:40.47:35.48:30.75
C:PHE:E:46:41.88:49.09:25.49 O:PRO:E:52:39.47:34.8:30.92
0:PHE:E:46:41.48:47.97:25.73 N:LYS:E:53:41.63:35.2:31.33
N:ARG:E:47:41.62:49.64:24.3 CA:LYS:E:53:41.74:34.15:32.27
CA:ARG:E:47:40.99:48.99:23.16 CB:LYS:E:53:43.29:33.68:32.32
CB:ARG:E:47:40.88:49.94:21.95 CG:LYS:E:53:43.82:33.09:31.02
CG:ARG:E:47:39.82:50.99:22.14 CD:LYS:E:53:43.54:31.63:30.66
CD:ARG:E:47:39.77:52.09:21.06 CE:LYS:E:53:43.98:30.7:31.81
NE:ARG:E:47:38.86:53.12:21.64 NZ:LYS:E:53:43.85:29.29:31.42
CZ:ARG:E:47:38.81 :54.43:21.31 C:LYS:E:53:41.2:34.55:33.7
NH1 :ARG:E:47:39.65:55.06:20.49 O:LYS:E:53:40.57:33.75:34.39
NH2:ARG:E:47:37.88:55.13:21.88 N:LEU:E:54:41.18:35.86:34.07
C:ARG:E:47:41.75:47.7:22.83 CA:LEU:E:54:40.73:36.34:35.37
O:ARG:E:47:41.02:46.71 :22.71 CB:LEU:E:54:40.99:37.86:35.58
CG:LEU:E:54:40.62:38.49:36.93 CG:ASP:E:59:42.59:41.71:49.28
CD1 :LEU:E:54:41.39:37.9:38.13 OD1 :ASP:E:59:41.93:40.81 :49.85
CD2:LEU:E:54:40.71 :40:36.85 OD2:ASP:E:59:43.83:41.9:49.55
C:LEU:E:54:39.25:36.1 :35.69 C:ASP:E:59:39.77:43.09:47.01
O:LEU:E:54:38.35:36.6:35.03 0:ASP:E:59:38.94:42.77:46.12
N:ILE:E:55:38.94:35.27:36.69 N:TYR:E:60:39.97:44.42:47.18
CA:ILE:E:55:37.59:34.9:37.07 CA:TYR:E:60:39.2:45.41:46.47
CB:ILE:E:55:37.4:33.38:37.22 CB:TYR:E:60:39.13:46.74:47.25
CG2:ILE:E:55:36.77:32.88:35.92 CG:TYR:E:60:40.48:47.26:47.61
HG21 :ILE:E:55:35.68:33.1 :35.9 CD1 :TYR:E:60:41.07:48.23:46.82
HG22:ILE:E:55:37.38:33.15:35.04 CE1 :TYR:E:60:42.35:48.69:47.11
HG23:ILE:E:55:36.71 :31.78:35.76 CZ:TYR:E:60:43.06:48.23:48.27
CG1 :ILE:E:55:38.7:32.66:37.67 OH:TYR:E:60:44.39:48.57:48.54
HG11 :ILE:E:55:39.26:32.45:36.74 CD2:TYR:E:60:41.1 :46.91 :48.82
HG12:ILE:E:55:39.25:33.39:38.3 CE2:TYR:E:60:42.47:47.29:49.02
CD:ILE:E:55:38.42:31.37:38.43 C:TYR:E:60:39.71 :45.67:44.98
C:ILE:E:55:37.26:35.64:38.32 O:TYR:E:60:40.8:45.32:44.63
O:ILE:E:55:36.06:35.88:38.51 N:LEU:E:61:38.86:46.14:44.09
N:MET:E:56:38.26:35.96:39.16 CA:LEU:E:61 :39.16:46.54:42.71
CA:MET:E:56:37.89:36.54:40.46 CB:LEU:E:61 :38.38:45.77:41.63
CB:MET:E:56:37.29:35.56:41.5 CG:LEU:E:61 :38.29:46.56:40.26
CG:MET:E:56:38:34.23:41.78 CD1 :LEU:E:61 :39.65:46.6:39.56
SD:MET:E:56:37.21 :33.18:43.07 CD2:LEU:E:61 :37.33:45.79:39.34
CE:MET:E:56:38.09:34.22:44.4 C:LEU:E:61:38.93:48.05:42.69
C:MET:E:56:39:37.4:41.13 O:LEU:E:61:37.89:48.51:43.05
O:MET:E:56:40.17:37.09:41.05 N:LEU:E:62:39.99:48.75:42.47
N:ILE:E:57:38.64:38.47:41.87 CA:LEU:E:62:40.1 :50.2:42.71
CA:ILE:E:57:39.53:39.44:42.41 CB:LEU:E:62:41.04:50.59:43.93
CB:ILE:E:57:39.3:40.81 :41.91 CG:LEU:E:62:41.6:52.03:44.02
CG2:ILE:E:57:40.15:41.82:42.72 CD1 :LEU:E:62:40.47:53.07:43.99
HG21 :ILE:E:57:39.94:41.93:43.81 CD2:LEU:E:62:42.43:52.32:45.21
HG22:ILE:E:57:41.22:41.52:42.61 C:LEU:E:62:40.55:50.78:41.4
HG23:ILE:E:57:40.03:42.81 :42.24 O:LEU:E:62:41.62:50.58:40.8
CGI :ILE:E:57:39.44:40.97:40.34 N:LEU:E:63:39.75:51.73:40.83
HGl 1 :ILE:E:57:40.47:40.72:40.04 CA:LEU:E:63:40.08:52.48:39.68
HG12:ILE:E:57:38.79:40.26:39.79 CB:LEU:E:63:39.11 :52.18:38.48
CD:ILE:E:57:39.17:42.37:39.86 CG:LEU:E:63:39.13:50.72:37.89
C:ILE:E:57:39.33:39.28:43.89 CD1 :LEU:E:63:37.78:50.32:37.32
0:ILE:E:57:38.25:39.33:44.36 CD2:LEU:E:63:40.26:50.66:36.98
N:THR:E:58:40.4:38.99:44.66 C:LEU:E:63:40.04:53.96:40.01
CA:THR:E:58:40.36:38.57:46.09 O:LEU:E:63:38.96:54.4:40.43
CB:THR:E:58:41.66:38:46.59 N:PHE:E:64:41.2:54.71 :39.92
OG1 :THR:E:58:41.98:36.72:46.05 CA:PHE:E:64:41.17:56.08:40.32
CG2:THR:E:58:41.82:37.76:48.14 CB:PHE:E:64:41.87:56.05:41.7
HG21 :THR:E:58:41.77:38.75:48.65 CG:PHE:E:64:42.42:57.37:42.33
HG22:THR:E:58:40.93:37.13:48.37 CD1 :PHE:E:64:41.59:58.52:42.41
HG23:THR:E:58:42.73:37.17:48.36 CE1 :PHE:E:64:42.06:59.71 :43.12
C:THR:E:58:39.99:39.73:47.04 CZ:PHE:E:64:43.39:59.78:43.52
O:THR:E:58:39.09:39.5:47.86 CD2:PHE:E:64:43.69:57.52:42.86
N:ASP:E:59:40.67:40.86:47 CE2:PHE:E:64:44.21 :58.63:43.47
CA:ASP:E:59:40.48:42.03:47.82 C:PHE:E:64:41.86:56.99:39.35
CB:ASP:E:59:41.82:42.63:48.34 O:PHE:E:64:43.08:56.91:39.16
N:ARG:E:65:41.06:57.82:38.63 CD:GLU:E:70:36.57:53.72:24.36
CA:ARG:E:65:41.56:58.79:37.67 OE1 :GLU:E:70:35.83:54.29:23.52
CB:ARG:E:65:42.22:60.02:38.43 OE2:GLU:E:70:37.51 :52.94:23.99
CG:ARG:E:65:41.15:60.76:39.38 C:GLU:E:70:34.78:52.04:28.64
CD:ARG:E:65:41.84:61.77:40.31 O:GLU:E:70:33.63:52.37:28.77
NE:ARG:E:65:40.74:62.4:41.07 N:SER:E:71 :35.26:50.86:29.01
CZ:ARG:E:65:40.93:63.01 :42.23 CA:SER:E:71 :34.37:49.81 :29.59
NH1 :ARG:E:65:42.12:63.26:42.75 CB:SER:E:71 :33.38:49.22:28.48
NH2:ARG:E:65:39.9:63.29:42.95 OG:SER:E:71 :32.36:48.38:29.03
C:ARG:E:65:42.33:58.29:36.46 C:SER:E:71 :35.24:48.7:30.23
0:ARG:E:65:43.42:58.6:36.18 O:SER:E:71 :36.42:48.61:30.02
N:VAL:E:66:41.62:57.48:35.62 N:LEU:E:72:34.6:47.88:31.04
CA:VAL:E:66:42.09:56.92:34.36 CA:LEU:E:72:35.29:46.69:31.64
CB :VAL:E:66:42.11 :55.42:34.25 CB:LEU:E:72:34.87:46.55:33.13
CG1 :VAL:E:66:42.69:54.84:32.95 CG:LEU:E:72:35.19:47.76:34.07
HG11 :VAL:E:66:43.02:53.78:33.04 CD1 :LEU:E:72:34.75:47.53:35.51
HG12:VAL:E:66:41.93:55.01 :32.15 CD2:LEU:E:72:36.75:47.85:34.12
HG13:VAL:E:66:43.68:55.23:32.62 C:LEU:E:72:35.01 :45.37:30.93
CG2:VAL:E:66:42.98:54.94:35.45 0:LEU:E:72:35.49:44.29:31.39
HG21 :VAL:E:66:43.88:55.56:35.65 N:LYS:E:73:34.12:45.38:29.89
HG22:VAL:E:66:42.3:54.87:36.33 CA:LYS:E:73:33.63:44.13:29.2
HG23:VAL:E:66:43.38:53.91 :35.32 CB:LYS:E:73:32.5:44.45:28.23
C:VAL:E:66:41.47:57.46:33.14 CG:LYS:E:73:32.82:45.51 :27.22
O:VAL:E:66:40.25:57.61 :33.05 CD:LYS:E:73:31.45:45.92:26.5
N:TYR:E:67:42.39:57.99:32.26 CE:LYS:E:73:31.35:47.41 :25.98
CA:TYR:E:67:42.06:58.66:31.03 NZ:LYS:E:73:32.61 :47.79:25.23
CB:TYR:E:67:42.9:59.97:30.71 C:LYS:E:73:34.63:43.29:28.43
CG:TYR:E:67:42.75:61.03:31.76 O:LYS:E:73:34.68:42.05:28.5
CD1 :TYR:E:67:42.61 :62.36:31.25 N:ASP:E:74:35.49:43.94:27.61
CE1:TYR:E:67:42.6:63.51 :32.1 CA:ASP:E:74:36.46:43.2:26.93
CZ:TYR:E:67:42.49:63.33:33.48 CB:ASP:E:74:36.67:43.95:25.62
OH:TYR:E:67:42.38:64.34:34.39 CG:ASP:E:74:35.47:44:24.76
CD2:TYR:E:67:42.66:60.93:33.17 ODl :ASP:E:74:34.43:43.33:24.84
CE2:TYR:E:67:42.62:62.06:33.99 OD2:ASP:E:74:35.57:44.72:23.79
C:TYR:E:67:41.96:57.69:29.82 C:ASP:E:74:37.8:43.16:27.68
O:TYR:E:67:43.01 :57.21:29.32 0:ASP:E:74:38.65:42.34:27.37
N:GLY:E:68:40.72:57.24:29.42 N:LEU:E:75:38:44.05:28.65
CA:GLY:E:68:40.45:56.3:28.3 CA:LEU:E:75:39.13:44.06:29.56
C:GLY:E:68:40.26:54.78:28.51 CB:LEU:E:75:39.15:45.4:30.46
O:GLY:E:68:40.45:54.03:27.57 CG:LEU:E:75:40.48:45.69:31.21
N:LEU:E:69:39.73:54.45:29.73 CD1 :LEU:E:75:41.67:45.84:30.19
CA:LEU:E:69:39.15:53.17:30.02 CD2:LEU:E:75:40.48:46.84:32.22
CB:LEU:E:69:39.33:52.71 :31.44 C:LEU:E:75:39.23:42.9:30.53
CG:LEU:E:69:38.84:51.19:31.68 O:LEU:E:75:40.21 :42.27:30.67
CD1 :LEU:E:69:39.49:50.15:30.71 N:PHE:E:76:38.05:42.55:31.13
CD2:LEU:E:69:39.09:50.77:33.14 CA:PHE:E:76:38.04:41.44:32.04
C:LEU:E:69:37.63:53.29:29.72 CB:PHE:E:76:37.99:41.88:33.59
O:LEU:E:69:36.87:53.86:30.47 CG:PHE:E:76:39.16:42.78:34.09
N:GLU:E:70:37.12:52.79:28.61 CD1 :PHE:E:76:40.44:42.32:34.11
CA:GLU:E:70:35.79:53.03:28.13 CE1 :PHE:E:76:41.51 :43.15:34.52
CB:GLU:E:70:35.66:52.99:26.61 CZ:PHE:E:76:41.26:44.46:34.98
CG:GLU:E:70:36.44:54.07:25.8 CD2:PHE:E:76:38.89:44.13:34.53
CE2:PHE:E:76:39.96:44.98:34.94 CG2:ILE:E:82:35.61 :41.98:44.69
C:PHE:E:76:36.8:40.59:31.69 HG21 :ILE:E:82:35.55:41.71:45.76
O:PHE:E:76:35.75:40.68:32.4 HG22:ILE:E:82:36.63:41.79:44.28
N:PRO:E:77:36.77:39.85:30.55 HG23:ILE:E:82:35.57:43.07:44.44
CD:PRO:E:77:37.84:39.75:29.56 CG1 :ILE:E:82:34.55:41.71 :42.25
CA:PRO:E:77:35.59:39.03:30.14 HG11 :ILE:E:82:35.6:41.75:41.89
CB:PRO:E:77:36.1 :38.17:29 HG12:ILE:E:82:33.99:41.03:41.56
CG:PRO:E:77:37.16:39.11 :28.4 CD:ILE:E:82:33.94:43.1 :42.07
C:PRO:E:77:34.9:38.19:31.25 C:ILE:E:82:34.67:39.31 :45.35
0:PRO:E:77:33.71 :38.32:31.52 0:ILE:E:82:33.61 :39.29:45.93
N:ASN:E:78:35.66:37.41:32.03 N:ARG:E:83:35.78:38.82:45.97
CA:ASN:E:78:35.12:36.28:32.8 CA:ARG:E:83:35.64:38.02:47.23
CB:ASN:E:78:36.06:35.1 :32.53 CB:ARG:E:83:36.67:36.87:47.3
CG:ASN:E:78:35.85:34.68:31.09 CG:ARG:E:83:37.15:36.64:48.75
OD1 :ASN:E:78:34.71 :34.86:30.55 CD:ARG:E:83:36.09:36.03:49.7
ND2:ASN:E:78:36.82:34.22:30.3 NE:ARG:E:83:36.67:35.9:51.09
C:ASN:E:78:35.05:36.5:34.32 CZ:ARG:E:83:36.49:36.75:52.07
0:ASN:E:78:34.55:35.7:35.1 NH1 :ARG:E:83:35.76:37.87:51.99
N:LEU:E:79:35.34:37.73:34.69 NH2:ARG:E:83:36.99:36.46:53.23
CA:LEU:E:79:35.14:38.24:36.03 C:ARG:E:83:35.71:39.11 :48.34
CB:LEU:E:79:35.75:39.66:36.1 0:ARG:E:83:35.22:38.95:49.46
CG:LEU:E:79:35.62:40.38:37.42 N:GLY:E: 84:36.38:40.25:48.05
CD1 :LEU:E:79:36.23:39.48:38.55 CA:GLY:E:84:36.54:41.41 :49.02
CD2:LEU:E:79:36.47:41.62:37.39 C:GLY:E:84:37.07:41.05:50.32
C:LEU:E:79:33.7:38.28:36.64 0:GLY:E: 84:36.48:41.51 :51.33
0:LEU:E:79:32.84:39.08:36.16 N:SER:E:85:38.2:40.21 :50.45
N:THR:E:80:33.48:37.49:37.66 CA:SER:E:85:38.65:39.69:51.72
CA:THR:E:80:32.2:37.09:38.15 CB:SER:E:85:39.74:38.56:51.55
CB:THR:E:80:31.82:35.67:37.61 OG:SER:E:85:40.23:38:52.72
OG1 :THR:E:80:30.49:35.32:38.01 C:SER:E:85:39.2:40.76:52.64
CG2:THR:E:80:32.88:34.65:38.12 O:SER:E:85:38.85:40.68:53.82
HG21 :THR:E:80:33.86:34.79:37.61 N:ARG:E:86:39.9:41.78:52.16
HG22:THR:E:80:33.03:34.53:39.21 CA:ARG:E:86:40.15:43.03:52.81
HG23:THR:E:80:32.56:33.66:37.7 CB:ARG:E:86:41.72:43.33:52.98
C:THR:E:80:32.13:37.11:39.67 CG: ARG:E:86:42.51 : 42.21 :53.77
O:THR:E:80:31.06:37.47:40.19 CD:ARG:E:86:43.75:41.57:53.06
N:VAL:E:81:33.21 :36.95:40.36 NE:ARG:E:86:43.18:40.38:52.39
CA:VAL:E:81 :33.12:37.09:41.82 CZ:ARG:E:86:43.78:39.29:52.1
CB:VAL:E:81 :33.49:35.79:42.55 NH1 :ARG:E:86:44.68:38.8:52.95
CG1 :VAL:E:81:33.62:36.05:44.1 NH2:ARG:E:86:43.54:38.74:50.9
HG11 :VAL:E:81 :33.65:35.05:44.58 C:ARG:E:86:39.46:44.02:51.96
HG12:VAL:E:81 :34.53:36.56:44.44 O:ARG:E:86:39.76:44.09:50.73
HG13:VAL:E:81 :32.7:36.61 :44.37 N:LEU:E:87:38.56:44.87:52.54
CG2:VAL:E:81:32.32:34.86:42.34 CA:LEU:E:87:37.88:45.9:51.72
HG21 :VAL:E:81 :31.28:35.23:42.5 CB:LEU:E:87:36.42:46.11 :52.37
HG22:VAL:E:81 :32.38:34.41 :41.32 CG:LEU:E:87:35.63:44.82:52.42
HG23:VAL:E:81 :32.4:33.99:43.04 CD1 :LEU:E:87:34.41 :44.75:53.37
C:VAL:E:81 :34.15:38.13:42.23 CD2:LEU:E:87:35.27:44.36:50.94
0:VAL:E:81:35.25:38.11 :41.76 C:LEU:E:87:38.69:47.22:51.71
N:ILE:E:82:33.82:39.01:43.12 0:LEU:E:87:39.64:47.37:52.41
CA:ILE:E:82:34.8:39.76:43.88 N:PHE:E:88:38.26:48.22:50.9
CB:ILE:E:82:34.56:41.23:43.79 CA:PHE:E:88:38.8:49.54:51.02
CB:PHE:E:88:39.59:49.99:49.71 0:LEU:E:93:33.32:49.9:44.54
CG:PHE:E:88:40.13:51.4:49.85 N:VAL:E:94:35.53:50.12:43.99
CD1 :PHE:E:88:39.64:52.51 :49.11 CA:VAL:E:94:35.68:51.6:43.92
CE1:PHE:E:88:40.08:53.83:49.36 CB:VAL:E:94:36.81 :52.08:44.85
CZ:PHE:E:88:41.16:54.05:50.2 CG1 :VAL:E:94:37.16:53.58:44.6
CD2:PHE:E:88:41.22:51.66:50.72 HG11 :VAL:E:94:36.27:54.22:44.74
CE2:PHE:E:88:41.68:52.99:50.96 HG12:VAL:E:94:37.78:53.92:45.46
C:PHE:E:88:37.7:50.39:51.45 HG13:VAL:E:94:37.65:53.79:43.62
O:PHE:E:88:36.68:50.58:50.81 CG2:VAL:E:94:36.31:51.94:46.3
N:PHE:E:89:37.76:50.79:52.78 HG21 :VAL:E:94:37.12:52.11 :47.04
CA:PHE:E:89:36.73:51.52:53.48 HG22:VAL:E:94:35.46:52.62:46.51
CB:PHE:E:89:36.79:53.02:53.17 HG23:VAL:E:94:35.89:50.92:46.33
CG:PHE:E:89:36.05:53.9:54.23 C:VAL:E:94:36.06:51.94:42.52
CD1 :PHE:E:89:36.43:54.03:55.57 O:VAL:E:94:37.07:51.5:41.94
CE1:PHE:E:89:35.69:54.8:56.46 N:ILE:E:95:35.22:52.71 :41.84
CZ:PHE:E:89:34.61 :55.64:56.01 CA:ILE:E:95:35.54:53.35:40.57
CD2:PHE:E:89:34.94:54.67:53.76 CB:ILE:E:95:34.59:52.91:39.46
CE2:PHE:E:89:34.26:55.46:54.61 CG2:ILE:E:95:35.19:53.55:38.15
C:PHE:E:89:35.35:50.94:53.34 HG21 :ILE:E:95:36.22:53.19:37.99
0:PHE:E:89:34.43:51.66:52.92 HG22:ILE:E:95:34.52:53.34:37.29
N:ASN:E:90:35.21 :49.67:53.64 HG23:ILE:E:95:35.36:54.64:38.25
CA:ASN:E:90:34.07:48.77:53.57 CG1 :ILE:E:95:34.61 :51.41 :39.39
CB:ASN:E:90:32.88:48.99:54.59 HG11 :ILE:E:95:35.67:51.11:39.23
CG:ASN:E:90:33.34:48.86:56.05 HG12:ILE:E:95:34.27:50.94:40.34
OD1 :ASN:E:90:34.49:48.51 :56.32 CD:ILE:E:95:33.74:50.86:38.24
ND2:ASN:E:90:32.47:49.27:57.02 C:ILE:E:95:35.44:54.83:40.71
C:ASN:E:90:33.63:48.5:52.12 O:ILE:E:95:34.28:55.36:40.83
O:ASN:E:90:32.95:47.48:51.94 N:PHE:E:96:36.52:55.59:40.69
N:TYR:E:91 :33.95:49.34:51.11 CA:PHE:E:96:36.46:56.93:41.06
CA:TYR:E:91 :33.79:49.05:49.71 CB:PHE:E:96:37.13:56.93:42.43
CB:TYR:E:91 :34.09:50.27:48.74 CG:PHE:E:96:37.14:58.31 :43.09
CG:TYR:E:91 :33.21 :51.46:49.17 CD1 :PHE:E:96:38.28:58.83:43.7
CD1 :TYR:E:91 :31.8:51.49:48.97 CE1 :PHE:E:96:38.27:60.12:44.23
CE1:TYR:E:91 :31.03:52.61 :49.31 CZ:PHE:E:96:37.12:60.94:44.1
CZ:TYR:E:91 :31.73:53.74:49.74 CD2:PHE:E:96:35.88:59.06:43.24
OH:TYR:E:91 :31.09:54.93:50.23 CE2:PHE:E:96:35.96:60.41 :43.53
CD2:TYR:E:91 :33.85:52.67:49.42 C:PHE:E:96:37.08:57.95:40.05
CE2:TYR:E:91 :33.07:53.79:49.73 0:PHE:E:96:38.25:57.8:39.65
C:TYR:E:91 :34.65:47.94:49.09 N:GLU:E:97:36.28:58.99:39.69
0:TYR:E:91 :35.85:47.78:49.45 CA:GLU:E:97:36.58:59.98:38.78
N:ALA:E:92:34.05:47.17:48.19 CB:GLU:E:97:37.33:61.19:39.52
CA:ALA:E:92:34.86:46.14:47.57 CG:GLU:E:97:36.4:62.15:40.25
CB:ALA:E:92:33.95:44.94:47.44 CD:GLU:E:97:37.04:63.47:40.54
C:ALA:E:92:35.27:46.54:46.11 OE1 :GLU:E:97:38.16:63.56:41.07
O:ALA:E:92:36.25:46.09:45.55 OE2:GLU:E:97:36.38:64.5:40.22
N:LEU:E:93:34.34:47.3:45.52 C:GLU:E:97:37.25:59.6:37.41
CA:LEU:E:93:34.48:47.83:44.19 O:GLU:E:97:38.11:60.32:36.9
CB:LEU:E:93:33.33:47.39:43.25 N:MET:E:98:36.72:58.59:36.78
CG:LEU:E:93:33.37:48.12:41.83 CA:MET:E:98:37.33:58.11 :35.57
CD1 :LEU:E:93:34.76:47.99:41.13 CB:MET:E:98:36.77:56.76:35.34
CD2:LEU:E:93:32.32:47.48:40.9 CG:MET:E:98:37.19:55.77:36.44
C:LEU:E:93:34.46:49.42:44.27 SD:MET:E:98:38.95:55.37:36.31
CE:MET:E:98:38.68:53.95:35.25 N:LEU:E: 104:29.6:50.41:32.25
C:MET:E:98:37.01 :59.09:34.39 CA:LEU:E:104:30.26:49.67:33.36
0:MET:E:98:36:59.81:34.48 CB:LEU:E: 104:29.27:49.52:34.53
N:VAL:E:99:37.8:59.11:33.35 CG:LEU:E:104:29.63:48.66:35.81
CA:VAL:E:99:37.67:60.13:32.29 CD1 :LEU:E:104:30.93:49.2:36.52
CB:VAL:E:99:38.93:61.05:32.24 CD2:LEU:E:104:28.35:48.74:36.62
CG1 :VAL:E:99:39.33:61.66:30.87 C:LEU:E: 104:30.86:48.34:32.95
HG11 :VAL:E:99:38.4:61.9:30.31 O:LEU:E: 104:31.99:48.02:33.23
HG12:VAL:E:99:39.85:62.63:30.94 N:GLY:E: 105:30.14:47.52:32.13
HG13:VAL:E:99:39.93:60.95:30.26 CA:GLY:E: 105:30.67:46.42:31.34
CG2:VAL:E:99:38.64:62.06:33.35 C:GLY:E: 105:30.82:45.12:32.03
HG21 :VAL:E:99:38.45:61.53:34.29 O:GLY:E: 105:31.45:44.21 :31.47
HG22:VAL:E:99:39.54:62.7:33.53 N:LEU:E: 106:30.34:45:33.22
HG23:VAL:E:99:37.81:62.75:33.11 CA:LEU:E:106:30.6:43.86:34.08
C:VAL:E:99:37.47:59.45:31.01 CB:LEU:E: 106:30.79:44.26:35.55
O:VAL:E:99:38.34:58.61 :30.74 CG:LEU:E:106:32.09:45.15:35.95
N:HIS:E:100:36.44:59.78:30.16 CD1 :LEU:E:106:31.95:45.74:37.39
CA:HIS:E: 100:35.85:59:29.05 CD2:LEU:E:106:33.35:44.24:35.92
CB:HIS:E: 100:36.77:59.05:27.83 C:LEU:E: 106:29.52:42.8:33.9
ND1 :HIS:E: 100:36.36:61.26:26.86 0:LEU:E: 106:28.73:42.57:34.76
CG:HIS:E: 100:37.21 :60.42:27.55 N:TYR:E: 107:29.41 :42.14:32.71
CE1:HIS:E: 100:37:62.37:26.77 CA:TYR:E: 107:28.22:41.42:32.27
NE2:HIS:E: 100:38.24:62.3:27.28 CB:TYR:E:107:28.02:41.55:30.69
CD2:HIS:E: 100:38.37:61.07:27.73 CG:TYR:E: 107:29.1 :40.72:30.07
C:HIS:E: 100:35.55:57.52:29.41 CD1 :TYR:E: 107:28.68:39.47:29.51
O:HIS:E:100:35.97:56.61 :28.68 CE1 :TYR:E: 107:29.54:38.87:28.64
N:LEU:E:101 :34.82:57.21 :30.47 CZ:TYR:E: 107:30.75:39.47:28.21
CA:LEU:E: 101 :34.39:55.87:30.82 OH:TYR:E: 107:31.57:38.7:27.33
CB:LEU:E: 101 :34.6:55.73:32.33 CD2:TYR:E: 107:30.37:41.23:29.79
CG:LEU:E: 101 :33.98:54.5:32.95 CE2:TYR:E: 107:31.19:40.67:28.79
CD1 :LEU:E: 101:34.14:53.23:32.16 C:TYR:E: 107:28.09:40:32.85
CD2:LEU:E: 101:34.57:54.13:34.36 O:TYR:E: 107:27.05:39.33:32.83
C:LEU:E: 101 :32.9:55.87:30.4 N:ASN:E: 108:29.22:39.58:33.43
O:LEU:E:101 :32.29:56.91 :30.7 CA:ASN:E:108:29.3:38.26:34.08
N:LYS:E: 102:32.35:54.8:29.73 CB:ASN:E: 108:30.51 :37.49:33.48
CA:LYS:E: 102:31.1:54.95:28.98 CG:ASN:E:108:30.15:37.02:32.08
CB:LYS:E: 102:31.31:54.55:27.52 OD1 :ASN:E: 108:29.1:36.37:31.96
CG:LYS:E: 102:32.24:55.48:26.72 ND2:ASN:E: 108:30.93:37.28:31.03
CD:LYS:E: 102:31.71 :56.9:26.44 C:ASN:E:108:29.35:38.39:35.62
CE:LYS:E: 102:30.46:56.98:25.51 O:ASN:E: 108:29.52:37.37:36.31
NZ:LYS:E: 102:30.25:58.39:25 N:LEU:E: 109:29.12:39.52:36.18
C:LYS:E: 102:30.14:53.97:29.54 CA:LEU:E:109:29.23:39.64:37.66
0:LYS:E: 102:29.03:54.37:29.85 CB:LEU:E: 109:29.56:41.06:38.11
N:GLU:E: 103:30.5:52.65:29.57 CG:LEU:E:109:29.9:41.29:39.63
CA:GLU:E: 103:29.62:51.6:30.05 CD1 :LEU:E:109:31.38:40.93:40.06
CB:GLU:E: 103:29.33:50.74:28.75 CD2:LEU:E:109:29.63:42.69:40.05
CG:GLU:E: 103:28.55:51.54:27.69 C:LEU:E: 109:27.99:39.1:38.32
CD:GLU:E: 103:28.46:50.66:26.5 O:LEU:E: 109:26.91:39.64:38.17
OEl :GLU:E: 103:27.62:49.73:26.47 N:MET:E: 110:28.2:38.08:39.17
OE2:GLU:E: 103:29.25:51.01 :25.5 CA:MET:E: 110:27.25:37.46:39.92
C:GLU:E: 103:30.25:50.78:31.11 CB:MET:E: 110:27.4:35.86:39.74
O:GLU:E: 103:31.4:50.39:30.95 CG:MET:E: 110:27.35:35.35:38.29
SD:MET:E:110:25.86:35.86:37.49 CB:SER:E:116:29.15:48.76:49.74 CE:MET:E:110:24.69:35:38.5 OG:SER:E: 116:30.05:49.39:50.66 C:MET:E:110:27.3:37.91:41.34 C:SER:E:116:30.5:48.2:47.7 0:MET:E: 110:26.23:37.95:42.04 0:SER:E:116:31.72:47.97:47.47 N:NLG:E:111:28.51:38.16:41.93 N:VAL:E: 117:29.78:48.97:46.84 CA:NLG:E:111:28.48:38.33:43.36 CA:VAL:E:117:30.31:49.5:45.65 CB:NLG:E:111:28.63:36.99:44.09 CB:VAL:E:117:29.34:48.99:44.51 CG:NLG:E:111:28.61:37.13:45.59 CGI : VAL:E: 117:29.66:49.62:43.13 ODl:NLG:E:ll 1:29.57:36.9:46.28 HG11:VAL:E:117:30.71:49.47:42.82 ND2:NLG:E:111:27.41:37.5:46.16 HG12: VAL:E: 117:29.03:49.22:42.3 C:NLG:E: 111 :29.64:39.26:43.78 HG13:VAL:E:117:29.35:50.7:43.13 0:NLG:E: 111 :30.76:39.24:43.3 CG2:VAL:E: 117:29.15:47.45:44.41 N:ILE:E: 112:29.34:40.12:44.79 HG21 : VAL:E: 117:30.09:46.89:44.59 CA:ILE:E: 112:30.35:40.67:45.62 HG22: VAL:E: 117:28.42:47.06:45.14 CB:ILE:E:112:30.43:42.28:45.48 HG23:VAL:E:117:28.72:47.15:43.43 CG2:ILE:E:112:31.63:42.75:46.33 C:VAL:E:117:30.32:50.98:45.79 HG21:ILE:E:112:31.28:43.36:47.18 0:VAL:E:117:29.44:51.49:46.45 HG22:ILE:E: 112:32.25:41.9:46.7 N:ARG:E:118:31.34:51.67:45.19 HG23:ILE:E: 112:32.33:43.35:45.69 CA:ARG:E:118:31.37:53.13:45 CGI :ILE:E: 112:30.59:42.71 :44.05 CB:ARG:E:118:32.19:53.77:46.04 HG11:ILE:E:112:31.54:42.25:43.68 CG: ARG:E: 118:32.06:55.29:45.98 HG12:ILE:E: 112:29.82:42.17:43.47 CD: ARG:E: 118:32.71 :55.99:47.19 CD:ILE:E: 112:30.48:44.24:43.73 NE:ARG:E:118:32.73:57.51:47.24 C:ILE:E:112:30:40.18:47.02 CZ:ARG:E:118:32.37:58.17:48.28 0:ILE:E:112:28.9:40.26:47.54 NH1:ARG:E:118:32.05:57.64:49.43 N:THR:E: 113:30.99:39.59:47.64 NH2:ARG:E:118:32.38:59.5:48.23 CA:THR:E:113:31.02:39.1:48.99 C:ARG:E:118:31.75:53.6:43.63 CB:THR:E:113:31.92:37.84:49.18 0:ARG:E:118:32.83:53.3:43.15 OGl:THR:E:113:31.48:36.73:48.41 N:ILE:E:119:30.85:54.36:42.9 CG2:THR:E: 113:32.01:37.34:50.62 CA:ILE:E:119:31.07:54.61:41.49 HG21:THR:E:113:31.02:37.26:51.13 CB:ILE:E:119:30.31:53.78:40.53 HG22:THR:E: 113:32.56:36.39:50.7 CG2:ILE:E: 119:30.63:54.25:39.08 HG23:THR:E:113:32.61:37.97:51.32 HG21:ILE:E: 119:30.25:55.29:38.99 C:THR:E:113:31.22:40.16:50.07 HG22:ILE:E:119:31.7:54.24:38.79 O:THR:E:113:32.22:40.85:50.17 HG23:ILE:E:119:30.01:53.6:38.42 N:ARG:E: 114:30.05:40.35:50.84 CG1:ILE:E:119:30.69:52.33:40.81 CA:ARG:E:114:29.79:41.25:51.97 HG11:ILE:E:119:31.75:52.2:40.51 CB:ARG:E:114:30.06:40.32:53.25 HG12:ILE:E:119:30.57:52.02:41.87 CG:ARG:E:114:29.25:39.04:53.46 CD:ILE:E:119:29.84:51.26:39.99 CD:ARG:E:114:27.81:39.29:53.66 C:ILE:E:119:30.66:56.05:41.33 NE:ARG:E:114:27.21:38.02:54.2 0:ILE:E:119:29.54:56.42:40.89 CZ:ARG:E: 114:25.96:38.03:54.74 N:GLU:E: 120:31.59:56.94:41.66 NH1 :ARG:E: 114:25.12:39.08:54.75 CA:GLU:E:120:31.43:58.36:41.87 NH2:ARG:E:114:25.45:36.92:55.18 CB:GLU:E: 120:31.89:58.62:43.2 C:ARG:E:114:30.61:42.56:51.93 CG:GLU:E: 120:32.27:60.05:43.61 0:ARG:E:114:31.41:42.86:52.77 CD:GLU:E: 120:32.48:60.09:45.12 N:GLY:E:115:30.41:43.44:50.94 OE1:GLU:E:120:33.49:59.51:45.6 CA:GLY:E:115:31.24:44.64:50.73 OE2:GLU:E:120:31.67:60.73:45.79 C:GLY:E: 115:30.74:45.45:49.59 C:GLU:E: 120:32.2:59.25:40.86 0:GLY:E: 115:30.27:44.92:48.59 0:GLU:E: 120:33.41:59.14:40.67 N:SER:E:116:30.64:46.76:49.82 N:LYS:E:121:31.37:60.18:40.34 CA:SER:E: 116:29.84:47.64:48.98 CA:LYS:E:121:31.78:61.31:39.53
CB :LYS:E: 121 :32.52:62.31 :40.47 CZ:TYR:E:127:23.38:51.49:28.03
CG:LYS:E:121:31.64:63.11:41.53 OH:TYR:E:127:22.85:51.66:26.78
CD:LYS:E: 121 :32.28:64.34:42.2 CD2:TYR:E: 127:23.36:50.42:30.26
CE:LYS:E:121:33.38:63.85:43.15 CE2:TYR:E: 127:22.75:50.72:29.08
NZ:LYS:E:121:34.16:64.9:43.84 C:TYR:E:127:24.85:51.22:34.37
C:LYS:E: 121:32.52:60.96:38.18 O:TYR:E:127:25.36:50.12:34.61
0:LYS:E:121:33.44:61.63:37.77 N:LEU:E: 128:24.49:52.14:35.28
N:ASN:E: 122:32.07:59.99:37.42 CA:LEU:E:128:24.74:51.95:36.73
CA:ASN:E: 122:32.82:59.48:36.27 CB :LEU:E: 128:25.34:53.14:37.47
CB:ASN:E:122:32.65:57.98:36.19 CG:LEU:E:128:26.6:53.53:36.71
CG:ASN:E:122:33.25:57.28:37.46 CD1:LEU:E:128:27.1:54.84:37.41
ODl:ASN:E:122:34.43:56.97:37.38 CD2:LEU:E:128:27.64:52.4:36.64
ND2:ASN:E: 122:32.37:57.07:38.47 C:LEU:E:128:23.49:51.67:37.56
C:ASN:E:122:32.19:60.02:35.01 O:LEU:E:128:23.66:51.02:38.58
O:ASN:E:122:31.03:59.72:34.58 N:ALA:E: 129:22.32:52.14:37.04
N:ASN:E:123:33.01:60.78:34.26 CA:ALA:E:129:21.07:51.82:37.71
CA:ASN:E:123:32.7:61.44:32.96 CB:ALA:E:129:20.35:53.06:38.04
CB:ASN:E:123:33.95:62.46:32.56 C:ALA:E: 129:20.22:50.96:36.8
CG:ASN:E:123:33.97:62.98:31.16 0:ALA:E: 129: 19.17:50.49:37.21
OD1:ASN:E:123:34.66:62.37:30.28 N:THR:E:130:20.84:50.59:35.64
ND2:ASN:E: 123:33.12:64.01 :30.87 CA:THR:E: 130:20.21 =49.79:34.66
C:ASN:E:123:32.17:60.56:31.86 CB:THR:E:130:20.8:49.84:33.24
0:ASN:E:123:32.93:59.83:31.26 0G1:THR:E: 130:22.2:49.65:33.33
N:GLU:E:124:30.84:60.63:31.56 CG2:THR:E:130:20.69:51.28:32.69
CA:GLU:E:124:30.15:59.81:30.49 HG21:THR:E:130:21.09:51.32:31.65
CB :GLU:E: 124:30.82:59.6:29.13 HG22:THR:E:130:19.61:51.51:32.78
CG:GLU:E:124:31.03:61:28.53 HG23:THR:E:130:21.11:52.07:33.35
CD:GLU:E:124:29.82:61.37:27.64 C:THR:E:130:20.17:48.33:35.06
0E1:GLU:E: 124:29.3:62.53:27.84 0:THR:E: 130: 19.41 :47.53:34.44
OE2:GLU:E: 124:29.57:60.62:26.72 N:ILE:E:131:21:47.91:36.07
C:GLU:E:124:29.58:58.43:31.01 CA:ILE:E:131:21.19:46.59:36.63
0:GLU:E: 124:29.04:57.65:30.21 CB:ILE:E:131:22.67:46.22:36.79
N:LEU:E:125:29.69:58.07:32.32 CG2:ILE:E: 131:23.29:46.02:35.37
CA:LEU:E:125 =29.37:56.71:32.7 HG21:ILE:E:131:23.05:45.04:34.92
CB:LEU:E:125:30.1:56.27:33.92 HG22:ILE:E:131:22.96:46.82:34.69
CG:LEU:E: 125:29.94:54.92:34.61 HG23:ILE:E:131:24.41:46.05:35.34
CD1:LEU:E:125:30.24:53.72:33.66 CG1:ILE:E:131:23.56:47.18:37.66
CD2:LEU:E: 125:30.83:54.72:35.87 HG11:ILE:E:131:22.8:47.55:38.38
C:LEU:E:125:27.84:56.5:32.74 HG12:ILE:E:131:24.33:46.58:38.19
0:LEU:E:125:27.21:57.15:33.59 CD:ILE:E:131:24.32:48.22:36.87
N:CYS:E:126:27.32:55.62:31.84 C:ILE:E:131:20.52:46.33:37.93
CA:CYS:E:126:26.02:55.01:31.91 O:ILE:E:131:20.12:47.27:38.68
CB:CYS:E:126:25.35:55.12:30.49 N:ASP:E:132:20.35:45.03:38.28
SG:CYS:E: 126:24.96:56.89:29.98 CA:ASP:E:132:19.65:44.67:39.47
C:CYS:E:126:26.09:53.59:32.34 CB:ASP:E:132:18.57:43.57:39.35
0:CYS:E:126:27.13:52.96:32.3 CG:ASP:E:132:17.72:43.34:40.58
N:TYR:E: 127:24.91 =53.08:32.75 OD1:ASP:E:132:17.94:44.07:41.62
CA:TYR:E:127:24.69:51.69:32.96 OD2:ASP:E:132:16.79:42.5:40.56
CB:TYR:E:127:25.23:50.68:31.97 C:ASP:E:132:20.71:44.36:40.56
CG:TYR:E: 127:24.62:50.94:30.57 O:ASP:E:132:21.33:43.3:40.65
CD1:TYR:E:127:25.25:51.76:29.61 N:TRP:E:133:20.88:45.32:41.48
CE1:TYR:E:127:24.65:52.05:28.39 CA:TRP:E:133:21.92:45.2:42.5
CB:TRP:E:133:22.37:46.59:43.05 CB:ASP:E: 138:23.83:41.34:51.37 CG:TRP:E: 133:23.07:47.52:42.04 CG:ASP:E:138:25.28:41.29:51.78 CD1 :TRP:E: 133:22.53:48.68:41.53 OD1 :ASP:E: 138:25.58:41.41 :53.01 NE1 :TRP:E: 133:23.49:49.32:40.76 OD2:ASP:E: 138:26.15:40.94:50.9 CE2:TRP:E:133:24.67:48.56:40.82 C:ASP:E:138:23.52:43.75:50.91 CD2:TRP:E: 133:24.41:47.41 :41.62 0:ASP:E: 138:22.67:43.97:51.76 CE3:TRP:E:133:25.43:46.49:41.86 N:SER:E:139:24.45:44.75:50.54 CZ3:TRP:E: 133:26.66:46.63:41.11 CA:SER:E: 139:24.55:46.03:51.21 CZ2:TRP:E: 133:25.96:48.79:40.3 CB:SER:E: 139:26.04:46.32:51.66 CH2:TRP:E: 133:26.89:47.77:40.43 OG:SER:E: 139:26.42:45.43:52.67 C:TRP:E: 133:21.46:44.33:43.67 C:SER:E: 139:24.22:47.2:50.3 0:TRP:E: 133:22.35:43.85:44.39 O:SER:E:139:24.85:48.29:50.34 N:SER:E: 134:20.17:43.98:43.81 N:VAL:E: 140:23.2:47.01 :49.53 CA:SER:E: 134: 19.73:43.09:44.85 CA:VAL:E: 140:22.76:47.84:48.43 CB:SER:E:134:18.14:43.09:44.95 CB:VAL:E:140:21.6:47.18:47.71 OG:SER:E: 134: 17.5:42.68:43.73 CGI :VAL:E: 140:21.17:48.09:46.58 C:SER:E: 134:20.12:41.6:44.6 HG11 :VAL:E: 140:20.56:48.92:47.01 O:SER:E: 134:19.95:40.79:45.45 HG12:VAL:E: 140:21.99:48.53:45.97 N:ARG:E: 135:20.63:41.36:43.41 HG13:VAL:E: 140:20.45:47.64:45.87 CA:ARG:E:135:21.15:40.07:43.05 CG2:VAL:E: 140:22.02:45.78:47.17 CB:ARG:E: 135:20.65:39.82:41.61 HG21 :VAL:E: 140:22.86:45.83:46.43 CG:ARG:E:135:19.22:40.12:41.36 HG22:VAL:E: 140:22.37:45.14:48.02 CD:ARG:E:135:18.28:39.24:42.11 HG23: VAL:E: 140:21.14:45.26:46.74 NE:ARG:E:135: 16.91 :39.64:41.73 C:VAL:E: 140:22.57:49.28:48.79 CZ:ARG:E: 135: 16.02:38.94:40.95 0:VAL:E: 140:22.98:50.15:48.03 NH1 :ARG:E: 135: 16.32:37.86:40.34 N:GLU:E: 141 :22.05:49.57:49.96 NH2:ARG:E: 135: 14.76:39.3:40.97 CA:GLU:E: 141 :21.71:50.9:50.43 C:ARG:E:135:22.66:39.99:43.07 CB:GLU:E:141 :20.75:50.76:51.62 0:ARG:E: 135:23.26:38.94:42.83 CG:GLU:E: 141 : 19.35:50.24:51.14 N:ILE:E:136:23.42:41.08:43.44 CD:GLU:E: 141 : 18.49:49.77:52.28 CA:ILE:E: 136:24.82:41.1 :43.35 OE1 :GLU:E: 141 :17.4:50.36:52.42 CB:ILE:E: 136:25.23:42.44:42.6 OE2:GLU:E: 141 :18.9:48.91 :53.11 CG2:ILE:E: 136:26.74:42.65:42.66 C:GLU:E: 141 :23.02:51.74:50.81 HG21 :ILE:E: 136:26.9:43.64:42.19 0:GLU:E: 141 :23.03:52.97:50.88 HG22:ILE:E: 136:27.05:42.85:43.72 N:ASP:E: 142:24.1 :50.94:51.16 HG23:ILE:E: 136:27.29:41.84:42.15 CA:ASP:E:142:25.33:51.53:51.57 CGI :ILE:E: 136:24.67:42.56:41.11 CB:ASP:E: 142:26.09:50.63:52.56 HG11 :ILE:E: 136:23.57:42.54:41.15 CG:ASP:E:142:25.52:50.71 :53.95 HG12:ILE:E: 136:24.99:43.51 :40.62 OD1 :ASP:E: 142:25.11 :51.78:54.49 CD:ILE:E: 136:25.15:41.39:40.29 OD2: ASP:E: 142:25.6:49.66:54.58 C:ILE:E: 136:25.46:41.12:44.7 C:ASP:E:142:26.3:51.68:50.37 O:ILE:E:136:26.48:40.42:44.89 0:ASP:E: 142:27.42:52.13:50.61 N:LEU:E:137:24.83:41.88:45.69 N:ASN:E: 143:25.86:51.46:49.14 CA:LEU:E: 137:25.32:42.11 :46.98 CA:ASN:E:143:26.41 :51.91 :47.9 CB:LEU:E: 137:26.28:43.34:46.97 CB:ASN:E: 143:25.7:51.46:46.6 CG:LEU:E: 137:25.74:44.77:46.78 CG:ASN:E:143:25.94:50.02:46.38 CD1 :LEU:E: 137:26.79:45.76:47.32 0D1 :ASN:E: 143:26.82:49.4:46.97 CD2:LEU:E: 137:25.54:45.17:45.35 ND2:ASN:E: 143:25.01 =49.39:45.57 C:LEU:E: 137:24.15:42.42:47.88 C:ASN:E:143:26.37:53.45:47.81 O:LEU:E:137:23.07:42.79:47.45 O:ASN:E: 143:25.34:54.08:48.07 N:ASP:E: 138:24.4:42.24:49.18 N:HIS:E: 144:27.5:54.06:47.32 CA:ASP:E: 138:23.51:42.4:50.28 CA:HIS:E:144:27.51 :55.43:46.87
CB:HIS:E:144:28.65:56.21:47.66 CB:LYS:E:149:26.33:60.22:31.8
ND1:HIS:E:144:27.71:58.45:48.23 CG:LYS:E: 149:25.65:60.64:30.5
CG:HIS:E: 144:28.61:57.64:47.48 CD:LYS:E: 149:26.58:60.99:29.32
CE1:HIS:E:144:28.1:59.69:47.92 CE:LYS:E:149:25.75:61.55:28.19
NE2:HIS:E: 144:29.19:59.67:47.07 NZ:LYS:E:149:26.6:61.52:27.03
CD2:HIS:E: 144:29.49:58.39:46.78 C:LYS:E: 149:24.27:59.28:32.89
C:HIS:E:144:27.72:55.54:45.35 0:LYS:E:149:23.23:59.59:32.3
0:HIS:E:144:28.88:55.37:44.85 N:ASP:E:150:24.32:58.18:33.66
N:ILE:E:145:26.63:55.76:44.6 CA:ASP:E:150:23.23:57.24:33.96
CA:ILE:E:145:26.75:55.76:43.17 CB:ASP:E:150:23.76:56.22:34.99
CB:ILE:E:145:26.1:54.52:42.41 CG:ASP:E:150:22.86:55.14:35.01
CG2:ILE:E: 145:25.98:54.92:40.94 OD1:ASP:E:150:22.75:54.55:33.9
HG21:ILE:E: 145:25.7:54.05:40.32 OD2: ASP:E: 150:22.18:54.81:36
HG22:ILE:E:145:25.26:55.77:40.84 C:ASP:E:150:22:57.96:34.46
HG23:ILE:E: 145:27.03:55.2:40.68 O:ASP:E:150:20.87:57.57:34.11
CGI :ILE:E: 145:26.87:53.17:42.63 N:ASP:E:151:22.24:58.99:35.31
HG11:ILE:E:145:27.91:53.4:42.32 CA:ASP:E:151:21.23:59.71:36.1
HG12:ILE:E:145:26.84:52.94:43.72 CB:ASP:E:151:21.9:60.25:37.42
CD:ILE:E:145:26.28:51.97:41.8 CG:ASP:E:151:21.99:59.19:38.51
C:ILE:E: 145:26.19:57.08:42.66 OD1 : ASP:E: 151 :22.49:59.41 :39.63
0:ILE:E:145:24.95:57.23:42.68 OD2:ASP:E:151:21.52:58.06:38.24
N:VAL:E:146:26.91:58.08:42.25 C:ASP:E:151:20.67:60.94:35.38
CA:VAL:E: 146:26.5:59.43:42.16 0:ASP:E:151:19.77:61.6:35.87
CB:VAL:E:146:26.79:60.31:43.41 N:ASN:E:152:21.26:61.29:34.17
CGI :VAL:E: 146:26.04:59.79:44.7 CA:ASN:E:152:20.87:62.41:33.29
HG11:VAL:E:146:24.95:59.85:44.52 CB:ASN:E:152:22.08:63.2:32.79
HG12:VAL:E:146:26.31:58.74:44.98 CG:ASN:E:152:22.87:63.97:33.85
HG13:VAL:E:146:26.24:60.38:45.62 ODl:ASN:E: 152:24.02:63.71:34.17
CG2: VAL:E: 146:28.33:60.29:43.7 ND2:ASN:E: 152:22.22:64.93:34.53
HG21:VAL:E:146:28.93:59.37:43.58 C:ASN:E:152:20.1:61.87:32.12
HG22:VAL:E:146:28.79:61.02:43.01 0:ASN:E:152:19.7:62.67:31.27
HG23:VAL:E:146:28.49:60.78:44.69 N:GLU:E:153:19.9:60.58:31.96
C:VAL:E:146:27.13:60.22:41.03 CA:GLU:E:153:18.94:59.91:31.1
0:VAL:E: 146:28.24:60.14:40.6 CB:GLU:E:153:17.46:60.35:31.26
N:LEU:E:147:26.21:61.11:40.5 CG:GLU:E:153:17.07:60.15:32.68
CA:LEU:E: 147:26.49:62.11 :39.47 CD:GLU:E: 153: 15.72:60.74:32.95
CB:LEU:E:147:27.48:63.15:40.02 OE1:GLU:E:153:14.67:60.05:33.11
CG:LEU:E: 147:26.92:63.92:41.22 OE2:GLU:E:153:15.68:62.03:33.01
CD1:LEU:E: 147:28.13:64.42:42.08 C:GLU:E:153:19.34:59.91:29.65
CD2:LEU:E: 147:26.07:65.09:40.72 0:GLU:E:153:18.58:59.56:28.78
C:LEU:E:147:27.03:61.6:38.15 N:GLU:E: 154:20.61:60.37:29.29
0:LEU:E:147:27.68:62.26:37.36 CA:GLU:E:154:21.04:60.53:27.93
N:ASN:E:148:26.75:60.37:37.83 CB:GLU:E:154:22.34:61.4:28.06
CA:ASN:E:148:26.99:59.86:36.51 CG:GLU:E:154:21.97:62.84:28.4
CB:ASN:E:148:27.34:58.4:36.55 CD:GLU:E:154:23.01:63.77:28.06
CG:ASN:E:148:28.38:58.11:37.68 OEl:GLU:E: 154:23.38:64.05:26.92
ODl:ASN:E:148:29.6:58.17:37.44 OE2:GLU:E: 154:23.66:64.3:29
ND2:ASN:E:148:27.92:57.68:38.85 C:GLU:E:154:21.32:59.28:27.14
C:ASN:E:148:25.91:60.16:35.47 0:GLU:E: 154:21.07:59.3:25.96
O:ASN:E:148:24.73:60.38:35.8 N:CYS:E:155:21.67:58.22:27.87
N:LYS:E: 149:26.25:60.06:34.18 CA:CYS:E:155:21.68:56.89:27.32
CA:LYS:E: 149:25.39:60.22:33.02 CB:CYS:E:155:22.68:56.01:28.1
SG:CYS:E:155:24.37:56.57:27.96 HG23:THR:E: 162: 13.14:42.67: 18.62
C:CYS:E:155:20.32:56.19:27.37 C:THR:E:162:12.58:43.43:21.48
O:CYS:E:155:20.17:55.12:26.83 0:THR:E:162:12.81:42.19:21.31
N:GLY:E:156:19.35:56.8:27.98 N:ALA:E: 163: 11.56:43.92:22.23
CA:GLY:E:156:17.93:56.33:27.97 CA:ALA:E: 163: 10.59:43.02:22.82
C:GLY:E:156:17.64:54.89:28.36 CB:ALA:E:163:9.91:43.7:24.08
O:GLY:E:156:16.57:54.37:28.04 C:ALA:E:163:9.64:42.32:21.83
N:ASP:E:157:18.64:54.32:29.04 0:ALA:E: 163:9.04:41.28:22.06
CA:ASP:E:157:18.64:52.95:29.53 N:LYS:E: 164:9.42:42.98:20.69
CB:ASP:E:157:17.68:52.77:30.72 CA:LYS:E:164:8.56:42.52:19.59
CG:ASP:E:157:17.96:53.61:31.95 CB:LYS:E:164:8.34:43.65:18.49
OD1:ASP:E:157:19.05:54.2:32.06 CG:LYS:E:164:7.83:45.03:18.95
OD2:ASP:E:157:17.12:53.53:32.91 CD:LYS:E:164:6.33:45.01:19.31
C:ASP:E:157:18.3:51.93:28.41 CE:LYS:E:164:5.7:46.4:19.57
O:ASP:E:157:17.55:50.98:28.65 NZ:LYS:E:164:4.3:46.21:19.95
N:ILE:E:158:18.84:52.18:27.21 C:LYS:E: 164:9.11 :41.27: 19.02
CA:ILE:E:158:18.64:51.23:26.1 O:LYS:E:164:8.38:40.29:18.98
CB:ILE:E:158:18.97:51.87:24.69 N:GLY:E:165:10.4:41.32:18.68
CG2:ILE:E:158:18.8:50.81:23.56 CA:GLY:E: 165: 11.11 : 40.29: 17.95
HG21:ILE:E:158:19.21:51.38:22.71 C:GLY:E:165:11.65:39.23:18.85
HG22:ILE:E: 158: 19.52:49.96:23.61 0:GLY:E:165:12.19:38.23:18.4
HG23:ILE:E:158:17.82:50.34:23.36 N:LYS:E:166:11.55:39.5:20.17
CG1:ILE:E:158:18.23:53.21:24.41 CA:LYS:E:166:11.95:38.61:21.26
HG11:ILE:E:158:17.15:52.96:24.33 CB:LYS:E:166:11.31:37.25:21.17
HG12:ILE:E:158:18.32:53.97:25.22 CG:LYS:E:166:9.77:37.53:21.2
CD:ILE:E:158:18.7:53.88:23.13 CD:LYS:E:166:8.92:36.23:21.4
C:ILE:E:158:19.47:49.99:26.33 CE:LYS:E:166:7.47:36.72:21.43
O:ILE:E:158:20.7:50.05:26.47 NZ:LYS:E:166:6.49:35.75:22
N:CYS:E:159:18.82:48.82:26.27 C:LYS:E:166:13.4:38.44:21.47
CA:CYS:E: 159: 19.42:47.54:26.47 0:LYS:E:166:13.92:37.35:21.71
CB:CYS:E:159:18.95:46.86:27.79 N:THR:E:167:14.12:39.58:21.27
SG:CYS:E:159:19.69:47.44:29.33 CA:THR:E:167:15.54:39.65:21.3
C:CYS:E:159:19.19:46.58:25.24 CB:THR:E:167:16.1:40.18:19.98
0:CYS:E:159:18.24:46.9:24.55 OGl:THR:E:167:15.74:41.51:19.71
N:PRO:E: 160: 19.97:45.54:24.93 CG2:THR:E:167:15.55:39.4:18.81
CD:PRO:E:160:21.21:45.22:25.65 HG21:THR:E:167:15.68:38.3:18.87
CA:PRO:E: 160: 19.9:44.72:23.71 HG22:THR:E:167:14.45:39.43:18.75
CB:PRO:E:160:20.65:43.47:24.12 HG23:THR:E:167:15.97:39.73:17.84
CG:PRO:E: 160:21.85:44.16:24.81 C:THR:E: 167: 16.11 :40.47:22.41
C:PRO:E:160:18.53:44.4:23.16 O:THR:E:167:17.33:40.61:22.56
0:PRO:E: 160: 17.56:44.21:23.9 N:ASN:E: 168: 15.24:40.94:23.29
N:GLY:E:161:18.31:44.48:21.78 CA:ASN:E:168:15.52:41.62:24.52
CA:GLY:E:161:17.17:43.94:21.12 CB:ASN:E:168:14.34:42.49:24.98
C:GLY:E:161:15.86:44.69:21.11 CG:ASN:E:168:13.1:41.69:25.2
0:GLY:E:161:15.87:45.93:21.18 OD1 : ASN:E: 168: 12.79:40.66:24.67
N:THR:E: 162: 14.87:43.96:20.95 ND2: ASN:E: 168: 12.24:42.22:26.12
CA:THR:E:162:13.5:44.43:20.81 C:ASN:E:168:16.06:40.81:25.71
CB:THR:E:162:13.07:44.76:19.4 O:ASN:E:168:16.03:39.56:25.76
OGl:THR:E:162:11.66:44.86:19.23 N:CYS:E:169:16.7:41.57:26.66
CG2:THR:E:162:13.56:43.66:18.35 CA:CYS:E:169:17.25:41.03:27.81
HG21:THR:E:162:13.16:43.93:17.35 CB :CYS:E: 169: 18.05:42.11 :28.59
HG22:THR:E: 162: 14.67:43.69: 18.29 SG:CYS:E:169:19.52:42.8:27.58
C:CYS:E:169:16.26:40.36:28.81 CB:ASN:E:175:7.84:49.46:43.04
O:CYS:E:169:15.05:40.64:28.79 CG:ASN:E:175:8.63:49.63:44.38
N:PRO:E: 170: 16.83:39.42:29.72 ODl:ASN:E:175:9.43:48.8:44.74
CD:PRO:E:170:18.14:38.77:29.53 ND2:ASN:E:175:8.29:50.67:45.19
CA:PRO:E:170:16.3:39.12:31.06 C:ASN:E:175:6.69:47.86:41.22
CB:PRO:E:170:17.37:38.45:31.78 O:ASN:E:175:6.24:48.85:40.67
CG:PRO:E:170:18.21:37.78:30.73 N:GLY:E: 176:6.26:46.64:40.88
C:PRO:E:170:15.83:40.27:31.92 CA:GLY:E:176:5.08:46.51:39.98
O:PRO:E:170:16.57:41.25:32.12 C:GLY:E:176:5.44:46.68:38.54
N:ALA:E:171:14.55:40.3:32.45 0:GLY:E: 176:4.57:46.5:37.6
CA:ALA:E:171:13.86:41.46:32.97 N:GLN:E: 177:6.76:46.9:38.3
CB:ALA:E:171:12.56:41.73:32.23 CA:GLN:E: 177:7.34:47.04:36.96
C:ALA:E:171:13.66:41.11:34.43 CB:GLN:E:177:8.09:48.39:36.79
O:ALA:E:171:13.21:40.01:34.76 CG:GLN:E:177:7.28:49.67:37.14
N:THR:E:172:13.84:42.1:35.34 CD:GLN:E:177:5.88:49.75:36.48
CA:THR:E:172:13.61:41.76:36.74 OEl:GLN:E:177:5.58:49.32:35.4
CB:THR:E:172:14.85:41.55:37.62 NE2:GLN:E: 177:4.92:50.32:37.27
OGl:THR:E:172:15.57:42.75:37.77 C:GLN:E:177:8.49:46.05:36.7
CG2:THR:E:172:15.89:40.59:37.05 0:GLN:E: 177:9.22:45.71 :37.6
HG21:THR:E: 172: 16.65:40.28:37.8 N:PHE:E:178:8.6:45.45:35.48
HG22:THR:E: 172: 15.36:39.67:36.73 CA:PHE:E: 178:9.72:44.62:35.05
HG23:THR:E:172:16.45:41.06:36.23 CB:PHE:E:178:9.23:43.39:34.26
C:THR:E:172:12.7:42.87:37.37 CG:PHE:E:178:8.88:42.23:35.14
0:THR:E: 172: 12.75:44.03:36.93 CD1:PHE:E:178:7.65:42.15:35.83
N:VAL:E:173:11.89:42.49:38.38 CE1:PHE:E:178:7.37:41.05:36.66
CA:VAL:E: 173: 10.82:43.23:39.02 CZ:PHE:E:178:8.37:40.01:36.82
CB:VAL:E:173:9.77:42.3:39.72 CD2:PHE:E:178:9.84:41.24:35.42
CGI : VAL:E: 173: 10.41 :41.71 :40.93 CE2:PHE:E:178:9.64:40.18:36.3
HG11:VAL:E:173:9.63:41.2:41.52 C:PHE:E:178:10.68:45.45:34.14
HG12:VAL:E:173:11.23:40.96:40.81 0:PHE:E: 178: 10.24:46.07:33.15
HG13:VAL:E:173:10.77:42.52:41.62 N:VAL:E: 179: 11.98:45.36:34.47
CG2:VAL:E: 173:8.49:43.08:40.02 CA:VAL:E:179:12.97:46.18:33.83
HG21:VAL:E:173:7.77:42.29:40.33 CB:VAL:E:179:13.37:47.25:34.78
HG22:VAL:E:173:8.72:43.74:40.88 CGI :VAL:E: 179: 14.5:48.04:34.12
HG23:VAL:E:173:8:43.61:39.17 HG11:VAL:E:179:14.28:48.28:33.05
C:VAL:E:173:11.31:44.32:39.82 HG12:VAL:E:179:14.51:49.08:34.5
O: VAL:E: 173: 12.39:44.22:40.39 HG13:VAL:E:179:15.51:47.59:34.17
N:ILE:E:174:10.54:45.41:39.96 CG2:VAL:E:179:12.11:48.12:35.16
CA:ILE:E: 174: 10.89:46.47:40.88 HG21:VAL:E:179:11.78:48.65:34.25
CB:ILE:E: 174: 12.04:47.32:40.46 HG22: VAL:E: 179: 11.29:47.47:35.52
CG2:ILE:E: 174: 11.91 :47.95:39.07 HG23:VAL:E:179:12.33:48.86:35.96
HG21:ILE:E:174:12.85:48.4:38.69 C: VAL:E: 179: 14.04:45.25:33.25
HG22:ILE:E: 174: 11.48:47.23:38.34 0:VAL:E: 179: 14.72:44.51 :33.94
HG23:ILE:E:174:11.24:48.84:39.09 N:GLU:E:180:14.2:45.41:31.92
CG1:ILE:E:174:12.33:48.31:41.54 CA:GLU:E:180:15.22:44.73:31.18
HG11:ILE:E:174:13.21:48.86:41.17 CB:GLU:E:180:15.08:44.8:29.68
HG12:ILE:E:174:11.51:49.06:41.66 CG:GLU:E:180:13.99:43.87:29.13
CD:ILE:E: 174: 12.63:47.55:42.85 CD:GLU:E:180:12.61:44.47:29.07
C:ILE:E:174:9.65:47.33:41 OE1:GLU:E:180:11.77:43.82:28.47
O:ILE:E:174:9.18:47.99:40.06 OE2:GLU:E:180:12.37:45.56:29.62
N:ASN:E: 175:9.04:47.32:42.21 C:GLU:E:180:16.66:45.18:31.67
CA:ASN:E: 175:7.74:47.97:42.4 O:GLU:E:180:16.92:46.37:31.79
N: ARG:E: 181 : 17.52:44.24:32.03 CA:SER:E:186:26.74:44.78:24.18
CA:ARG:E:181:18.77:44.46:32.78 CB:SER:E:186:28.33:44.61:24.37
CB:ARG:E:181:18.92:43.63:34.09 OG:SER:E: 186:28.91 :45.71 :23.72
CG:ARG:E:181:17.76:43.76:35.05 C:SER:E:186:26.2:43.48:24.8
CD:ARG:E:181:17.44:45.21:35.48 0:SER:E:186:25.98:42.49:24.1
NE:ARG:E:181:16.46:45.07:36.57 N:HIS:E:187:25.98:43.44:26.13
CZ:ARG:E:181:15.95:46.04:37.34 CA:HIS:E:187:25.82:42.2:26.85
NH1 :ARG:E: 181 : 16.49:47.25:37.38 CB:HIS:E:187:27.21:41.62:27.31
NH2:ARG:E: 181 : 14.99:45.77:38.24 ND1:HIS:E:187:28.09:40.71:25.18
C:ARG:E:181:20.01:44.19:31.93 CG:HIS:E:187:28.26:41.56:26.21
O:ARG:E:181:20.15:43.09:31.34 CE1:HIS:E: 187:29.22:40.76:24.47
N:CYS:E:182:20.87:45.22:31.82 NE2:HIS:E:187:30.13:41.57:25.12
CA:CYS:E:182:22.09:45.11:31.14 CD2:HIS:E: 187:29.49:42.14:26.2
CB:CYS:E:182:21.99:45.43:29.59 C:HIS:E:187:24.79:42.39:27.99
SG:CYS:E:182:21.66:47.18:29.08 0:HIS:E:187:24.64:43.3:28.8
C:CYS:E:182:23.04:46.12:31.74 N:CYS:E:188:23.89:41.43:28.15
0:CYS:E:182:22.64:47.12:32.34 CA:CYS:E:188:22.84:41.52:29.19
N:TRP:E:183:24.37:45.81:31.57 CB:CYS:E:188:21.53:40.8:28.83
CA:TRP:E:183:25.58:46.48:32.16 SG:CYS:E:188:20.71:41.18:27.27
CB:TRP:E:183:26.58:45.5:32.83 C:CYS:E:188:23.33:41.03:30.5
CG:TRP:E:183:25.98:44.5:33.74 O:CYS:E:188:24.3:40.3:30.65
CD1:TRP:E:183:25.39:43.33:33.43 N:GLN:E: 189:22.52:41.4:31.54
NE1:TRP:E: 183:25.23:42.54:34.53 CA:GLN:E:189:22.75:40.78:32.89
CE2:TRP:E:183:25.63:43.27:35.63 CB:GLN:E:189:21.95:41.49:34
CD2:TRP:E:183:26.15:44.45:35.15 CG:GLN:E:189:22.05:40.89:35.41
CE3:TRP:E:183:26.81:45.35:36 CD:GLN:E:189:21.6:41.86:36.6
CZ3:TRP:E:183:26.99:45.02:37.31 OEl:GLN:E:189:21:42.88:36.45
CZ2:TRP:E:183:25.77:42.94:36.97 NE2:GLN:E:189:22.02:41.49:37.8
CH2:TRP:E: 183:26.44:43.87:37.85 C:GLN:E:189:22.53:39.33:32.87
C:TRP:E:183:26.4:47.27:31.12 0:GLN:E: 189:21.61:38.8:32.17
0:TRP:E:183:27.49:47.62:31.35 N:LYS:E:190:23.29:38.5:33.68
N:THR:E:184:25.83:47.36:29.96 CA:LYS:E:190:22.96:37.13:33.93
CA:THR:E:184:26.45:47.71:28.67 CB:LYS:E:190:24.21:36.42:34.46
CB :THR:E: 184:27.84:47.1 :28.37 CG:LYS:E:190:25.19:36.21:33.29
OG1:THR:E:184:28.35:47.31:27.04 CD:LYS:E: 190:24.67:35.17:32.28
CG2:THR:E:184:27.77:45.63:28.61 CE:LYS:E:190:25.86:34.64:31.43
HG21 :THR:E: 184:28.68:45.2:28.14 NZ:LYS:E:190:26.46:35.8:30.78
HG22:THR:E: 184:27.7:45.6:29.72 C:LYS:E:190:21.95:37.03:35.05
HG23:THR:E:184:26.85:45.09:28.28 0:LYS:E: 190:22.29:37.17:36.2
C:THR:E:184:25.45:47.35:27.6 N:VAL:E: 191:20.69:36.82:34.7
0:THR:E:184:24.57:46.56:27.83 CA:VAL:E:191:19.59:36.74:35.61
N:HIS:E:185:25.51:48.03:26.45 CB:VAL:E:191:18.22:37.11:35.09
CA:HIS:E:185:24.55:47.71:25.37 CG1:VAL:E:191:17.03:36.7:35.94
CB:HIS:E:185:24.46:48.92:24.43 HG11:VAL:E:191:16.87:35.6:35.94
ND1:HIS:E:185:23.7:49.73:22.27 HG12:VAL:E:191:17.19:37.01:36.99
CG:HIS:E:185:23.5:48.83:23.31 HG13:VAL:E:191:16.08:37.08:35.51
CE1:HIS:E:185:22.72:49.55:21.4 CG2:VAL:E:191:18.15:38.65:35.1
NE2:HIS:E:185:21.84:48.6:21.9 HG21:VAL:E:191:17.31:39.03:34.48
CD2:HIS:E: 185:22.34:48.12:23.11 HG22:VAL:E:191:18.29:39.14:36.08
C:HIS:E:185:25.06:46.53:24.63 HG23:VAL:E:191:19.03:38.87:34.46
0:HIS:E:185:24.31:46:23.82 C:VAL:E:191:19.53:35.29:36.19
N:SER:E:186:26.3:46.08:24.79 0:VAL:E:191:19.68:34.28:35.47
N:CYS:E:192:19.37:35.12:37.54 CA:SER:E: 198:24.1 :32.99:45.34
CA:CYS:E:192:19.4:33.74:38.08 CB:SER:E:198:25.04:33.93:46.27
CB:CYS:E:192:20.8:33.4:38.73 OG:SER:E:198:24.31:35.05:46.85
SG:CYS:E:192:21.12:31.64:39.35 C:SER:E:198:24.96:31.99:44.6
C:CYS:E:192:18.35:33.48:39.16 O:SER:E:198:26.04:32.27:44.03
O:CYS:E:192:18.45:34.15:40.23 N:HIS:E: 199:24.43:30.75:44.49
N:PRO:E:193:17.33:32.58:39.03 CA:HIS:E:199:25.17:29.49:44.08
CD:PRO:E:193:16.89:32:37.78 CB:HIS:E:199:24.35:28.16:44.34
CA:PRO:E:193:16.41:32.26:40.15 ND1 :HIS:E: 199:22.89:28.14:46.44
CB:PRO:E:193:15.88:30.86:39.78 CG:HIS:E:199:24.1:28.01:45.8
CG:PRO:E:193:15.68:31.16:38.22 CE1:HIS:E:199:23.11:28.04:47.7
C:PRO:E:193:16.9:32.23:41.59 NE2:HIS:E: 199:24.37:27.69:47.98
O:PRO:E:193:17.9:31.72:42.03 CD2:HIS:E: 199:24.99:27.67:46.75
N:THR:E:194:15.98:32.83:42.44 C:HIS:E:199:25.58:29.47:42.61
CA:THR:E:194:16.35:33.14:43.84 0:HIS:E:199:26.6:28.85:42.24
CB:THR:E:194:15.63:34.33:44.52 N:GLY:E:200:24.81:30.19:41.77
OG1:THR:E:194:14.33:34.05:44.94 CA:GLY:E:200:25.06:30.22:40.32
CG2:THR:E:194:15.44:35.45:43.49 C:GLY:E:200:23.98:29.43:39.54
HG21 :THR:E: 194: 16.44:35.66:43.06 O:GLY:E:200:23.27:28.57:40.07
HG22:THR:E: 194: 14.81 :35.16:42.61 N:CYS:E:201:23.88:29.72:38.28
HG23:THR:E: 194: 14.92:36.32:43.93 CA:CYS:E:201 :22.99:29.24:37.26
C:THR:E:194:16.41:31.91:44.68 CB:CYS:E:201:21.96:30.32:36.81
O:THR:E:194:17.02:31.86:45.76 SG:CYS:E:201:20.48:30.49:37.85
N:ILE:E:195:15.83:30.79:44.25 C:CYS:E:201:23.63:28.5:36.13
CA:ILE:E: 195: 15.95:29.46:44.74 O:CYS:E:201:24.71:28.86:35.58
CB:ILE:E:195:14.9:28.64:44.1 N:THR:E:202:23.02:27.34:35.78
CG2:ILE:E: 195: 13.52:29.33:44.15 CA:THR:E:202:23.72:26.31:34.99
HG21:ILE:E:195:13.44:30.13:43.38 CB:THR:E:202:24.65:25.45:35.83
HG22:ILE:E: 195: 12.66:28.64:44.03 OG1:THR:E:202:25.64:24.88:34.97
HG23:ILE:E:195:13.41:29.7:45.19 CG2:THR:E:202:23.96:24.39:36.61
CG1:ILE:E:195:15.33:28.23:42.67 HG21 :THR:E:202:23.04:24.75:37.12
HG11:ILE:E:195:15.81:29.11:42.19 HG22:THR:E:202:23.63:23.52:35.99
HG12:ILE:E:195:16.11:27.44:42.81 HG23:THR:E:202:24.67:24.02:37.38
CD:ILE:E:195:14.13:27.67:41.81 C:THR:E:202:22.75:25.45:34.29
C:ILE:E:195:17.24:28.81:44.62 O:THR:E:202:21.53:25.48:34.54
0:ILE:E:195:17.51:27.78:45.21 N:ALA:E:203:23.2:24.72:33.22
N:CYS:E:196:18.1:29.36:43.73 CA:ALA:E:203:22.44:24.17:32.07
CA:CYS:E: 196: 19.47:28.99:43.44 CB:ALA:E:203:22.12:22.7:32.38
CB:CYS:E:196:19.82:29.14:41.91 C:ALA:E:203:21.23:24.96:31.66
SG:CYS:E:196:18.85:28.06:40.78 O:ALA:E:203:21.28:26.16:31.48
C:CYS:E:196:20.48:29.89:44.21 N:GLU:E:204:20.14:24.25:31.37
0:CYS:E:196:21.69:29.95:43.94 CA:GLU:E:204:19.01:24.78:30.71
N:LYS:E:197:19.98:30.7:45.21 CB:GLU:E:204:18.05:23.84:29.92
CA:LYS:E:197:20.57:31.74:46.01 CG:GLU:E:204:18.68:23.13:28.76
CB:LYS:E:197:20.98:31.11:47.36 CD:GLU:E:204:19.51:21.9:29.23
CG:LYS:E:197:19.85:30.36:48.09 OE1:GLU:E:204:20.55:21.61:28.63
CD:LYS:E:197:20.18:29.93:49.58 OE2:GLU:E:204:19.2:21.36:30.39
CE:LYS:E: 197: 19.2:28.93:50.07 C:GLU:E:204:18.15:25.59:31.73
NZ:LYS:E:197:17.88:29.47:50.35 O:GLU:E:204:17.63:26.63:31.36
C:LYS:E: 197:21.73:32.42:45.4 N:GLY:E:205:18.13:25.1:32.98
0:LYS:E:197:21.7:33:44.35 CA:GLY:E:205:17.31:25.72:34.02
N:SER:E:198:22.95:32.39:46.05 C:GLY:E:205:17.67:25.34:35.37
O:GLY:E:205: 16.77:25.2:36.2 N:LEU:E:213:30:29.4:44.49
N:LEU:E:206: 18.97:25.12:35.68 CA:LEU:E:213:29.52:30.71 :44.41
CA:LEU:E:206:19.28:24.47:36.96 CB:LEU:E:213:30.41 :31.6:45.28
CB :LEU:E:206:20.11 :23.18:36.73 CG:LEU:E:213:30.25:33.08:45.21
CG:LEU:E:206:19.3:22.09:36.06 CD1 :LEU:E:213:28.84:33.63:45.47
CD1 :LEU:E:206:20.22:20.91 :35.77 CD2:LEU:E:213:31.25:33.74:46.18
CD2:LEU:E:206: 18.19:21.6:37.1 C:LEU:E:213:29.6:31.22:42.98
C:LEU:E:206:20.12:25.36:37.75 O:LEU:E:213:30.54:31.01 :42.18
O:LEU:E:206:20.86:26.22:37.29 N:GLY:E:214:28.5:31.82:42.48
N:CYS:E:207: 19.96:25.22:39.11 CA:GLY:E:214:28.5:32.58:41.21
CA:CYS:E:207:20.97:25.78:40.04 C:GLY:E:214:28.19:31.71:39.95
CB:CYS:E:207:20.42:25.54:41.43 O:GLY:E:214:27.34:32.06:39.13
SG:CYS:E:207:18.82:26.3:41.77 N:NLG:E:215:29:30.66:39.73
C:CYS:E:207:22.28:25.07:40.02 CA:NLG:E:215:29.01:30.01 :38.43
O:CYS:E:207:22.38:23.99:39.48 CB:NLG:E:215:29.6:30.89:37.3
N:CYS:E:208:23.35:25.65:40.6 CG:NLG:E:215:29.34:30.35:35.86
CA:CYS:E:208:24.63:24.99:40.84 ODl :NLG:E:215:28.45:29.51 :35.68
CB:CYS:E:208:25.94:25.85:40.79 ND2:NLG:E:215:30.19:30.71 :34.92
SG:CYS:E:208:26.18:26.61 :39.12 C:NLG:E:215:29.56:28.64:38.57
C:CYS:E:208:24.7:24.4:42.3 O:NLG:E:215:30.34:28.33:39.47
O:CYS:E:208:23.85:24.63:43.13 N:CYS:E:216:29.13:27.76:37.67
N:HIS:E:209:25.71 :23.51 :42.64 CA:CYS:E:216:29.64:26.4:37.56
CA:HIS:E:209:25.89:22.8:43.89 CB:CYS:E:216:29.05:25.46:38.59
CB:HIS:E:209:27.04:21.71:43.85 SG:CYS:E:216:27.29:25.15:38.17
ND1 :HIS:E:209:28.17:21.61 :46 C:CYS:E:216:29.67:25.87:36.11
CG:HIS:E:209:27.34:20.94:45.1 0:CYS:E:216:28.95:26.43:35.25
CE1:HIS:E:209:28.57:20.65:46.86 N:SER:E:217:30.6:24.9:35.77
NE2:HIS:E:209:28.14:19.43:46.48 CA:SER:E:217:30.58:24.34:34.39
CD2:HIS:E:209:27.35: 19.59:45.31 CB:SER:E:217:31.98:23.73:33.9
C:HIS:E:209:25.98:23.71 :45.12 OG:SER:E:217:32.99:24.7:33.73
O:HIS:E:209:26.3:24.89:45.04 C:SER:E:217:29.42:23.32:34.24
N:SER:E:210:25.7:23.18:46.39 0:SER:E:217:28.91 :23.21 :33.13
CA:SER:E:210:25.57:24.06:47.54 N:GLN:E:218:29.05:22.57:35.37
CB:SER:E:210:25.03:23.38:48.78 CA:GLN:E:218:28.17:21.42:35.46
OG:SER:E:210:25.72:22.2:49.08 CB:GLN:E:218:28.9:20.11 :35.28
C:SER:E:210:26.9:24.74:48.01 CG:GLN:E:218:29.4:19.64:33.84
O:SER:E:210:26.81 :25.79:48.63 CD:GLN:E:218:28.47: 19.76:32.65
N:GLU:E:211 :28.08:24.25:47.57 OE1 :GLN:E:218:28.83:20.12:31.56
CA:GLU:E:211:29.35:24.92:47.94 NE2:GLN:E:218:27.2: 19.35:32.87
CB:GLU:E:211:30.41 :23.93:48.44 C:GLN:E:218:27.61 :21.38:36.89
CG:GLU:E:211:30.06:23.41 :49.92 0:GLN:E:218:28.32:21.61 :37.86
CD:GLU:E:211:28.96:22.29:50.07 N:PRO:E:219:26.26:21.19:37.04
0E1 :GLU:E:211 :29.31 :21.13:49.72 CD:PRO:E:219:25.38:20.93:35.88
OE2:GLU:E:211 :27.86:22.64:50.5 CA:PRO:E:219:25.61 :20.77:38.29
C:GLU:E:211 :29.91 :25.89:46.84 CB:PRO:E:219:24.13:20.64:37.92
0:GLU:E:211 :30.91 :26.54:47.07 CG:PRO:E:219:24.16:20.18:36.42
N:CYS:E:212:29.32:25.95:45.66 C:PRO:E:219:26.17: 19.51 :39.02
CA:CYS:E:212:29.59:26.95:44.66 0:PRO:E:219:26.87: 18.72:38.4
CB:CYS:E:212:28.91 :26.69:43.27 N:ASP:E:220:25.77:19.4:40.3
SG:CYS:E:212:29.19:25.15:42.22 CA:ASP:E:220:25.98: 18.12:41.04
C:CYS:E:212:29.26:28.4:44.98 CB:ASP:E:220:25.31 : 16.95:40.33
0:CYS:E:212:28.28:28.61 :45.71 CG:ASP:E:220:23.89: 17.13:39.92
OD1 :ASP:E:220:23.44: 16.32:39.15 HG21 :VAL:E:226:31.93:28.84:37.11
OD2:ASP:E:220:23.24: 18.09:40.42 HG22:VAL:E:226:32.47:30.24:36.32
C:ASP:E:220:27.5: 17.73:41.42 HG23:VAL:E:226:33.54:28.97:36.2
O:ASP:E:220:27.81 : 16.57:41.62 C:VAL:E:226:34.7:29.5:40.35
N:ASP:E:221 :28.39: 18.71:41.48 O:VAL:E:226:35.88:29.82:40.25
CA:ASP:E:221 :29.86: 18.6:41.84 N:ALA:E:227:33.97:29.56:41.44
CB : ASP:E:221 :30.61 : 18 :40.49 CA:ALA:E:227:34.3:30.03:42.72
CG:ASP:E:221 :31.97: 17.48:40.85 CB:ALA:E:227:34.09:31.49:42.82
OD1 :ASP:E:221 :32.62: 16.69:40.06 C:ALA:E:227:33.49:29.35:43.82
OD2:ASP:E:221 :32.51 : 17.85:41.93 0:ALA:E:227:32.57:28.62:43.48
C:ASP:E:221 :30.44:19.95:42.34 N:CYS:E:228:33.82:29.49:45.13
O:ASP:E:221 :30.23:20.99:41.66 CA:CYS:E:228:33.47:28.67:46.28
N:PRO:E:222:31.11 :20.02:43.52 CB:CYS:E:228:34.74:28.12:46.97
CD:PRO:E:222:31.15: 18.96:44.49 SG:CYS:E:228:35.67:26.93:45.94
CA:PRO:E:222:31.75:21.26:43.97 C:CYS:E:228:32.68:29.5:47.25
CB:PRO:E:222:31.88:21:45.5 O:CYS:E:228:32.97:30.69:47.42
CG:PRO:E:222:32.02: 19.46:45.64 N:ARG:E:229:31.52:29.02:47.82
C:PRO:E:222:33.06:21.42:43.29 CA:ARG:E:229:30.65:29.51:48.86
0:PRO:E:222:33.73:22.41 :43.53 CB:ARG:E:229:29.69:28.47:49.35
N:THR:E:223:33.47:20.48:42.39 CG:ARG:E:229:28.83:28.81:50.55
CA:THR:E:223:34.74:20.54:41.73 CD:ARG:E:229:27.42:28.23:50.65
CB:THR:E:223:35.21 :19.16:41.32 NE:ARG:E:229:27.46:26.78:50.8
OG1 :THR:E:223:35.02: 18.3:42.41 CZ:ARG:E:229:27.45:26.13:51.98
CG2:THR:E:223:36.72: 19.11 :40.9 NH1 :ARG:E:229:27.86:26.6:53.1
HG21 :THR:E:223:37.34:19.8:41.51 NH2:ARG:E:229:27.29:24.8:52.03
HG22:THR:E:223:37.15:18.09:40.84 C:ARG:E:229:31.5:30.03:50.04
HG23:THR:E:223:36.88:19.62:39.93 0:ARG:E:229:31.35:31.16:50.5
C:THR:E:223:34.62:21.38:40.51 N:ASN:E:230:32.28:29.08:50.63
0:THR:E:223:35.64:21.8:39.96 CA:ASN:E:230:33.22:29.31 :51.77
N:LYS:E:224:33.35:21.66:40.11 CB:ASN:E:230:32.79:28.43:52.97
CA:LYS:E:224:33.24:22.29:38.72 CG:ASN:E:230:31.43:28.9:53.5
CB:LYS:E:224:32.07:21.48:38.02 OD1 :ASN:E:230:30.98:30.02:53.47
CG:LYS:E:224:32.48:20:37.86 ND2:ASN:E:230:30.72:27.95:54.11
CD:LYS:E:224:33.57: 19.64:36.83 C:ASN:E:230:34.63:29.02:51.19
CE:LYS:E:224:33.57: 18.14:36.51 O:ASN:E:230:35.3:29.88:50.7
NZ:LYS:E:224:34.02: 17.31 :37.62 N:PHE:E:231 :35.19:27.76:51.27
C:LYS:E:224:32.8:23.76:38.88 CA:PHE:E:231 :36.63:27.57:51.13
0:LYS:E:224:32.48:24.42:37.9 CB:PHE:E:231:37.28:27.14:52.51
N:CYS:E:225:32.62:24.25:40.12 CG:PHE:E:231 :36.91 :28.02:53.61
CA:CYS:E:225:32.37:25.67:40.47 CD1 :PHE:E:231 :37.4:29.27:53.6
CB:CYS:E:225:32.07:25.85:41.93 CE1 :PHE:E:231:37.08:30.25:54.61
SG:CYS:E:225:31.03:24.59:42.71 CZ:PHE:E:231 :36.16:29.85:55.57
C:CYS:E:225:33.4:26.61:40.11 CD2:PHE:E:231 :36.11 :27.6:54.65
O:CYS:E:225:34.56:26.4:40.4 CE2:PHE:E:231:35.7:28.51 :55.61
N:VAL:E:226:33.09:27.78:39.42 C:PHE:E:231 :36.99:26.62:49.94
CA:VAL:E:226:34.02:28.9:39.09 0:PHE:E:231 :36.39:25.62:49.74
CB:VAL:E:226:33.41 :30.06:38.22 N:TYR:E:232:38.1:26.94:49.24
CG1 :VAL:E:226:34.46:31.08:37.88 CA:TYR:E:232:38.64:26.13:48.23
HG11 :VAL:E:226:34.06:31.83:37.16 CB:TYR:E:232:39:27.07:47.06
HG12:VAL:E:226:34.72:31.63:38.81 CG:TYR:E:232:39.7:26.38:45.98
HG13:VAL:E:226:35.36:30.68:37.38 CD1 :TYR:E:232:39.12:25.37:45.28
CG2:VAL:E:226:32.87:29.42:36.96 CE1 :TYR:E:232:39.79:24.77:44.21
CZ:TYR:E:232:41.08:25.18:43.87 HG22:VAL:E:238:37.38:22.19:52.42
OH:TYR:E:232:41.68:24.55:42.72 HG23:VAL:E:238:36.9:23.7:51.37
CD2:TYR:E:232:41.01 :26.8:45.61 C:VAL:E:238:33.75:23.79:51.72
CE2:TYR:E:232:41.73:26.2:44.53 0:VAL:E:238:34.19:24.96:51.82
C:TYR:E:232:39.84:25.25:48.8 N:GLU:E:239:32.71 :23.41 :52.37
O:TYR:E:232:40.9:25.81:49.2 CA:GLU:E:239:32.05:24.22:53.4
N:LEU:E:233:39.73:23.95:48.82 CB:GLU:E:239:30.81 :23.5:53.87
CA:LEU:E:233:40.59:22.94:49.28 CG:GLU:E:239:30.21:24.12:55.12
CB:LEU:E:233:40.03:22.32:50.6 CD:GLU:E:239:28.81:23.44:55.33
CG:LEU:E:233:40.83:21.14:51.25 OEl :GLU:E:239:27.86:23.85:54.59
CD1 :LEU:E:233:42.18:21.68:51.79 OE2:GLU:E:239:28.67:22.64:56.24
CD2:LEU:E:233:39.99:20.51 :52.34 C:GLU:E:239:32.92:24.61:54.57
C:LEU:E:233:40.85:21.94:48.17 0:GLU:E:239:32.92:25.81 :54.98
0:LEU:E:233:39.99:21.22:47.71 N:THR:E:240:33.78:23.68:55.1
N: ASP:E:234:42.11 :21.77:47.75 CA:THR:E:240:34.69:24.04:56.18
CA:ASP:E:234:42.63:20.8:46.82 CB:THR:E:240:34.21:23.69:57.52
CB:ASP:E:234:42.93: 19.44:47.57 OG1 :THR:E:240:35.05:24.11 :58.59
CG:ASP:E:234:43.93: 19.61 :48.63 CG2:THR:E:240:34:22.18:57.75
OD1 :ASP:E:234:44.09: 18.66:49.46 HG21 :THR:E:240:33.5:21.7:56.88
OD2:ASP:E:234:44.73:20.61 :48.55 HG22:THR:E:240:34.96:21.65:57.97
C:ASP:E:234:41.69:20.4:45.6 HG23:THR:E:240:33.31 :22.17:58.62
0:ASP:E:234:41.32: 19.26:45.35 C:THR:E:240:35.99:23.18:55.98
N:GLY:E:235:41.19:21.38:44.78 O:THR:E:240:35.93:22.17:55.26
CA:GLY:E:235:40.35:21.28:43.6 N:CYS:E:241 :37.17:23.56:56.59
C:GLY:E:235:38.88:21.35:43.7 CA:CYS:E:241 :38.36:22.75:56.41
0:GLY:E:235:38.1 :21.51 :42.76 CB:CYS:E:241:39.52:23.67:55.82
N:ARG:E:236:38.39:21.42:44.88 SG:CYS:E:241 :38.9:24.73:54.54
CA:ARG:E:236:36.98:21.52:45.18 C:CYS:E:241 :38.82:22.09:57.66
CB:ARG:E:236:36.32:20.08:45.45 0:CYS:E:241 :39.42:22.82:58.43
CG:ARG:E:236:36.19:19.57:46.84 N:PRO:E:242:38.61 :20.89:58
CD:ARG:E:236:37.52:19.51 :47.61 CD:PRO:E:242:37.62:20.01 :57.35
NE:ARG:E:236:37.38: 18.61 :48.8 CA:PRO:E:242:39.23:20.26:59.15
CZ:ARG:E:236:38.36: 18.09:49.49 CB:PRO:E:242:38.92: 18.76:58.92
NH1 :ARG:E:236:39.6: 18.52:49.33 CG:PRO:E:242:37.52:18.87:58.29
NH2:ARG:E:236:38.11 : 17.22:50.43 C:PRO:E:242:40.7:20.44:59.39
C:ARG:E:236:36.62:22.49:46.29 O:PRO:E:242:41.36:20.49:58.4
0:ARG:E:236:37.43:22.8:47.16 N:PRO:E:243:41.35:20.4:60.58
N:CYS:E:237:35.39:23.06:46.17 CD:PRO:E:243:40.74:20.76:61.85
CA:CYS:E:237:34.76:23.95:47.19 CA:PRO:E:243:42.75:19.92:60.78
CB:CYS:E:237:33.55:24.8:46.49 CB:PRO:E:243:42.86: 19.61 :62.31
SG:CYS:E:237:34.21:25.73:44.99 CG:PRO:E:243:41.98:20.75:62.8
C:CYS:E:237:34.18:23.14:48.33 C:PRO:E:243:43.19: 18.65:59.96
0:CYS:E:237:33.25:22.32:48.19 0:PRO:E:243:42.33: 17.8:59.8
N:VAL:E:238:34.68:23.39:49.53 N:PRO:E:244:44.41 : 18.41 :59.48
CA:VAL:E:238:34.37:22.73:50.82 CD:PRO:E:244:44.81 :17.3:58.6
CB:VAL:E:238:35.51 :21.95:51.44 CA:PRO:E:244:45.53:19.33:59.65
CG1 :VAL:E:238:36.03:20.92:50.37 CB:PRO:E:244:46.75: 18.34:59.55
HG11 :VAL:E:238:35.19:20.3:49.98 CG:PRO:E:244:46.32:17.47:58.35
HG12:VAL:E:238:36.49:21.45:49.51 C:PRO:E:244:45.6:20.41:58.5
HG13:VAL:E:238:36.76:20.12:50.63 O:PRO:E:244:46.72:20.89:58.26
CG2:VAL:E:238:36.59:22.87:52.04 N:TYR:E:245:44.47:20.84:57.9
HG21 :VAL:E:238:36.26:23.39:52.96 CA:TYR:E:245:44.39:21.77:56.8
CB:TYR:E:245:43.19:21.63:55.75 N:ASP:E:250:41.36:33.86:53.79
CG:TYR:E:245:43.34:20.35:55.14 CA:ASP:E:250:40:33.68:54
CD1 :TYR:E:245:42.26: 19.42:55.26 CB:ASP:E:250:39.36:34.89:54.89
CE1:TYR:E:245:42.34: 18.23:54.51 CG:ASP:E:250:40:35.19:56.13
CZ:TYR:E:245:43.43: 18:53.74 OD1 :ASP:E:250:40.38:34.18:56.77
OH:TYR:E:245:43.5: 16.74:53.17 OD2:ASP:E:250:40.07:36.45:56.5
CD2:TYR:E:245:44.49:20.06:54.39 C:ASP:E:250:39.1 :33.3:52.84
CE2:TYR:E:245:44.47: 18.87:53.64 O:ASP:E:250:38:33.82:52.72
C:TYR:E:245:44.52:23.18:57.38 N:TRP:E:251 :39.56:32.26:52.09
0:TYR:E:245:43.87:23.47:58.38 CA:TRP:E:251:38.72:31.78:50.96
N:TYR:E:246:45.33:24.06:56.76 CB:TRP:E:251 :38.5:32.81 :49.85
CA:TYR:E:246:45.6:25.36:57.32 CG:TRP:E:251:39.55:33.15:48.87
CB:TYR:E:246:47.06:25.63:57.28 CD1 :TRP:E:251 :40.38:34.25:48.93
CG:TYR:E:246:47.68:24.95:58.45 NE1 :TRP:E:251:41.23:34.17:47.85
CD1 :TYR:E:246:48.27:23.66:58.27 CE2:TRP:E:251:41 :33.15:47.08
CE1:TYR:E:246:48.92:22.98:59.35 CD2:TRP:E:251 :39.89:32.48:47.59
CZ:TYR:E:246:48.88:23.53:60.61 CE3:TRP:E:251:39.38:31.39:46.88
OH:TYR:E:246:49.3:22.79:61.79 CZ3:TRP:E:251 :40.01 :31.07:45.67
CD2:TYR:E:246:47.77:25.52:59.7 CZ2:TRP:E:251 :41.65:32.83:45.95
CE2:TYR:E:246:48.24:24.78:60.83 CH2:TRP:E:251 :41.09:31.75:45.2
C:TYR:E:246:44.98:26.44:56.45 C:TRP:E:251:39.36:30.51 :50.4
0:TYR:E:246:45.24:26.58:55.26 0:TRP:E:251 :38.82:29.87:49.52
N:HIS:E:247:44.07:27.24:57 N:ARG:E:252:40.49:30.13:50.99
CA:HIS:E:247:43.33:28.32:56.33 CA:ARG:E:252:41.2:28.92:50.6
CB:HIS:E:247:42.36:28.85:57.36 CB:ARG:E:252:42.48:29.3:49.89
ND1 :HIS:E:247:39.96:28.32:56.71 CG:ARG:E:252:42.28:29.89:48.49
CG:HIS:E:247:41.06:28.15:57.58 CD:ARG:E:252:43.61 :30.1 :47.62
CE1:HIS:E:247:39:27.71:57.23 NE:ARG:E:252:44.12:28.77:47.27
NE2:HIS:E:247:39.37:27.15:58.43 CZ:ARG:E:252:45.06:28.57:46.4
CD2:HIS:E:247:40.77:27.33:58.6 NH1 :ARG:E:252:45.76:29.57:45.84
C:HIS:E:247:44.21 :29.41 :55.74 NH2:ARG:E:252:45.39:27.34:46.16
0:HIS:E:247:45.27:29.79:56.21 C:ARG:E:252:41.63:28.09:51.83
N:PHE:E:248:43.78:29.95:54.57 0:ARG:E:252:41.81:28.62:52.89
CA:PHE:E:248:44.55:30.81 :53.7 N:CYS:E:253:41.98:26.82:51.63
CB:PHE:E:248:45.46:29.93:52.74 CA:CYS:E:253:42.65:26.02:52.64
CG:PHE:E:248:46.19:30.84:51.8 CB:CYS:E:253:41.67:25.16:53.45
CD1 :PHE:E:248:47.07:31.77:52.41 SG:CYS:E:253:40.43:26.15:54.32
CE1:PHE:E:248:47.91 :32.47:51.54 C:CYS:E:253:43.72:25.25:51.83
CZ:PHE:E:248:47.94:32.25:50.18 O:CYS:E:253:43.53:24.76:50.72
CD2:PHE:E:248:46.28:30.5:50.47 N:VAL:E:254:44.88:25.17:52.49
CE2:PHE:E:248:47.18:31.24:49.63 CA:VAL:E:254:46.15:24.7:51.94
C:PHE:E:248:43.65:31.84:53.02 CB:VAL:E:254:46.95:25.82:51.29
0:PHE:E:248:42.63:31.45:52.38 CGI : VAL:E:254:46.17:26.49:50.17
N:GLN:E:249:43.95:33.11 :53.17 HG11 :VAL:E:254:46.86:27.19:49.67
CA:GLN:E:249:43.32:34.16:52.35 HG12:VAL:E:254:45.87:25.68:49.48
CB:GLN:E:249:43.48:33.99:50.82 HG13:VAL:E:254:45.3:27:50.63
CG:GLN:E:249:44.89:34.23:50.26 CG2:VAL:E:254:47.43:26.83:52.39
CD:GLN:E:249:45.6:35.58:50.47 HG21 :VAL:E:254:46.59:27.23:52.99
OE1 :GLN:E:249:45.05:36.53:51.03 HG22:VAL:E:254:48.23:26.34:53
NE2:GLN:E:249:46.83:35.62:50 HG23:VAL:E:254:47.95:27.72:51.99
C:GLN:E:249:41.92:34.55:52.78 C: VAL:E:254:46.91 :23.9:52.99
0:GLN:E:249:41.34:35.5:52.21 0:VAL:E:254:46.51 :23.85:54.14
N:NLG:E:255:47.87:23.06:52.63 CA: ASP:E:261 :56.69:27.69:50.12
CA:NLG:E:255:48.69:22.33:53.62 CB:ASP:E:261 :55.94:27:48.95
CB:NLG:E:255:48.98:20.97:53.03 CG:ASP:E:261 :56.59:25.71 :48.61
CG:NLG:E:255:50.07:20.82:51.96 ODl :ASP:E:261 :55.94:24.88:47.93
OD1 :NLG:E:255:50.63:21.8:51.49 OD2:ASP:E:261 :57.76:25.52:49.06
ND2:NLG:E:255:50.26:19.66:51.5 C:ASP:E:261 :56.4:29.16:50.31
C:NLG:E:255:49.85:23.13:54.16 0:ASP:E:261:57.21 :29.99:49.84
0:NLG:E:255:49.98:24.35:53.89 N:LEU:E:262:55.18:29.46:50.88
N:PHE:E:256:50.66:22.51 :55.09 CA:LEU:E:262:54.83:30.87:51.1
CA:PHE:E:256:51.79:23.23:55.72 CB:LEU:E:262:53.4:30.99:51.68
CB:PHE:E:256:52.27:22.55:56.98 CG:LEU:E:262:52.15:30.82:50.72
CG:PHE:E:256:53.62:22.97:57.57 CD1 :LEU:E:262:50.91 :30.56:51.47
CD1 :PHE:E:256:54.54:21.91 :57.88 CD2:LEU:E:262:51.97:31.9:49.67
CE1:PHE:E:256:55.69:22.13:58.53 C:LEU:E:262:55.82:31.61 :51.95
CZ:PHE:E:256:55.97:23.38:59.02 O:LEU:E:262:56.03:32.8:51.87
CD2:PHE:E:256:53.96:24.23:58.21 N:HIS:E:263:56.45:30.88:52.94
CE2:PHE:E:256:55.17:24.51 :58.82 CA:HIS:E:263:57.53:31.39:53.78
C:PHE:E:256:52.97:23.55:54.74 CB:HIS:E:263:57.78:30.49:54.93
0:PHE:E:256:53.54:24.66:54.79 ND1 :HIS:E:263:60.1:30.15:55.83
N:SER:E:257:53.31 :22.58:53.8 CG:HIS:E:263:58.89:30.83:55.82
CA:SER:E:257:54.3:22.6:52.75 CE1 :HIS:E:263:60.69:30.6:56.97
CB:SER:E:257:54.26:21.28:51.9 NE2:HIS:E:263:59.89:31.43:57.67
OG:SER:E:257:55.13:21.31 :50.77 CD2:HIS:E:263:58.69:31.54:56.94
C:SER:E:257:54.04:23.8:51.82 C:HIS:E:263:58.86:31.77:53.1
0:SER:E:257:54.92:24.54:51.35 0:HIS:E:263:59.58:32.73:53.52
N:PHE:E:258:52.73:24.09:51.52 N:HIS:E:264:59.26:30.93:52.12
CA:PHE:E:258:52.33:25.31 :50.8 CA:HIS:E:264:60.42:31.15:51.25
CB:PHE:E:258:50.83:25.22:50.4 CB:HIS:E:264:60.59:29.96:50.28
CG:PHE:E:258:50.31 :26.21 :49.29 ND1 :HIS:E:264:62.97:29.88:49.92
CD1 :PHE:E:258:50.2:27.62:49.45 CG:HIS:E:264:61.74:30.1 :49.31
CE1:PHE:E:258:49.66:28.37:48.44 CE1 :HIS:E:264:63.8:29.98:48.87
CZ:PHE:E:258:49.07:27.83:47.32 NE2:HIS:E:264:63.17:30.29:47.72
CD2:PHE:E:258:49.7:25.68:48.12 CD2:HIS:E:264:61.81 :30.4:48.03
CE2:PHE:E:258:49.05:26.48:47.15 C:HIS:E:264:60.53:32.48:50.5
C:PHE:E:258:52.64:26.54:51.6 O:HIS:E:264:59.53:33.25:50.56
O:PHE:E:258:53.17:27.51 :51.06 N:LYS:E:265:61.58:32.9:49.82
N:CYS:E:259:52.41 :26.52:52.98 CA:LYS:E:265:61.66:34.2:49.12
CA:CYS:E:259:52.78:27.63:53.82 CB:LYS:E:265:63.15:34.43:48.65
CB:CYS:E:259:52.16:27.44:55.26 CG:LYS:E:265:64.38:34.56:49.65
SG:CYS:E:259:51.79:29.05:56.1 CD:LYS:E:265:64.13:35.62:50.69
C:CYS:E:259:54.27:27.83:53.92 CE:LYS:E:265:64.82:36.96:50.27
0:CYS:E:259:54.78:28.96:53.99 NZ:LYS:E:265:64.08:38.15:50.74
N:GLN:E:260:55.12:26.8:54.01 C:LYS:E:265:60.69:34.38:47.96
CA:GLN:E:260:56.51 :26.72:53.84 O:LYS:E:265:60.65:33.47:47.12
CB:GLN:E:260:57.06:25.22:54.15 N:CYS:E:266:60.19:35.6:47.78
CG:GLN:E:260:56.96:24.64:55.55 CA:CYS:E:266:59.47:35.98:46.54
CD:GLN:E:260:58.11 :23.65:55.81 CB:CYS:E:266:58.94:37.41 :46.78
OE1 :GLN:E:260:57.89:22.45:55.57 SG:CYS:E:266:57.76:37.49:48.12
NE2:GLN:E:260:59.32:24.01 :56.33 C:CYS:E:266:60.13:35.95:45.23
C:GLN:E:260:57.02:27.21 :52.51 O:CYS:E:266:61.35:36.22:45.02
O:GLN:E:260:58.02:27.85:52.43 N:LYS:E:267:59.35:35.56:44.18
N:ASP:E:261 :56.39:26.96:51.33 CA:LYS:E:267:59.93:35.54:42.82
CB:LYS:E:267:59.37:34.29:42.08 N:GLY:E:273:55.88:44.87:49.21
CG:LYS:E:267:59.75:32.98:42.76 CA:GLY:E:273:56.54:44.05:50.2
CD:LYS:E:267:59.4:31.83:41.86 C:GLY:E:273:56.42:42.62:50.1
CE:LYS:E:267:60.01:30.45:42.29 0:GLY:E:273:55.53:42.12:49.41
NZ:LYS:E:267:59.58:29.36:41.36 N:CYS:E:274:57.36:41.86:50.67
C:LYS:E:267:59.51 :36.73:42.08 CA:CYS:E:274:57.39:40.43:50.55
O:LYS:E:267:60.19:37.12:41.15 CB:CYS:E:274:58.62:39.89:49.77
N:ASN:E:268:58.3:37.33:42.42 SG:CYS:E:274:58.75:38.13:49.78
CA:ASN:E:268:57.82:38.5:41.88 C:CYS:E:274:57.22:39.91 :51.99
CB:ASN:E:268:56.46:38.71:42.53 O:CYS:E:274:57.91 :40.43:52.91
CG:ASN:E:268:55.68:40.03:42.13 N:HIS:E:275:56.36:38.9:52.19
OD1 :ASN:E:268:56.08:40.72:41.21 CA:HIS:E:275:55.99:38.45:53.55
ND2:ASN:E:268:54.55:40.24:42.84 CB:HIS:E:275:54.51 :38.58:53.83
C:ASN:E:268:58.69:39.73:42.02 ND1 :HIS:E:275:53.76:40.88:54.54
O:ASN:E:268:59.2:40.04:43.08 CG:HIS:E:275:54.04:39.99:53.52
N:SER:E:269:58.98:40.47:40.97 CE1 :HIS:E:275:53:41.89:53.94
CA:SER:E:269:59.9:41.56:41.01 NE2:HIS:E:275:52.98:41.66:52.6
CB:SER:E:269:60.76:41.65:39.69 CD2:HIS:E:275:53.62:40.47:52.35
OG:SER:E:269:59.93:41.72:38.5 C:HIS:E:275:56.38:36.97:53.81
C:SER:E:269:59.31 :42.91 :41.41 0:HIS:E:275:56.24:36.22:52.88
O:SER:E:269:60.04:43.94:41.56 N:GLN:E:276:56.91 :36.68:55.04
N:ARG:E:270:58.02:42.98:41.75 CA:GLN:E:276:57.26:35.34:55.54
CA: ARG:E:270:57.41 :44.26:42.02 CB:GLN:E:276:58.48:35.47:56.46
CB:ARG:E:270:55.88:44.18:41.84 CG:GLN:E:276:59.77:35.74:55.73
CG: ARG:E:270:55.41 :43.92:40.38 CD:GLN:E:276:60.89:35.77:56.8
CD:ARG:E:270:53.95:44.27:40.12 OEl :GLN:E:276:61.25:36.9:57.19
NE:ARG:E:270:53.08:43.39:40.94 NE2:GLN:E:276:61.4:34.59:57.34
CZ:ARG:E:270:52.77:42.12:40.68 C:GLN:E:276:56.1:34.72:56.27
NH1 :ARG:E:270:53.07:41.51:39.54 0:GLN:E:276:55.9:34.96:57.48
NH2:ARG:E:270:52.08:41.46:41.58 N:TYR:E:277:55.3:33.91 :55.58
C:ARG:E:270:57.76:45.04:43.31 CA:TYR:E:277:54.13:33.18:56.1
O:ARG:E:270:58.32:44.43:44.17 CB:TYR:E:277:53.27:32.61 :54.93
N:ARG:E:271 :57.42:46.34:43.35 CG:TYR:E:277:52.53:33.61 :54.06
CA:ARG:E:271 :57.92:47.27:44.37 CD1 :TYR:E:277:53.23:34.48:53.24
CB:ARG:E:271:57.46:48.79:44.15 CE1 :TYR:E:277:52.59:35.42:52.46
CG:ARG:E:271 :58.24:49.42:42.98 CZ:TYR:E:277:51.23:35.53:52.5
CD:ARG:E:271 :59.7:49.79:43.3 OH:TYR:E:277:50.56:36.37:51.6
NE:ARG:E:271 :59.82:50.35:44.7 CD2:TYR:E:277:51.16:33.78:54.12
CZ:ARG:E:271:60.5:49.78:45.62 CE2:TYR:E:277:50.47:34.65:53.29
NH1 :ARG:E:271 :61.4:48.89:45.49 C:TYR:E:277:54.37:32.15:57.12
NH2:ARG:E:271 :60.28:50.18:46.85 O:TYR:E:277:55.21 :31.22:57.01
C:ARG:E:271 :57.43:46.96:45.76 N:VAL:E:278:53.66:32.22:58.23
O: ARG:E:271 :58.07:47.27:46.76 CA:VAL:E:278:53.82:31.45:59.48
N:GLN:E:272:56.13:46.51 :45.83 CB:VAL:E:278:54.28:32.39:60.62
CA:GLN:E:272:55.5:45.85:46.94 CG1 :VAL:E:278:55.85:32.57:60.49
CB :GLN:E:272:54.31 :45.11 :46.42 HG11 :VAL:E:278:56.37:31.67:60.88
CG:GLN:E:272:54.64:43.86:45.64 HG12:VAL:E:278:56.14:32.89:59.47
CD:GLN:E:272:53.47:43.44:44.82 HG13:VAL:E:278:56.03:33.42:61.18
OE1 :GLN:E:272:53.56:43.42:43.57 CG2:VAL:E:278:53.6:33.78:60.63
NE2:GLN:E:272:52.3:43.18:45.38 HG21 :VAL:E:278:52.61:33.79:60.12
C:GLN:E:272:56.27:44.94:47.87 HG22:VAL:E:278:53.31:34.05:61.68
0:GLN:E:272:57.22:44.27:47.5 HG23:VAL:E:278:54.21:34.63:60.26
C:VAL:E:278:52.46:30.92:59.8 CB:CYS:E:284:48.89:29.34:55.65
0:VAL:E:278:51.42:31.1 :59.12 SG:CYS:E:284:50.36:29.83:54.88
N:ILE:E:279:52.38:30.11 :60.89 C:CYS:E:284:48.64:31.68:56.82
CA:ILE:E:279:51.16:29.43:61.33 0:CYS:E:284:47.92:32.14:55.91
CB:ILE:E:279:51.47:27.94:61.61 N:ILE:E:285:49.31 :32.47:57.68
CG2:ILE:E:279:50.18:27.3:62.13 CA:ILE:E:285:49.11 :33.93:57.78
HG21 :ILE:E:279:49.95:27.72:63.13 CB:ILE:E:285:48.47:34.27:59.09
HG22:ILE:E:279:49.28:27.51 :61.52 CG2:ILE:E:285:49.43:33.82:60.21
HG23:ILE:E:279:50.27:26.19:62.06 HG21 :ILE:E:285:50.3:34.5:60.18
CGI :ILE:E:279:52.1 :27.2:60.43 HG22:ILE:E:285:48.95:34:61.2
HG11 :ILE:E:279:53.13:27.56:60.2 HG23:ILE:E:285:49.78:32.78:60.12
HG12:ILE:E:279:52.3:26.23:60.92 CG1 :ILE:E:285:48.03:35.78:59.19
CD:ILE:E:279:51.11 :27.15:59.21 HG11 :ILE:E:285:48.89:36.46:59.12
C:ILE:E:279:50.67:30.12:62.57 HG12:ILE:E:285:47.42:35.97:58.29
O:ILE:E:279:51.33:30.25:63.61 CD:ILE:E:285:47.18:36.11 :60.44
N:HIS:E:280:49.45:30.64:62.55 C:ILE:E:285:50.38:34.71 :57.5
CA:HIS:E:280:48.76:31.11 :63.75 O:ILE:E:285:51.41 :34.09:57.84
CB:HIS:E:280:48.76:32.62:63.7 N:PRO:E:286:50.49:35.92:56.95
ND1 :HIS:E:280:48.36:33.17:66.18 CD:PRO:E:286:49.43:36.48:56.11
CG:HIS:E:280:47.98:33.28:64.81 CA:PRO:E:286:51.68:36.69:56.74
CE1:HIS:E:280:47.33:33.71 :66.86 CB:PRO:E:286:51.22:38.05:56.23
NE2:HIS:E:280:46.39:34.26:66.11 CG:PRO:E:286:49.96:37.78:55.45
CD2:HIS:E:280:46.72:33.99:64.78 C:PRO:E:286:52.49:36.98:57.97
C:HIS:E:280:47.36:30.65:63.77 O:PRO:E:286:53.71 :37.09:57.84
O:HIS:E:280:46.62:30.92:62.83 N:GLU:E:287:51.86:37.09:59.14
N:ASN:E:281 :46.89:30.04:64.84 CA:GLU:E:287:52.6:37.51 :60.29
CA:ASN:E:281:45.51 :29.57:64.95 CB:GLU:E:287:52.68:39.06:60.46
CB:ASN:E:281 :44.65:30.79:65.4 CG:GLU:E:287:53.48:39.84:59.33
CG:ASN:E:281:43.24:30.33:65.7 CD:GLU:E:287:53.27:41.3:59.46
OD1 :ASN:E:281 :42.98:29.18:66.01 OEl :GLU:E:287:53.34:41.85:60.56
ND2:ASN:E:281 :42.25:31.29:65.77 OE2:GLU:E:287:53.06:41.94:58.38
C:ASN:E:281:44.88:28.78:63.74 C:GLU:E:287:51.92:37.04:61.55
O:ASN:E:281 :43.79:29.02:63.21 O:GLU:E:287:50.71 :36.94:61.63
N:ASN:E:282:45.66:27.8:63.21 N:CYS:E:288:52.77:36.85:62.57
CA: ASN:E:282:45.21 :26.99:62.1 CA:CYS:E:288:52.35:36.65:63.9
CB:ASN:E:282:43.96:26.07:62.43 CB:CYS:E:288:53.57:36.1:64.72
CG:ASN:E:282:44.36:24.91 :63.35 SG:CYS:E:288:54.3:34.51:64.43
ODl :ASN:E:282:45.47:24.88:63.84 C:CYS:E:288:51.82:37.85:64.47
ND2:ASN:E:282:43.34:24.07:63.54 0:CYS:E:288:52.23:38.95:64.2
C:ASN:E:282:45.06:27.77:60.8 N:PRO:E:289:50.83:37.77:65.26
0:ASN:E:282:44.45:27.23:59.85 CD:PRO:E:289:49.97:36.59:65.6
N:LYS:E:283:45.7:28.86:60.62 CA:PRO:E:289:50.3:38.99:65.93
CA:LYS:E:283:45.9:29.65:59.42 CB:PRO:E:289:49.16:38.5:66.77
CB:LYS:E:283:45.49:31.11 :59.55 CG:PRO:E:289:48.7:37.22:66.08
CG:LYS:E:283:43.98:31.44:59.81 C:PRO:E:289:51.25:39.71 :66.93
CD:LYS:E:283:43.27:30.75:60.87 0:PRO:E:289:52.33:39.27:67.23
CE:LYS:E:283:41.89:31.41 :61.15 N:SER:E:290:50.87:40.93:67.44
NZ:LYS:E:283:41.22:30.81 :62.34 CA:SER:E:290:51.52:41.59:68.55
C:LYS:E:283:47.38:29.7:58.95 CB:SER:E:290:50.64:42.74:69.06
0:LYS:E:283:48.24:29.96:59.75 OG:SER:E:290:50.34:43.66:68.02
N:CYS:E:284:47.49:29.78:57.65 C:SER:E:290:51.84:40.82:69.86
CA:CYS:E:284:48.7:30.16:56.99 O:SER:E:290:50.96:40.27:70.59
N:GLY:E:291 :53.16:40.78:70.21 C:SER:E:297:59.39:24.19:62.25
CA:GLY:E:291:53.67:40.09:71.41 O:SER:E:297:59.23:23.99:61.03
C:GLY:E:291 :53.66:38.57:71.17 N:ASN:E:298:58.39:24.27:63.17
0:GLY:E:291 :53.61:37.77:72.12 CA:ASN:E:298:57:24.06:62.99
N:TYR:E:292:53.64:38.1:69.87 CB:ASN:E:298:56.4:23.38:64.26
CA:TYR:E:292:53.72:36.71 :69.49 CG:ASN:E:298:56.79:24.09:65.52
CB:TYR:E:292:52.43:36.24:68.63 ODl :ASN:E:298:56.25:25.1 :65.9
CG:TYR:E:292:51.24:35.93:69.44 ND2:ASN:E:298:57.68:23.47:66.31
CD1 :TYR:E:292:50.37:37:69.87 C:ASN:E:298:56.3:25.38:62.68
CE1:TYR:E:292:49.06:36.73:70.36 0:ASN:E:298:55.19:25.39:62.15
CZ:TYR:E:292:48.65:35.39:70.48 N:LEU:E:299:56.95:26.54:62.99
OH:TYR:E:292:47.38:35.1 :70.91 CA:LEU:E:299:56.43:27.87:62.73
CD2:TYR:E:292:50.68:34.63:69.47 CB:LEU:E:299:56.33:28.29:61.26
CE2:TYR:E:292:49.44:34.35:70.07 CG:LEU:E:299:57.55:28.16:60.29
C:TYR:E:292:54.88:36.54:68.52 CD1 :LEU:E:299:57.25:28.39:58.76
0:TYR:E:292:55.22:37.41 :67.69 CD2:LEU:E:299:58.82:28.95:60.63
N:THR:E:293:55.59:35.4:68.65 C:LEU:E:299:55.1 :28.24:63.4
CA:THR:E:293:56.78:35.13:67.85 0:LEU:E:299:54.47:29.13:62.88
CB:THR:E:293:58.04:35.81:68.23 N:LEU:E:300:54.7:27.62:64.51
OG1 :THR:E:293:59.09:35.52:67.35 CA:LEU:E:300:53.53:28.08:65.28
CG2:THR:E:293:58.38:35.48:69.74 CB:LEU:E:300:53.08:27.02:66.27
HG21 :THR:E:293:59.11 :36.28:69.98 CG:LEU:E:300:51.76:27.2:67.08
HG22:THR:E:293:57.55:35.54:70.47 CD1 :LEU:E:300:50.61 :27.5:66.08
HG23:THR:E:293:58.96:34.55:69.93 CD2:LEU:E:300:51.51 :25.92:67.84
C:THR:E:293:56.79:33.58:67.45 C:LEU:E:300:53.84:29.39:66.08
0:THR:E:293:56.49:32.7:68.25 O:LEU:E:300:54.69:29.58:66.93
N:MET:E:294:57.03:33.21 :66.17 N:CYS:E:301 :53.04:30.42:65.8
CA:MET:E:294:56.92:31.8:65.79 CA:CYS:E:301 :52.98:31.67:66.51
CB:MET:E:294:56.88:31.51 :64.24 CB:CYS:E:301:52.08:32.72:65.74
CG:MET:E:294:58.17:32.1 :63.56 SG:CYS:E:301 :52.82:33.2:64.2
SD:MET:E:294:58.1 :33.85:63.08 C:CYS:E:301 :52.42:31.41 :67.89
CE:MET:E:294:59.75:34.47:63.65 O:CYS:E:301 :51.26:31.02:68.14
C:MET:E:294:57.96:30.83:66.41 N:THR:E:302:53.27:31.79:68.9
O:MET:E:294:59.16:31.01 :66.59 CA:THR:E:302:53.07:31.52:70.33
N:ASN:E:295:57.43:29.6:66.85 CB:THR:E:302:53.86:30.31 :70.79
CA:ASN:E:295:58.2:28.46:67.29 OG1 :THR:E:302:53.07:29.61 :71.83
CB:ASN:E:295:57.33:27.36:67.96 CG2:THR:E:302:55.24:30.69:71.34
CG:ASN:E:295:57.02:27.63:69.42 HG21 :THR:E:302:55.89:31.16:70.56
OD1 :ASN:E:295:57.69:28.43:70.08 HG22:THR:E:302:55.24:31.48:72.12
ND2:ASN:E:295:55.93:27.08:69.97 HG23:THR:E:302:55.81 :29.81 :71.69
C:ASN:E:295:59.04:27.84:66.07 C:THR:E:302:53.43:32.84:71.06
0:ASN:E:295:58.55:27.7:64.94 O:THR:E:302:54.36:33.53:70.66
N:SER:E:296:60.27:27.56:66.31 N:PRO:E:303:52.72:33.18:72.19
CA:SER:E:296:61.3:26.92:65.44 CD:PRO:E:303:51.45:32.6:72.67
CB:SER:E:296:62.63:26.58:66.21 CA:PRO:E:303:53.09:34.34:73.08
OG:SER:E:296:63.38:27.73:66.65 CB:PRO:E:303:51.99:34.26:74.19
C:SER:E:296:60.82:25.67:64.7 CG:PRO:E:303:50.75:33.67:73.51
O:SER:E:296:60.38:24.72:65.34 C:PRO:E:303:54.49:34.34:73.62
N:SER:E:297:60.97:25.57:63.45 O:PRO:E:303:55.01 :33.26:73.9
CA:SER:E:297:60.84:24.41 :62.68 N:CYS:E:304:55.09:35.51 :73.6
CB:SER:E:297:61.56:23.14:63.24 CA:CYS:E:304:56.45:35.74:74.02
OG:SER:E:297:62.01 :22.21 :62.19 CB:CYS:E:304:57.19:37.04:73.49
SG:CYS:E:304:56.24:38.57:73.77 CD2:PHE:F:705:36.47:56.86:50.11
C:CYS:E:304:56.66:35.47:75.48 CE2:PHE:F:705:36.93:55.73:49.46
O:CYS:E:304:55.78:35.49:76.36 C:PHE:F:705:38.28:60.07:50.32
N:LEU:E:305:57.84:35.01 :75.84 O:PHE:F:705:39.23:59.31 :50.43
CA:LEU:E:305:58.33:34.96:77.24 N:GLU:F:706:38.27:61.06:49.39
CB:LEU:E:305:59.61 :34.14:77.27 CA:GLU:F:706:39.34:61.4:48.46
CG:LEU:E:305:59.59:32.71:76.8 CB:GLU:F:706:38.96:62.55:47.43
CD1 :LEU:E:305:60.91 :32.05:77.04 CG:GLU:F:706:40.08:63.51 :46.97
CD2:LEU:E:305:58.43:31.95:77.43 CD:GLU:F:706:39.51 :64.62:46.04
C:LEU:E:305:58.47:36.33:77.88 OE1 :GLU:F:706:38.36:65.09:46.32
O:LEU:E:305:58.9:37.29:77.31 OE2:GLU:F:706:40.14:64.98:44.99
N:GLY:E:306:58.22:36.4:79.26 C:GLU:F:706:40.59:61.6:49.19
CA:GLY:E:306:58.37:37.55:80.16 O:GLU:F:706:41.65:61.28:48.76
C:GLY:E:306:57.76:38.82:79.76 N:ASP:F:707:40.57:62.44:50.31
O:GLY:E:306:56.85:38.84:78.97 CA:ASP:F:707:41.76:62.88:50.99
N:PRO:E:307:58.07:40.05:80.14 CB:ASP:F: 707:41.46:64.03:52.01
CD:PRO:E:307:58.91 :40.2:81.31 CG:ASP:F:707:40.74:65.15:51.35
CA:FRO:E:307:57.31 :41.23:79.81 OD1 :ASP:F:707:39.87:65.74:52.07
CB:PRO:E:307:57.56:42.11 :81.04 OD2:ASP:F:707:41 :65.45:50.17
CG:PRO:E:307:58.98:41.74:81.45 C:ASP:F:707:42.43:61.7:51.68
C:PRO:E:307:57.98:41.84:78.56 O:ASP:F:707:43.68:61.58:51.7
O:PRO:E:307:58.45:42.99:78.55 N:TYR:F:708:41.7:60.78:52.26
N:CYS:E:308:58.03:41.05:77.52 CA:TYR:F:708:42.23:59.57:52.83
CA:CYS:E:308:58.35:41.4:76.21 CB:TYR:F:708:41.08:58.83:53.63
CB:CYS:E:308:58.38:40.13:75.3 CG:TYR:F:708:41.32:57.32:53.87
SG:CYS:E:308:56.9:39.2:75.59 CD1 :TYR:F:708:40.42:56.4:53.18
C:CYS:E:308:57.43:42.47:75.67 CE1 :TYR:F:708:40.55:55.04:53.48
O:CYS:E:308:56.21 :42.26:75.84 CU:TYR:F:708:41.54:54.52:54.39
N:PRO:E:309:57.83:43.54:74.98 OH:TYR:F:708:41.63:53.11 :54.59
CD:PRO:E:309:59.16:44.06:75.02 CD2:TYR:F:708:42.27:56.75:54.8
CA:PRO:E:309:56.93:44.42:74.29 CE2:TYR:F:708:42.43:55.37:55.03
CB:PRO:E:309:57.9:45.4:73.61 C:TYR:F:708:42.9:58.56:51.84
CG:PRO:E:309:59.18:45.49:74.46 O:TYR:F:708:43.99:58:52.1
C:PRO:E:309:55.92:43.83:73.3 N:LEU:F:709:42.23:58.47:50.71
O:PRO:E:309:56.33:42.93:72.61 CA:LEU:F:709:42.72:57.91 :49.46
N:LYS:E:310:54.66:44.26:73.23 CB:LEU:F:709:41.51 :57.85:48.46
CA:LYS:E:310:53.62:43.7:72.47 CG:LEU:F:709:41.82:57.34:46.97
CB:LYS:E:310:52.18:44.04:73.01 CD1 :LEU:F:709:42.47:55.95:47.05
CG:LYS:E:310:51.98:43.4:74.34 CD2:LEU:F:709:40.54:57.31 :46.06
CD:LYS:E:310:51.74:41.95:74.22 C:LEU:F:709:44.08:58.41 :48.99
CE:LYS:E:310:51.28:41.46:75.59 O:LEU:F:709:45.11:57.68:48.82
NZ:LYS:E:310:51.27:39.99:75.58 N:HSD:F:710:44.18:59.77:48.95
C:LYS:E:310:53.68:44.22:71 CA:HSD:F:710:45.43:60.49:48.67
OT1 :LYS:E:310:53.99:43.38:70.11 CB:HSD:F:710:45.21 :62.06:48.59
OT2:LYS:E:310:53.47:45.43:70.78 ND1 :HSD:F:710:43.84:63.78:47.34
N:PHE:F:705:36.51 :61.1 :51.81 CG:HSD:F:710:44.55:62.59:47.41
CA:PHE:F:705:37.07:59.86:51.2 CE1 :HSD:F:710:43.39:63.88:46.08
CB:PHE:F:705:35.87:59.28:50.29 NE2:HSD:F:710:43.8:62.85:45.33
CG:PHE:F:705:36.24:58.07:49.5 CD2:HSD:F:710:44.53:62.01 :46.18
CD1 :PHE:F:705:36.59:58.19:48.14 C:HSD:F:710:46.52:60.23:49.72
CE1 :PHE:F:705:37.13:57.1 :47.45 O:HSD:F:710:47.63:59.93:49.37
CU:PHE:F:705:37.32:55.86:48.1 N:ASN:F:711 :46.17:60.1:51
CA: ASN:F:711 :47.1 :59.73:51.99 HG21 :VAL:F:715:53.82:55.6:50.45
CB:ASN:F:711 :46.33:59.87:53.41 HG22:VAL:F:715:53.2:54.11 :51.09
CG:ASN:F:711 :46.9:61.17:54.13 HG23:VAL:F:715:52.18:55.21 :49.96
ODl :ASN:F:711 :47.6:61.99:53.51 C:VAL:F:715:53.72:57.87:52.14
ND2:ASN:F:711 :46.59:61.25:55.41 0:VAL:F:715:54.56:57.75:51.23
C:ASN:F:711 :47.68:58.36:51.73 N:PRO:F:716:54.13:58.48:53.3
0:ASN:F:711 :48.94:58.12:51.74 CD:PRO:F:716:53.38:58.54:54.55
N:VAL:F:712:46.83:57.32:51.55 CA:PRO:F:716:55.52:58.68:53.63
CA:VAL:F:712:47.22:55.9:51.23 CB:PRO:F:716:55.56:59.34:54.99
CB:VAL:F:712:46.07:54.82:51.25 CG:PRO:F:716:54.37:58.73:55.71
CG1 :VAL:F:712:46.47:53.39:50.76 C:PRO:F:716:56.43:57.48:53.54
HG11 :VAL:F:712:46.71 :53.48:49.68 0:PRO:F:716:55.96:56.34:53.71
HG12:VAL:F:712:47.32:53:51.36 N:ARG:F:717:57.75:57.72:53.24
HG13:VAL:F:712:45.66:52.63:50.8 CA:ARG:F:717:58.66:56.71 :52.82
CG2:VAL:F:712:45.43:54.72:52.63 CB:ARG:F:717:59.19:56.96:51.37
HG21 :VAL:F:712:45.01 :55.7:52.98 CG:ARG:F:717:58.07:57.08:50.27
HG22:VAL:F:712:44.48:54.16:52.46 CD:ARG:F:717:57.44:55.7:49.95
HG23:VAL:F:712:46.1:54.24:53.35 NE:ARG:F:717:58.54:54.89:49.47
C:VAL:F:712:48.04:55.8:49.92 CU:ARG:F:717:58.85:53.6:49.76
O:VAL:F:712:49.03:55.1:49.85 NH1 :ARG:F:717:58.09:52.91 :50.56
N:VAL:F:713:47.59:56.52:48.87 NH2:ARG:F:717:59.9:53:49.31
CA:VAL:F:713:48.42:56.58:47.72 C:ARG:F:717:59.87:56.5:53.73
CB:VAL:F:713:47.69:57.28:46.53 O:ARG:F:717:60.49:57.54:54.16
CG1 :VAL:F:713:48.63:57.33:45.33 N:PRO:F:718:60.27:55.25:54.02
HG11 :VAL:F:713:47.96:57.48:44.45 CD:PRO:F:718:59.49:54.05:53.77
HG12:VAL:F:713:49.43:58.08:45.48 CA:PRO:F:718:61.63:54.88:54.5
HG13:VAL:F:713:49.1:56.31 :45.23 CB:PRO:F:718:61.53:53.43:54.95
CG2:VAL:F:713:46.55:56.32:46.23 CG:PRO:F:718:60.47:52.79:54.17
HG21 :VAL:F:713:46.83:55.27:46.43 C:PRO:F:718:62.54:55.01 :53.29
HG22:VAL:F:713:45.69:56.64:46.85 0:PRO:F:718:62.13:54.82:52.18
HG23:VAL:F:713:46.13:56.49:45.22 N:SER:F:719:63.79:55.5:53.54
C:VAL:F:713:49.81 :57.26:47.93 CA:SER:F:719:64.81 :55.33:52.51
O:VAL:F:713:50.83:56.66:47.55 CB:SER:F:719:65.74:56.52:52.38
N:PHE:F:714:49.88:58.42:48.62 OG:SER:F:719:65.11 :57.83:52.44
CA:PHE:F:714:51.01:59.28:48.77 C:SER:F:719:65.77:54.14:52.77
CB:PHE:F:714:50.67:60.72:48.42 OTl :SER:F:719:66.91 :54.24:53.37
CG:PHE:F:714:49.93:60.85:47.16 OT2:SER:F:719:65.3:53.01 :52.44
CD1 :PHE:F:714:50.58:60.51 :45.98 C1 :NAG:X:720:55.48:55.64:31.49
CE1:PHE:F:714:49.93:60.69:44.72 C5:NAG:X:720:53.71 :57.27:31.41
CU:PHE:F:714:48.73:61.35:44.62 O5:NAG:X:720:55.07:56.97:31.77
CD2:PHE:F:714:48.68:61.5:47.06 C2:NAG:X:720:55.75:55.53:29.99
CE2:PHE:F:714:48.18:61.78:45.79 N:NAG:X:720:56.25:54.25:29.59
C:PHE:F:714:51.74:59.04:50.09 C:NAG:X:720:57.37:53.77:30.04
O:PHE:F:714:52.45:60:50.56 O:NAG:X:720:58.06:54.38:30.88
N:VAL:F:715:51.64:57.84:50.69 CT:NAG:X:720:57.72:52.42:29.41
CA:VAL:F:715:52.29:57.39:51.92 C3:NAG:X:720:54.42:55.81 :29.34
CB:VAL:F:715:52.41 :55.91:52.1 O3:NAG:X:720:54.5:55.8:27.97
CG1 :VAL:F:715:50.96:55.35:52.45 C4:NAG:X:720:53.79:57.19:29.86
HG11 :VAL:F:715:50.22:55.97:51.92 O4:NAG:X:720:52.51 :57.42:29.22
HG12:VAL:F:715:50.8:54.26:52.26 C6:NAG:X:720:53.28:58.62:31.88
HG13:VAL:F:715:50.74:55.45:53.52 O6:NAG:X:720:54.28:59.61:31.56
CG2:VAL:F:715:52.89:55.14:50.83 C1 :NAG:X:721 :50.32:33.4:28.65
C5:NAG:X:721 :51.84:32.26:27.16 04:NAG:X:724:47.9: 16.59:49.41
O5:NAG:X:721 :51.07:32.26:28.41 C6:NAG:X:724:47.87:19.15:47.97
C2:NAG:X:721 :49.37:33.71 :27.52 O6:NAG:X:724:47.57:20.51:48.06
N:NAG:X:721:48.37:34.9:27.67
C:NAG:X:721 :47.05:34.83:28.05
0:NAG:X:721:46.49:33.82:28.43
CT:NAG:X:721 :46.21 :36.08:27.93
C3:NAG:X:721 :50.13:33.76:26.22
O3:NAG:X:721 :49.28:33.79:25.08
C4:NAG:X:721 :51.07:32.58:25.95
04:NAG:X:721 :51.92:32.68:24.8
C6:NAG:X:721 :52.8:31.02:27.02
O6:NAG:X:721 :53.72:30.85:28.14
C1 :NAG:X:722:27.17:37.44:47.59
C5:NAG:X:722:26.8:35.84:49.36
05:NAG:X:722:26.81:36.1 :47.95
C2:NAG:X:722:26.03:38.33:48.01
N:NAG:X:722:24.82:38.14:47.21
C:NAG:X:722:23.56:38.6:47.46
0:NAG:X:722:23.14:38.99:48.52
CT:NAG:X:722:22.65:38.53:46.3
C3:NAG:X:722:25.9:38.19:49.52
O3:NAG:X:722:27.05:38.74:50.14
C4:NAG:X:722:25.76:36.77:50
04:NAG:X:722:24.45:36.26:49.67
C6:NAG:X:722:28.17:35.73:50.1
06:NAG:X:722:29.12:35:49.36
C1 :NAG:X:723:30.04:30.45:33.54
C5:NAG:X:723:29.88:28.78:31.92
O5:NAG:X:723:29.93:29.09:33.31
C2:NAG:X:723:28.84:31.18:32.91
N:NAG:X:723:28.72:32.64:33.2
C:NAG:X:723:28.24:33.18:34.36
O:NAG:X:723:27.43:32.55:35.07
CT:NAG:X:723:28.76:34.47:34.82
C3:NAG:X:723:28.65:30.79:31.45
O3:NAG:X:723:27.46:31.4:31.02
C4:NAG:X:723:28.6:29.26:31.29
04:NAG:X:723:28.53:28.94:29.9
C6:NAG:X:723:30.06:27.33:31.73
06:NAG:X:723:28.91:26.68:32.21
CI :NAG:X:724:50.9: 19.28:50.25
C5:NAG:X:724:48.85: 18.78:49.13
O5:NAG:X:724:50: 19.55:49.14
C2:NAG:X:724:51.31 : 17.79:50.23
N:NAG:X:724:52.44:17.35:51.09
C:NAG:X:724:52.44: 17.32:52.44
0:NAG:X:724:51.51 :17.63:53.18
CT:NAG:X:724:53.77: 16.85:53.01
C3:NAG:X:724:50.08: 16.91 :50.28
O3:NAG:X:724:50.52: 15.56:50.25
C4:NAG:X:724:49.16: 17.31 :49.14
Claims (94)
1. An insulin analog comprising an A chain peptide and a B chain peptide, wherein the B chain comprises an aromatic or large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin and/or an aromatic or large aliphatic residue at a position corresponding to amino acid number 15 of the B chain of human insulin, wherein the analog comprises at least one amino acid found in human insulin but lacking in the corresponding position of Conus geographus venom insulin, and wherein the A chain peptide and the B chain peptide are bonded together across at least one pair of cysteine residues.
2. The insulin analog of claim 1 , wherein the aromatic or large aliphatic residue at a position corresponding to amino acid number 20 of the B chain of human insulin is selected from the group consisting of tyrosine, phenylalanine, 4-methylphenylalanine, histidine, tryptophan, methionine, cyclopentylalanine and cyclohexylalanine.
3. The insulin analog of claim 1 or claim 2, wherein the B chain is truncated at the C-terminal end when compared to human insulin.
4. The insulin analog of claim 3, wherein the B chain is lacking one or more or all of the nine C-terminal amino acids of human insulin.
5. The insulin analog of claim 3 or claim 4, wherein the B chain is at least lacking PheB24 of human insulin.
6. The insulin analog according to any one of claims 1 to 5, wherein the B chain is at least lacking the human B chain aromatic triplet (amino acids PheB24-PheB25-TyrB26)-
7. The insulin analog according to any one of claims 1 to 6, comprising:
an A chain peptide comprising the sequence Gly-XA2-XA3-XA4-XA5-CysA6-
CySA7-XA8-XA -XA10-CySAl l-XA12-XA13-XA14-XA15-XA16-XA17-XA18-XA1 -CySA20-XA21- XA22-XA23-XA24-XA25-XA26-XA27-XA28-XA29-XA30-XA31-XA32-XA33-XA34,
wherein XA2 = Val or He; XA3 = Val or Ala; XA4 = Glu, Asp, Cys or gamma carboxyglutamate; XAS = Gin, Glu, gamma carboxyglutamate, His or Val; CysA6,
CysA7, and CysAn are independently Cys or selenocysteine; XA8 = Thr, His, Asp, Gin, Tyr, Lys, Ala or Val; XA9 = Ser, Arg, Asn, Gly, His or Lys; XAIO = He, Pro, Tyr, Ala, Ser, Val, Phe, His or Thr; XAI2 = Ser or Thr; XAI3 = Leu, Asn, Val, Arg or Asp; XAM = Tyr, Ala, Gin, His, Asp or Glu; XAis = Gin, Glu or Thr; XAI6 = Phe, Leu, or Ala; ΧΑΠ = Glu, Gin, Lys, Arg, He, Met, Thr or Ser; XAis = Lys, Ser, Thr, Asn, Gin or Glu; XAi9 = Tyr or Phe; CysA20 = Cys, selenocysteine, amidated Cys, or amidated
selenocysteine; XA2i = Asn, Pro, His, Ser, Gly, Ala, or is absent; XA22 = Pro, Asn, Thr, Leu, Ser or is absent; XA23 = Thr, Leu, Val, Ser or is absent; XA24 = Arg, Thr, Met, Gin, Leu or is absent; XA25 = Glu, Gly or is absent; from XA26 = Ser, Leu or is absent; XA27 to XA3I are independently Ser or are absent; XA32 = Ala, Ser or is absent; XA33 = Ala, Val or is absent; and XA34 = Ala or is absent (SEQ ID NO: 1); and
a B chain peptide comprising the sequence XBI-XB2-XB3-XB4-XBS-XB6-XB7-XB8-
CySB9-XB10-XBll-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB19-XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29-XB30-XB31-XB32-XB33-XB34-XB35-XB36-XB37-XB38"XB39, wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Ala, Asp, Gly, Pro, Leu, Phe, or His; XB4 = Ala, Thr, Pro, Asp, Val or Gly; XB5 = Asn, Pro, His, Thr, Arg, Lys, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; ΧΒγ = His, Tyr, Arg or He; XBg = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB 1o = Gly, Gin or Asp; XB 11 = Ser, Leu, Gly or Pro; XBi2 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XBi3 = He, Leu, Asp, Val or Ala; XB14 = Thr, Ala, Pro, Val or Arg; XB 1s = Asn, Asp, Ala, Val, Thr, Pro or Glu; XBI6 = Ala, Ser, Gin, His, Thr, Tyr, Arg or Gly; XBn = Thr, Tyr, Pro, Leu or Gly; XBis = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB1 = Leu, Asp, Gin, Gly, Lys, Glu, Arg, Ser or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Val, Tyr, Phe, His, Gly, Gin, Leu, amidated His,
amidated Val or is absent; XB23 = Glu, Asp, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB2 = Thr, His, He, Leu, Ser, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Ala, Asp, Ser, Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB3 = Ala or is absent (SEQ ID NO: 2).
8. The insulin analog of claim 7, wherein the B chain peptide comprises the
Sequence XB1-XB2-XB3-XB4-XB5-XB6-XB7-XB8-CySB -XB10-XBll-XB12-XB13-XB14-XB15-
XB16-XB17-XB18-XB1 -XB20-CySB21-XB22-XB23-XB24-XB25-XB26-XB27-XB28-XB2 -XB30-
XB31-XB32-XB33-XB34-XB35-XB36"XB37-XB38-XB39,
wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XBs = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteme; XB 1o = Gly, Gin or Asp; XB 11 = Ser, Leu, Gly or Pro; XB 12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB15 = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XBi9 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB29 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB¾ = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO: 3).
9. The insulin analog of claim 7 or claim 8, wherein the B chain peptide comprises the sequence ΧΒ1-ΧΒ2-ΧΒ3-ΧΒ4-ΧΒ5-ΧΒ6-ΧΒ7-ΧΒ8-^8Β9-ΧΒΙΟ-ΧΒΙΙ-ΧΒΙ2-ΧΒΙ3-ΧΒΙ4-
XB15-XB16-XB17-XB18-XB19-XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29" XB30-XB31-XB32-XB33-XB34-XB35-XB36-XB37"XB38-XB39,
wherein XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His, Tyr, Arg or He; XBg = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB 1o = Gly, Gin or Asp; XB 11 = Ser, Leu, Gly or Pro; XB 12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB15 = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB 1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XBi9 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; and XB24 , XB25 , XB26 , XB27 , XB28 , XB29 , XB30, XB3I , XB32 , XB33 , XB34 , XB35 , XB36 , XB37 , XB38 and XB39 are absent (SEQ ID NO: 4).
10. The insulin analog of claim 7 or claim 8, wherein the B chain peptide comprises the sequence XB 1-XB2-XB3-XB4-XBS-XB6-XB7-XB8- CysB9-Gly-Ser-XB 12-XBi3-XBi4-XBi5- XB16-XB17-XB18-XB19-XB20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29"XB30- XB31-XB32-XB33-XB34-XB35-XB36"XB37-XB38-XB39,
where XB1 = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; ΧΒγ = His, Tyr, Arg or He; XBg = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB 12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB 1s = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB1 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val, Leu or Lys; CB2i = Cys, amidated Cys, selenocysteine or amidated
selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB2 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB35 = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB3 = Ala or is absent (SEQ ID NO: 5).
11. The insulin analog of claim 7 or claim 8, wherein the B chain peptide comprises the sequence XB1-XB2-XB3-XB4-XB5-XB6-XB7-XB8-CysB9-Gly-Ser-XB 12-XBi3-XBi4-XBi5-
XB16-XB17-XB18-XB19-¾20-CySB21-XB22-XB23- XB24-XB25-XB26-XB27-XB28-XB29"XB30- XB31-XB32-XB33-XB34-XB35-XB36"XB37-XB38-XB39,
where XBi = Thr, Asn, Ser or is absent; XB2 = Phe, Ser, Asn, Thr, Gin or is absent; XB3 = Asp, Gly, Pro, Leu, Phe, or His; XB4 = Thr, Pro, Asp, Val or Gly; XBs = Asn, Pro, His, Thr, Arg, Ser or hydroxyproline; XB6 = Lys, Glu, Asn, Asp, Arg, Gin or Gly; XB7 = His or Tyr; XBs = Arg, Thr, He, Ser, Leu, Tyr or Lys; CysB9 = Cys or selenocysteine; XB 12 = His, Glu, gamma carboxyglutamate, Asp, or Asn; XB 13 = He, Leu, Asp, Val or Ala; XB 14 = Thr, Ala, Pro, Val or Arg; XB 1s = Asn, Asp, Ala, Val, Thr, Pro or Glu; XB 16 = Ala, Ser, Gin, His, Tyr, Arg or Gly; XBn = Tyr or Leu; XB1g = Tyr, Met, Val, Gin, He, Asp, Gly, Asn or Leu; XB 19 = Leu, Asp, Gin, Gly, Lys, Glu, Arg or Thr; XB2o = Val or Leu; CB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr, Phe or His; XB23 = Glu, Arg, Ser, Gly or is absent; XB24 = Arg, Asp, Val or is absent; XB25 = Gly, Leu, Val or is absent; XB26 = Phe, Val, He or is absent; XB27 = Phe, Asn, Pro, Glu or is absent; XB28 = Tyr, Cys, His or is absent; XB29 = Thr, His, Leu, Tyr or is absent; XB3o = Pro, Glu, Leu, He, Arg or is absent; XB3j = He, Lys or is absent; XB32 = Thr, Lys, Leu, Gin or is absent; XB33 = Cys or is absent; XB34 = Glu, Pro, Val or is absent; XB3s = Glu, Gly or is absent; XB36 = Glu, Gly or is absent; XB37 = Glu, Val or is absent; XB3g = Ala, Asp or is absent; and XB39 = Ala or is absent (SEQ ID NO: 6).
12. The insulin analog of claim 7, wherein the B chain peptide comprises the sequence XBi-XB2- B3- B4- B5- B6- B7- B8-CysB9-Gly-Ser-XBi2- Bi3- Bi4- Bi5- XBi6-XBi7-XBi8-XBi -XB20-CysB2i-XB22-XB23, wherein XBI = Thr, Asn or is absent; XB2 = Phe, Ser or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys, Asn or Gin; XB7 = His or Tyr; XB8 = Arg, He or Leu; CysB9 = Cys or selenocysteine; XBI2 = His, Asp, Glu or gamma carboxyglutamate; XBI3 = Val, He or Leu; XBi4 = Thr, Ala, Pro, or Val; XBis = Glu, Val, Asn or Asp; XBi6 = Ser, Gin, Tyr or Ala; XBn = Tyr or Leu; XBis = Tyr, Asp, Met or Val; XBi = Leu, Asp, Gin or Lys; XB20 = Leu or Val; CysB2i = Cys, amidated Cys, selenocysteine or amidated selenocysteine; XB22 = Tyr or Gly; and XB23 = Glu, Arg, Gly or is absent (SE ID NO: 7).
13. The insulin analog of claim 12, wherein the B chain peptide comprises the sequence XBi-XB2-XB3-XB4-XB5-XB6-His-XB8-CysB9-Gly-Ser-XBi2-XBi3-XBi4-XBi5- XBi6-XBi7-XBi8-XBi9-XB20-CysB2i-XB22-XB23, wherein XBI = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XBs = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB8 = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XBI3 = He or Leu; XBM = Thr, or Val; XBIS = Glu, or Asn; XBI6 = Ser or Ala; XBn = Tyr or Leu, XBis = Tyr or Met; XBi9 = Leu or Asp, XB20 = Leu or Val; XB22 = Tyr; and XB23 = Glu or Arg (SEQ ID NO: 8).
14. The insulin analog according to any one of claims 7 to 12, wherein XB22 is Tyr.
15. The insulin analog according to any one of claims 1 to 14, wherein the A chain peptide comprises the sequence Gly-XA2-XA3-XA4-XA5-CysA6-CysA7-XA8-XA9-XAio-
CySAll-XA12-XA13-XA14-XA15-Phe-XA17-XA18-XA19-CySA20-XA21- XA22-XA23-XA24-XA25" XA26-XA27-XA28-XA29-XA30-XA31-XA32-XA33-XA34,
wherein XA2 = Val or He; XA3 = Val or Ala; XA4 = Glu, gamma
carboxyglutamate or Cys; XAs = Gin, Glu, gamma carboxyglutamate, His or Val;
CysA6, CysA7, and CysA11 are independently Cys or selenocysteine; XAg = Thr, His, Asp, Gin, Tyr, Lys or Val; XA9 = Ser, Arg, Asn, His or Lys; XAio = He, Pro, Tyr, Ala,
Ser, Phe, His or Thr; XAi2 = Ser or Thr; XAi3 = Leu, Asn, Val or Asp; XAM = Tyr, Ala, Gin, Asp or Glu; XA15 = Gin, Glu or Thr; or Ala; XA17 = Glu, Lys, Arg, He, Met, Thr or Ser; XAIS = Lys, Thr, Asn, Gin or Glu; XAI = Tyr or Phe; CysA20 = Cys,
selenocysteine, amidated Cys, or amidated selenocysteine; XA2i = Asn, Pro, His, Ser, Gly, Ala, or is absent; XA22 = Pro, Asn, Thr, Leu, Ser or is absent; XA23 = Thr, Leu, Val, Ser or is absent; XA24 = Arg, Thr, Met, Gin, Leu or is absent; XA25 = Glu, Gly or is absent; from XA26 = Ser, Leu or is absent; XA27 to XA3j are independently Ser or are absent; XA32 = Ala, Ser or is absent; XA33 = Ala, Val or is absent; and XA34 = Ala or is absent (SEQ ID NO: 9).
16. The insulin analog according to any one of claims 1 to 6, comprising:
an A chain peptide comprising a sequence Gly-XA2-Val-XA4-XAs-CysA6-
CysA7-XAg-XA9-XA1o-CysA11-Ser-XA13-XA14-XA15-XA16-XAi7-XAi8-Tyr-CysA2o-XA2i, wherein XA2 is Val or He, XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XAg is His or Thr, XA9 is Arg or Ser, XAio is Pro or He, XA13 is Asn or Leu, XAw is Ala or Tyr, XA1s is Glu or Gin, XA½ is Phe or Leu, XAn is Lys or Glu, XA1g is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 10); and
an B chain peptide comprising the sequence XBI-XB2-XB3-XB4-XB5-XB6-HIS-
XB8-CySB -Gly-Ser-XB12-XB13-XB14-XB15-XB16-XB17-XB18-XB1 -XB20-CySB21-XB22-XB23, wherein XBi = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XB5 = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBg = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XB13 = He or Leu; XB 14 = Thr, or Val; XB1s = Glu, or Asn; XB 16 = Ser or Ala; XB 17 = Tyr or Leu; XB 18 = Tyr or Met; XB19 = Leu or Asp, XB2o = Leu or Val; XB22 = Gly or Tyr; and XB23 = Glu or Arg (SEQ ID NO: 11).
17. The insulin analog of claim 15 or claim 16, comprising:
an A chain peptide comprising a sequence
Gly-XA2-Val-XA4-XA5-CysA6-CysA7-XA8-XA9-XA1o-CysA11-Ser-XA13-XA14-XA15- Phe-XA17-XAi8-Tyr-CysA2o-XA2i, wherein XA2 is Val or He, XA4 is Glu or gamma carboxyglutamate, XAs is His or Gin, XAs is His or Thr, XA9 is Arg or Ser, XAio is Pro
or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAI8 is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 12); and
an B chain peptide comprising the sequence XB1-XB2-XB3-XB4-XBs-XB6-His-
XB8-CysB -Gly-Ser-XBi2-XBi3-XBi4-XBi5-XBi6- XBi7-XBi8-XBi9-XB20-CysB2i- yr-XB23, wherein XB1 = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XB5 = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XBs = Arg or Leu; XBi2 = His, Glu or gamma carboxyglutamate; XB13 = He or Leu; XB 14 = Thr, or Val; XB1s = Glu, or Asn; XB 16 = Ser or Ala; XB17 = Tyr or Leu; XB1g = Tyr or Met; XB1 = Leu or Asp, XB2o = Leu or Val; and XB23 = Glu or Arg (SEQ ID NO: 13).
18. The insulin analog according to any one of claims 15 to 17, comprising:
an A chain peptide comprising a sequence
Gly-Val-Val-XA4-XA5-CysA6-CysA7-XA8-XA - Aio-CysAii-Ser-XAi3-XAi4-XAi5- Phe-XAi7-XAi8-Tyr-CysA20-XA2i, wherein XA4 is Glu or gamma carboxyglutamate, XAS is His or Gin, XAs is His or Thr, XA9 is Arg or Ser, XAIO is Pro or He, XAi3 is Asn or Leu, XAM is Ala or Tyr, XAis is Glu or Gin, ΧΑΠ is Lys or Glu, XAIS is Lys or Asn and XA2i is Asn or absent (SEQ ID NO: 14); and
an B chain peptide comprising the sequence XBi-XB2-XB3-XB4-XBs-XB6-His- Arg-CysB9-Gly-Ser-XB12-Ile-XB14-XB15-XB16- XB17-XBi8-XBi9-Leu-CysB2i-Tyr-XB23, wherein XB1 = Thr or is absent; XB2 = Phe or is absent; XB3 = Phe or Asp; XB4 = Thr or Val; XB5 = Pro, Asn or hydroxyproline; XB6 = Lys or Gin; XB 12 = His, Glu or gamma carboxyglutamate; XBi4 = Thr, or Val; XBis = Glu, or Asn; XBi6 = Ser or Ala; XBn = Tyr or Leu; XB1g = Tyr or Met; XB 19 = Leu or Asp, and XB23 = Glu or Arg (SEQ ID NO: 15).
19. The insulin analog according to any one of claims 1 to 18, wherein one or more or all of the following apply;
i) XA4 is gamma carboxyglutamate,
ii) XB5 is hydroxyproline; and
iii) XB 12 is gamma carboxyglutamate.
20. The insulin analog according to any one of claims 1 to 19, comprising:
an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr- Ser-Ile-Cys-Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 16); and
a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly- Ser-His-Leu-Val-Glu-Ala-Xaa-Tyr-Leu-Val-Cys-Gly-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 17).
21. The insulin analog according to any one of claims 1 to 19, comprising:
an A chain peptide comprising the sequence Gly-Ile-Val-Glu-Gln-Cys-Cys-Thr- Ser-Ile-Cys-Ser-Leu-Tyr-Gln-Leu-Glu-Asn-Tyr-Cys-Asn (SEQ ID NO: 18); and
a B chain peptide comprising the sequence Phe-Val-Asn-Gln-His-Leu-Cys-Gly- Ser-His-Leu-Val-Glu-Ala-Leu-Tyr-Leu-Val-Cys-Xaa-Glu, where Xaa is an aromatic residue or large aliphatic residue (SEQ ID NO: 19).
22. The insulin analog according to any one of claims 1 to 21, wherein CysB9 of the B chain peptide is bonded to CysA6 of the A chain peptide.
23. The insulin analog according to any one of claims 1 to 22, wherein CysB2i of the B chain peptide is bonded to CysA20 of the A chain peptide.
24. The insulin analog according to any one of claims 1 to 23, wherein CysA7 is bonded to CysAi i-
25. The insulin analog according to any one of claims 1 to 24, wherein the A chain peptide and the B chain peptide are linked together at one pair of their respective terminal ends.
26. The insulin analog according to any one of claims 1 to 25, wherein the A chain peptide and the B chain peptide are linked together at both terminal ends.
27. The insulin analog according to any one of claims 1 to 26 which has an IC50 against the human IR-B receptor of less than 10~6 M.
28. The insulin analog according to any one of claims 1 to 27, wherein in solution at least 75% of the analog is monomeric in solution.
29. The insulin analog according to any one of claims 1 to 28 which has increased bioavailability when administered to a human when compared human insulin.
30. The insulin analog of claim 29 which has a peak bioavailability within 0.5 to 3 hours of administration to a human.
31. The insulin analog of claim 29 or claim 30 which has an onset of activity within 10 minutes of administration.
32. The insulin analog of any one of claims 1 to 31, wherein the analog does not bind human IGF-IR or binds human IGF-IR weakly.
33. The insulin analog of claim 32, wherein the analog has an affinity (¾) for human IGF-IR of weaker than 100 nM.
32. A pharmaceutical composition, comprising the insulin analog of any one of claims 1 to 31 or a pharmaceutically acceptable salt thereof and one or more pharmaceutically acceptable carriers.
33. A method for treating and/or preventing an insulin-related condition, comprising administering a therapeutically effective amount of the insulin analog of any one of claims 1 to 32 to a subject in need thereof.
34. The method of claim 33, wherein the insulin related condition is hyperglycemia, insulin resistance, type-1 diabetes, gestational diabetes or type-2 diabetes.
35. A method for decreasing blood glucose levels, comprising administering a therapeutically effective amount of the insulin analog of any one of claims 1 to 32 to a subject in need thereof.
36. Use of the insulin analog of any one of claims 1 to 32 in the manufacture of a medicament for treating and/or preventing an insulin-related condition in a subject.
37. Use of the insulin analog of any one of claims 1 to 32 in the manufacture of a medicament for decreasing blood glucose levels in a subject.
38. An insulin analog of any one of claims 1 to 32 for use in treating and/or preventing an insulin-related condition in a subject.
39. An insulin analog of any one of claims 1 to 32 for use in decreasing blood glucose levels in a subject.
40. A method of redesigning or modifying a polypeptide which is known to bind to an insulin receptor (IR) comprising performing structure-based evaluation of a structure defined by the atomic coordinates of Appendix I or a subset thereof and redesigning or chemically modifying the polypeptide as a result of the evaluation.
41. The method of claim 40, wherein structure-based evaluation comprises comparison of the structure defined by the atomic coordinates of Appendix I or a subset thereof, with the atomic coordinates of insulin or a subset thereof.
42. The method of claim 40 or claim 41, wherein structure-based evaluation further comprises molecular modelling of a complex formed between the structure defined by the atomic coordinates of Appendix I or a subset thereof with the atomic coordinates of an insulin receptor or a subset thereof.
43. The method of claim any one of claims 40 to 42, further comprising synthesising or obtaining the redesigned or chemically modified polypeptide and testing for its ability to bind IR.
44. The method according to any one of claims 40 to 42, further comprising synthesising or obtaining the redesigned or chemically modified polypeptide and determining the ability of the redesigned or chemically modified polypeptide to modulate IR activation.
45. The method according to any one of claims 40 to 42, further comprising synthesising or obtaining the redesigned or chemically modified polypeptide and determining the ability of the redesigned or chemically modified polypeptide to lower blood glucose levels.
46. The method of any one of claims 40 to 45, wherein the polypeptide which is known to bind to IR is insulin.
47. The method of claim 46, wherein the insulin is human insulin.
48. A polypeptide which has been redesigned or modified by the method of any one of claims 40 to 47.
49. The polypeptide of claim 48, which is monomeric.
50. An isolated molecule which is an IR agonist, wherein the molecule is identified and/or designed based on the 3D structure of Con-Ins Gl defined by the atomic coordinates of Appendix I or a subset thereof.
51. The molecule of claim 50, wherein the molecule is a peptide, polypeptide or peptidomimetic.
52. The molecule of claim 50 or claim 51 which is monomeric.
53. The molecule according to any one of claims 50 to 52, which has an IC50 against the human IR-B receptor of less than 10"6 M.
54. A method of identifying a compound which binds IR, the method comprising: i) generating a three-dimensional structure model of a polypeptide having
a) a structure defined by the atomic coordinates of Appendix I or a subset thereof, or
b) a structure having a root mean square deviation less than about 2.0A when superimposed on the corresponding backbone atoms of a), and
ii) designing or screening for a compound which potentially binds the IR.
55. The method of claim 54, wherein generating a three-dimensional structure model comprises generating a model of the polypeptide bound to IR or regions thereof.
56. The method of claim 54 or claim 55, further comprising synthesising the compound which potentially binds the IR.
57. The method according to any one of claims 54 to 56, wherein the compound modulates at least one biological activity of IR.
58. The method according to any one of claims 54 to 57, wherein the compound is monomeric.
59. The method of any one of claims 54 to 58 which further comprises testing the compound designed or screened for in ii) for its ability to modulate blood glucose levels.
60. The method according to any one of claims 54 to 59, wherein i) and ii) are performed in silico.
61. A computer-based method of identifying a compound which mimics insulin activity, the method comprising
i) generating a three-dimensional structure model of a polypeptide having
a) a structure defined by the atomic coordinates of Appendix I or a subset thereof, or
b) a structure having a root mean square deviation less than about 2.0 A when superimposed on the corresponding backbone atoms of a), and
ii) designing or screening for a compound which mimics insulin activity.
62. The method of claim 61, wherein generating a three-dimensional structure model comprises generating a model of the polypeptide bound to IR or regions thereof.
63. The method of claim 61 or claim 62, further comprising synthesising the compound which potentially binds the IR.
64. The method according to any one of claims 61 to 63, wherein the compound modulates at least one biological activity of IR.
65. The method according to any one of claims 61 to 64, wherein the compound is monomeric.
66. The method according to any one of claims 61 to 65, the method further comprising testing the compound designed or screened for in ii) for its ability to modulate blood glucose levels.
67. The method according to any one of claims 61 to 66, wherein i) and ii) are performed in silico.
68. A compound identified using a method according to any one of claims 54 to 67.
69. A crystal of Con-Ins Gl polypeptide having a space group P432 with unit cell dimensions of a = b= c = 74.91 A with up to about 2 % variation in any cell dimension.
70. The structure of Con-Ins Gl polypeptide as defined by the atomic coordinates of Appendix I.
71. The use of the structure of claim 70 as a structural model.
72. The use of the structural model according to claim 71 for identification of insulin analogs.
73. An insulin analog identified by the use of claim 72.
74. A pharmaceutical composition comprising the polypeptide of claim 48 or claim 49, the molecule according to any one of claims 50 to 53, the compound of claim 68, or the insulin analog of claim 72.
75. A peptide comprising an insulin A chain peptide and an insulin B chain peptide, wherein the B chain peptide comprises a substitution at amino acid 10 and amino acid 20.
76. The peptide of claim 75, wherein the substitution at amino acid 20 is G20Y, G20F, or G20P.
77. The peptide of claim 76, wherein the substitution at amino acid 20 is G20P and the peptide further comprises a substitution at amino acid 21, wherein the substitution at amino acid 21 is G21H.
78. The peptide of any one of claims 75 to 77, wherein the substitution at amino acid 10 is H10E, H10D or H10Q.
79 The peptide of any one of claims 75 to 78, further comprising at least one substitution in the A chain peptide.
80. The peptide of claim 79, wherein the at least one substitution in the A chain peptide is T8H, T8Y, T8K, or S9R.
81. The peptide of claim 80, further comprising at least two substitutions in the A chain peptide.
82. The peptide of claim 79, wherein the at least two substitutions in the A chain peptide are two of the substitutions selected from: T8H, T8Y, T8K, and S9R.
83. The peptide of any one of claims 75 to 82, wherein the peptide is a des- octapeptide insulin.
84. The peptide of any one of claims 75 to 83, wherein the B chain peptide comprises the sequence of FVNQHLCGSELVEALYLVCYER (SEQ ID NO: 30).
85. The peptide of any one of claims 75 to 84, wherein the A chain comprises the sequence of GIVEQCCHRICSLYQLENYCN (SEQ ID NO: 39).
86. The peptide of any one of claims 75 to 85, wherein the A chain peptide and B chain peptide are bonded via at least one disulfide bond.
87. The peptide of any one of claims 75 to 86, wherein the peptide is a monomer.
88. The peptide of any one of claims 75 to 87, wherein the insulin A chain peptide is at least 70% identical to wild type human insulin A chain peptide.
89. A pharmaceutical composition comprising the peptide of any one of claims 75 to 88 and a pharmaceutically acceptable carrier.
90. A method of increasing insulin receptor activation in a subject comprising administering a therapeutically effective amount of any one of the peptides of any one of claims 75 to 88 to a subject in need thereof.
91. A method of lowering the blood sugar in a subject comprising administering a therapeutically effective amount of any one of the peptides of any one of claims 75 to 88 to a subject in need thereof.
92. A method of treating type 1 diabetes in a subject comprising administering a therapeutically effective amount of any one of the peptides of any one of claims 75 to 88 to a subject in need thereof.
93. The method of claim 92, wherein the subject has been diagnosed with type 1 diabetes prior to administering the peptide.
94. A therapeutic protein having an A chain peptide bonded to a B chain peptide via at least one disulfide bond, wherein the A chain comprises the sequence of GIVEQCCHRICSLYQLENYCN (SEQ ID NO: 39), and wherein the B chain peptide comprises the sequence of FVNQHLCGSELVEALYLVCYER (SEQ ID NO: 30).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2021269301A AU2021269301B2 (en) | 2016-07-22 | 2021-11-16 | Insulin Analogs |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2016902883A AU2016902883A0 (en) | 2016-07-22 | Insulin Analogs | |
AU2016902883 | 2016-07-22 | ||
US201762483118P | 2017-04-07 | 2017-04-07 | |
US62/483,118 | 2017-04-07 | ||
PCT/AU2017/050758 WO2018014091A1 (en) | 2016-07-22 | 2017-07-21 | Insulin analogs |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2021269301A Division AU2021269301B2 (en) | 2016-07-22 | 2021-11-16 | Insulin Analogs |
Publications (2)
Publication Number | Publication Date |
---|---|
AU2017298565A1 AU2017298565A1 (en) | 2019-02-21 |
AU2017298565B2 true AU2017298565B2 (en) | 2021-08-19 |
Family
ID=60991677
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2017298565A Active AU2017298565B2 (en) | 2016-07-22 | 2017-07-21 | Insulin analogs |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU2017298565B2 (en) |
CA (1) | CA3030930A1 (en) |
WO (1) | WO2018014091A1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109844105A (en) | 2016-07-11 | 2019-06-04 | 得克萨斯州大学系统董事会 | Recombinant polypeptide and its production method comprising selenocysteine |
CN110072884B (en) | 2016-07-22 | 2023-07-28 | 犹他大学研究基金会 | Insulin analogues |
CA3034701A1 (en) | 2016-08-30 | 2018-03-08 | Board Of Regents, The University Of Texas System | Production of seleno-biologics in genomically recoded organisms |
WO2018187568A1 (en) * | 2017-04-07 | 2018-10-11 | University Of Utah Research Foundation | Insulin analogs and methods of using |
EP3892628B1 (en) * | 2018-06-29 | 2022-09-07 | Akston Biosciences Corporation | Ultra-long acting insulin-fc fusion proteins and methods of use |
CN114675019B (en) * | 2022-02-10 | 2022-12-16 | 江苏省人民医院(南京医科大学第一附属医院) | Kit for detecting insulin receptor extracellular domain antibody |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005095443A1 (en) * | 2004-03-31 | 2005-10-13 | Cardio Incorporated | Drug delivery system using peptide modification |
US20080146492A1 (en) * | 2006-12-13 | 2008-06-19 | Zimmerman Ronald E | Insulin production methods and pro-insulin constructs |
WO2012174480A2 (en) * | 2011-06-17 | 2012-12-20 | Halozyme, Inc. | Continuous subcutaneous insulin infusion methods with a hyaluronan degrading enzyme |
-
2017
- 2017-07-21 AU AU2017298565A patent/AU2017298565B2/en active Active
- 2017-07-21 WO PCT/AU2017/050758 patent/WO2018014091A1/en unknown
- 2017-07-21 CA CA3030930A patent/CA3030930A1/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005095443A1 (en) * | 2004-03-31 | 2005-10-13 | Cardio Incorporated | Drug delivery system using peptide modification |
US20080146492A1 (en) * | 2006-12-13 | 2008-06-19 | Zimmerman Ronald E | Insulin production methods and pro-insulin constructs |
WO2012174480A2 (en) * | 2011-06-17 | 2012-12-20 | Halozyme, Inc. | Continuous subcutaneous insulin infusion methods with a hyaluronan degrading enzyme |
Non-Patent Citations (8)
Title |
---|
BAJAJ, M. et al., "Coypu insulin. Primary structure, conformation and biological properties of a hystricomorph rodent insulin", Biochemical Journal, (1986), vol. 238, no. 2, pages 345 - 351 * |
CAS RN 102961-54-6, STN Entry Date 28 June 1986 * |
CAS RN 135317-44-1, STN Entry Date 02 August 1991 * |
CAS RN 1353849-21-4, STN Entry Date 23 January 2012 * |
CAS RN 177150-87-7, STN Entry Date 07 June 1996 * |
CAS RN 53123-87-8, STN Entry Date 16 Nov 1984 * |
CAS RN 556776-12-6, STN Entry Date 29 Jul 2003 * |
SIMON, J. et al., "Evolution of preproinsulin gene in birds", Molecular Phylogenetics and Evolution, (2004), vol. 30, no. 3, pages 755 - 766 * |
Also Published As
Publication number | Publication date |
---|---|
WO2018014091A1 (en) | 2018-01-25 |
CA3030930A1 (en) | 2018-01-25 |
AU2017298565A1 (en) | 2019-02-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2017298565B2 (en) | Insulin analogs | |
AU2021269301B2 (en) | Insulin Analogs | |
Pioszak et al. | Molecular recognition of corticotropin-releasing factor by its G-protein-coupled receptor CRFR1 | |
US8377701B2 (en) | Specific ligands to sortilin | |
US7225083B2 (en) | Crystallographic structure of the androgen receptor ligand binding domain | |
EP2411038B1 (en) | Parathyroid hormone peptides and parathyroid hormone-related protein peptides and methods of use | |
EP2901154B1 (en) | Structure of insulin in complex with n- and c-terminal regions of the insulin receptor alpha-chain | |
US20090117662A1 (en) | Mutants of IGF Binding Proteins and Methods of Production of Antagonists Thereof | |
NZ750355B2 (en) | Insulin analogs | |
EP1265927B1 (en) | Crystal | |
Erskine et al. | Structure of the neuronal protein calexcitin suggests a mode of interaction in signalling pathways of learning and memory | |
Wilken | Structure-function relationships of chorionic gonadotropin | |
Watson | Insulin analogues for insulin receptor studies and medical applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) |