AU2012200841B2 - Mutant proteins and methods for selecting them - Google Patents
Mutant proteins and methods for selecting them Download PDFInfo
- Publication number
- AU2012200841B2 AU2012200841B2 AU2012200841A AU2012200841A AU2012200841B2 AU 2012200841 B2 AU2012200841 B2 AU 2012200841B2 AU 2012200841 A AU2012200841 A AU 2012200841A AU 2012200841 A AU2012200841 A AU 2012200841A AU 2012200841 B2 AU2012200841 B2 AU 2012200841B2
- Authority
- AU
- Australia
- Prior art keywords
- gpcr
- mutant
- receptor
- amino acid
- parent
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title abstract description 96
- 108010021466 Mutant Proteins Proteins 0.000 title description 12
- 102000008300 Mutant Proteins Human genes 0.000 title description 12
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 claims abstract description 416
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 claims abstract description 416
- 239000003446 ligand Substances 0.000 claims abstract description 188
- 230000027455 binding Effects 0.000 claims abstract description 98
- 238000009739 binding Methods 0.000 claims abstract description 94
- 230000001965 increasing effect Effects 0.000 claims abstract description 91
- 108090000623 proteins and genes Proteins 0.000 claims description 89
- 102000004169 proteins and genes Human genes 0.000 claims description 85
- 239000005557 antagonist Substances 0.000 claims description 73
- 239000000556 agonist Substances 0.000 claims description 66
- 239000003599 detergent Substances 0.000 claims description 53
- 238000002425 crystallisation Methods 0.000 claims description 24
- 239000003795 chemical substances by application Substances 0.000 claims description 8
- 102000040430 polynucleotide Human genes 0.000 claims description 8
- 108091033319 polynucleotide Proteins 0.000 claims description 8
- 239000002157 polynucleotide Substances 0.000 claims description 8
- 230000003196 chaotropic effect Effects 0.000 claims description 7
- 238000007876 drug discovery Methods 0.000 claims description 6
- 102000018208 Cannabinoid Receptor Human genes 0.000 claims description 3
- 108050007331 Cannabinoid receptor Proteins 0.000 claims description 3
- 230000014759 maintenance of location Effects 0.000 claims description 2
- 239000007787 solid Substances 0.000 claims description 2
- 238000011948 assay development Methods 0.000 claims 1
- 102000017922 Neurotensin receptor Human genes 0.000 abstract description 60
- 108060003370 Neurotensin receptor Proteins 0.000 abstract description 34
- 102000009346 Adenosine receptors Human genes 0.000 abstract description 28
- 108050000203 Adenosine receptors Proteins 0.000 abstract description 27
- 230000035772 mutation Effects 0.000 description 216
- 102000005962 receptors Human genes 0.000 description 196
- 108020003175 receptors Proteins 0.000 description 196
- 125000000539 amino acid group Chemical group 0.000 description 109
- 235000001014 amino acid Nutrition 0.000 description 91
- 229940024606 amino acid Drugs 0.000 description 84
- 235000018102 proteins Nutrition 0.000 description 84
- 150000001413 amino acids Chemical class 0.000 description 81
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 69
- 125000003275 alpha amino acid group Chemical group 0.000 description 50
- 108010052285 Membrane Proteins Proteins 0.000 description 39
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 37
- 230000003019 stabilising effect Effects 0.000 description 32
- NLEBIOOXCVAHBD-QKMCSOCLSA-N dodecyl beta-D-maltoside Chemical compound O[C@@H]1[C@@H](O)[C@H](OCCCCCCCCCCCC)O[C@H](CO)[C@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 NLEBIOOXCVAHBD-QKMCSOCLSA-N 0.000 description 31
- 102220571318 Cytoplasmic polyadenylated homeobox-like protein 2_Y227A_mutation Human genes 0.000 description 30
- 238000003556 assay Methods 0.000 description 29
- 102200038515 rs2295769 Human genes 0.000 description 29
- 210000004027 cell Anatomy 0.000 description 26
- 102200027764 rs76394784 Human genes 0.000 description 26
- 108060003345 Adrenergic Receptor Proteins 0.000 description 25
- 102000017910 Adrenergic receptor Human genes 0.000 description 25
- 102000018697 Membrane Proteins Human genes 0.000 description 25
- 101100135641 Caenorhabditis elegans par-3 gene Proteins 0.000 description 24
- 230000014509 gene expression Effects 0.000 description 22
- 102000014415 Muscarinic acetylcholine receptor Human genes 0.000 description 21
- 108050003473 Muscarinic acetylcholine receptor Proteins 0.000 description 21
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- NCYCYZXNIZJOKI-IOUUIBBYSA-N 11-cis-retinal Chemical compound O=C/C=C(\C)/C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C NCYCYZXNIZJOKI-IOUUIBBYSA-N 0.000 description 19
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 19
- 101500028886 Rattus norvegicus Neurotensin Proteins 0.000 description 19
- 229960005305 adenosine Drugs 0.000 description 19
- 239000013078 crystal Substances 0.000 description 18
- 150000002632 lipids Chemical class 0.000 description 18
- 108090000765 processed proteins & peptides Proteins 0.000 description 18
- 238000010438 heat treatment Methods 0.000 description 17
- 102000004196 processed proteins & peptides Human genes 0.000 description 17
- 241000588724 Escherichia coli Species 0.000 description 16
- 230000000694 effects Effects 0.000 description 16
- 238000011049 filling Methods 0.000 description 16
- 238000013507 mapping Methods 0.000 description 16
- 239000012528 membrane Substances 0.000 description 16
- 229920001184 polypeptide Polymers 0.000 description 16
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 15
- 102000004330 Rhodopsin Human genes 0.000 description 15
- 108090000820 Rhodopsin Proteins 0.000 description 15
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 13
- 238000002474 experimental method Methods 0.000 description 13
- 238000002703 mutagenesis Methods 0.000 description 13
- 231100000350 mutagenesis Toxicity 0.000 description 13
- 108091006027 G proteins Proteins 0.000 description 12
- 102000030782 GTP binding Human genes 0.000 description 12
- 108091000058 GTP-Binding Proteins 0.000 description 12
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 12
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 11
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 11
- SYELZBGXAIXKHU-UHFFFAOYSA-N dodecyldimethylamine N-oxide Chemical compound CCCCCCCCCCCC[N+](C)(C)[O-] SYELZBGXAIXKHU-UHFFFAOYSA-N 0.000 description 11
- 101500028867 Homo sapiens Neurotensin Proteins 0.000 description 9
- 239000000232 Lipid Bilayer Substances 0.000 description 9
- 239000007983 Tris buffer Substances 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 230000004927 fusion Effects 0.000 description 9
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 9
- -1 inositol phosphates Chemical class 0.000 description 9
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 9
- JADDQZYHOWSFJD-FLNNQWSLSA-N N-ethyl-5'-carboxamidoadenosine Chemical compound O[C@@H]1[C@H](O)[C@@H](C(=O)NCC)O[C@H]1N1C2=NC=NC(N)=C2N=C1 JADDQZYHOWSFJD-FLNNQWSLSA-N 0.000 description 8
- 125000004429 atom Chemical group 0.000 description 8
- WOQQAWHSKSSAGF-WXFJLFHKSA-N decyl beta-D-maltopyranoside Chemical compound O[C@@H]1[C@@H](O)[C@H](OCCCCCCCCCC)O[C@H](CO)[C@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 WOQQAWHSKSSAGF-WXFJLFHKSA-N 0.000 description 8
- 229940125425 inverse agonist Drugs 0.000 description 8
- HEGSGKPQLMEBJL-RKQHYHRCSA-N octyl beta-D-glucopyranoside Chemical compound CCCCCCCCO[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HEGSGKPQLMEBJL-RKQHYHRCSA-N 0.000 description 8
- 230000000144 pharmacologic effect Effects 0.000 description 8
- PWTBZOIUWZOPFT-UHFFFAOYSA-N 4-[2-[[7-amino-2-(2-furanyl)-[1,2,4]triazolo[1,5-a][1,3,5]triazin-5-yl]amino]ethyl]phenol Chemical compound N=1C2=NC(C=3OC=CC=3)=NN2C(N)=NC=1NCCC1=CC=C(O)C=C1 PWTBZOIUWZOPFT-UHFFFAOYSA-N 0.000 description 7
- 101800001814 Neurotensin Proteins 0.000 description 7
- 102400001103 Neurotensin Human genes 0.000 description 7
- 238000005259 measurement Methods 0.000 description 7
- PCJGZPGTCUMMOT-ISULXFBGSA-N neurotensin Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 PCJGZPGTCUMMOT-ISULXFBGSA-N 0.000 description 7
- 238000000159 protein binding assay Methods 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 101150051188 Adora2a gene Proteins 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- 238000011534 incubation Methods 0.000 description 6
- 239000004031 partial agonist Substances 0.000 description 6
- 230000002829 reductive effect Effects 0.000 description 6
- 102200082883 rs33959855 Human genes 0.000 description 6
- 230000006641 stabilisation Effects 0.000 description 6
- 229920000936 Agarose Polymers 0.000 description 5
- 241000039087 Apostates Species 0.000 description 5
- 101001086405 Bos taurus Rhodopsin Proteins 0.000 description 5
- 102220577246 Density-regulated protein_A86Y_mutation Human genes 0.000 description 5
- 102000011107 Diacylglycerol Kinase Human genes 0.000 description 5
- 108010062677 Diacylglycerol Kinase Proteins 0.000 description 5
- 241000238631 Hexapoda Species 0.000 description 5
- 238000012867 alanine scanning Methods 0.000 description 5
- JVHCMYZFGCOCTD-UHFFFAOYSA-N dihydroalprenolol Chemical compound CCCC1=CC=CC=C1OCC(O)CNC(C)C JVHCMYZFGCOCTD-UHFFFAOYSA-N 0.000 description 5
- 229940079593 drug Drugs 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000002287 radioligand Substances 0.000 description 5
- 102200012531 rs111033829 Human genes 0.000 description 5
- 102220086464 rs148634904 Human genes 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 4
- 102000007471 Adenosine A2A receptor Human genes 0.000 description 4
- 108010085277 Adenosine A2A receptor Proteins 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- 108010029485 Protein Isoforms Proteins 0.000 description 4
- 102000001708 Protein Isoforms Human genes 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 230000003281 allosteric effect Effects 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000008512 biological response Effects 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 4
- 239000003398 denaturant Substances 0.000 description 4
- 238000012856 packing Methods 0.000 description 4
- 102220046035 rs587782586 Human genes 0.000 description 4
- 102220225095 rs751226641 Human genes 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- ZFXYFBGIUFBOJW-UHFFFAOYSA-N theophylline Chemical compound O=C1N(C)C(=O)N(C)C2=C1NC=N2 ZFXYFBGIUFBOJW-UHFFFAOYSA-N 0.000 description 4
- 241000701447 unidentified baculovirus Species 0.000 description 4
- JWZZKOKVBUJMES-UHFFFAOYSA-N (+-)-Isoprenaline Chemical compound CC(C)NCC(O)C1=CC=C(O)C(O)=C1 JWZZKOKVBUJMES-UHFFFAOYSA-N 0.000 description 3
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 3
- PWTBZOIUWZOPFT-XHHURNKPSA-N 4-[2-[[7-amino-2-(furan-2-yl)-[1,2,4]triazolo[1,5-a][1,3,5]triazin-5-yl]amino]ethyl]-2-tritiophenol Chemical compound C1=C(O)C([3H])=CC(CCNC2=NC3=NC(=NN3C(N)=N2)C=2OC=CC=2)=C1 PWTBZOIUWZOPFT-XHHURNKPSA-N 0.000 description 3
- 108010082845 Bacteriorhodopsins Proteins 0.000 description 3
- 102220513109 Calcium homeostasis modulator protein 4_Y219A_mutation Human genes 0.000 description 3
- 108010078791 Carrier Proteins Proteins 0.000 description 3
- 108020004414 DNA Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 101000987581 Homo sapiens Perforin-1 Proteins 0.000 description 3
- 102000007207 Muscarinic M1 Receptor Human genes 0.000 description 3
- 108010008406 Muscarinic M1 Receptor Proteins 0.000 description 3
- 102220469280 Pre-mRNA cleavage complex 2 protein Pcf11_K60S_mutation Human genes 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 239000000654 additive Substances 0.000 description 3
- 230000000996 additive effect Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000002050 diffraction method Methods 0.000 description 3
- 238000009792 diffusion process Methods 0.000 description 3
- 229930182478 glucoside Natural products 0.000 description 3
- 150000008131 glucosides Chemical class 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 230000006872 improvement Effects 0.000 description 3
- 229960001317 isoprenaline Drugs 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 229960002748 norepinephrine Drugs 0.000 description 3
- SFLSHLFXELFNJZ-UHFFFAOYSA-N norepinephrine Natural products NCC(O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-UHFFFAOYSA-N 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 150000007523 nucleic acids Chemical class 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108020001580 protein domains Proteins 0.000 description 3
- 238000003653 radioligand binding assay Methods 0.000 description 3
- 102220011020 rs397507515 Human genes 0.000 description 3
- 238000010187 selection method Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- RIRGCFBBHQEQQH-SSFGXONLSA-N (-)-n6-(2-phenylisopropyl)adenosine Chemical compound C([C@@H](C)NC=1C=2N=CN(C=2N=CN=1)[C@H]1[C@@H]([C@H](O)[C@@H](CO)O1)O)C1=CC=CC=C1 RIRGCFBBHQEQQH-SSFGXONLSA-N 0.000 description 2
- AQHHHDLHHXJYJD-AWEZNQCLSA-N (2s)-1-naphthalen-1-yloxy-3-(propan-2-ylamino)propan-2-ol Chemical compound C1=CC=C2C(OC[C@@H](O)CNC(C)C)=CC=CC2=C1 AQHHHDLHHXJYJD-AWEZNQCLSA-N 0.000 description 2
- UCTWMZQNUQWSLP-VIFPVBQESA-N (R)-adrenaline Chemical compound CNC[C@H](O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-VIFPVBQESA-N 0.000 description 2
- UMCMPZBLKLEWAF-BCTGSCMUSA-N 3-[(3-cholamidopropyl)dimethylammonio]propane-1-sulfonate Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCC[N+](C)(C)CCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 UMCMPZBLKLEWAF-BCTGSCMUSA-N 0.000 description 2
- 102000003916 Arrestin Human genes 0.000 description 2
- 108090000328 Arrestin Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 108010077805 Bacterial Proteins Proteins 0.000 description 2
- 102100021935 C-C motif chemokine 26 Human genes 0.000 description 2
- 108091006146 Channels Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 2
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 2
- 108700022150 Designed Ankyrin Repeat Proteins Proteins 0.000 description 2
- 102000016359 Fibronectins Human genes 0.000 description 2
- 108010067306 Fibronectins Proteins 0.000 description 2
- 101000783751 Homo sapiens Adenosine receptor A2a Proteins 0.000 description 2
- 101000783756 Homo sapiens Adenosine receptor A2b Proteins 0.000 description 2
- 101000783645 Homo sapiens Adenosine receptor A3 Proteins 0.000 description 2
- 101000959437 Homo sapiens Beta-2 adrenergic receptor Proteins 0.000 description 2
- 101000897493 Homo sapiens C-C motif chemokine 26 Proteins 0.000 description 2
- 101000596892 Homo sapiens Neurotrimin Proteins 0.000 description 2
- 102000004310 Ion Channels Human genes 0.000 description 2
- 108090000862 Ion Channels Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 102100033986 Neurotensin receptor type 1 Human genes 0.000 description 2
- 102100035107 Neurotrimin Human genes 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 102000001253 Protein Kinase Human genes 0.000 description 2
- 101000591386 Rattus norvegicus Neurotensin receptor type 1 Proteins 0.000 description 2
- 108091005682 Receptor kinases Proteins 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- PAZJSJFMUHDSTF-UHFFFAOYSA-N alprenolol Chemical compound CC(C)NCC(O)COC1=CC=CC=C1CC=C PAZJSJFMUHDSTF-UHFFFAOYSA-N 0.000 description 2
- 229960002213 alprenolol Drugs 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 239000003125 aqueous solvent Substances 0.000 description 2
- 238000000149 argon plasma sintering Methods 0.000 description 2
- 238000002820 assay format Methods 0.000 description 2
- 108010079452 beta Adrenergic Receptors Proteins 0.000 description 2
- 102000012740 beta Adrenergic Receptors Human genes 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 102200010673 c.385A>G Human genes 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- WLNARFZDISHUGS-MIXBDBMTSA-N cholesteryl hemisuccinate Chemical compound C1C=C2C[C@@H](OC(=O)CCC(O)=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 WLNARFZDISHUGS-MIXBDBMTSA-N 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000009509 drug development Methods 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 210000003743 erythrocyte Anatomy 0.000 description 2
- 238000007710 freezing Methods 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- KWIUHFFTVRNATP-UHFFFAOYSA-N glycine betaine Chemical compound C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 102000055905 human ADORA2A Human genes 0.000 description 2
- 102000055126 human ADORA2B Human genes 0.000 description 2
- 102000046278 human ADORA3 Human genes 0.000 description 2
- 230000005660 hydrophilic surface Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 238000012417 linear regression Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000002887 multiple sequence alignment Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 2
- 238000002203 pretreatment Methods 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 238000010791 quenching Methods 0.000 description 2
- 230000000171 quenching effect Effects 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 238000002708 random mutagenesis Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 102220266111 rs1555187033 Human genes 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 150000003384 small molecules Chemical class 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 229960000278 theophylline Drugs 0.000 description 2
- ZZJYIKPMDIWRSN-TZBSWOFLSA-N (+)-butaclamol Chemical compound C12=CC=CC=C2CCC2=CC=CC3=C2[C@@H]1CN1CC[C@@](C(C)(C)C)(O)C[C@@H]13 ZZJYIKPMDIWRSN-TZBSWOFLSA-N 0.000 description 1
- HXYCHJFUBNTKQR-RNFRBKRXSA-N (2R,3R)-heptane-1,2,3-triol Chemical compound CCCC[C@@H](O)[C@H](O)CO HXYCHJFUBNTKQR-RNFRBKRXSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 102000004899 14-3-3 Proteins Human genes 0.000 description 1
- 101710112812 14-3-3 protein Proteins 0.000 description 1
- ZOOGRGPOEVQQDX-UUOKFMHZSA-N 3',5'-cyclic GMP Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-UUOKFMHZSA-N 0.000 description 1
- IROWCYIEJAOFOW-MERQFXBCSA-N 4-[(1r)-1-hydroxy-2-(propan-2-ylamino)ethyl]benzene-1,2-diol;hydrochloride Chemical compound Cl.CC(C)NC[C@H](O)C1=CC=C(O)C(O)=C1 IROWCYIEJAOFOW-MERQFXBCSA-N 0.000 description 1
- WNPNNLQNNJQYFA-YIDNRZKSSA-N 4-[(1r)-2-amino-1-hydroxyethyl]benzene-1,2-diol;(2r,3r)-2,3-dihydroxybutanedioic acid Chemical class OC(=O)[C@H](O)[C@@H](O)C(O)=O.NC[C@H](O)C1=CC=C(O)C(O)=C1 WNPNNLQNNJQYFA-YIDNRZKSSA-N 0.000 description 1
- HKJKONMZMPUGHJ-UHFFFAOYSA-N 4-amino-5-hydroxy-3-[(4-nitrophenyl)diazenyl]-6-phenyldiazenylnaphthalene-2,7-disulfonic acid Chemical compound OS(=O)(=O)C1=CC2=CC(S(O)(=O)=O)=C(N=NC=3C=CC=CC=3)C(O)=C2C(N)=C1N=NC1=CC=C([N+]([O-])=O)C=C1 HKJKONMZMPUGHJ-UHFFFAOYSA-N 0.000 description 1
- 101150111620 AQP1 gene Proteins 0.000 description 1
- 102000000506 Adenosine A2 Receptors Human genes 0.000 description 1
- 108010041368 Adenosine A2 Receptors Proteins 0.000 description 1
- UCTWMZQNUQWSLP-UHFFFAOYSA-N Adrenaline Natural products CNCC(O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-UHFFFAOYSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- 108010049777 Ankyrins Proteins 0.000 description 1
- 102000008102 Ankyrins Human genes 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- 102000010637 Aquaporins Human genes 0.000 description 1
- 108010063290 Aquaporins Proteins 0.000 description 1
- 108010014223 Armadillo Domain Proteins Proteins 0.000 description 1
- 102000008081 Arrestins Human genes 0.000 description 1
- 108010074613 Arrestins Proteins 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 101100228200 Caenorhabditis elegans gly-5 gene Proteins 0.000 description 1
- 108010078311 Calcitonin Gene-Related Peptide Receptors Proteins 0.000 description 1
- 102000014468 Calcitonin Gene-Related Peptide Receptors Human genes 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- YZCKVEUIGOORGS-OUBTZVSYSA-N Deuterium Chemical group [2H] YZCKVEUIGOORGS-OUBTZVSYSA-N 0.000 description 1
- QRLVDLBMBULFAL-UHFFFAOYSA-N Digitonin Natural products CC1CCC2(OC1)OC3C(O)C4C5CCC6CC(OC7OC(CO)C(OC8OC(CO)C(O)C(OC9OCC(O)C(O)C9OC%10OC(CO)C(O)C(OC%11OC(CO)C(O)C(O)C%11O)C%10O)C8O)C(O)C7O)C(O)CC6(C)C5CCC4(C)C3C2C QRLVDLBMBULFAL-UHFFFAOYSA-N 0.000 description 1
- 102000004980 Dopamine D2 Receptors Human genes 0.000 description 1
- 108090001111 Dopamine D2 Receptors Proteins 0.000 description 1
- 101000925646 Enterobacteria phage T4 Endolysin Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 102100038195 Exonuclease mut-7 homolog Human genes 0.000 description 1
- 108700025474 F 372 Proteins 0.000 description 1
- 102100031812 Fibulin-1 Human genes 0.000 description 1
- 101710170731 Fibulin-1 Proteins 0.000 description 1
- 102000008924 GPCR kinases Human genes 0.000 description 1
- 108050000878 GPCR kinases Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 108010027915 Glutamate Receptors Proteins 0.000 description 1
- 102000018899 Glutamate Receptors Human genes 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- 102000010029 Homer Scaffolding Proteins Human genes 0.000 description 1
- 108010077223 Homer Scaffolding Proteins Proteins 0.000 description 1
- 101001033280 Homo sapiens Cytokine receptor common subunit beta Proteins 0.000 description 1
- 101000958030 Homo sapiens Exonuclease mut-7 homolog Proteins 0.000 description 1
- 101000613565 Homo sapiens PRKC apoptosis WT1 regulator protein Proteins 0.000 description 1
- 101001135199 Homo sapiens Partitioning defective 3 homolog Proteins 0.000 description 1
- 101001098557 Homo sapiens Proteinase-activated receptor 3 Proteins 0.000 description 1
- 101001113471 Homo sapiens Proteinase-activated receptor 4 Proteins 0.000 description 1
- 101100189057 Homo sapiens SLC10A3 gene Proteins 0.000 description 1
- 101000670960 Homo sapiens V-type proton ATPase subunit E 1 Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- HQVHOQAKMCMIIM-FQEVSTJZSA-N LSM-15495 Chemical compound C([C@H]1COC=2C=CC=C3C(C(=O)C=4C5=CC=CC=C5C=CC=4)=C(N1C3=2)C)N1CCOCC1 HQVHOQAKMCMIIM-FQEVSTJZSA-N 0.000 description 1
- 102220531100 Left-right determination factor 1_D322A_mutation Human genes 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- 102000019298 Lipocalin Human genes 0.000 description 1
- 108050006654 Lipocalin Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000203353 Methanococcus Species 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- UQOFGTXDASPNLL-XHNCKOQMSA-N Muscarine Chemical compound C[C@@H]1O[C@H](C[N+](C)(C)C)C[C@H]1O UQOFGTXDASPNLL-XHNCKOQMSA-N 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 102000004868 N-Methyl-D-Aspartate Receptors Human genes 0.000 description 1
- 108090001041 N-Methyl-D-Aspartate Receptors Proteins 0.000 description 1
- 102000034570 NR1 subfamily Human genes 0.000 description 1
- 102100034437 Neurabin-2 Human genes 0.000 description 1
- 108091014434 Neurabin-2 Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102220575423 Opioid growth factor receptor-like protein 1_V57A_mutation Human genes 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 102000000470 PDZ domains Human genes 0.000 description 1
- 108050008994 PDZ domains Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102100027184 Periplakin Human genes 0.000 description 1
- 101710202907 Periplakin Proteins 0.000 description 1
- 102220509039 Platelet-activating factor acetylhydrolase IB subunit beta_L72A_mutation Human genes 0.000 description 1
- 229920002582 Polyethylene Glycol 600 Polymers 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 102100037133 Proteinase-activated receptor 3 Human genes 0.000 description 1
- 102100023710 Proteinase-activated receptor 4 Human genes 0.000 description 1
- 102220469369 Putative uncharacterized protein URB1-AS1_A69L_mutation Human genes 0.000 description 1
- 102000008944 RGS Proteins Human genes 0.000 description 1
- 108010074020 RGS Proteins Proteins 0.000 description 1
- 101100163076 Rattus norvegicus Aqp4 gene Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 102100028927 Secretin receptor Human genes 0.000 description 1
- 241000710961 Semliki Forest virus Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 101000677856 Stenotrophomonas maltophilia (strain K279a) Actin-binding protein Smlt3054 Proteins 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 235000014897 Streptococcus lactis Nutrition 0.000 description 1
- 102220499982 Target of EGR1 protein 1_F79A_mutation Human genes 0.000 description 1
- 102000003790 Thrombin receptors Human genes 0.000 description 1
- 108090000166 Thrombin receptors Proteins 0.000 description 1
- ATJFFYVFTNAWJD-UHFFFAOYSA-N Tin Chemical compound [Sn] ATJFFYVFTNAWJD-UHFFFAOYSA-N 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 102220532456 Vitamin D3 receptor_H75A_mutation Human genes 0.000 description 1
- 238000002441 X-ray diffraction Methods 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 229940102884 adrenalin Drugs 0.000 description 1
- 230000001800 adrenalinergic effect Effects 0.000 description 1
- 108010042113 alanine receptor Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 229940125516 allosteric modulator Drugs 0.000 description 1
- 125000000266 alpha-aminoacyl group Chemical group 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 229960003237 betaine Drugs 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 108010007701 calcyon Proteins 0.000 description 1
- 238000005515 capillary zone electrophoresis Methods 0.000 description 1
- BQXQGZPYHWWCEB-UHFFFAOYSA-N carazolol Chemical compound N1C2=CC=CC=C2C2=C1C=CC=C2OCC(O)CNC(C)C BQXQGZPYHWWCEB-UHFFFAOYSA-N 0.000 description 1
- 229960004634 carazolol Drugs 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000002983 circular dichroism Methods 0.000 description 1
- 238000001142 circular dichroism spectrum Methods 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 238000002288 cocrystallisation Methods 0.000 description 1
- 238000007398 colorimetric assay Methods 0.000 description 1
- 230000001447 compensatory effect Effects 0.000 description 1
- 230000009137 competitive binding Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 229910052805 deuterium Inorganic materials 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- UVYVLBIGDKGWPX-KUAJCENISA-N digitonin Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)C[C@@H](O)[C@H](O[C@H]5[C@@H]([C@@H](O)[C@@H](O[C@H]6[C@@H]([C@@H](O[C@H]7[C@@H]([C@@H](O)[C@H](O)CO7)O)[C@H](O)[C@@H](CO)O6)O[C@H]6[C@@H]([C@@H](O[C@H]7[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O7)O)[C@@H](O)[C@@H](CO)O6)O)[C@@H](CO)O5)O)C[C@@H]4CC[C@H]3[C@@H]2[C@@H]1O)C)[C@@H]1C)[C@]11CC[C@@H](C)CO1 UVYVLBIGDKGWPX-KUAJCENISA-N 0.000 description 1
- UVYVLBIGDKGWPX-UHFFFAOYSA-N digitonine Natural products CC1C(C2(CCC3C4(C)CC(O)C(OC5C(C(O)C(OC6C(C(OC7C(C(O)C(O)CO7)O)C(O)C(CO)O6)OC6C(C(OC7C(C(O)C(O)C(CO)O7)O)C(O)C(CO)O6)O)C(CO)O5)O)CC4CCC3C2C2O)C)C2OC11CCC(C)CO1 UVYVLBIGDKGWPX-UHFFFAOYSA-N 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 230000007783 downstream signaling Effects 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 238000002003 electron diffraction Methods 0.000 description 1
- WVLHGCRWEHCIOT-UHFFFAOYSA-N eltoprazine Chemical compound C1CNCCN1C1=CC=CC2=C1OCCO2 WVLHGCRWEHCIOT-UHFFFAOYSA-N 0.000 description 1
- 229950006047 eltoprazine Drugs 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 1
- 238000002189 fluorescence spectrum Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 239000003205 fragrance Substances 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 229960003692 gamma aminobutyric acid Drugs 0.000 description 1
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 235000014304 histidine Nutrition 0.000 description 1
- 125000000487 histidyl group Chemical class [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 102000055647 human CSF2RB Human genes 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000002169 hydrotherapy Methods 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- 239000002198 insoluble material Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 108010024941 iodothyronine deiodinase type II Proteins 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 102000006240 membrane receptors Human genes 0.000 description 1
- 108020004084 membrane receptors Proteins 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000302 molecular modelling Methods 0.000 description 1
- 230000003551 muscarinic effect Effects 0.000 description 1
- CGVLVOOFCGWBCS-RGDJUOJXSA-N n-octyl β-d-thioglucopyranoside Chemical compound CCCCCCCCS[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O CGVLVOOFCGWBCS-RGDJUOJXSA-N 0.000 description 1
- 239000002107 nanodisc Substances 0.000 description 1
- 229940126662 negative allosteric modulator Drugs 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 239000012454 non-polar solvent Substances 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- IXQGCWUGDFDQMF-UHFFFAOYSA-N o-Hydroxyethylbenzene Natural products CCC1=CC=CC=C1O IXQGCWUGDFDQMF-UHFFFAOYSA-N 0.000 description 1
- FATBGEAMYMYZAF-KTKRTIGZSA-N oleamide Chemical compound CCCCCCCC\C=C/CCCCCCCC(N)=O FATBGEAMYMYZAF-KTKRTIGZSA-N 0.000 description 1
- FATBGEAMYMYZAF-UHFFFAOYSA-N oleicacidamide-heptaglycolether Natural products CCCCCCCCC=CCCCCCCCC(N)=O FATBGEAMYMYZAF-UHFFFAOYSA-N 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000026792 palmitoylation Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 238000002497 phase alternated rotation of magnetisation Methods 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000013081 phylogenetic analysis Methods 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 229920002523 polyethylene Glycol 1000 Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 229960004604 propranolol hydrochloride Drugs 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 239000002469 receptor inverse agonist Substances 0.000 description 1
- 102000027426 receptor tyrosine kinases Human genes 0.000 description 1
- 108091008598 receptor tyrosine kinases Proteins 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000002702 ribosome display Methods 0.000 description 1
- 102200002568 rs116843064 Human genes 0.000 description 1
- 102220005206 rs33974936 Human genes 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000013077 scoring method Methods 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 108700027603 secretin receptor Proteins 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000007811 spectroscopic assay Methods 0.000 description 1
- DKGZKTPJOSAWFA-UHFFFAOYSA-N spiperone Chemical compound C1=CC(F)=CC=C1C(=O)CCCN1CCC2(C(NCN2C=2C=CC=CC=2)=O)CC1 DKGZKTPJOSAWFA-UHFFFAOYSA-N 0.000 description 1
- 229950001675 spiperone Drugs 0.000 description 1
- 238000012430 stability testing Methods 0.000 description 1
- 210000004722 stifle Anatomy 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000002198 surface plasmon resonance spectroscopy Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 230000036964 tight binding Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- 230000024883 vasodilation Effects 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Landscapes
- Peptides Or Proteins (AREA)
Abstract
A method for selecting a G-protein coupled receptor (GPCR) with increased stability, the method comprising (a) providing one or more mutants of a parent GPCR, (b) selecting a ligand, the ligand being one which binds to the parent GPCR when the GPCR is residing in a particular conformation, 5 (c) determining whether the or each mutant GPCR has increased stability with respect to binding the selected ligand compared to the stability of the parent GPCR with respect to binding that ligand, and (d) selecting those mutants that have an increased stability compared to the parent GPCR with respect to binding the selected ligand. Mutants of p-adrenergic receptor, adenosine receptor and neurotensin receptor are also disclosed.
Description
P/001011I Regulaton 3.2 AUSTRALIA Patents Act 1990 COMPLETE SPECIFICATION STANDARD PATENT Invention Title: Mutant proteins and methods for selecting them The following statement is a full description of this invention, including the best method of performing it known to us: IA MUTANT PROTEINS AND METHODS FOR SELECTING THEM The present invention relates to mutant G protein coupled receptors (GPCRs) and methods for selecting those with increased stability. In particular, it relates to the selection and preparation of mutant GPCRs which have increased stability under a particular condition compared to their respective parent proteins. Such proteins a-e more likely to be crystallisable, and hence amenable to structure determination, than the parent proteins. They are also useful for drug discovery and development studies. Over the past 20 years the rate of determination of membrane protein structures has gradually increased, but most success has been in crystallising membrane proteins from bacteria rather than from eukaryotes [1]. Bacterial membrane proteins have been easier to overexpress using standard techniques in Escherichia coli than eukaryotic membrane proteins [2,3] and the bacterial proteins are sometimes far more stable in detergent, detergent-stability being an essential prerequisite to purification and crystallisation. Genome sequencing projects have also allowed the cloning and expression of many bomologues of a specific transporter or ion channel, which also greatly improves the chances of success during crystallisation. However, out of the 120 different membrane protein structures that have been solved to date, there are only seven structures of mammalian integral membrane proteins (http*J/blanco.biomol.uci.edui); five of these membrane proteins were purified from natural sources and are stable in detergent solutions. Apart from the difficulties in overexpressing eukaryotic membrane proteins, they often have poor stability in detergent solutions, which severely restricts the range of crystallisation conditions that can be explored without their immediate denaturation or precipitation. Ideally, membrane proteins should be stable for many days in any given detergent solution, but the detergents that are best suited to growing diffraction-quality crystals tend to be the most destabilising detergents ie those with short aliphatic chains and small or charged head groups. It is also the structures of humanmembrane proteins that we would like to solve, because these are required to help the development of therapeutic agents by the pharmaceutical industry; often there are substantial differences in the pharmacology of receptors, channels and transporters from different mammals, whilst yeast and bacterial genomes may not include any homologous proteins. There is thus an overwhelming need to develop a generic strategy that will allow 5 the production of detergent-stable eukaryotic integral membrane proteins for crystallisation and structure determination and potentially for other purposes such as drug screening, bioassay and biosensor applications. Membrane proteins have evolved to be sufficiently stable in the membrane to 10 ensure cell viability, but they have not evolved to be stable in detergent solution, suggesting that membrane proteins could be artificially evolved and detergent stable mutants isolated [4]. This was subsequently demonstrated for two bacterial proteins, diacylglycerol kinase (DGK) [5,6] and bacteriorhodopsin [7]. Random mutagenesis of DGK identified specific point mutations that increased 15 thermostability and, when combined, the effect was additive so that the optimally stable mutant had a half-life of 35 minutes at 80*C compared with a half-life of 6 minutes at 55"C for the native protein [6]. It was shown that the trimer of the detergent-resistant DGK mutant had become stable in SDS and it is thus likely that stabilisation of the oligomeric state played a significant role in 0 thennostabilisation. Although the aim of the mutagenesis was to produce a membrane protein suitable for crystallisation, the structure of DGK has yet to be determined and there have been no reports of successful crystallization. A further study on bacteriorhodopsin by cysteine-scanning mutagenesis along helix B demonstrated that it was not possible to predict which amino acid residues would i lead to thennostability upon mutation nor, when studied in the context of the structure, was it clear why thermostabilisation had occurred [7]. GPCRs constitute a very large family of proteins that control many physiological processes and are the targets of many effective drugs. Thus, they are of considerable pharmacological importance. A list of GPCRs is -given in Foord et al (2005) Pharmacol.Rev. 57, 279-288, which is incorporated herein by reference. GPCRs are generally unstable when isolated, and despite considerable efforts, it 3 has not been possible to crystallise any except bovine rhodopsin, which naturally is exceptionally stable. GPCRs are druggable targets, and reference is made particularly to Overington et 5 al (2006) Nature Rev. Drug Discovery 5, 993-996 which indicates that over a. quarter of present drugs have a GPCR as a target GPCRs are thought to exist in multiple distinct confonnations which are associated with different pharmacological classes of ligand such as agonists and 10 antagonists, and to cycle between these conformations in order to function (Xenakin T. (1997)Ann NYAcadSci 812, 116-125). It will be appreciated that the methods of the invention do not include a method as described in D'Antona et al., including binding of [H]CP55940 to a 15 constitutively inactive mutant human cannabinoid receptor I (T21 0A) in which the Thr residue at position 210 is replaced with an Ala residue. The listing or discussion of an apparently prior-published document in this specification should not necessarily be taken as an acknowledgement that the to document is part of the state of the art or is common general knowledge. We have realised that there are two serious problems associated with trying to crystallise GPCRs, namely their lack of stability in detergent and the fact that they exist in multiple conformations. In order to function GPCRs have evolved to 5 cycle through at least two distinct conformations, the agonist-bound form and the antagonist-bound form, and changes between these two conformations can occur spontaneously in the absence of ligand. It is thus likely that any purified receptors populate a mixture of conformations. Just adding ligands to GPCRs during crystallisation trials has not resulted in their structure determination. To improve the likelihood of crystallisation, we therefore selected mutations that improved the .stability of the GPCR and, in addition, preferentially locked the receptor in a specific biologically relevant conformation.
4 We decided to see whether stabilisation of a GPCR in a particular, biologically relevant conformation was possible and whether the effect was sufficiently great that it would significantly improve the chances of obtaining diffraction-quality crystals. In Example 1, the p1-adrenergic receptor (PAR) from turkey erythrocytes [8] was chosen as a test subject for this study for a number of reasons. The PAR is a G protein-coupled receptor (GPCR) that has well developed pharmacology with many ligands commercially available and in a radiolabelled form. In addition, overexpression of PAR has been particularly successful using the baculovirus expression system and it can be purified in milligram quantities in a functional form [9]. In Example 2, a human adenosine receptor was used, and in Example 3, a rat neurotensin receptor was used. As used herein, except where the context requires-otherwise, the term "comprise" and variations of the term, such as "comprising", "comprises" and "comprised", are not intended to exclude other additives, components, integers or steps. Reference to any prior art in the specification is not, and should not be taken as, an acknowledgment, or any form of suggestion, that this prior art forms part of the common general knowledge in Australia or any other jurisdiction or that this prior art could reasonably be expected to be ascertained, understood and regarded as relevant by a person skilled in the art. Method for selecting mutant GPCRs with increased stability A first aspect of the invention provides a method for selecting a mutant G-protein coupled receptor (GPCR) with increased stability, the method comprising (a) providing one or more mutants of a parent GPCR, (b) selecting a ligand, the ligand being one which binds to the parent GPCR when the GPCR is residing in a particular conformation, 4A (c) determining whether the or each mutant GPCR has increased stability with respect to binding the selected ligand compared to the stability of the parent GPCR with respect to binding that ligand, and (d) selecting those mutants that have an increased stability compared to the parent GPCR with respect to binding of the selected ligand. In one embodiment, the invention provides a method for selecting a G-protein coupled receptor (GPCR) with increased stability, the method comprising (a) providing one or more mutants of a parent GPCR, (b) selecting a ligand, the ligand being one which binds to the parent GPCR when the GPCR is residing in a particular conformation, (c) determining whether the or each mutant GPCR has increased conformational stability with respect to binding the selected ligand compared to the conformational stability of the parent GPCR with respect to binding that ligand, and (d) selecting those mutants that have an increased conformational stability compared to the parent GPCR with respect to binding the selected ligand; wherein the particular conformation in which the GPCR resides in step (c) corresponds to the class of ligand selected in step (b).
Ss The inventors have appreciated that, in order to improve the likelihood of crystallisation of a GPCR in a biologically relevant form (which is therefore phannacologically useful), it is desirable not only to increase the stability of the protein, but also for the protein to have this increased stability when in a s particular conformation. The conformation is determined by a selected ligand, and is a biologically relevant conformation in particular a pharmacologically relevant conformation. Thus, the method of the invention may be considered to be a method for selecting mutants of a GPCR which have increased stability of a particular conformation, for example they may have increased conformational 10 thermostability. - The method may be used to create stable, conformationally locked GPCRs by mutagenesis. The selected mutant GPCRs are effectively purer forms of the parent molecules in that a much higher proportion of them occupies a particular conformational state. The deliberate selection of a chosen receptor conformation resolved from other conformations by use of a ligand (or ligands) 15 that bind preferentially to this conformation is therefore an important feature of the invention. The method may also be considered to be a method for selecting utant GPCaNwhich are more tractable to crystallisation. Thus the invention includes a method for selecting a mutant G-protein coupled 20 receptor (GPCR) with increased stability, the method comprising (a) providing one or more mutants of a parent GPCR, (b) selecting a ligand, the ligand being one which binds to the parent 25 GPCR when the GPCR is residing in a particular conformation, (c) determining whether the or each mutant GPCR when residing in the particular conformation has increased stability with respect to binding the selected ligand compared to the stability of the parent 30 GPCR when residing in the same particular conformation with respect to binding that ligand, and 6 (d) selecting those mutants that have an increased stability compared to the parent GPCR with respect to binding of the selected ligand. In a review of the druggable genome by Hopkins & Groom (2002) Nature Rev. 5 Drug Discovery 1, 727-730, Table 1 contains a list of protein families many of which are GPCRs. Overington et al (2006) Nature Rev. Drug Discovery 5, 993 996 provides more details of drug targets, and Figure 1 indicates that more than a quarter of current drugs target GPCRs. There are 52 GCR targets for orally available drugs out of a total of 186 total targets in this category. 10 Suitable GPCRs for use in the practice of the invention include, but are not limited to A-adrenergic receptor, adenosine receptor, in particular adenosine A2a receptor, and neurotensin receptor (NTR). Other suitable GPCRs are well kno-wn in the art and include those listed in Hopkins & Groom supra. In addition, the 15 International Union of Pharmacology produce a list of GPCRs (Foord et al (2005) Pharmacol. Rev. 57, 279-288, incorporated herein by reference and this list is periodically updated at http://www.iuphar db.org/GPCR/ReceptorFamiliesForward). It will be noted that GPCRs are divided into different classes, principally based on their amino acid sequence m0 similarities. They are also divided into families by reference to the natural ligands to which they bind. All GPCRs are.included in the scope of the invention. The amino acid sequences (and the nucleotide sequences of the cDNAs which encode them) of many GPCRs are readily available, for example by reference to 5 GenBank. In particular, Foord et al supra gives the human gene symbols and human, mouse and rat gene IDs from Entrez Gene (http://www.ncbi.nlm.nih. gov/entrez). It should be noted, also, that because the sequence of the human genome is substantially complete, the amino acid sequences of human GPCRs can be deduced therefrom. Although the GP CR may be derived from any source, it is particularly preferred if it is from a eukaryotic source. It is particularly preferred if it is derived from a 7 vertebrate source such as a mammal or a bird. It is particularly preferred if the GPCR is derived from rat, mouse, rabbit or dog or non-human primate or man, or from chicken or turkey. For the avoidance of doubt, we include within the meaning of "derived from" that a cDNA or gene was originally obtained using 5 genetic material from the source, but that the protein may be expressed in any host cell subsequently. Thus, it will be plain that a eularyotic GPCR (such as an avian or mammalian GPCR) may be expressed in a prokaryotic host cell, such as E. coli, but be considered to be avian- or mammalian-derived, as the case may be. jo In some instances, the GPCR may be composed of more than one different subunit. For example, the calcitonin gene-related peptide receptor requires the binding of a single transmembrane helix protein (RAMPI) to acquire its physiological ligand binding characteristics. Effector, accessory, auxiliary or GPCR-interacting proteins which combine with the GPCR to form or modulate a 15 functional complex are well known in the art and include, for example, receptor kinases, G-proteins and arrestins (Bockaert et al (2004) Curr Opinion Drug Discov and Dev 7, 649-65 7). The mutants of the parent GPCR may be produced in any suitable way and io provided in any suitable form. Thus, for example, a series of specific mutants of the parent protein may be made in which each amino acid residue in all or a part of the parent protein is independently changed to another amino acid residue. For example, it may be convenient to make mutations in those parts of the protein which are predicted to be membrane spanning. The three-dimensional structure s of rhodopsin is known (Li et al (2004) JMot Biol 343, 1409-1438; Palczewski et al (2000) Science 289, 739-745), and it is possible to model certain GPCRs using this stmcture. Thus, conveniently, parts of the GPCR to mutate may be based on modelling. Similarly, computer programs are available which model transmembrane regions of GPCRs based on hydrophobicity (Kyle & Dolittle (1982) J Mol. Biol 157, 105-132), and use can be made of such models when selecting parts of the protein to mutate. Conventional site-directed mutagenesis may be employed, or polymerase chain reaction-based procedures well known in 8 the art may be used. It is possible, but less desirable, to use ribosome display methods in the selection of the mutant protein. Typically, each selected amino acid is replaced by Ala (ie Ala-scanning 5 mutagenesis), although it may be replaced by any other amino acid. If the selected amino acid is Ala, it may conveniently be replaced by Leu. Alternatively, the amino acid may be replaced by Gly (ie Gly-scanning mutagenesis), which may allow a closer packing of neighbouring helices that may lock the protein in a particular conformation. If the selected amino acid is Gly, it 10 may conveniently be replaced by Ala. Although the amino acid used to replace the given amino acid at a particular position is typically a naturally occurring amino acid, typically an "encodeable" amino acid, it may be a non-natural amino acid (in which case the protein is 15 typically made by chemical synthesis or by use of non-natural amino-acyl tRNAs). An "encodeable" amino acid is one which is incorporated into a polypeptide by translation of mRNA. It is also possible to create non-natural amino acids or introduce non-peptide linkages at a given position by covalent chemical modification, for example by post-translational treatment of the protein 20 or semisynthesis. These post-translational modifications may be natural, such as phosphorylation, glycosylation or palmitoylation, or synthetic or biosynthetic. Alternatively, the mutants may be produced by a random mutagenesis procedure, which may be of the whole protein or of a selected portion thereof. Random 25 mutagenesis procedures are well known in the art. Conveniently, the mutant GPCR has one replaced amino acid compared to the parent protein (ie it is mutated at one amino acid position). In this way, the contribution to stability of a single amino acid replacement may be assessed. 3M However, the mutant GPCR assayed for stability may have more than one replaced amino acid compared to the parent protein, such as 2 or 3 or 4 or 5 or 6 replacements.
9 As is discussed in more detail below, combinations of mutations may be made based on the results of the selection method. It has been found that in some specific cases combining mutations in a single mutant protein leads to a further 5 increase in stability. Thus, it will be appreciated that the method of the invention can be used in an iterative way by, for example, carrying it out to identify single mutations which increase stability, combining those mutations in a single mutant GPCRs which is the GPCR then provided in part (a) of the method. Thus, multiply-mutated mutant proteins can be selected using the method. 10 The parent GPCR need not be the naturally occurring protein. Conveniently, it may be an engineered version which is capable of expression in a suitable host organism, such as Escherichia coli. For example, as described in Example 1, a convenient engineered version of the turkey p-adrenergic receptor is one which is 15 truncated and lacks residues 1-33 of the amino acid sequence (ie PAR 3 4 2 4 ). The parent GPCR may be a truncated form of the naturally occurring protein (truncated at either or both ends), or it may be a fusion, either to the naturally occurring protein or to a fragment thereof Alternatively or additionally, the parent GPCR, compared to a naturally-occurring GPCR, may be modified in 20 order to improve, for example, solubility, proteolytic stability (eg by truncation, deletion of loops, mutation of glycosylation sites or mutation of reactive amino acid side chains such as cysteine). In any event, the parent GPCR is a protein that is able to bind to the selected ligand which ligand is one which is known to bind the naturally occurring GPCR. Conveniently, the parent GPCR is one which, on !5 addition of an appropriate ligand, can affect any one or more of the downstream activities which are commonly known to be affected by G-protein activation. However, it will be appreciated that the stability of the mutant is to be compared to a parent in order to be able to assess an increase in stability. 0 A ligand is selected, the ligand being one which binds to the parent GPCR when residing in a particular conformation. Typically, the ligand will bind to one 10 confornation of the parent GPCR (and may cause the GPCR to adopt this conformation), but does not bind as strongly to another conformation that the GPCR may be able to adopt. Thus, the presence of the ligand may be considered to encourage the GPCR to adopt the particular conformation. Thus, the method 5 may be considered to be a way of selecting mutant GPCRs which are trapped in a conformation of biological relevance (eg ligand bound state), and which are more stable with respect to that conformation. Preferably the particular conformation in which the GPCR resides in step (c) 10 corresponds to the class of ligand selected in step (b). Preferably the selected ligand is from the agonist class of ligands and the particular conformation is an agonist conformation, or the selected ligand is from the antagonist class of ligands and the particular conformation is an antagonist 15 confonnation. Preferably the selected ligand is from the agonist class of ligands and the particular conformation in which the GPCR resides in step (c) is the agonist conformation. 20 Preferably, the selected ligand binding affnity for the mutant receptor should be equal to or greater than that for the wild type receptor, mutants that exhibit significantly reduced binding to the selected ligand are typically rejected. By "ligand" we include any molecule which binds to the GPCR and which causes 25 the GPCR to reside in a particular conformation. The ligand preferably is one which causes more than half of the GPCR molecules overall to be in a particular conformation. Many suitable ligands are known. 30 Typically, the ligand is a full agonist and is able to bind to the GPCR and is capable of eliciting a full (100%) biological response, measured for example by 11 G-protein coupling, downstream signalling events or a physiological output such as vasodilation. Thus, typically, the biological response is GDP/GTP exchange in a G-protein, 'followed by stimulation of the linked effector pathway. The measurement, typically, is GDP/GTP exchange or a change in the level of the end 5 product of the pathway (eg cAMP, cGMP or inositol phosphates). The ligand may also be a partial agonist and is able to bind to the GPCR and is capable of eliciting a partial (<100%) biological response. The ligand may also be an inverse agonist, which is a molecule which binds to a 10 receptor and reduces its basal (ie unstimulated by agonist) activity sometimes even to zero. The ligand may also be an antagonist, which is a molecule which binds to a receptor and blocks binding of an agonist, so preventing a biological response. 15 Inverse agonists and partial agonists may under certain assay conditions be antagonists. The above ligands may be orthosteric, by which we include the meaning that they combine with the same site as the endogenous agonist; or they may be allosteric 20 or allotopic, by which we include the meaning that they combine with a site distinct from the orthosteric site. The above ligands may be syntopic, by which we include the meaning that they interact with other ligand(s) at the same or an overlapping site. They may be reversible or irreversible. 25 In relation to antagonists, they may be surmountable, by which we include the meaning that the maximum effect of agonist is not reduced by either pre-treatment or simultaneous treatment with antagonist; or they may be insurmountable, by which we include the meaning that the maximum effect of agonist is reduced by either pre-treatment or simultaneous treatment with antagonist; or they may be 30 neutral, by which we include the meaning the antagonist is one without inverse agonist or partial agonist activity. Antagonists typically are also inverse agonists.
12 Ligands for use in the invention may also be allosteric modulators such as positive allosteric modulators, potentiators, negative allosteric modulators and inhibitors. They may have activity as agonists or inverse agonists in their own right or they may only have activity in the presence of an agonist or inverse s agonist in which case they are used in combination with such molecules in order to bind to the GPCR. Neubig et al (2003) Pharmacol. Rev. 55, 597-606, incorporated herein by reference, describes various classes of ligands. 10 Preferably, the above-mentioned ligands are small organic or inorganic moieties, but they may be peptides or polypeptides. Typically, when the ligand is a small organic or organic moiety, it has a Mr of from 50 to 2000, such as from 100 to 1000, for example from 100 to 500. 15 Typically, the ligand binds to the GPCR with a Kd of from mM to pM, such as in the range of from FM (micromolar) to nM. Generally, the ligands with the lowest Kd are prefeied. 20 Small organic molecule ligands are well known in the art, for example see the Examples below. Other small molecule ligands include 5HT which is a full agonist at the 5HTIA receptor; eltoprazine which is a partial agonist at the 5HTIA receptor (see Newman-Tancredi et al (1997) Neurophaiacology 36, 451 459); (+)-butaclamol and spiperone are dopamine D2 receptor inverse agonists 25 (see Roberts & Strange (2005) Br. J Pharnacol. 145, 34-42); and WIN55212-3 is a neutral antagonist of CB2 (Savinainen et al (2005) Br. J. Pharmacol. 145, 636 645). The ligand may be a peptidomnaimetic, a nucleic acid, a peptide nucleic acid (PNA) 0 or an aptamer. It may be an ion such as Na 4 or Zn 2 +, a lipid such as oleamide, or a carbohydrate such as heparin.
13 The ligand may be a polypeptide which binds to the GPCR. Such polypeptides (by which we include oligopeptides) are typically from Mr 500 to Mr 50,000, but may be larger.' The polypeptide may be a naturally occurring GPCR-interacting protein or other protein which interacts with the GPCR, or a derivative or 5 fragment thereof, provided that it binds selectively to the GPCR in a particular conformation. GPCR-interacting proteins include those associated with signalling and those associated with trafficking, which often act via PDZ domains in the C terminal portion of the GPCR 10 Polypeptides which are known to bind certain GPCRs include any of a G protein, an arrestin, a RGS protein, G protein receptor kinase, a RAvfP, a 14-3-3 protein, a NSF, a periplakin, a spinophilin, a GPCR kinase, a receptor tyrosine kinase, an ion channel or subunit thereof, an ankyrin and a Shanks or Homer protein. Other polypeptides include NMDA receptor subunits NR1 or NR2a, calcyon, or a is fibronectin domain framework. The polypeptide may be one which binds to an extracellular domain of a GPCR, such as fibulin-1. The polypeptide may be another GPCR, which binds to the selected GPCR in a hetero-oligomer. A review of protein-protein interactions at GPCRs is found in Milligan & White (2001) Trends Pharmacol. Sci. 22, 513-518, or in Bockaert et al (2004) Curr. Opinion 20 'DrugDiscov. Deiv 7,-649-657 incorporated herein by reference. The polypeptide ligand may conveniently be an antibody which binds to the GPCR. By the term "antibody" we include naturally-occurring antibodies, monoclonal antibodies and fragments thereof. We also include engineered 25 antibodies and molecules which are antibody-like in their binding characteristics, including single chain Fv (scFv) molecules and domain antibodies (dAbs). Mention is also made of camelid antibodies and engineered camelid antibodies. Such molecules which bind GPCRs are known in the art and in any event can be made using well known technology. Suitable antibodies include ones presently 30 used in radioimmunoassay (RIAs) for GPCRs since they tend to recognise confonnational epitopes.
14 The polypeptide may also be a binding protein based on a modular framework, such as ankyrin repeat proteins, armadillo repeat proteins, leucine rich proteins, tetratriopeptide repeat proteins or Designed Ankyrin Repeat Proteins (DARPins) or proteins based on lipocalin or fibronectin domains or Affilin scaffolds based on s either human gamma crystalline or human ubiquitin. In one embodiment of the invention, the ligand is covalently joined to the GPCR, such as a G-protein or arrestin fusion protein. Some GPCRs (for example thrombin receptor) are cleaved N-terminally by a protease and the new N io terminus binds to the agonist site. Thus, such GPCRs are natural GPCR-ligand fusions. It will be appreciated that the use of antibodies, or other "universal" binding polypeptides (such as G-proteins which are known to couple with many different 15 GPCRs) may be particularly advantageous in the use of the method on "orphan" GPCRs for which the natural ligand, and small molecule ligands, are not known. Once the ligand has been selected, it is then determined whether the or each mutant GPCR has increased stability with respect to binding the selected ligand 20 compared to the parent GPCR with respect to binding that ligand. It will be appreciated that this step (c) is one in which it is determined whether the or each mutant GPCR has an increased stability (compared to its parent) for the particular conformation which is determined by the selected ligand. Thus, the mutant GPCR has increased stability with respect to binding the selected ligand as 25 measured by ligand binding or whilst binding the selected ligand. As is discussed below, it is particularly preferred if the increased stability is assessed whilst binding the selected ligand. The increased stability is conveniently measured by an extended lifetime of the 3o mutant under the imposed conditions which may lead to instability (such as heat, harsh detergent conditions, chaotropic agents and so on). Destabilisation under the imposed condition is typically determined by measuring denaturation or loss 15 of structure. As is discussed below, this may manifest itself by loss of ligand binding ability or loss of secondary or tertiary structure indicators. As is described with respect to Figure 12 below (which depicts a particular, 5 preferred embodiment), there are different assay formats which may be used to determine stability of the mutant GPCR. In one embodiment the mutant GPCR may be brought into contact with a ligand before being subjected to a procedure in which the stability of the mutant is 10 determined (the mutant GPCR and ligand remaining in contact during the test period). Thus, for example, when the method is being used to select for mutant GPCRs which in one conformation bind to a ligand and which have improved thermostablity, the receptor is contacted with the ligand before being heated, and then the amount of ligand bound to the receptor following heating may be used to 15 express thermostability compared to the parent receptor. This provides a measure of the amount of the GPCR which retains ligand binding capacity following exposure to the denaturing conditions (eg heat), which in turn is an indicator of stability. 20 In an alternative (but less preferred) embodiment, the mutant GPCR is subjected to a procedure in which the stability of the mutant is determined before being contacted with the ligand. Thus, for example, when the method is being used to select for mutant membrane receptors which in one conformation bind to a ligand and which have improved thermostability, the receptor is heated first, before !5 being contacted with the ligand, and then the amount of ligand bound to the receptor may be used to express thermostability. Again, this provides a measure of the amount of the GPCR which retains ligand binding capacity following exposure to the denaturing conditions. 0 In both embodiments, it will be appreciated that the comparison of stability of the mutant is made by reference to the parent molecule under the same conditions.
16 It will be appreciated that in both of these embodiments, the mutants that are selected are ones which have increased stability when residing in the particular conformation compared to the parent protein. s The preferred route may be dependent upon the specific GPCR, and will be dependent upon the number of conformations accessible to the protein in the absence of ligand. In the embodiment described in Figure 12, it is preferred if the ligand is present during the beating step because this increases the probability that the desired conformation is selected. 10 From the above, it will be appreciated that the invention includes a method for selecting a mutant GPCR with increased themostability, the method comprising (a) providing one or more mutants of a parent GPCR, (b) selecting an antagonist or an agonist which binds the parent GPCR, (c) determining whether the or each 15 mutant has increased thermostability when in the presence of the said antagonist or agonist by measuring the ability of the mutant GPCR to bind the selected said antagonist or agonist at a particular temperature and after a particular time compared to the parent GPCR and (d) selecting those mutant GPCRs that bind more of the selected said antagonist or agonist at the particular temperature and 20 after the particular time than the parent GPCR under the same conditions. In step (c), a fixed period of time at the particular temperature is typically used in measuring the ability of the mutant GPCR to bind the selected said antagonist or agonist. In step (c), typically a temperature and a time is chosen at which binding of the selected said antagonist or agonist by the parent GPCR is reduced by 50% 25 during the fixed period of time at that temperature (which is indicative that 50% of the receptor is inactivated; "quasi" Tm). Conveniently, when the ligand is used to assay the GPCR (ie used to determine if it is in a non-denatured state), the ligand is detectably labelled, eg radiolabelled or 3D fluorescently labelled. In another embodiment, ligand binding can be assessed by measuring the amount of unbound ligand using a secondary detection system, for example an antibody or other high affinity binding partner covalently linked to a 17 detectable moiety, for example an enzyme which may be used in a colorimetric assay (such as alkaline phosphatase or horseradish peroxidase). FRET methodology may also be used. It will be appreciated that the ligand used to assay the mutant GPCR in determining its stability need not be the same ligand as 5 selected in step (b) of the method. Although it is convenient to measure the stability of the parent and mutant GPCR by using the ability to bind a ligand as an indicator of the presence of a non denatured protein, other methods are known in the art. For example, changes in 10 fluorescence spectra can be a sensitive indicator of unfolding, either by use of intrinsic tryptophan fluorescence or the use of extrinsic fluorescent probes such as 1-anilino-8-napthaleneulfonate (ANS), for example as implemented in the ThermofluorTM method (Mezzasalma et al, J Biomol Screening, 2007, Apr;12(3):418-428). Proteolytic stability, deuterium/hydrogen exchange 15 measured by mass spectrometry, blue native gels, capillary zone electrophoresis, circular dichroism (CD) spectra and light scattering may also be used to measure unfolding by loss of signals associated with secondary or tertiary structure. However, all these methods require the protein to be purified in reasonable quantities before they can be used (eg high pmol/nmol quantities), whereas the 20 method described in the Examples makes use of pmol amounts of essentially unpurified GPCR. In a preferred embodiment, in step (b) two or more ligands of the same class are selected, the presence of each causing the GPCR to reside in the same particular 25 conformation. Thus, in this embodiment one or more ligands (whether natural or non-natural) of the same class (eg full agonist or partial agonist or antagonist or inverse agonist) may be used. Including multiple ligands of the same class in this process, whether in series or in parallel, ninimises the theoretical risk of inadvertently engineering and selecting multiply mutated receptor conformations 30 substantially different to the parent, for example in their binding site, but still able, due to compensatory changes, to bind ligand. The following steps may be used to mitigate this risk: 18 1. Select a chemically distinct set (eg n=2-5) of ligands, in a common pharmacological class as evidenced by for example a binding or functional or spectroscopic assay. These ligands should be thought to bind to a common. spatial 5 region of the receptor, as evidenced for example by competitive binding studies using wild type and/or mutated receptors, and/or by molecular modelling, although they will not necessarily express a common pharmacophore. 2. Make single or multiple receptor mutants intended to increase stability, ]o and assay for tight binding using the full set of ligands. The assays can be parallelised, multiplexed or run in series. 3. Confirm authenticity of stabilised receptor mutant by measurement for example of the binding isotherm for each ligand, and by measurement of the 15 stability shift with ligand (the window should typically be narrowed compared to wild type). In order to guard against changes in apparent affinity caused by perturbations to the binding site upon mutation, preferably ligands of the same pharmacological class, but different chemical class, should be used to profile the receptor. These 20 should typically show similar shifts in affinity (mutant versus parent, e.g. wild type) in spite of having different molecular recognition properties. Binding experiments should preferably be done using labelled ligand within the same pharmacological class. 25 Nonetheless it should be recognised that confonnational substrates may exist that are specific to chemical classes of ligand within the same pharmacological class, and these may be specifically stabilised in the procedure depending on the chemical class of the selected ligand. 30 Typically the selected ligand binds to the mutant GPCR with a similar potency to its binding to the parent GPCR. Typically, the Kd values for the particular ligand binding the mutant GPCR and the parent GPCR are within 5-10 fold of each 19 other, such as within 2-3 fold. Typically, the binding of the ligand to the mutant GPCR compared to the parent GPCR would be not more than 5 times weaker and not more than 10 times stronger. 5 Typically, mutant receptors which have been stabilised in the selected conformation should bind the selected ligand with approximately equal affinity (that is to say typically within 2-3 fold) or greater affinity than does the parent receptor. For agonist-conformation mutants, the mutants typically bind the agonists with the same or higher affinity than the parent GPCR and typically bind 10 antagonists with the same or lower affnity than the parent GPCR. Similarly for antagonist-conformation mutants, the mutants typically bind the antagonists with the same or higher affinity than the parent GPCR and typically bind agonists with the same or lower affinity than the parent GPCR. 15 Mutants that exhibit a significant reduction (typically greater than 2-3 fold) in affinity for the selecting ligand are typically rejected. Typically, the rank order of binding of a set of ligands of the same class are comparable, although there may be one or two reversals in the order, or there may 20 be an out-her from the set. In a further embodiment, two or more ligands that bind simultaneously to the receptor in the same conformation may be used, for example an allosteric modulator and orthosteric agonist. 25 For the avoidance of doubt, and as is evident from the Examples, it is not necessary to use multiple ligands for the method to be effective. In a further embodiment, it may be advantageous to select those mutant GPCRs 30 which, while still being able to bind the selected ligand, are not able to bind, or bind less strongly than the parent GPCR, a second selected ligand which is in a different class to the first ligand. Thus, for example, the mutant GPCR may be 20 one that is selected on the basis that it has increased stability with respect to binding a selected antagonist, but the mutant GPCR so selected is further tested to determine whether it binds to a full agonist (or binds less strongly to a full agonist than its parent GPCR). Mutants are selected which do not bind (or have reduced 5 binding of) the full agonist. In this way, further selection is made of a GPCR which is locked into one particular conformation. It will be appreciated that the selected ligand (with respect to part (b) of the method) and the further (second) ligand as discussed above, may be any pair of 10 ligand classes, for example: antagonist and full agonist; full agonist and antagonist; antagonist and inverse agonist; inverse agonist and antagonist; inverse agonist and full agonist; full agonist and inverse agonist; and so on. It is preferred that the mutant receptor binds the further (second) ligand with an 15 affinity which is less than 50% of the affinity the parent receptor has for the same further (second) ligand, more preferably less than 10% and still more preferably less than 1% or 0.1% or 0.01% of affinity for the parent receptor. Thus, the Kd for the interaction of the second ligand with mutant receptor is higher than for the parent receptor. As is shown in Example 1, the mutant 0-adrenergic receptor 20 pAR-m23 (which was selected by the method of the invention using an antagonist) binds an agonist 3 orders of magnitude more weakly than its parent (ie Kd is 1000 x higher). Similarly, in Example 2, the mutant adenosine A2a receptor Rant2l binds agonist 2-4 orders of magnitude more weakly than its parent. 25 This type of counter selection is useful because it can be used to direct the mutagenesis procedure more specifically (and therefore more rapidly and more efficiently) along a pathway towards a pure conformation as defined by the ligand. 30 Preferably, the mutant-GPCR is provided in.a suitable solubilised forn in which it maintains structural integrity and is in a functional form (eg is able to bind ligand). An appropriate solubilising system, such as a suitable detergent (or other 21 amphipathic agent) and buffer system is used, which may be chosen by the person skilled in the art to be effective for the particular protein. Typical detergents which may be used include, for example, dodecylmaltoside (DDM) or CHAPS or octylglucoside (OG) or many others. It may be convenient to include other s compounds such as cholesterol hemisuccinate or cholesterol itself or heptane-1,2,3-triol. The presence of glycerol or proline or betaine may be useful. It is important that the GPCR, once solubilised from the membrane in which it resides, must be sufficiently stable to be assayed. For some GPCRs, DDM will be sufficient, but glycerol or other polyols may be added to increase stability for 10 assay purposes, if desired. Further stability for assay purposes may be achieved, for example, by solubilising in a mixture of DDM, CHAPS and cholesterol hemisuccinate, optionally in the presence of glycerol. For particularly unstable GPCRs, it may be desirable to solubilise them using digitonin or amphipols or other polymers which can solubilise GPCRs directly from the membrane, in the 5 absence of traditional detergents and maintain stability typically by allowing a significant number of lipids to remain associated with the GPCR. Nanodiscs may also be used for solubilising extremely unstable membrane proteins in a functional form. Typically, the mutant GPCR is provided in a crude extract (eg of the membrane fraction from the host cell in which it has been expressed, such as E. coli). It may be provided in a form in which the mutant protein typically comprises at least 75%, more typically at least 80% or 85% or 90% or 95% or 98% or 99% of the protein present in the sample. Of course, it is typically solubilised as discussed above, and so the mutant GPCR is usually associated with detergent molecules and/or lipid molecules. A mutant GPCR may be selected which has increased stability to any denaturant or denaturing condition such as to any one or more of heat, a detergent, a chaotropic agent or an extreme of pH.
22 In relation to an increased stability to heat (ie thermostability), this can readily be - determined by measuring ligand binding or by using spectroscopic methods such as fluorescence, CD or light scattering at a particular temperature. Typically, when the GPCR binds to a ligand, the ability of the GPCR to bind that ligand at a 5 particular temperature may be used to determine thermostability of the mutant. It may be convenient to determine a "quasi T." ie the temperature at which 50% of the receptor is inactivated under stated conditions after incubation for a given period of time (eg 30 minutes). Mutant GPCRs of higher thermostability have an increased quasi Tin compared to their parents. 10 In relation to an increased stability to a detergent or to a chaotrope, typically the GPCR is incubated for a defimed time in the presence of a test detergent or a test chaotropic agent and the stability is determined using, for example, ligand binding or a spectroscopic method as discussed above. 15 In relation to an extreme of pH, a typical test pH would be chosen (eg in the range 4.5 to 5.5 (low pH) or in the range 8.5 to 9.5 (high pH). Because relatively harsh detergents are used during crystallisation procedures, it 20 is preferred that the mutant GPCR is stable in the presence of such detergents. The order of "harshness" of certain detergents is DDM, Cu I-+ CIO -. C 9
-+C
8 maltoside or glucoside, lauryldimethylamine oxide (LDAO) and SDS. It is particularly preferred if the mutant GPCR is more stable to any of C 9 maltoside or glucoside, C 8 maltoside or glucoside, LDAO and SDS, and so it is preferred that 25 these detergents are used for stability testing. Because of its ease of determination, it is preferred that thermostability is determined, and those mutants which have an increased thermostability compared to the parent protein with respect to the selected condition are chosen. It will be 30 appreciated that heat is acting as the denaturant, and this can readily be removed by cooling the sample, for example by placing on ice. It is believed that thermostability may also be a guide to the stability to other denaturants or 23 denaturing conditions. Thus, increased thermostability is likely to translate into stability in denaturing detergents, especially those that are more denaturing than DDM, eg those detergents with a smaller head group and a shorter alkyl chain and/or with a charged head group. We have found that a thermostable GPCR is 5- also more stable towards harsh detergents. When an extreme of pH is used as the denaturing condition, it will be appreciated that this can be removed quickly by adding a neutralising agent. Similarly, when a chaotrope is used as a denaturant, the denaturing effect can be removed by 10 diluting the sample below the concentration in which the chaotrope exerts its chaotropic effect In a particular embodiment of the invention, the GPCR is p-adrenergic receptor (for example from turkey) and the ligand is dihydroalprenolol (DHA), an 15 antagonist. In a further preferred embodiment of the invention, the GPCR is the adenosine A 2 , receptor (A 2 ,R) (for example, from man) and the ligand is ZM 241385 (4-[2-[[7 amino-2-(2-furyl) [1,2,4] -triazolo[2,3-a,][1,3,S]triazin-5-yl]amino]ethyl]phenol), o an antagonist or NECA (5'-N-ethylcarboxamido adenosine), an agonist. In a stifl further preferred embodiment, the GPCR is the neurotensin receptor (NTR) (for example, from rat) and the ligand is neurotensin, an agonist. 5 A second aspect of the invention provides a method for preparing a mutant GP CR, the method comprising (a) carrying out the method of the first aspect of the invention, (b) identifying the position or positions of the mutated amino acid residue or residues in the mutant GPCR or GPCRs which has been selected for increased stability, and 24 (c) synthesising a mutant GPCR.which contains a mutation at one or more of the positions identified. 5 As can be seen in the Examples, surprisingly, changes to a single amino acid within the GPCR may increase the stability of the protein compared to the parent protein with respect to a particular condition in which the protein resides in a particular conformation. Thus, in one embodiment of the method of the second aspect of the invention, a single amino acid residue of the parent protein is io changed in the mutant protein. Typically, the amino acid residue is changed to the amino acid residue found in the mutant tested in the method of the first aspect of the invention. However, it may be replaced by any other amino acid residue, such as any naturally-occurring amino acid residue (in particular, a "codeable" amino acid residue) or a non-natural amino acid. Generally, for convenience, the 15 amino acid residue is replaced with one of the 19 other codeable amino acids. Preferably, it is the replaced amino acid residue which is present in the mutant selected in the first aspect of the invention. Also as can be seen in the Examples, a further increase in stability may be 20 obtained by replacing more than one of the amino acids of the parent protein. Typically, each of the amino acids replaced is one which has been identified using the method of the first aspect of the invention. Typically, each amino acid identified is replaced by the amino acid present in the mutant protein although, as noted above, it may be replaced with any other amino acid. 25 Typically, the mutant GPCR contains, compared to the parent protein, from 1 to 10 replaced amino acids, preferably from 1 to 8, typically from 2 to 6 such as 2, 3, 4, 5 or 6 replaced amino acids. 30 It will be appreciated that the multiple mutants may be subject to the selection method of the first aspect of the invention. In other words, multiple mutants may be provided in step (a) of the method of the first aspect of the invention. It will be 25 appreciated that by the first and/or second aspect of the invention multiply mutagenised GPCRs may be made, whose conformation has been selected to create a very stable multiple point mutant protein. 5 The mutant GPCRs may be prepared by any suitable method. Conveniently, the mutant protein is encoded by a suitable nucleic acid molecule and expressed in a suitable host cell. Suitable nucleic acid molecules encoding the mutant GPCR may be made using standard cloning techniques, site-directed mutagenesis and PCR as is well lown in the art. Suitable expression systems include constitutive 10 or inducible expression systems in bacteria or yeasts, virus expression systems such as baculovirus, semliki forest virus and lentiviruses, or transient transfection in insect or mammalian cells. Suitable host cells include E. coli, Lactococcus lactis, Saccharomyces cerevisiae, Schizosaccharomyces ponbe, Pichia pastoris, Spodoptera frugiperda and Trichoplusiani cells. Suitable animal host cells 15 include HEK 293, COS, S2, CHO, NSO, DT40 and so on. It is known that some GPCRs require specific lipids (eg cholesterol) to function. In that case, it is desirable to select a host cell which contains the lipid. Additionally or alternatively the lipid may be added during isolation and purification of the mutant protein. It will be appreciated that these expression systems and host cells 20 may also be used in the provision of the mutant GPCR in part (a) of the method of the first aspect of the invention. Molecular biological methods for cloning and engineering genes and cDNAs, for mutating DNA, and for expressing polypeptides from polynucleotides in host 25 cells are well known in the art, as exemplified in "Molecular cloning, a laboratory manual", third edition, Sambrook, J. & Russell, D.W. (eds), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference. In a further embodiment of the first or second aspect of the invention it is 30 detennined whether the selected or prepared mutant GPCR is able to couple to a G protein. It is also preferred if it is determined whether the selected or prepared mutant GPCR is able to bind a plurality of ligands of the same class as the 26 selecting ligand with a comparable spread and/or rank order of affinity as the parent GPCR. A third aspect of the invention provides a mutant GPCR prepared by the method 5 of the second aspect of the invention. The invention includes mutant GPCRs with increased stability compared to their parent GPCRs, particularly those with increased thermostability. 10 The invention includes an isolated mutant G-protein coupled receptor (GPCR) with increased conformational stability compared to its parent GPCR, which mutant GPCR has an extended lifetime of a particular conformation relative to the same conformation of its parent GPCR under denaturing conditions, the extended lifetime being manifest by retention of ligand binding ability; wherein the mutant 15 GPCR is not a human cannabinoid receptor. Mutant 1-adrenergic receptor p-adrenergic receptors are well known in the art. They share sequence homology 20 to each other and bind to adrenalin. A fourth aspect of the invention provides a mutant p-adrenergic receptor which, when compared to the corresponding wild-type p-adrenergic receptor, has a different amino acid at a position which corresponds to any one or more of the 25 following positions according to the numbering of the turkey p-adrenergic receptor as set out in Figure 9: Ile 55, Gly 67, Arg 68, Val 89, Met 90, Gly 98, Ile 129, Ser 151, Val 160, Gln 194, Gly 197, Leu 221, Tyr 227, Arg 229, Val 230, Ala 234, Ala 282, Asp 322, Phe 327, Ala 334, Phe 338. 30 The mutant p-adrenergic receptor may be a mutant of any p-adrenergic receptor provided that it is mutated at one or more of the amino acid positions as stated by reference to the given turkey p-adrenergic receptor amino acid sequence.
26A It is particularly preferred if the mutant GPCR is one which has at least 20% amino acid sequence identity when compared to the given turkey p-adrenergic receptor sequence, as determined using MacVector and CLUSTALW (Thompson et al (1994) Nucl. Acids Res. 22, 4673-4680). More preferably, the mutant 5 receptor has at least 30% or at least 40% or at least 50% amino acid sequence identity. There is generally a higher degree of amino acid sequence identity 27 which is conserved around the orthosteric ("active") site to which the natural ligand binds. As is described in Example 1 and Figure 1 below, individual replacement of the 5 following amino acid residues in the parent turkey P-adrenergic sequence (as shown in Figure 9) lead to an increase in thermostability: Ile 55, Gly 67, Arg 68, Val 89, Met 90, Gly 98, Be 129, Ser 151, Val 160, Gln 194, Gly 197, Leu 221, Tyr 227, Arg 229, Val 230, Ala 234, Ala 282, Asp 322, Phe 327, Ala 334, Phe 338. 10 Thus, the invention includes mutant turkey p-adrenergic receptors in which, compared to its parent, one or more of these amino acid residues have been replaced by another amino acid residue. The invention also includes mutant -adrenergic receptors from other sources in which one or more corresponding 15 amino acids in the parent receptor are replaced by another amino acid residue. For the avoidance of doubt, the parent may be a p-adrenergic receptor which has a naturally-occurring sequence, or it may be a truncated fonn or it may be a fusion, either to the naturally occurring protein or to a fragment thereof; or it may contain mutations compared to the naturally-occurring sequenced provided that it retains 20 ligand-binding ability. By "corresponding amino acid residue" we include the meaning of the amino acid residue in another P-adrenergic receptor which aligns to the given amino acid residue in turkey p-adrenergic receptor when the turkey P-adrenergic receptor and 25 the other P-adrenergic receptor are compared using MacVector and CLUSTALW. Figure 9 shows an alignment between turkey -adrenergic receptor and human P1, P2 and P3 P-adrenergic receptors. 30 It can be seen thai le 72 of human Pl corresponds to Be 55 of turkey p-adrenergic receptor; fle 47 of human P2 corresponds to le 55 of turkey 28 P-adrenergic receptor; and Thr5l of human P3 corresponds to le 55 of turkey 0-adrenergic receptor. Other corresponding amino acid residues in human p1, p2 and P3 can readily be identified by reference to Figure 9. 5 It is preferred that the particular amino acid is replaced with an Ala. However, when the particular amino acid residue is an Ala, it is preferred that it is replaced with a Len (for example, see turkey 0-adrenergic Ala 234, Ala 282 and Ala 334 in Figure 1). o It is preferred if the mutant p-adrenergic receptor has a different amino acid compared to its parent at more than one amino acid position since this is likely to give greater stability. Particularly preferred human P31 receptor mutants are those in which one or more of the following amino acid residues are replaced with another amino acid residue: K85, M107, Y244, A316, F361 and F372. 5 Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or lie (unless they are already that residue). Mutant human P1 receptors which have combinations of 3 or 4 or 5 or 6 mutations as described above are prepared. .0 Particularly preferred human P2 receptor mutants are those in which one or more of the following amino acids are replaced with another amino acid residue: K60, M82, Y219, C265, L310 and F321. Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or Ile (unless they are already that 5 residue). Mutant human 02 receptors which have combinations of 3 or 4 or 5 or 6 mutations as described above are preferred. Figure 26 shows the effect on thermostability when six thermostabilising o mutations in 01-m23 (R68S, M90V, Y227A, A282L, F327A, F338M) were transferred directly to the human -p2 receptor (equivalent mutations K60S, M82V, Y219A, C265L, L310A, F321M), making human P2-m23. The Tmns for human 29 02 and 02-m23 were 29 0 C and 41*C respectively, thus exemplifying the transferability of thermostabilising mutations from one receptor to another receptor. Accordingly, a particularly preferred human $2 receptor mutant is one which comprises the mutations K60S, M82V, Y219A, C265L, L31OA, F321M. 5 Particularly preferred human f3 receptor mutants are those in which one or more of the following amino acids are replaced with another amino acid residue: W64, M86, Y224, P284, A330 and F341. Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or fle (unless they are already that 10 residue). Mutant human 03 receptors which have combinations of 3 or 4 or 5 or 6 mutations as described above are preferred. 15 Particularly preferred combinations of mutations are described in detail in Tables 1 and 2 in Example 1, and the invention includes the mutant turkey p-adrenergic receptors, and also includes mutant p-adrenergic receptors where amino acids in corresponding position have been replaced by another amino acid, typically the same amino acid as indicated in Tables 1 and 2 in Example 1. 20 Particularly preferred mutants are those which contain mutations in the amino acids which correspond to the given amino acid residue by reference to turkey p adrenergic receptor: (R68S, Y227A, A282L, A334L) (see m6-10 in Table 2 below); (M90V, Y227A, F338M) (see m7-7 in Table 2 below); (R68S, M90V, 25 V230A, F327A, A334L) (see mlO-8 in Table 2 below); and (R68S, M90V, Y227A, A282L, F327A, F338M) (see m23 in Table 2 below). Mutant adenosine receptor 30 Adenosine receptors are well known in the art. They share sequence homology to each other and bind to adenosine.
30 A fifth aspect of the invention provides a mutant adenosine receptor which, when compared to the corresponding wild-type adenosine, has a different amino acid at a position which corresponds to any one or more of the following positions according to the numbering of the human adenosine A2a receptor as set out in 5 Figure 10: Gly 114, Gly 118, Leu 167, Ala 184, Arg 199, Ala 203, Leu 208, Gln 210, Ser 213, Glu 219, Arg 220, Ser 223, Thr 224, Gln 226, Lys 227, His 230, Leu 241, Pro 260, Ser 263, Leu 267, Leu 272, Thr 279, Asn 284, Gln 311, Pro 313, Lys 315, Ala 54, Val 57, His 75, Thr 88, Gly 114, Gly 118, Thr 119, Lys 122, Gly 123, Pro 149, Glu 151, Gly 152, Ala 203, Ala 204, Ala 231, Leu 235, 10 Va1 239. The mutant adenosine receptor may be a mutant of any adenosine receptor provided that it is mutated at one or more of the amino acid positions as stated by reference to the given human adenosine Au receptor amino acid sequence. 15 It is particularly preferred if the mutant GPCR is one which has at least 20% amino acid sequence identity when compared to the given human adenosine A2. receptor sequence, as determined using MacVector and CLUSTALW. Preferably, the mutant GPCR has at least 30% or at least 40% or at least 50% or at least 60% 20 sequence identity. Typically, there is a higher degree of sequence conservation at the adenosine binding site. As is described in Example 2 below, individual replacement of the following amino acid residues in the human adenosine A2, receptor sequence (as shown in 25 Figure 10) lead to an increase in thermostability when measured with the agonist 5'-N-ethylcarboxamidoadenosine (NECA): Gly 114, Gly 118, Leu 167, Ala 184, Arg 199, Ala 203, Leu 208, Gln 210, Ser 213, Glu 219, Arg 220, Ser 223, Thbr 224, Gln 226, Lys 227, His 230, Leu 241, io Pro 260, Ser 263, Leu 267, Leu 272, Thr 279, Asn 284, Gln 311, Pro 313, Lys 315.
31 Replacement of the following amino acid residues in the human A,. receptor sequence (as shown in Figure 10) lead to an increase in thermostability when measured with the antagonist ZM 241385 (4-[2-[[7-amino-2-(2-fury1) [1,2,4] triazolo[2,3-a][1,3,5]triazin-5-yl]amino]ethylphenol): 5 Ala 54, Val 57, His 75, Thr 88, Gly 114, Gly 118, Thr 119, Lys 122, Gly 123, Pro 149, Glu 151, Gly 152, Ala 203, Ala 204, Ala 231, Leu 235, Val 239. Thus, the invention includes mutant human adenosine A 2 , receptors in which, .0 compared to its parent, one or more of these amino acid residues have been replaced by another amino acid residue. The invention also includes mutant adenosine receptors from other sources in which one or more corresponding amino acids in the parent receptor are replaced by another amino acid residue. For the avoidance of doubt, the parent may be an adenosine receptor which has a 5 naturally-occurring sequence, or it may be a truncated form or it may be a fusion, either to the naturally-occurring protein or to a fragment thereof, or it may contain mutations compared to the naturally--occurring sequence, provided that it retains ligand-binding ability. o By "corresponding amino acid residue" we include the meaning of the amino acid residue in another adenosine receptor which aligns to the given amino acid residue in human adenosine A 2 . receptor when the human adenosine A 2 receptor and the other adenosine receptor are compared using MacVector and CLUSTALW. Figure 10 shows an alignment between human adenosine A 2 8 receptor and three other human adenosine receptors (A2b, A3 and A1). It can be seen that, for example, Ser 115 in the Agb receptor (indicated as AA2BR) corresponds to Gly 11.4 in the A 2 , receptor. Similarly, it can be seen that Ala 60 in the A 3 receptor (indicated as AA3R) corresponds to Ala 54 in the A 2 , receptor, 32 and so on. Other corresponding amino acid residues in human adenosine receptors Aab, A 3 and A, can readily be identified by reference to Figure 10. It is preferred that the particular amino acid in the parent is replaced with an Ala. 5 However, when the particular amino acid residue in the parent is an Ala, it is preferred that it is replaced with a Leu. It is preferred that the mutant adenosine receptor has a different amino acid compared to its parent at more than one amino acid position. Particularly 10 preferred human adenosine A2b receptors are those in which one or more of the following amino acid residues are replaced with another amino acid residue: A55, T89, R123, L236 and V240. Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or Ile (unless they are already that residue). 15 Mutant human adenosine A2b receptors which have combinations of 3 or 4 or 5 mutations as described above are preferred. Particularly preferred human adenosine A3 receptors are those in which one or 20 more of the following amino acid residues are replaced with another amino acid residue: A60, T94, W128, L232 and L236. Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or Ie (unless they are already that residue). 25 Mutant human adenosine A3 receptors which have combinations of 3 or 4 or 5 mutations as described above are preferred. Particular preferred human adenosine Al receptors are those in which one or more of the following residues are replaced: A57, T91, A125, L236, and L240. 30 Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or Ile (unless they are already that residue).
Particularly preferred combinations of mutations are described in detail in Example 2. The invention includes these mutant human adenosine Ala receptors, and also includes other mutant adenosine receptors where amino acids in corresponding positions have been replaced by another amino acid, typically the. 5 same amino acid as indicated in Example 2. Particularly preferred adenosine receptor mutants are those which contain mutations in the amino acids which correspond to the given amino residue by reference to human adenosine A2a receptor: (A54L, K122A, L235A) (Rant 17); 10 (A54L, T88A, V239A, A204L) (Rant 19); and (A54L, T88A, V239A, K122A) (Rant 21). Mutant neurotensin receptor 15 Neurotensin receptors are known in the art. They share sequence homology and bind neurotensin. A sixth aspect of the invention provides a mutant neurotensin receptor which, when compared to the corresponding wild-type neurotensin receptor, has a 20 different amino acid at a position which corresponds to any one or more of the following positions according to the numbering of the rat neurotensin receptor as set out in Figure 11: Ala 69, Leu 72, Ala 73, Ala 86, Ala 90, Ser 100, His 103, Ser 108, Leu 109, Leu 111, Asp 113, Ile 116, Ala 120, Asp 139, Phe 147, Ala 155, Val 165, Glu 166, Lys 176, Ala 177, Thr 179, Met 181, Ser 182, Arg 183, 25 Phe 189, Leu 205, Thr 207, Gly 209, Gly 215, Val 229, Met 250, fle 253, Leu 256, Ile 260, Asn 262, Val 268, Asn 270, Thr 279, Met 293, Thr 294, Gly 306, Leu 308, Val 309, Leu 310, Val 313, Phe 342, Asp 345, Tyr 349, Tyr 351, Ala 356, Phe 358, Val 360, Ser 362, Asn 370, Ser 373, Phe 380, Ala 385, Cys 386, Pro 389, Gly 390, Trp 391, Arg 392, His 393, Arg 395, Lys 397, Pro 399. 30 It is particularly preferred if the mutant GPCR is one which has at least 20% amino acid sequence identity when compared to the given rat neurotensin receptor 34 sequence, as determined using MacVector and CLUSTALW. Preferably, the mutant GPCR has at least 30% or at least 40% or at least 50% amino acid sequence identity. 5 The mutant neurotensin receptor may be a mutant of any neurotensin receptor provided that it is mutated at one or more of the amino acid positions as stated by reference to the given rat neurotensin receptor amino acid sequence. As is described in Example 3 below, individual replacement of the following 1o amino acid residues in the rat neurotensin receptor sequence (as shown in Figures 11 and 28) lead to an increase in thermostability when considered with respect to the absence of neurotensin. Leu 72, Ala 86, Ala 90, Ser 100, His 103, Ser 108, Leu 109, Leu 111, Asp 113, Ile 116, Ala 120, Asp 139, Phe 147, Ala 155, Lys 176, Thr 179, Met 181, Ser 182, Phe 189, Leu 205, Thr 207, Gly 209, Gly 215, 15 Leu 256, Asn 262, Val 268, Met 293, Asp 345, Tyr 349, Tyr 351, Ala 356, Phe 358, Ser 362, Ala 385, Cys 386, Trp 391, Arg 392, His 393, Lys 397, Pro 399. As is described in Example 3 below, individual replacement of the following amino acid residues in the rat neurotensin receptor sequence (as shown in Figures 20 11 and 28) lead to an increase in thermostability when considered with respect to the presence of neurotensin. Ala 69, Ala 73, Ala 86, Ala 90, His 103, Val 165, Glu 166, Ala 177, Arg 183, - ' Val 229, Met 250, Ile 253, Ie 260, Thr 279, Thr 294, Gly 306, Leu 308, Val 309, Leu 310, Val 313, Phe 342, Phe 358, Val 360, Ser 362, Asn 370, Ser 373, Phe 380, Ala 385, Pro 389, Gly 390, Arg 395. 25 Thus, the invention includes mutant rat neurotensin receptor in which, compared to its parent, one or more of these amino acid residues have been replaced by another amino acid residue. The. invention also includes mutant neurotensin receptors from other sources in which one or more corresponding amino acids in 30 the parent receptor are replaced by another amino acid residue. For the avoidance of doubt the parent may be a neurotensin receptor which has a naturally-occurring sequence, or it may be a truncated form or it may be a fusion, either to the 35 naturally-occurring protein or to a fragment thereof, or it may contain mutations compared to the naturally-occurring sequence, providing that it retains ligand binding ability. 5 By "corresponding amino acid residue" we include the meaning of the amino acid residue in another neurotensin receptor which aligns to the given amino acid residue in rat neurotensin receptor when the rat neurotensin receptor and the other neurotensin receptor are compared using MacVector and CLUSTALW. 10 Figure 11 shows an alignment between rat neurotensin receptor and two human neurotensin receptors 1 and 2. It can be seen, for example, that Ala 85 of the human neurotensin receptor 1 corresponds to Ala 86 of the rat neurotensin receptor, that Phe 353 of the human neurotensin receptor 1 corresponds to Phe 358 of the rat neurotensin receptor, and so on. Other corresponding amino acid 15 residue in the human neurotensin receptors I and 2 can readily be identified by reference to Figure 11. It is preferred that the particular amino acid in the parent is replaced with an Ala. However, when the particular amino acid residue in the parent is an Ala, it is 20 preferred that it is replaced with a Len. It is preferred that the mutant neurotensin receptor has a different amino acid compared to its parent at more than one amino- acid position. Particularly preferred human neurotensin receptors (NTR1) are those in which one or more of 25 the following amino acid residues are replaced with another amino acid residue: Ala 85, His 102, Ile 259, Phe 337 and Phe 353. Typically, the given amino acid residues is replaced with Ala or Val or Met or Leu or Ile (unless they are already that residue). 30 Mutant human neurotensin receptors (NTRI) which have combinations of 3 or 4 or 5 mutations as described above are preferred.
36 Particularly preferred human neurotensin receptors (NTR2) are those in which one or more of the following amino acid residues are replaced with another amino acid residue: V54, R69; T229, P331 and F347. Typically, the given amino acid residue is replaced with Ala or Val or Met or Leu or Ie (unless they are already 5 that residue). Mutant human neurotensin receptors (NTR2) which have combinations of 3 or 4 or 5 mutations as described above are preferred. Particularly preferred combinations of mutations are described in detail in Example 3. The invention includes these mutant rat neurotensin receptors, and 10 also includes other mutant neurotensin receptors where amino acids in corresponding positions have been replaced by another amino acid, typically the same amino acid as indicated in Example 3. Particularly preferred neurotensin receptor mutants are those which contain 15 mutations in the amino acid residues which correspond to the given amino acid residue by reference to the rat neurotensin receptor: (F358A, A86L, I260A, F342A) (Nag7m); (F358A, H103A, 1260A, F342A).(Nag7n). Mutant muscarinic receptor 20 Muscarinic receptors are known in the art. They share sequence homology and bind muscarine. A seventh aspect of the invention provides a mutant muscarinic receptor which, 25 when compared to the corresponding wild-type muscarinic receptor, has a different amino acid at a position which corresponds to any one or more of the following positions according to the numbering of the human muscarinic receptor M1 as set out in Figure 17: Leu 65, Met 145, Leu 399, Ile 383 and Met 384. 30 It is particularly preferred if the mutant GPCR is one which has at least 20% amino acid sequence identity when compared to the given human muscarinic receptor sequence, as determined using MacVector and CLUSTALW. Preferably, 37 the mutant GPCR has at least 30% or at least 40% or at least 50% amino acid sequence identity. The mutant muscarinic receptor may be a mutant of any muscarinic receptor 5 provided that it is mutated at one or more of the amino acid positions as stated by reference to the given muscarinic receptor amino acid sequence. Thus, the invention includes a mutant human muscarinic receptor in which, compared to its parent, one or more of these amino acid residues have been 10 replaced by another amino acid residue. The invention also includes mutant muscarinic receptors from other sources in which one or more corresponding amino acids in the parent receptor are replaced by another amino acid residue. For the avoidance of doubt the parent may be a muscarinic receptor which has a naturally-occurring sequence, or it may be a truncated form or it may be a fusion, 15 either to the naturally-occurring protein or to a fragment thereof, or it may contain mutations compared to the naturally-occurring sequence, providing that it retains ligand-binding ability. By "corresponding amino acid residue" we include the meaning of the amino acid 20 residue in another muscarinic receptor which aligns to the given amino acid residue in human muscarinic receptor when the human muscarinic receptor and the other muscarinic receptor are compared using MacVector and CLUSTALW. It is preferred that the particular amino acid is replaced with an Ala. However, 25 when the particular amino acid residue is an Ala, it is preferred that it is replaced with a Leu. As shown in Examples 1-3 and described above, we have identified thermostabilising mutations scattered widely throughout the sequences of the 30 turkey betal adrenergic receptor, human adenosine receptor, rat neurotensin receptor and human muscarinic receptor. Figure 17 provides an alignment of these sequences with the sequence of the human beta-2AR such that when the 38 thermostabilising mutations are positioned onto the sequences then, in 11 instances out of a total of 70, two sequences contain mutations at the same position (denoted in Figure 17 with a star). Thus it will be appreciated that once one or more stabilising mutations have been identified in one GPCR, a further 5 GPCR with increased stability can be generated by aligning the amino acid sequences of the GPCRs and making the .same one or more mutations at the corresponding position or positions. This concept is clearly exemplified in Figure 26 wherein the six thermostabilising mutations in turkey $1-m23 were transferred directly to the human $2 receptor. The resultant mutant, 02-m23, had a Tm .12"i 10 higher than that of the human 02 receptor. ( Accordingly, an eighth aspect of the invention provides a method for producing a mutant GPCR with increased stability relative to its parent GPCR, the method comprising: 15 (i) identifying in the amino acid sequence of one or more mutants of a first parent GPCR with increased stability relative to the first parent GPCR, the position or positions at which the one or more mutants have at least one different amino acid residue compared to 20 the first parent GPCR, and (ii) making one or more mutations in the amino acid sequence that defines a second GPCR at the corresponding position or positions, to provide one or more mutants of a second parent GPCR with 25 increased stability relative to the second parent GPCR. The one or more mutants of a first parent GPCR may be selected or prepared according to the methods of the first or second aspects of the invention. Accordingly, it will be appreciated that the one or more mutants of a first parent 30 GPCR may be any of the mutants of the third, fourth, fifth, sixth or .seventh. aspects of the invention. Hence, the method of the eighth aspect of the invention may be used to create stable, conformationally locked GPCRs by mutagenesis.
39 For example, following the selection of mutant GPCRs which have increased stability in a particular conformation, the stabilising mutation can be identified and the amino acid at a corresponding position in a second GPCR replaced to produce a mutant GPCR with increased stability in a particular conformation 5 relative to a second parent GPCR. For the avoidance of doubt the first parent GPCR may be a GPCR which has a naturally-occurring sequence, or it may be a truncated form or it may be a fusion, either to the naturally-occurring protein or to a fragment thereof or it may contain 10 mutations compared to the naturally-occurring sequence, providing that it retains ligand-binding ability. Typically, identifying the position or positions at which the one or more mutants have at least one different amino acid residue compared to the first parent GPCR 15 involves aligning their amino acid sequences with that of the parent GPCR, for example using the Clustal W program (Thompson et al., 1994). By "corresponding position or positions", we include the meaning of the position in the amino acid sequence of a second GPCR which aligns to the position in the 20 amino acid sequence of the first GPCR, when the first and second GPCRs are compared by alignment, for example by using MacVector and Clustal W. For example, as shown in the alignment in Figure 17, the six stabilising mutations in turkey $1-m23, R68S, M90V, Y227A, A282L, F327A and F338M, are at positions which correspond to residues K60, M82, Y219, C265, L310 and F321 25 respectively in the human P2 receptor. Having identified the corresponding position or positions in the amino acid sequence of a second GPCR, the amino acids at those positions are replaced with another amino acid. Typically, the amino acids are replaced with the same amino 30 acids which replaced the amino acids at the corresponding positions in the mutant of the first parent GPCR (unless they are already that residue). For example, at position 68 in turkey p1-m23 (R68S), an arginine residue was replaced with a 40 serine residue. Therefore, at the corresponding position in the human p2 receptor, position 60 (K60), the lysine residue is preferably replaced with a serine residue. Mutations can be made in an amino acid sequence, for example, as described 5 above and using techniques well-established in the art. It will be appreciated that the second GPCR may be any other GPCR. For example, stabilising mutations in a GPCR from one species may be transferred to a second GPCR from another species. Similarly, stabilising mutations in one 10 particular GPCR isoform may be transferred to a second GPCR which is a different isoform. Preferably, the second parent GPCR is of the same GPCR class or family as the first parent GPCR. Phylogenetic analyses have divided GPCRs into three main classes based on protein sequence similarity, i.e., classes 1, 2, and 3 whose prototypes are rhodopsin, the secretin receptor, and the metabotropic 15 glutamate receptors, respectively (Foord et al (2005) Pharmacol. Rev. 57, 279 288). Thus, the second GPCR may be a GPCR which is of the same GPCR class as the first parent GPCR. Similarly, GPCRs have been divided into families by reference to natural ligands such as glutamate and GABA. Thus, the second GPCR may be of the same GPCR family as the first parent GPCR. A list of 20 GPCR classes and families has been produced by the International Union of Pharmacology (Foord et al (2005) Pharmacol. Rev. 57, 279-288) and this list is periodically updated at http://www.iuphar db.org/GPCR/ReceptorFamiliesForward. 25 It will be appreciated that the second parent GPCR must be able to be aligned with the first parent GPCR such that the corresponding positions of the mutations in the first GPCR can be determined in the second GPCR. Thus typically, the second parent GPCR has at least 20% sequence identity to the first parent GPCR and more preferably at least 30%, 40%, 50%, 60%, 70%, 80% or 90% sequence 30 identity to the first parent GPCR. However, some GPCRs have low sequence identity (e.g. family B and C GPCRs) and at the same time are very similar in structure. Thus the 20% sequence identity threshold is not absolute.
41 The inventors have reasoned that the identification of structural motifs in which the one or more mutations in a mutant GPCR with increased stability reside, will be useful in producing further mutant GPCRs with increased stability. 5 Accordingly, a ninth aspect of the invention provides a method for producing a mutant G-protein coupled receptor (GPCR) with increased stability relative to its parent GPCR, the method comprising: 10- (i) providing one or more mutants of a first parent GPCR with increased stability relative to the first parent GPCR (ii) identifying in a structural membrane protein model the structural motif or motifs in which the one or more mutants 15 have at least one different amino acid residue compared to the first parent GPCR, and (iii) making one or more mutations in the amino acid sequence that defines a corresponding structural motif or motifs in a second 20 parent GPCR, to provide one or more mutants of a second parent GPCR with increased stability relative to the second parent GPCR. Mapping stabilising mutations onto one or more known structural models can be 25 used to identify particular structural motifs in which such stabilising mutations reside. We have mapped stabilising mutations of the p1-adrenergic receptor onto structural models of the p2-adrenergic receptor (Rasmussen et al (2007) Nature 450, 383-387; Cherezov et al (2007) Science 318:1258-65; Rosenbaum et al (2007) Science 318:1266-1273) in order to identify such motifs. For example, 30 Table (vi) lists the turkey 1-adrenergic receptor mutations which we have mapped onto the human 2-adrenergic receptor and describes the corresponding structural motifs in which they reside. As discussed in Example 4, mapping of the 42 Y227A mutation (equivalent to Y219 in the human $2 receptor) onto the human p 2 -adrenergic receptor reveals its position at the interface between helices such that the mutation may improve packing at the helical interface (see Figures 15, 16 and 23). Similarly, mapping of the M90V mutation (equivalent to M82 in the 5 human P2 receptor) onto the human 2 -adrenergic receptor reveals it to be in helix 2 at a point where the helix is kinked (see Figures 15, 16 and 20). Other mutations were found to reside in further structural motifs including transmembrane helix surfaces pointing into the lipid bilayer, hydrophobic-hydrophilic boundary regions, protein binding pockets and loop regions (see Table (vi) and Figures 18 10 19,21-22 and 24-25). Such structural motifs, by virtue of them containing stabilising mutations, are important in determining protein stability. Therefore, targeting mutations to these motifs will facilitate the generation of stabilised mutant GPCRs. Indeed, there 15 were several instances where more than one mutation mapped to the same structural motif. For example, the Y227A, V230A and A234L mutations in the turkey p1 adrenergic receptor mapped to the same helical interface, the V89L and M90V mutations mapped to the same helical kink and the F327A and A334L mutations mapped to the same helical surface pointing towards the lipid bilayer 20 (Table (vi)). Thus, when one stabilising mutation has been identified, the determination of the structural motif in which that mutation is located will enable the identification of further stabilising mutations. In an embodiment of the ninth aspect of the invention, the one or more mutants of 25 a first parent GPCR are selected or prepared according to the methods of the first, second or eighth aspects of the invention. Accordingly, it will be appreciated that the one or more mutants of a first parent GPCR may be any of the mutants of the third, fourth, fifth, sixth oi- seventh aspects of the invention. Hence, the method of the ninth aspect of the invention may also be used to create stable, 30 conformationally locked GPCRs by mutagenesis. For example, following the selection of mutant GPCRs which have increased stability in a particular conformation, the structural motifs in which such stabilising mutations reside can 43 be identified. Making one or more mutations in the amino acid sequence that defines the corresponding structural motif in another GPCR can then be used to produce a mutant GPCR with increased..stability in a particular conformation relative to its parent GPCR. We have performed a multiple sequence alignment of the human beta-2AR, rat NTR1, turkey beta-i AR, human Adenosine A2aR and human muscarinic M1 receptor amino acid sequences (Figure 17) which shows that, when the thermostabilising mutations identified (see Examples 1-3) are positioned on the 10 sequences then, in 11 instances out of a total of 70, two sequences contain mutations at the same position (denoted in Figure 17 with a star). Without wishing to be bound by any theory, the inventors believe that thermostabilising mutations at these positions should be of enhanced transferability for mapping onto a structural membrane protein model. Thus in one embodiment, the mutant 15 of the first parent GPCR is a mutant human beta-2AR, rat NTR1, turkey beta-1 AR, human Adenosine A2aR or human muscarinic Ml receptor which, when compared to its corresponding parent receptor, has a different amino acid at a position which corresponds to any one or more of the following positions according to the numbering of the human beta2 AR as set out in Figure 17: Ala 20 59, Val 81, Ser 143, Lys 147, Val 152, Glu 180, Val 222, Ala 226, Ala 271, Leu 275 and Val 317. In order to identify the structural motif or motifs, the stabilising mutations are mapped onto a known structure of a membrane protein. 25 By "membrane protein" we mean a protein that is attached to or associated with a membrane of a cell or organelle. Preferably, the membrane protein is an integral membrane protein that is permanently integrated into the membrane and can only be removed using detergents, non-polar solvents or denaturing agents that 30 physically disrupt the lipid bilayer.
44 The structural model of a membrane protein may be any suitable structural model. For example, the model may be a known crystal structure. Examples of GPCR crystal structures include bovine rhodopsin (Palczewski, K. et al., Science 289, 739-745. (2000)) and human P2 adrenergic receptor (Rasmussen et al, Nature 450, 5 383-7 (2007); Cberezov et al (2007) Science 318:1258-65; Rosenbaum et al (2007) Science 318:1266-1273). -The coordinates for the human p2 adrenergic receptor structure can be found in the RCSB Protein Data Bank under accession codes: 2rhl, 2r4r and 2r4s. Alternatively, the structural model may be a computer generated model based upon homology or using de novo structure prediction 10 methods (Qian et at Nature (2007) 450: 259-64). It will be appreciated that stabilising mutations of a given mutant GPCR can be mapped onto a structural model of any membrane protein which has sufficient structural similarity to the GPCR. In particular, the domain of the membrane is protein must have sufficient structural similarity to the GPCR domain in which the stabilising mutation resides, for a given mutation to be transferable. A protein domain is typically defined as a discretely folded assembly of secondary structure elements which may stand alone as a single protein or be part 20 of a larger protein in combination with other domains. It is commonly a functional evolutionary unit. GPCRs are essentially single domain proteins excluding those with large N tenninal domains. Therefore, typically, the structural model is of a membrane 25 protein which comprises at least one domain that has sufficient structural similarity to the GPCR. Structural similarity can be determined indirectly by the analysis of sequence identity, or directly by comparison of structures. 30 With regard to sequence identity, the amino acid sequence encoding the GPCR domain in which the mutant has at least one different amino acid residue 45 compared to the first parent GPCR, is aligned with an amino acid sequence encoding a domain of a membrane protein for which a structural model is available. It will be appreciated that one or more of these sequences may contain an inserted sequence or N-terminal or C-terminal extensions which are additional 5 to the core conserved domain. For optimal alignment, such sequences are removed so as not to skew the analysis. Membrane proteins with sufficient sequence identity across the domain in question may then be used as the structural model for mapping mutations. It has been shown for soluble protein domains that their 3D structure is broadly conserved above 20% sequence identity and well 10 conserved above 30% identity, with the level of structural conservation increasing as sequence identity increases up to 100% (Ginalski,K. Curr Op Struc Biol (2006) 16,172-177). Thus, it is preferred if the structural membrane protein model is a model of a membrane protein which contains a domain that shares at least 20% sequence identity with the mutant GPCR domain containing the at least one 15 different amino acid residue compared to the first parent GPCR, and more preferably at least 30%, 40%, 50%, 60%, 70%, 80% or 90% sequence identity, and yet more preferably at least 95% or 99% sequence identity. Sequence identity may be measured by the use of algorithms such as BLAST or 20 PSI-BLAST (Altschul et al, NAR (1997), 25, 3389-3402) or methods based on Hidden Markov Models (Eddy S et al, J Comput Biol (1995) Spring 2 (1) 9-23). Typically, the percent sequence identity between two polypeptides may be determined using any suitable computer program, for example the GAP program of the University of Wisconsin Genetic Computing Group and it will be appreciated 25 that percent identity is calculated in relation to polypeptides whose sequence has been aligned optimally. The alignment may alternatively be carried out using the Clustal W program (Thompson et al., 1994). The parameters used may be as follows: Fast pairwise alignment parameters: K-tuple(word) size; 1, window size; 5, gap penalty; 3, number of top diagonals; 5. Scoring method: x percent. Multiple 30 alignment parameters: gap open penalty; 10, gap extension penalty; 0.05. Scoring matrix: BLOSUM.
46 In addition to sequence identity, structural similarity can be determined directly by comparison of structural models. Structural models may be used to detect regions of structural sinilarity not evident from sequence analysis alone, and which may or may not be contiguous in the sequence. For example, family B and C GPCRs 5 are thought to share similar structures; however, their sequence identity is very low. Similarly, the water transporting aquaporins spinach SoPip2, E. coli AqpZ , Methanococcus AqpM, rat Aqp4, human Aqp1 and sheep Aqp0 share low sequence identity but all have similar structures. 10 Structural models of high fidelity may be constructed for proteins of unknown structure using standard software packages such as MODELLER (Sali A and Blundell T, J Mol Biol (1993) 234(3) 779-815), wherein the structure is modelled on a known structure of a homologous protein. Such modelling improves with increasing sequence identity. Typically, the sequence identity between the 15 sequence of unknown structure and a sequence of known 3D structure is more than 30% (Ginalski,K. Curr Op Struc Biol (2006) 16, 172-177). In addition, de novo structure prediction methods based on sequence alone may be used to model proteins of unknown structure (Qian et al, (2007) Nature 450:259-64). Once structures have been experimentally determined or derived by modelling, regions 20 of structural similarity may be detected by direct comparison of two or more 3D structures. They may, for example, comprise secondary structure elements of a particular architecture and topology which can be detected by the use of software such as DALI (Holm, L and Sander, C (1996) Science 273, 595-603). They may comprise local arrangements of amino acid side chains and the polypeptide 25 backbone, or specific sits of atoms or groups of atoms in a particular spatial . arrangement, which may for example also be detected by the use of graph theoretical representations (ArtymiukP et al, (2005) 1 Amer Soc Info Sci Tech 56 (5) 518-528). In this approach, the atoms or groups of atoms within the proteins or regions of proteins to be compared are typically represented as the nodes of a 30 graph, with the edges of the graph describing the angles and distances between the nodes. Common pattems in these graphs indicate common structural motifs. This approach may be extended to include any descriptor of atoms or groups of atoms, 47 such as hydrogen bond donor or acceptor, hydrophobicity, shape, charge or aromaticity; for example proteins may be spatially mapped according to such descriptors using GRID and this representation used as a basis for similarity searching (Baroni et a] (2007) J Chem Inf Mod 47, 279-294). Descriptions of the 5 methods, availability of software, and guidelines for user-defined selection of parameters, thresholds and tolerances are described in the references given above. In a preferred embodiment, the structural membrane protein model is a structural GPCR model. It will be appreciated that the structural model of a GPCR may be a 10 model of the first parent GPCR. For example, stabilising mutations within a mutant GPCR having increased stability can be directly mapped onto the first parent GPCR structure and the structural motifs in which such mutations are located, identified. Where the structure of the first parent GPCR is unknown, structural models of other GPCRs may be used. For example, stabilising 15 mutations in a GPCR from one species may be mapped onto a known structural model of the same GPCR from another species. Similarly, stabilising mutations in one particular GPCR isoform may be mapped onto a known structural model of another GPCR isoform. Moreover, stabilising mutations from one GPCR may be mapped onto a GPCR of the same class or family. A list of GPCR classes and 20 families has been produced by the International Union of Pharmacology (Foord et al (2005) Pharmacol. .Rev. 57, 279-288) and this list is periodically updated at http://www.iuphar-db.org/GPCR/ReceptorFamiliesForward. As described above, it will be appreciated that the structural model may be of any 25 GPCR provided it has sufficient structural similarity across the domain in which the mutant GPCR has at least one different amino acid compared to the first parent GPCR. Thus, it is preferred if the GPCR shares at least 20% sequence identity with the mutant of the first parent GPCR across the protein domain containing the at least one different amino acid residue compared to the first parent GPCR, and 30 more preferably at least 30%, 40%, 50%, 60%, 70%, 80% or 90% sequence identity, and yet more preferably at least 95% or 99% sequence identity. However, the inventors recognise that the 20% sequence identity threshold is not 48 absolute. GPCRs with less than 20% sequence identity to the first parent GPCR may also serve as a structural model to which stabilising mutations are transferred, wherein the low sequence identity is counterbalanced by other similarities,. including, for example, the presence of the same sequence motifs, binding to the s same G-protein or having the same function, or having substantially the same hydropathy plots compared to the first parent GPCR. Mapping of stabilising mutations onto the structural model can be done using any suitable method known in the art. For example, typically, the amino acid 10 sequence of the GPCR for which the structural model is available is aligned with the amino acid sequence of the mutant of the first parent GPCR. The position or positions of the at least one different amino acid residue in the mutant GPCR relative to the first parent GPCR can then be located in the amino acid sequence of the GPCR for which a structural model is available. 15 By 'structural motif' we include the meaning of a three dimensional description of the location in a GPCR structural model of a thermostabilising mutation. For example, the structural motif may be any secondary or tertiary structural motif within the GPCR. By 'tertiary structural motif' we include any descriptor of 20 atoms or groups of atoms, such as hydrogen bond donor or acceptor, hydrophobicity, shape, charge or aromaticity. For example, proteins may be spatially mapped according to such descriptors using GRID and this representation used as a basis for defining a structural motif (Baroni et al (2007) J Chem Inf Mod 47, 279-294). 25 Table (vi) lists the structural motifs in which the turkey P1 adrenergic receptor stabilising mutations were found to reside. As seen from the table, the mutations are positioned in a number of distinct localities. Three mutations are in loop regions that are predicted to be accessible to aqueous solvent. Eight mutations are 30 in the transmembrane a-helices and point into the lipid bilayer; three of these mutations are near the end of the helices and may be considered to be at the hydrophobic-hydrophilic boundary layer. Eight mutations are found at the 49 interfaces between transmembrane a-helices, three of which are either within a kinked or distorted region of the helix and another two mutations occur in one helix but are adjacent to bne or more other helices which contain a kink adjacent in space to the mutated residue. These latter mutations could affect the packing of s the amino acids within the kinked region, which could result in thermostabilisation. Another mutation is in a substrate binding pocket. Accordingly, in one embodiment, the structural motif is any of a helical interface, a helix kink, a helix opposite a helix kink, a helix surface pointing into the lipid 10 bilayer, a helix surface pointing into the lipid bilayer at the hydrophobic hydrophilic boundary layer, a loop region or a protein binding pocket. Identifying a structural motif in which a stabilising mutation resides suggests the importance of that motif in protein stability. Therefore, making one or more 15 mutations in the amino acid sequence that defines a corresponding structural motif or motifs in a second parent GPCR, should provide one or more mutants of a second parent GPCR with increased stability relative to the second parent GPCR. The amino acid sequence which defines a structural motif is the primary amino 20 acid sequence of the amino acid residues which combine in the secondary or tertiary structure of the protein to form the structural motif. It will be appreciated that such a primary amino acid sequence may comprise contiguous or non contiguous amino acid residues. Thus, identifying the amino acid sequence which defines the structural motif will involve determining the residues involved and 25 subsequently defining the sequence. Mutations can be made in an amino acid sequence, for example as described above and using techniques well-established in the art. By "corresponding structural motif or motifs", we mean the analogous structural ;o motif or motifs identified in the structural model which are present in the second parent GPCR. For example, if a helical interface was identified, the corresponding helical interface in the second parent GPCR would be the interface 50 between the belices which are analogous to the helices present in the structural model. If a helical kink was identified, the corresponding helical kink would be the kink in the helix which is analogous to the kinked helix present in the structural model, An analogous structural motif or motifs in the second parent s GPCR can be identified by searching for similar amino acid sequences in the sequence of the second parent GPCR which define the motif or motifs in the structural model, for example, by sequence alignment. Moreover, computer based algorithms are widely available in the art that can be used to predict the presence of protein motifs based on an amino acid sequence. Thus, based upon the relative 10 position of a particular motif within the amino acid sequence and its position ( relative to other motifs, an analogous structural motif can readily be identified. It will be appreciated that if a structural model of the second parent GPCR is available, the analogous structural motif or motifs can be directly mapped onto the structure of the protein. Typically, the amino acid sequence defining the 15 analogous structural motif has at least 20% sequence identity with the sequence defining the motif in the structural model, more preferably at 16ast 30%, 40%, 50%, 60%, 70%, 80% and 90% sequence identity and yet more preferably 95% and 99% sequence identity. 20 In one embodiment, the second parent GPCR is the first parent GPCR. For the avoidance of doubt, the second parent GPCR may have the naturally-occurring sequence of the first parent GPCR, or it may be a truncated form or it may be a fusion, either to the naturally occurring protein or to a fragment thereof, or it may contain mutations compared to the naturally-occurring sequence, providing that it 25 retains ligand-binding. In an alternative embodiment, the second parent GPCR is not the first parent GPCR. For example, a mutant of a first parent GPCR may have been identifed that has increased stability but it is desired to generate a mutant of a different 30 GPCR with increased stability. Preferably, the second parent GPCR is of the same GPCR class or family as the first parent GPCR as described above. However, it will be appreciated that the second parent GPCR may be any known 51 GPCR provided that it shares sufficient structural similarity with the first parent GPCR, such that it contains a corresponding structural motif in which the stabilising mutation of the mutant of the first parent GPCR resides. Thus typically, the second parent GPCR has at least 20% sequence identity to the first 5 parent GPCR and more preferably at least 30%, 40%, 50%, 60%, 70%, 80% or 90% sequence identity. However, as mentioned above, some GPCRs have low sequence identity (e.g. family B and C GPCRs) but are similar in structure. Thus the 20% sequence identity threshold is not absolute. 10 Since there are potentially thousands of mutations that can be screened in a GPCR for increased stability, it is advantageous to target particular mutations which are known to be important in conferring stability. Therefore, it will be appreciated that the methods of the eighth and ninth aspects of the invention may be used in a method of selecting mutant GPCRs with increased stability. In particular, is carrying out the methods of the eighth or ninth aspects of the invention can be used to target mutations to particular amino acid residues or to amino acid sequences which define structural motifs important in determining stability. Accordingly, in one embodiment the methods of the eighth or ninth aspects further 20 comprise: (I) selecting a ligand, the ligand being one which binds to the second parent GPCR when the GPCR is residing in a particular conformation 25 (II) determining whether the or each mutant of the second parent GPCR when residing in a particular conformation has increased stability with respect to binding the selected ligand compared to the stability of the second parent GPCR when residing in the same particular conformation with respect to binding that ligand, and 30 (III) selecting those mutants that have an increased stability compared to the second parent GPCR with respect to binding the selected ligand.
52 It will be noted that steps (I), (II) and (III) correspond to steps (b), (c) and (d) of the method of the first aspect of the invention described above. Accordingly, preferences for the ligand and methods of assessing stability are as defined abQve 5 with respect to the method of the first aspect of the invention. A tenth aspect of the invention provides a mutant GPCR with increased stability relative to its parent GPCR produced by the method of the tenth aspect of the invention. 10 In one embodiment, the mutant GPCR of the tenth aspect of the invention is a mutant GPCR which has, compared to its parent receptor, at least one different amino acid at a position which corresponds to any one or more of the following positions: (i) according to the numbering of the turkey -adrenergic receptor as set 15 out in Figure 9: Ie 55, Gly 67, Arg 68, Val 89, Met 90, Gly 67, Ala 184, Arg 199, Ala 203, Leu 208, GIn 210, Ser 213, Glu 219, Arg 220, Ser 223, Thr 224, GIn 226, Lys 227, His 230, Leu 241, Pro 260, Ser 263, Leu 267, Leu 272, Thr 279, Asn 284, Gln 311, Pro 313, Lys 315, (iii) according to the numbering of the rat neurotensin receptor as set out in Figure 11: Ala 69, Leu 72, Ala 73, Ala 86, Ala 20 90, Ser 100, His 103, Ser 108, Leu 109, Leu 111, Asp 113, Ile 116, Ala 120, Asp 139, Phe 147, Ala 155, Val 165, Glu 166, Lys 176, Ala 177, Thr 179, Met 181, Ser 182, Arg 183, Phe 189, Leu 205, Thr 207, Gly 209, Gly 215, Val 229, Met 250, Ie 253, Leu 256, Ile 260, Asn 262, Val 268, Asn 270, Thr 279, Met 293, Thr 294, Gly 306, Leu 308, Val 309, Leu 310, Val 313, Phe 342, Asp 345, Tyr 349, 25 Tyr 351, Ala 356, Phe 358, Val 360, Ser 362, Asn 37.0, Ser.373, Phe 380, Ala 385, Cys 386, Pro 389, Gly 390, Trp 391, Arg 392, His 393, Arg 395, Lys 397, Pro 399, and (iv) according to the numbering of the muscarinic receptor as set out in Figure 17: Leu 65, Met 145, Leu 399, fle 383 and Met 384. 30 Alignment of the turkey p1 AR, human adenosine receptor, rat neurotensin receptor and human muscarinic receptor amino acid sequences in Figure 17, shows that in 11 instances out of 70, two sequences contain mutations art the same 53 position, namely at the following positions according to the numbering of the human beta. AR as set out in Figure 17: Ala 59, Val 81, Ser 143, Lys 147, Val 152, Glu 180, Val 222, Ala 226, Ala 271, Leu 275 and Val 317. Therefore, in a preferred embodiment, the miitant GPCR of the tenth aspect of the invention is 5 one which has, compared to its parent receptor, a different amino acid at any one of these positions. In one embodiment the mutant GPCR of the tenth aspect of the invention is a mutant p-adrenergic receptor. For example, the mutant p-adrenergic receptor may 10 have at least one different amino acid residue in a structural motif in which the mutant receptor compared to its parent receptor has a different amino acid at a position which corresponds to any of the following positions according to the numbering of the turkey 0-adrenergic receptor as set out in Figure 9: Ile 55, Gly 67, Arg 68, Val 89, Met 90, Gly 98, Ile 129, Ser 151, Val 160, GIn 194, Gly 197, 15 Leu 221, Tyr 227, Arg 229, Val 230, Ala 234, Ala 282, Asp 322, Phe 327, Ala 334, Phe 338. In one embodiment the mutant GPCR of the tenth aspect of the invention is a mutant adenosine receptor. For example, the mutant adenosine receptor may have 20 at least one different amino acid residue in a structural motif in which the mutant receptor compared to its parent receptor has a different amino acid at a position which corresponds to any of the following positions according to the numbering of the human adenosine A 2 . receptor as set out in Figure 10: Gly 114, Gly 118, Leu 167, Ala 184, Arg 199, Ala 203, Leu 208, GIn 210, Ser 213, Glu 219, Arg 220, 25 Ser 223, Thr 224, GIn 226, Lys 227, His 230, Leu 241, Pro 260, Ser 263, Leu 267, Leu272, Thr 279, Asn 284, Gln 311, Pro 313, Lys 315. In one embodiment the mutant GPCR of the tenth aspect of the invention is a mutant neurotensin receptor. For example, the mutant neurotensin receptor may 30 have at least one different amino acid residue in a structural motif in which the mutant receptor compared to its parent receptor has a different amino acid at a position which corresponds to any of the following positions according to the 54 numbering of the rat neurotensin receptor as set out in Figure 11: Ala 69, Leu 72, Ala 73, Ala 86,Ala 90, Ser 100, His 103, Ser 108, Leu 109, Leu 111, Asp 113, le 116, Ala 120, Asp 139, Phe 147, Ala 15.5, Val 165, Glu 166, Lys 176, Ala 177, Thr 179, Met 181, Ser. 182, Arg 183, Phe 189, Leu 205, Thr 207, Gly 209, Gly 5 215, Val 229, Met 250, le 253, Leu 256, Ile 260, Asn 262, Val 268, Asn 270, Thr 279, Met 293, Thr 294, Gly 306, Leu 308, Val 309, Leu 310, Val 313, Phe 342, Asp 345, Tyr 349, Tyr 351, Ala 356, Phe 358, Val 360, Ser 362, Asn 370, Ser 373, Phe 380, Ala 385, Cys 386, Pro 389, Gly 390, Trp 391, Arg 392, His 393, Arg 395, Lys 397, Pro 399. 10 In one embodiment the mutant GPCR of the tenth aspect of the invention is a mutant muscarinic receptor. For example, the mutant muscarinic receptor may have at least one different amino acid residue in a structural motif in which the mutant receptor compared to its parent receptor has a different amino acid at a is position which corresponds to any of the following positions according to the numbering of the human muscarinic receptor as set out in Figure 17: Leu 65, Met 145, Leu 399, Ile 383 and Met 384. It is preferred that the mutant GPCRs of the invention have increased stability to 20 any one of heat, a detergent, a chaotropic agent and an extreme of pH. It is preferred if the mutant GPCRs of the invention have increased thermostability. 25 It. is preferred that the mutant GPCRs of the invention, including the mutant P-adrenergic, adenosine and neurotensin receptors, have an increased thermostability compared to its parent when in the presence or absence of a ligand thereto. Typically, the ligand is an antagonist, a full agonist, a partial agonist or an inverse agonist, whether orthosteric or allosteric. As discussed above, the 30 ligand may be apolypeptide, such as an antibody.
55 It is preferred that the mutant GPCRs of the invention, for example a mutant p--adrenergic receptor or a mutant adenosine receptor or a mutant neurotensin receptor is at least 2*C more stable than its parent preferably at least 5*C more stable, more preferably at least 8*C more stable and even more preferably at least 5 10*C or 15*C or 20*C more stable than its parent. Typically, thermostability of the parent and mutant receptors are measured under the same conditions. Typically, thermostability is assayed under a condition in which the GPCR resides in a particular conformation. Typically, this selected condition is the presence of a ligand which binds the GPCR 10 It is preferred that the mutant GPCRs of the invention, when solubilised and purified in a suitable detergent has a similar thermostability to bovine rhodopsin purified in dodecyl maltoside. It is particularly preferred that the mutant GPCR retains at least 50% of its ligand binding activity after heating at 40 0 C for 30 15 minutes. It is further preferred that the mutant GPCR retains at least 50% of its ligand binding activity after heating at 55*C for 30 minutes. The mutant GPCRs disclosed herein are useful for crystallisation studies and are useful in drug discovery programmes. They may be used in biophysical 20 measurements of receptor/ligand kinetic and thermodynamic parameters eg by surface plasmon resonance or fluorescence based techniques. They may be used in ligand binding screens, and may be coupled to solid surfaces for use in high throughput screens or as biosensor chips. Biosensor chips containing the mutant GPCRs may be used to detect molecules, especially biomolecules. 25 The invention also includes a polynucleotide which encodes a mutant GPCR of the invention. In particular, polynucleotides are -included which encode the mutant p-adrenergic receptor or the mutant adenosine receptors or the mutant neurotensin receptors of the invention. The polynucleotide may be DNA or it 30 may be RNA. Typically, it is comprised in a vector, such as a vector which can be used to express the said mutant GPCR. Suitable vectors are ones which 56 propagate in and/or allow the expression in bacterial or mammalian or insect cells. The invention also includes host cells, such as bacterial or eukaryotic cells, which 5 contain a polynucleotide which encodes the mutant GPCR. Suitable cells include E. coli cells, yeast cells, mammalian cells and insect cells. The invention will now be described in more detail with respect to the following Figures and Examples wherein: 10 Figure 1 Amino acid changes in PAR that lead to thermostability. Stability quotient indicates the % remaining binding activity of the mutants after heating the sample for 30 min at 32"C. All values are normalized to PAR 3 a 24 (50%, showed as a discontinuous line) to remove any experimental variability between 15 assays. Bars show the stability for each mutant. The letters on the x-axis indicate the amino acid present in the mutant. The original amino acid and its position in
PAR
3 44 2 4 is indicated below. Bars corresponding to the same amino acid in
PAR
3 4 42 4 are in the same colour with arrows indicating the best mutations. Errors were calculated from duplicate measurements; the best mutants were subsequently 20 re-assayed to determine the Tm for each individual mutation and to give an accurate rank order of stability for each mutant (see Example 1). Figure 2 Side chains in rhodopsin that are at equivalent positions to the thermostable mutations in QAR34.424. The equivalent amino acid residues in 25 rhodopsin to the amino acid residues mutated in PAR 3 4- 2 4 were located in the rhodopsin structure, based upon an alignment among rhodopsin, p1 adrenergic receptor, neurotensin receptor, and adenosine A2a receptor (data not shown). Side chains in the same transmembrane helix are shown as space filling models in the same colour. The name and position of the amino acid residues are those in 30 rhodopsin.
57 Figure 3 Evolution of thermostability in PAR. Starting from IAR-ml0-8, combinations of mutations were rearranged systematically to fmd the optimum combination- of mutations (see also Table 2). 5 Figure 4 Stability of PAR-m23 and pAR344 2 4 in the apo-state or containing the bound antagonist [H]-DHA. To determine Tm in the absence of ligand (apo-state, discontinuous lines), detergent-solubilised receptors were incubated for 30 minutes at the temperatures indicated before carrying out the binding assay. For the Tm determination of the antagonist-bound form (continuous lines), 10 detergent-solubilised receptors were pre-incubated with [ 3 H]-DHA, followed by incubation at the temperatures indicated. 3AR-m23 (circles), and PAR344 24 (squares). Data points are from duplicates measurements in a representative experiment. 15 Figure 5 Competition binding of agonists to PAR-m23 and PAR 3 4 4 2 4 . Binding assays were performed on receptors partially purified in DDM; PAR-n23 (triangles) and pAR34 2 4 (squares). [ 3 H]-DHA was used at a concentration three times greater than the KD of partially purified receptor (see Methods). [ 3 H]-DHA binding was competed with increasing concentrations of the agonists, 20 norepinephrine (a) and isoprenaline (b), or with an antagonist, alprenolol (c), LogECso and corresponding ECso values for the different ligands were calculated by nonlinear regression using GraphPad Prism software and the error for logEC 5 os were lower than 10%. The EC 5 os for ligand binding to PAR 34
-
4 2 4 and fAR-m23 are: norepinephrine, pAR 3 4 24 1.5 pM, $AR-mn23 3.7 mM; isoprenaline, PAR 3 4.4 2 4 2-5 330 nM, 3AR-m23 20 pM; alprenolol, PAR 78 nM, pAR-rn23 112 nM. Figure 6 Stability of PAR-m23 and 0AR 3 4 2 4 in five different detergents. Samples of pAR 3 4 2 4 (a), and PAR-m23 (b) solubilized in DDM were partially purified on Ni-NTA agarose columns allowing the exchange into various different io detergents: DDM (squares), DM (triangles), OG (inverted triangles), LDAO (diamonds) and NG (circles). PAR is so unstable in OGT, NG and LDAO that it was not possible to measure any activity after purification at 6*C. Assays were 58 carried out as described in the Methods and the Tm is shown at the intersection between the curves and the discontinuous line. Results are from duplicate measurements in a representative experiment performed in parallel. (c) Photomicrograph of a crystal of #AR--23 mutant, which showed good order by 5 X-ray diffraction. Figure 7 Curve of thermostability of 0AR344 24 (Tm). Binding assays were performed using [ 3 H)-dihydroalprenolol (DHA) as radioligand as described under "Methods". Samples were heated for 30 minutes at different temperatures before 10 the assay. Tm represents the temperature at which the binding decreased to the 50%, value showed as a discontinuous line. Data points are from duplicates of one single experiment. This experiment has been repeated several times with similar results. 15 Figure 8 Saturation binding assays of membranes of PAR 3 4 24 and 3AR-m23. Binding assays were performed as described in "Methods" using
[
3 H]-dihydroalprenolol (DHA) as radioligand; PAR 3 4 .4 2 4 (a) and PAR-m23 (b). Scatchard plots are shown as insets along with the corresponding values for B. and KD. Data points are from duplicates of two independent experiments for each !o protein. Data were analyzed by nonlinear regression using Prism software (GraphPad). Figure 9 Alignment of the turkey #-adrenergic receptor with human P1, P2 and 03 receptors. 5 Figure 10 Alignment of human adenosine receptors. Figure 11 Alignment of neurotensin receptors. > Figure 12 Flow chart showing the two different assay formats of ligand (+) and ligand (-) used to determine receptor thermostablity.
59 Figure 13 Pharmacological profile of thermostable mutant adenosine A2a receptor, Rant21. Saturation binding of (A) antagonist and. (B) agonist to solubilised receptors. (C-F) Inhibition-of [ 3 H]ZM241385 binding by increasing concentrations of antagonists (C) XAC and (D) Theophylline, and agonists (E) 5 NECA and (F) R-PIA; binding of [ 3 H]ZM241385 (10 nM) in the absence of unlabelled ligand was set to 100%. Each solubilised receptor was incubated with ligands for one hour on ice in binding buffer (50mM Tris pH7.5 and 0.025% DDM) containing 400 mM NaCl (A, C-F). Data shown are from two independent experiments with each data point measured in triplicate. KD and Ki values are to given in Table (iii). Figure 14 Thermostable mutants show a decreased dependence on lipids (A) and an increased survival at higher concentration of DDM (B) upon heating compared to the wild-type receptor. Receptors were solubilised in 1% DDM (diluted in 50 5 mM Tris pH7.5 and 400 mM NaCl) and immobilised on Ni-NTA agarose for the IMAC step. Exchange of buffer containing the appropriate concentration of DDM and/or lipids was performed during washes and elution from the Ni-NTA beads. Figure 15 Mapping of the M90V, Y227A, A282L and F338M m23 mutations in turkey betal adrenergic receptor onto homologous residues (M82, Y219, C265 and A321 respectively) in the human beta2 adrenergic receptor structure (Rasmussen et al (2007) Nature 15;383-387; pdb accession codes 2R4R and 2R4S) reveals their position at a helical interface and helical kink respectively. Amino acid residues in equivalent positions to the thermostabilising mutations in the turkey p1 adrenergic receptor are shown as labelled space filling models. Figure 16 Mapping of m23 mutations in turkey betal adrenergic receptor onto homologous residues in the human beta2 adrenergic receptor structure (Cherezov et al (2007) Science, 318:1258-65; pdb accession code 2RH1). The Ccitrace of the 02AR is shown with the fusion moiety (T4 lysozyme) removed. The six mutations in 3AR-m23 (R68S, M90V, Y227A, A282L, F327A, F338M) are equivalent to amino acid residues K60, M82, Y219, C265, L310, F321 in the 60 human p2AR. Lys60 is on the intracellular end of Helix I and points into the lipid-water interface. MetS2 is near the middle of Helix .2 and points into the ligand binding pocket; the nearest distance between the substrate carazolol and the Met side chain is 5.7 A. Tyr219 is towards the intracellular end of helix 5 and is 5 at the helix5-helix 6 interface. Cys265 is at the end of the loop region between helices 5 and 6 and points away from the transmembrane regions. Leu3 10 and Phe321 are both in helix 7 and both point out into the lipid bilayer. Figure 17 Multiple sequence alignment of human beta-2AR, rat NTRI, turkey 10 beta-1 AR, human Adenosine A2aR and human muscarinic M1 receptors. In each sequence, thermostabilising mutations are marked with a box. Mutations occurring in two or more sequences are denoted with a star. Figure 18 Mapping of turkey betalAR mutation 155A (human beta2AR 147) onto 15 human beta2AR structure (pdb accession code 2RHl). Mutation is at the interface between 3 helices (HI, H2 kink, H7 kink). Left: side view; right: top view. Figure 19 Mapping of turkey betalAR V89L mutation (human beta2AR V81) onto human beta2AR structure (pdb accession code 2RH1). Mutation is in the 20 kink in helix 2. The helices are numbered and the bound antagonist is shown as a space filling model. Amino acid residues in equivalent positions to the thermostabilising mutations in the turkey $1 adrenergic receptor are shown as space filling models and are arrowed for clarity. Left: side view; right: top view. 25 Figure 20 Mapping of turkey betalAR M90V mutation (human beta2AR M82) onto human beta2AR structure (pdb accession code 2RHI). Mutation is in kink-in 'helix 2 oriented towards the binding pocket. The helices are numbered and the bound antagonist is shown as a space filing model. Amino acid residues in equivalent positions to the thermostabilising mutations in the turkey 1 adrenergic 30 receptor are shown as space filling models and are arrowed for clarity. Left: side view; right: top view.
61 Figure 21 Mapping of turkey betalAR Il29V mutation (human beta2AR 1121) onto human beta2AR structure (pdb accession code 2RH1). Mutation is opposite a kink in helix S. The helices are numbered and the bound antagonist is shown as a space filling model. Amino acid residues in equivalent positions to the 5 thermostabilising mutations in the turkey p1 adrenergic receptor are shown as space filling models and are arrowed for clarity. Left: side view; right: bottom view. Figure 22 Mapping of turkey betalAR F338M mutation (human beta2AR F321) 10 onto human beta2AR structure (pdb accession code 2RH1). Mutation is in kink in helix 7. The helices are numbered and the bound antagonist is shown as a space filling model. Amino acid residues in equivalent positions to the thermostabilising mutations in the turkey Pl adrenergic receptor are shown as space filling models and are arrowed for clarity. Left: side view; right: top view. 15 Figure 23 Mapping of turkey betalAR Y227A mutation (human beta2AR Y219) onto human beta2AR structure (pdb accession code 2RH1). Mutation is at helix helix interface. The helices are numbered and the bound antagonist is shown as a space filling model. Amino acid residues in equivalent positions to the !o thermostabilising mutations in the turkey Pl adrenergic receptor are shown as space filling models and are arrowed for clarity. Left: side view; right: bottom view. Figure 24 Mapping of turkey betalAR A282L mutation (human beta2AR C265) 5 onto human beta2AR structure (pdb accession code 2RH1). Mutation is in loop region. The helices are numbered and the bound antagonist is shown as a space filling model. Amino acid residues in equivalent positions to the thermostabilising mutations in the turkey $1 adrenergic receptor are shown as space filling models and are arrowed for clarity. Left: side view; right: top view. Figure 25 Mapping of turkey betalAR 'R68S mutation (human beta2AR K60) onto human beta2AR structure (pdb accession code 2RH1). Mutation is at the 62 lipid-water boundary, pointing into the solvent. The helices are numbered and the bound antagonist is shown as a space filling model. Amino acid residues in equivalent positions to the thermostabilising mutations in the turkey 031 adrenergic receptor are shown as space filling models and are arrowed for clarity. Left: side 5 view; right: angled top view: Figure 26 Comparison of the thermostabilities of three p adrenergic receptors (turkey pl (z), human Pl (V) and human p2 (e)) and two thermostabilised receptors (turkey P1-m23 (A) and human p2-m23 (+)). The six thermostabilising 10 mutations in P1-m23 (R68S, M90V, Y227A, A282L, F327A, F338M) were all transferred directly to the human P2 receptor (K60S, M82V, Y219A, C265L, L310A, F321M) making 02-m23, based upon the alignment in Figure 9. The resulting mutants were transiently expressed in mammalian cells, solubilised in 0.1% dodecylmaltoside and assayed for thenmostability in the minus-ligand format 15 (heating the apo-state, quenching on ice, adding 3H-DHA). The apparent Tins for turkey P1 and p2-mn23 were 23*C and 4500 respectively, giving a ATm of 22 0 C as seen previously in E. coli expressed receptor. The Tins for human P2 and 2-m.23 were 29*C and 41*C respectively, showing that the apo receptor was stabilised by 12*C. This exemplifies the principle of the transferability of thermostabilising 20 mutations from one receptor to another receptor, which in this case are 59% identical. The human p1 receptor (Tm-12*C) is much less stable than the turkey 31 receptor. Figure 27 Percentage identity of the turkey P1 adrenergic receptor, human 25 adenosine receptor and rat neurotensin receptor to human p adrenergic receptors, human adenosine receptors and human neurotensin receptors, respectively. Figure 28 Alignment of neurotensin receptors 30 Example 1: Conformational stabilisation of the 5-adrenergic receptor in detergent-resistant form 63 Summary There are over 500 non-odorant G protein-coupled receptors (GPCRs) encoded by the human genome, many of which are predicted to be potential therapeutic 5 targets, but there is only one structure available, that of bovine rhodopsin, to represent the whole of the family. There are many reasons for the lack of progress in GPCR structure determination, but we hypothesise that improving the detergent-stability of these receptors and simultaneously locking them into one preferred conformation will greatly improve the chances of crystallisation. A 10 generic strategy for the isolation of detergent-solubilised thermostable mutants of a GPCR, the p-adrenergic receptor, was developed based upon alanine scanning mutagenesis followed by an assay for receptor stability. Out of 318 mutants tested, 15 showed a measurable increase in stability. After optimisation of the amino acid residue at the site of each initial mutation, an optimally stable receptor is was constructed by combining specific mutations. The most stable mutant receptor, 3AR-m23, contained 6 point mutations that led to a Tm 21*C higher than the native protein and, in the presence of bound antagonist, PARm23 was as stable as bovine rhodopsin. In addition, PAR-m23 was significantly more stable in a wide range of detergents ideal for crystallisation and was preferentially in an 20 antagonist conformation in the absence of ligand. Results Selection of single mutations that increase the thermostability of the P1 25 adrenergic receptor PAR from turkey erythrocytes is an ideal target for structural studies because it is well characterised and is expressed at high-levels in insect cells using the baculovirus expression system[10,11]. The best overexpression of PAR is 30 obtained using a truncated version of the receptor containing residues 34-424
(PAR
3 4 24 ) [9] and this was used as the.starting point for this work. Alanine scanning mutagenesis was used to define amino residues in pAR 34
-
424 that, when 64 mutated, altered the thermostability of the receptor; if an alanine was present in the sequence it was mutated to a leucine residue. A total of 318 mutations. were made -to amino acid residues 37-369, a region that encompasses all seven transmembrane domains and 23 amino acid residues at the C terminus; mutations 5 at 15 amino residues were not obtained due to strong secondary structure in the DNA template. After sequencing each mutant to ensure the presence of only the desired mutation, the receptors were functionally expressed in E. coli and assayed for stability. 10 The assay for thennostability was performed on unpurified detergent-solubilised receptors by heating the receptors at 32*C for 30 minutes, quenching the reaction on ice and then performing a radioligand binding assay, using the antagonist
[
3 H}-dihydroalprenolol, to determine the number of remaining functional AR39 4 24 molecules compared to the unheated control. Heating the unmutated 15 @AR 3 4 4 24 at 32*C for 30 min before the assay reduced binding to approximately 50% of the unheated control (Fig. 7); all the data for the mutants were normalised by including the unmutated pAR 34
-
4 24 as a control in every assay performed. In the first round of screening, eighteen mutants showed an apparent increase in stability, maintaining more than 75% of antagonist binding after heating and being 20 expressed in E. coli to at least 50% of the native pAR 3 44 24 levels. In view of the possibility of increasing further the stability of these mutants, each of the 18 residues was mutated to 2-5 alternative amino acid residues of varying size or charge (Fig. 1). Out of these 18 mutants, 12 were not improved by further changes, 5 had better thermostability if another amino acid was present and one 25 mutation from the first screen turned out to be a false positive. In addition, three residues that were not stabilised upon mutation to alanine (V89, S151, L221) were mutated to a range of other amino acid residues; the two positions that when mutated to alanine did not affect thermostability, were also unaffected by other changes. In contrast, V89 showed less thermostability when mutated to alanine, 30 but thermostability increased when it was mutated to Leu. Thus the initial alanine scanning successfully gave two-thirds of the best amino acid residues of those tested for any given position.
65 The position and environment predicted for each of the 16 amino residues that gave the best increases in thermostability when mutated were determined by aligning the PAR sequence with that of rhodopsin whose structure is known 5 (Fig. 2). Fourteen of these residues were predicted to be present in transmembrane a-helices, with five of the residues predicted to be lipid-facing, 4 being deeply buried and the remainder were predicted to be at the interfaces between the helices. Some of these residues would be expected to interact with each other in the PAR structure, such as the consecutive amino acids G67 and R68 10 (V63 and Q64 in rhodopsin), or the amino acids within the cluster Y227, R229, V230 and A234 in helix 5 (Y223, Q225, L226 and V230 in rhodopsin). Other amino acid residues that could interact in PAR were Q194A in external loop 2 and D322A in external loop 3 (G182 and P285 in rhodopsin, respectively). 15 The increase in stability that each individual mutation gave to PAR 3
"
24 was determined by measuring the Tm for each mutant (results not shown); Tm in this context is the temperature that gave a 50% decrease in functional binding after heating the receptor for 30 minutes. Each mutation increased the Tm of pAR 3 4.4 24 by 1-3*C, with the exception of M90A and Y227A that increased the Tm by 8 0 C. 20 66 Combining mutations to make an optimally stable receptor Initially, mutations that improved thennostability that were adjacent to one another in the primary amino sequence of PAR were combined. Constructions 5 containing the mutations G67A and R68S, or different combinations of the mutations at the end of helix 5 (Y227A, R229Q, V230A and A234L) were expressed and assayed; the Tm values (results not shown) were only 1-3*C higher than the Tm for PAR 3 44 24 and one mutant was actually slightly less stable, suggesting that combining mutations that are adjacent to one another in the 10 primary amino acid sequence does not greatly improve thermo stability. Subsequently, mutations predicted to be distant from one another in the structure were combined. PCR reactions were performed using various mixes of primers to combine up to 5 different mutations in a random manner and then tested for thermostability (Table 1). The best of these combinations increased the Tm more 15 than 100C compared to the Tm of PAR 3 4 24 . In some cases, there was a clear additive effect upon the Tm with the sequential incorporation of individual mutations. This is seen in a series of 3 mutants, m4-1, m4-7 and m4-2, with the addition of V230A to m4-1 increasing the Tm by 2*C and the additional mutation D332A in m4-7 increasing the Tm a further 3*C. Mutants that contained Y227A 20 and M90A all showed an increase in Tm of 10*C or more. Just these two mutations together increased the Tm of PAR 3 44 24 by 13*C (m7-5), however, the total antagonist binding was less than 50% of 3AR 34 4 24 suggesting impaired expression of this mutant. The addition of F338M to m7-5 did not increase the thermostability, but it increased levels of functional expression in E. coli. 25 Table 1 Combinations of mutations by PCR. 10 PCR reactions were performed combining different pairs of primers that contained the selected mutations. Successful PCR reactions are shown in the table. The stability of these new mutants was assayed as described in Figure 7 and the Tm calculated. The results 30 are shown as the mean t S.E. from duplicates.
67 PCR Receptor Mutations Tm ( 0 C) pAR34424 31.7+0.1 4 m4-1 G67A, G98A 35.5+0.9 m4-2 G67A, G98A, V230A, D322A 40.9*0.9 m4-6 G98A, D322A 35.010.2 m4-7 G67A, G98A, V230A 38.0 1.2 6 m6-1 Y227A, A234L, A282L, A334L 41.6±0.9 m6-4 R68S, Y227A, A234L, A282L 41.6±0.1 m6-5 R68S, A234L, A282L, A334L 41.9±0.5 m6-9 R68S, Y227A, A234L, A282L, A334L 43.7±0.4 m6-10 R68S, Y227A, A282L, A334L 47.4±1.1 m6-11 R68S, A282L, A334L 39.1±0.5 7 m7-1 M90V, A282L,F338M 43.0 0.8 m7-2 M90V, A282L 38.9±0.6 m7-5 M90V, Y227A 45.2±1.0 m7-6 M90V, I129V 40.0±0.6 m7-7 M90V, Y227A, F338M 45.2±2.0 10 m10-4 R68S, M90V, V230A, A334L 46.9±1.0 m10-8 R68S, M90V, V230A, F327A, A334L 47.3±1.4 The most thermostable mutants obtained, which were still expressed at high levels in E. coli, were m6-10, m7-7 and ml0-8. These mutants contained collectively a total of 10 different mutations, with 8 mutations occurring in at least two of the 5 mutants. A second round of mutagenesis was performed using m10-8 as the template and adding or replacing mutations present in m6-1 0 and m7-7 (Fig. 3); some of these mutations were very close in the primary amino acid sequence of PAR and therefore were not additive as noted above, but many mutations improved the Tm further (Table 2). For example, exchanging two mutations in 10 m10-8, to create m18, raised the Tm to 49.6"C and adding A282L to make m23 increased the Tm a further 3 0 C to 52.8*C. This produced the most thermostable
PAR
34 4 2 4 mutant so far and will be referred to as PAR-m23. Table 2 Improvement of best combination of mutations. These new mutants is were obtained mixing the changes present in mutants m6-1 0, m7-7 and ml 0-8 by PCR. The stability of these new mutants was assayed as described in Figure 7 and the Tm calculated. The results are shown as the mean + S.E. from duplicates.
68 Mutations TM ("C) m17 R68S M90V Y227A V230A - F327A A334L - 48.2*1.4 m18 R68S M90V Y227A V230A A282L F327A - F338M 49.6=0/9 m19 R68S M90V Y227A - A282L F327A - F338M 49.0+0.8 m20 R68S M90V - - - F327A A334L - 48.4+0.7 m21 R68S M90V Y227A - - F327A A334L - 47.0±1.3 m22 R68S M90V Y227A F327A A334L - 47.440.5 m23 R68S M90V Y227A - A282L F327A - F338M 52.811.4 The thermostability assays used to develop PAR3"2 4 mutants were performed by heating the receptor in the absence of the antagonist, but it is well known that bound ligand stabilises receptors. Therefore, stability assays for PAR 3 4.4 2 4 and 5 pAR--m23 were repeated with antagonist bound to the receptors during the heating step (Fig. 4). As expected, the Tm of the receptor that contained bound antagonist during the incubation was higher than that for the receptor without antagonist. For pAR 34424 the Tm was 6*C higher with bound antagonist and for PAR-m23 the Tm increased 2"C to 55*C; the smaller increase in thermostability observed for 10 pAR-m23 when antagonist binds suggests that the receptor is already in a more stable conformation similar to the antagonist bound state than PAR 3 44 2 4 (see also below). The Tm of pAR-m23 with antagonist bound is very similar to the Tm of dark-state rhodopsin in dodecylnaltoside (DDM)[12], whose structure has been solved by two independent laboratories[13,14]. This suggested that QAR-m23 is 15 sufficiently stable for crystallisation. Characterization of PAR-m23 The three characteristic activities measured for $AR-m23 and pAR 34 42 4 to identify 20 the effect of the six mutations were the affinity of antagonist binding, the relative efficacies of agonist binding and the ability of 3AR-m23 to couple to G proteins. Saturation binding experiments to membranes using the antagonist.
[
3 HJ-dihydroalprenolol (Figure 8) showed that the affinity of binding to pAR-m23 (Kr 6.5 L 0.2 nM, n=2) was slightly lower than for QAR 3 4 42 4 (KD 2.8 +0.1 nM, 69 n=2), suggesting that there are no large perturbations in the structure of PARmn23 in the antagonist-bound conformation. This is consistent with the observation that none of the mutations in pAR-m23 correspond with amino acids believed to be implicated in ligand binding. In contrast to antagonist binding, the efficacy of 5 agonist binding by pAR-m23 is 3 orders of magnitude weaker than for pAR 3 4
.
4 2 4 (Fig. 5). The potency of the agonist isoprenaline is consistently lower in PAR m23 and QAR 3 4 4 24 than for the native agonist norepinephrine, indicating that the agonist-bound conformation for the two receptors is likely to be similar. However, the large decrease in agonist efficacy in pAR-m23 compared to pAR 3 4 . 10 424- indicates that the 6 mutations in PAR-m23 have locked the receptor preferentially in an antagonist-bound conformation. From a crystallisation perspective, this is an added bonus to thermostabilisation, because it is essential to have a conformationally homogeneous protein population for the production of diffraction-quality crystals. 15 All of the thermostability assays used to derive pAR-m23 were performed on receptors solubilised in DDM. The aim of the thermostabilisation process was to produce a receptor that is ideal for crystallography, which means being stable in a variety of different detergents and not just DDM. We therefore tested the stability 20 of pAR-m23 and PAR in a variety of different detergents, concentrating on small detergents that are preferentially used in crystallising integral membrane proteins. Membranes prepared from E. coli expressing PAR-m23 or QAR 3 4 24 were solubilised in DDM, bound to Ni-NTA agarose then washed with either DDM, decylmaltoside (DM), octylglucoside (00), lauryldimethylamine oxide (LDAO) 25 or nonylglucoside (NG). Stability assays were performed on the receptors in each of the different detergents (Fig 6). 'pAR34.4 was only stable in DDM and DM, with no active receptors eluting from the resin washed with OG, NG or LDAO. In contrast, functional PAR-m23 was still present in all detergents and the Tm could be determined.. As expected, the smaller detergents were considerably more 30 denaturing than either DDM (Tm 52"C) or DM (Tm 48"C), with Tns of 25*C (NG), 23 0 C (LDAO) and 17 0 C (OG). The difference in Tm between QAR-m23 and pAR34-424 is about 20*C, irrespective of whether the receptors were solubilised 70 in either DDM or DM; it is therefore not surprising that no active pAR 3 4 2 4 could be found in even NG, because the predicted Tm would be about 5*C, thus resulting'in rapid inactivation of the receptor under the conditions used for purification. The selection strategy used for the generation of pAR-m23 was 5 chosen deliberately to be based upon thermostability, because it is far simpler to apply than selecting for stability in detergents of increasing harshness. However, it is clear that increasing the thermostability of PAR 3 4-4 24 also resulted in increasing tolerance to small detergents ideal for crystallising integral membrane proteins. 10 Crystallisation of mutant GPCR Earlier attempts to crystallise several different constructs of turkey beta-adrenegic receptor failed. Despite experimenting with a variety of conditions, using both the 15 native sequence and several truncated and loop-deleted constructs, over many years, no crystals were obtained. However, once the stabilising mutations from PAR-m23 were transferred into the constructs, several different crystals were obtained in different detergents and 20 different conditions. The crystals that have been most studied so far were obtained using the purified beta-36 construct (amino acid residues 34-367 of the turkey beta receptor containing the following changes: point mutations Cll6L and C358A; the 6 25 thermostabilising point mutations in m23; replacement of amino acid residues 244-278 with the sequence ASKRK; a C terminal His6 tag) expressed in insect cells using the baculovirus expression system, after transferring the receptor into the detergent octyl-thioglucoside. The precipitant used was PEG600 or PEG1000 and the crystals obtained are elongated plates. 30 Experiments have also been carried out to see whether, once the crystallisation conditions had been defined using the stabilised receptor, it was possible to get 71 crystals using the original non-stablised construct It was possible that similar or perhaps very -small crystals could have been obtained, but, in fact, the "wild type" (i.e. the starting .structure from which the mutagenesis began) never gave any crystals. 5 The crystals are plate-shaped with space group C2 and diffract well, though the cell dimensions do vary depending on the freezing conditions used. In general, once a GPCR has been stabilised it may be subjected to a variety of 10 well-known techniques for structure determination. The most common technique for crystallising membrane proteins is by vapour diffusion (20, 21), usually using initially a few thousand crystallisation conditions set up using commercial robotic devices (22). However, sometimes the crystals formed by vapour diffusion are small and disordered, so additional techniques may then be employed. One 15 technique involves the co-crystallisation (by vapour diffusion) of the membrane protein with antibodies that bind specifically to conformational epitopes on the proteins' surface (23, 24); this increases the hydrophilic surface of the protein and can form strong crystal contacts. A second alternative is to use a different crystallisation matrix that is commonly called either lipidic cubic phase or lipidic 20 mesophase (25, 26), which has also been developed into a robotic platform (27). This has proven very successful for producing high quality crystals of proteins with only small hydrophilic surfaces e.g. bacteriorhodopsin (28). Membrane protein structures can also be determined to high-resolution by electron crystallography (29). 25 72 The evolution of PAR-m.23 from PAR4 24 by a combination of alanine scanning mutagenesis and the selection of thermostable mutants has resulted in a GPCR that is ideal for crystallography. The Tm for pAR-m23 is 21 *C higher than for 5 OAR 34 4 2 4 and, in the presence of antagonist, OAR-m23 has a similar stability to rhodopsin. The increased Tm of pAR-m23 has resulted in an increased stability in a variety of small detergents that inactivate sAR 3 44 24 . In addition, the selection strategy employed resulted in a receptor that is preferentially in the antagonist bound conformation, which will also improve the chances of obtaining crystals, 10 because the population of receptor conformations will be more homogeneous than for wild type pAR 3 44 2 4 . Thus we have achieved a process of confornational stabilisation in a single selection procedure. It is not at all clear why the particular mutations we have introduced lead to the is thermostabilisation of the receptor. Equivalent positions in rhodopsin suggest that the amino acid residues mutated could be pointing into the lipid bilayer, into the centre of the receptor or at the interfaces between these two environments. Given the difficulties in trying to understand the complexities of the thermostabilisation of soluble proteins[15], it seems unlikely that membrane proteins will be any 20 easier to comprehend; we found that there was no particular patten in the amino acid residues in PAR that, when mutated, led to thermostability. However, since nearly 5% of the mutants produced were more stable than the native receptor, alanine scanning mutagenesis represents an efficient strategy to rapidly identify thermostable mutants. 25 The procedure we have used to generate QAR-m23 is equally applicable to any membrane protein that has a convenient assay for detecting activity in the detergent solubilized form. While we have selected for stability as a function of temperature as the most convenient primary parameter, the procedure can easily 30 be extended to test.primarily for stability, for example, in a harsh detergent, an extreme of pH or in the presence of chaotropic salts. Conformational stabilisation of a variety of human receptors, channels and transporters will make them far 73 more amenable to crystallography and will also allow the improvement in resolution of membrane proteins that have already been crystallised. It is to be hoped that coformational stabilisation will allow membrane protein crystallisation to become a far more tractable problem with a greater probability of 5 rapid success than is currently the case. This should allow routine crystallisation of human membrane proteins in the pharmaceutical industry, resulting in valuable structural insights into drug development. METHODS -10 Materials. The truncated 31 adrenergic receptor from turkey (PAR34-4)[9] was kindly provided by Dr Tony Wame (MRC Laboratory of Molecular Biology, Cambridge, UK). This PAR construct encoding residues 34-424 contains the mutation Cll6L to improve expression[1], and a C-terminal tag of 10 histidines 15 for purification. 1-[4,6-propyl- 3 H]-dihydroalprenolol ([ 3 H]-DHA) was supplied by Amersham Bioscience, (+) L-norepinephrine bitartrate salt, (-) isoprenaline hydrochloride, (-) alprenolol tartrate salt and s-propranolol hydrochloride were from Sigma. 20 Mutagenesis of PAR. The PAR cDNA was ligated into pRGIII to allow the functional expression of PAR in E. coli as a MalE fusion protein[16]. Mutants were generated by PCR using the expression plasmid as template using the QuikChange II methodology (Stratagene). PCR.reactions were transformed into XL1O-Gold ultracompetent cells (Stratagene) and individual clones were fully 25 sequenced to check that only the desired mutation was present. Different mutations were combined randomly by PCR by including all the pairs of primers that introduced the following mutations: Mut4, G67A, G068A, V230A, D322A and F327A; Mut6, R068S, Y227A, A234L, A282L and A334L; Mut7, M90V, I129V, Y227A, A282L and F338M; MutlO, R68S, M90V, V230A, F327A and 30 A334L. The PCR mixes were transformed and the clones sequenced to determine exactly which mutations were introduced.
74 Protein expression and membrane preparations. Expression of PAR and the mutants-was performed.in XLIO cells (Stratagene). Cultures of 50 ml of 2xTY medium containing ampicillin (100 ±g/mnl) were grown at 37*C with shaking until
OD
600 =3 and then induced with 0.4 mM IPTG. Induced cultures were incubated s at 25*C for 4h and then cells were harvested by centrifugation at 13,000 xg for I min (aliquots of 2 ml) and stored at -20 0 C. For the assays, cells were broken by freeze-thaw (five cycles), resuspended in 500 pW of buffer [20mM Tris pH 8, 0.4 M NaCl, 1mM EDTA and protease inhibitors (CompleteTM, Roche)]. After an incubation for 1h at 4*C with 100 pig/ml lysozyme and DNase I (Sigma), samples 10 were solubilized with 2% DDM on ice for 30 minutes. Insoluble material was removed by centrifugation (15,000xg, 2 min, 4"C) and the supernatant was used directly in radioligand binding assays. For large-scale membrane preparations, 2L and 6L of E. coli culture of PAR and 15 Mut23, respectively, were grown as described above. Cells were harvested by centrifugation at 5,000 xg for 20 min, frozen in liquid nitrogen and stored at 800C. Pellets were resuspended in 10 ml of 20 mM Tris pH 7.5 containing lx protease inhibitor cocktail (CompleteJm EDTA-free, Roche); 1 mg DNase I (Sigma) was added and the final volume was made to 100 ml. Cells were broken 20 by a French press (2 passages, 20,000 psi), and centrifuged at 12,000 xg for 45 min at 4*C to remove cell debris. The supernatant (membranes) was centifuged at 200,000 xg for 30 min at 4*C; the membrane pellet was resuspended in 15 ml of 20 mM Tris pH 7.5 and stored in 1 ml aliquots at -80*C after flash-freezing in liquid nitrogen. The protein concentration was determined by the amido black 25 method[17]. These samples were used in radioligand binding assays after thawing and being solubilized in 2% DDM as above. For competition assays, as well as testing different detergents, DDM-solubilized PAR was partially purified with Ni-NTA agarose (Qiagen). 200 pl of Ni-NTA 30 agarose was added to 2 ml of solubilized samples (10 mg/ml of membrane protein) in 20mM Tris pH 8, 0.4 M NaCl, 20 mM imidazole pH 8 and incubated for 1 h at 40C. After incubation, samples were centrifuged at 13,000 'x g for 75 30 sec and washed twice with 250 p1 of buffer (20mM Tris pH 8, 0.4 M NaCI, 20 mM imidazole) containing detergent (either 0.1% DDM, 0.1% DM, 0.1% LDAO, 0.3% NG or 0.7% OG). 5 Receptors were elated in 2 x 100 pl of buffer (0.4 M NaCI, ImM EDTA, 250 mM imidazole pH 8, plus the relevant detergent). The KD for [ 3 H]-DHA binding to semipurified PAR 3 4 2 4 and pAR-m23 was, respectively 3.7 nM and 12.5 nM and the final concentration of [ 3 H]-DHA used in the competition assays was 3 times the KD ie 12 nM for pAR 3 4- 24 and 40 nM for pAR-m23. 10 Radioligand binding and thermostability assays. Single point binding assays contained 20mM Tris pH 8, 0.4 M NaCL, 1mM EDTA, 0.1% DDM (or corresponding detergent) with 50 nM [ 3 H]-DHA and 20-100 p±g membrane protein in a final volume of 120 pl; equilibration was for 1 h at 4*C. Thermostability was 15 assessed by incubating the binding assay mix, with or without [ 3 H1-DHA at the specified temperature for 30 minutes; reactions were placed on ice and [ 3 H1-DHA added as necessary and equilibrated for a farther hour. Receptor-bound and free radioligand were separated by gel filtration as described previously[1 8]. Non specific binding was determined in the presence of 1 pM of s-propranolol. 20 Saturation curves were obtained using a range of [ 3 H]-DHA concentration from 0.4 nM to 100 nM. Competition assays were performed using a concentration of [3H]-DHA of 12 nM for QAR 34 4 2 4 and 40 nM for QAR-m23 (ie three times the KD) and various concentrations of unlabeled ligands (0-100 mM). Radioactivity was counted on a Beckman LS6000 liquid scintillation counter and data were analyzed 25 by nonlinear regression using Prism software (GraphPad). Location of pAR-m23 thermostable mutations in rhodopsin structure. The pdb file for the rhodopsin structure, accession code 1GZM[14], was downloaded from the Protein Data Bank website (www.pdb.org) and displayed in the program 30 PyMOLXIIHybrid (DeLano Scientific). The equivalent amino acid residues in rhodopsin for the thermostable mutations in PAR were' located in the rhodopsin structure based upon an alignment among the four GPCRs with which we are most 76 familiar, namely rhodopsin, P1 adrenergic receptor, neurotensin receptor and adenosine A,, receptor 19]. Example 2: Mutants of the adenosine A, receptor (A R) with increased 5 thermostability 1. 315 site-directed mutants made between residues 2-316 of A 2 R. 2. All of these mutants have been assayed for thermostability using an assay 10 measuring agonist and antagonist binding ater the heating step (Ligand(-) format as described in Figure 12). a. 26 mutants showed improved thermostability when measured with 3 H-NECA (agonist): G114 A, G118A, L167A, A]84L, R199A, A203L, L208A, Q210A, S213A, E219A, R220A, S223A, T224A, Q226A, 15 K227A, H230A, L241A, P260A, S263A, L267A, L272A, T279A, N284A, Q311A, P313A, K315A. b. 18 mutants showed improved thermostability when assayed with 3 H-ZM241385 (antagonist): A54L, V57A, H75A, T88A, G114A, G118A, Tl19A, K122A, G123A, P149A, E151A, G152A, A203L, A204L, A231L, 20 L235A, V239A. 3. Mutations have been combined to generate mutants in a putative antagonist conformation. Wildtype A 2 .R has a Tm of 31*C with ZM241385 bound. a. Rant17 A54L+K122A+L235A Tm 48*C (ZM241385 bound) 25 b. Rant19 A54L,T8SA,V239A+A204L Tm 47"C (ZM241385 bound) c. Rant2l A54LT88AV239A+K122A Tm 49'C (ZM241385 bound) 4. Mutations from the agonist screen have been combined, but have led to only a very low level of improvement in Tm of+2"C. 30 Table (i). List of A2aR stabilising mutations. Mutants were expressed in B. coli, solubilised in 2% DDM + 10% glycerol and tested for ligand-binding, using the 77 agonist [ 3 H]-NECA (on the right) and the antagonist [ 3 HJ-ZM241385 (left). Concentrations of radioligands were 6-10-fold above their KD measured for the wild-type receptor. Expression of active receptor was evaluated by ligand binding at 4"C. Stability was assayed by heating the solubilised receptor in its apo-state at 5 30"C for 30 minutes and then measuring residual binding activity. Under these conditions, wild-type activity decays to 50% (S.D.=15%). Data obtained for expression and stability were normalised to wild-type values. Mutations included in subsequent rounds of mutagenesis were those whose expression was > 30-40% and stability?: 130-140% compared to the wild-type. Bold lines indicate cluster of t0 mutations. Agonist Antagonist Mutation Expression Stability Mutation Expression Stability (%) (%) (%) (%) wt 100 100 wt 100 100 SO90A 151 151 A054L 90 140 GI14A 62 143 V057A 44 144 G118A 71 151 H075A 82 152 L167A 41 174 TOSSA 67 230 A184L 140 150 GI14A 73 153 R199A 73 202 G118A 84 148 A203L 42 172 T119A 90 148 L208A 276 215 K122A 52 153 Q210A 46 155 G123A 90 158 S213A 40 140 P149A 54 215 E219A 96 221 E151A 63 173 R220A 84 250 G152A 70 156 S223A 57 146 A203L 111 132 T224A 142 276 A204L 40 181 Q226A 119 217 A231L 90 148 K227A 87 2.22 L235A 85 140 H230A 57 154 V239A 91 134 L241A 139 156 P260A 70 169 S263A 60 158 L267A 40 187 L272A 34 157 T279A 125 158 78 N284A 64 151 Q311A 49 164 P313A 44 148 K315A 64 186 Table (ii). Stability of best combinations. Receptors were solubilised in 1% DDM (no glycerol). A melting profile was obtained by heating the solubilised receptor at different temperatures in absence (apo-state) or presence of ligand 5 (ligand-occupied state). Data shown are representative of at least three independent experiments. S.D. is < 1"C. Tm ("C) Tm ("C) agonist agonist antagonist antagonist Wt 21 29 wt 31 32 Rag 1 26 34 Rant 5 (A184LR199A/L272A) 2(A54L/T88A/V239A) 42 46 Rag 23 Rant 21 (Rag 1+F79A /L208A) 22 38 (Rant 5+K122A) 10 Table (ii). Summary of results for competition assays of detergent-solubilised wild-type A2aR and thermo-stable mutant Rant 21. Values are representative of two independent experiments. Each data point was assayed in triplicate and plotted as mean d SD. Each solubilised receptor was incubated with ligands for one hour on ice in binding buffer (50 mM Tris pH 7.5 and 0.025% DDM) 15 containing 400 mM NaCl. Binding of [3H]ZM241385 (10 nM) in the absence of unlabeled ligand was set to 100%. Data shown are from two independent experiments with each data point measured in triplicate. Incubation of samples with ligands was for 1 hour on ice with CHJZM241385 at a concentration of 10 niM. Ki values were calculated according to the Cheng and Prusoff equation using 20 the non-linear regression equation of the software Prism, applying a K1 for [H]ZM241385 of 12 nM for the wild-type and 15 nM for Rant 21. Rant 21 did not bind NECA sufficiently for an accurate K; determination (hence indicated as >1 x 10-). The affinity of Rant2l for agonist binding is weakened 232 fold for R PIA and at least by 1900 fold for NECA. 25 79 Competitor wa2 wt Rant 21 XAC 2.3 x 10- 2.3 x 10 Theophylline 1.5 x 10- 0.9 x 10 3 NECA 7.0 x 10- >1 x ' R-PIA 1.6 x 105 3.6 x 10' 5 Table (iv). Summary of results for saturation assays of detergent-solubilised wild-type A2aR and thermo-stable mutants. Values are representative of three independent experiments. Each data point was assayed in triplicate and plotted as mean ± SD. Data were fitted to the Michaelis-Menten equation using the non linear regression equation of the software Prism. 10 Kn (nM) Receptor [ 3 H]NECA [-HJZM241385 (agonist) (antagonist) wt 32+1 12 3 Rag 1 26+0.4 26+0.5 Rag23 21 1 62±1 Rant21 >450 15+3 15 80 Table (v). Summary of stability of wild-type and mutant receptors in different detergents. Solubilisation of receptors and detergent exchange was performed during-the IMAC step. S.D. is < 1C. It was not possible to determine the Tm for some receptor-detergent combinations, because the receptor was too 5 unstable (t). 10 Tm (*C) Agonist-binding Antagonist binding wt Rag 23 wt Rant 21 0.01% DDM 27 34 25 39 0.1% DM 23 29 10 28 0.3% NM 22 28 <4 25 0.3% NG t t t 22 0.6% OG <9 16 t 23 0.003% LDAO 28 38 32 42 0.006% FC12 37 39 43 49 15 Example 3: Mutants of the neurotensin receptor (NTR) with increased thermostability 1. 340 site-directed mutants have been made between residues 61-400 of 20 NTR.
81 2. Initially, all of these mutants were assayed for thermostability using an assay measuring 3 H-neurotensin (agonist) binding after the heating step. 24 mutations led to a small but significant increase in thermostability: A356L, H103A, D345A, A86L, A385L, Y349A, C386A, K397A, H393A, 5 1116A, F358A, S108A, MI81A, R392A, D113A, G209A, L205A, L72A, A120L, P399A, Y351A, V268A, T207A, A155L, S362A, F189A, N262A, L109A, W391A, T179A, S182A, M293A, L256A, F147A, D139A, S100A, K176A, L111A, A90L, N270A. jo 3. Mutants tested for thermostability by heating in the absence of the agonist were re-tested using a slightly different assay where the mutants were heated in the presence of 3 H-neurotensin (Ligand(+) format in Figure 12). Mutants with improved thermostability are: A69L, A73L, A86L, A90L, H103A, V165A, E166A, G215A, V229A, M250A, I253A, A177L, 15 R183A, I260A, T279A, T294A, G306A, L308A, V309A, L310A, V313A, F342A, F358A, V360A, S362A, N370A, S373A, F380A, A385L, P389A, G390A, R395A. 4. There are also mutants that have a significantly enhanced expression level 20 compared to the wildtype receptor and could be used to boost preceptor production levels for crystallisation: A86L, H103A, F358A, S362A, N370A A385L, G390A. All of these also have increased thermostability. 5. Preferred combinations are 25 a.Nag7m F358A+A86L+1260A+F342A Tm 51"C (neurotensin bound) b.Nag7n F358A+H103A+I260A+F342A Tm 51*C (neurotensin bound) 30 Wildtype NTR h'as a Tm of 35*C with neurotensin bound.
82 Example 4: Identification of structural motifs in which stabilising GPCR mutations reside. The structure of the p2 adrenergic receptor has been determined (20, 21), which is 5 59% identical to the turkey p1 receptor, but with a distinctly different pharmacological profile (22, 23). In order to determine the structural motifs in which the stabilising mutations of the turkey p1 receptor reside, we mapped the mutations onto the human D2 structure (21). io The beta adrenergic receptors were first aligned using ClustalW in the MacVector package; thermostabilising mutations in turkey p1 were highlighted along with the corresponding residue in the human p2 sequence. The human p2 model (pdb accession code 2RHl) was visualised in Pymol and the desired amino acids were shown as space filling models by standard procedures known in the art. The 15 structural motifs in which the stabilising mutations were located, were determined by visual inspection Table (vi) lists the equivalent positions in the p2 receptor corresponding to the thermostabilising mutations in pAR-m23 and the structural motifs in which they 20 reside. As seen from Table (vi), the mutations are positioned in a number of distinct localities. Three mutations are in loop regions that are predicted to be accessible to aqueous solvent (loop). Eight mutations are in the transmembrane cc-helices 25 and point into the lipid bilayer (lipid); three of these mutations are near the end of the helices and may be considered to be at the hydrophilic boundary layer (lipid boundary). Eight mutations are found at the interfaces between transmembrane a helices (helix-helix interface), three of which are either within a kinked or distorted region of the helix (kink) and another two mutations occur in one helix 30 but are adjacent to one or more other helices which contain a kink adjacent ,in space to the mutated residue (opposite kink). These latter mutations could affect 83 the packing of the amino acids within the kinked region, which could result in thermostabilisation. Another mutation is in a substrate binding pocket.(pocket). Turkey f1 Human P2 Description Helix 1 155A 147 3-helix kink interface Fig 18 Helix 1 G67A A59 lipid boundary Helix 1 R68S K60 lipid boundary Fig 25 Helix 2 V89L V81 kink Fig 19 Helix 2 M90V M82 kink Fig 20 Helix 2 G98A G90 pocket Helix 3 1129V 1121 opposite kink Fig 21 S151E S143 loop Helix 4 Vl60A V152 lipid Q194A A186 loop Helix 5 L221V V213 lipid Helix 5 Y227A Y219 helix-helix interface Fig 23 Helix 5 R229Q R221 lipid Helix 5 V230A V222 helix-helix interface Helix 5 A234L A226 helix-helix interface Helix 6 A282L C265 loop Fig 24 D322A K305 lipid boundary Helix 7 F327A L310 lipid Helix 7 A334L V317 lipid Helix 7 F338M P321 kink Fig 22 5 Table (vi) Position in the human P2 structure of the amino acid residues equivalent to the thermostabilising mutations found in the turkey Pl receptor and the structural motifs in which they reside. Such structural motifs, by virtue of them containing stabilising mutations, are 10 important in determining protein stability. Therefore, targeting mutations to these 84 motifs will -facilitite the generation of stabilised mutant GPCRs. Indeed, there were several instances where more than one mutation mapped to the same structural motif. For example, the Y227A, V230A and A234L mutations in the turkey p1 adrenergic receptor all mapped to the same helical interface, the V89L 5 and M90V mutations mapped to the same helical kink and the F327A and A334L mutations mapped to the same helical surface pointing towards the lipid bilayer (Table (vi)). Thus, when one stabilising mutation has been identified, the determination of the structural motif in which that mutation is located will enable the identification of further stabilising mutations. 10 ( ) References 1. S. H. White (2004) Protein Sci 13, 1948-1949. 2. C. G. Tate (2001) FEBSLett 504, 94-98. 15 3. R. Grisshammer, C. G. Tate (1995) Q Rev Biophys 28, 315-422. 4. J. U. Bowie (2001) Curr Opin Struci Biol 11, 397-402. 5. F. W. Lau, S. Nauli, Y. Zhou, J. U. Bowie (1999) JMol Biol 290, 559-564. 6. Y. Zhou, J. U. Bowie (2000) JBiol Chem 275, 6975-6979. 7. S. Faham, D. Yang, E. Bare, S. Yohannan, J. P. Whitelegge, J. U. Bowie 20 (2004) JMo/Biol 335, 297-305. 8. Y. Yarden, H. Rodriguez, S. K. Wong, D. R. Brandt, D. C. May, J. Bumier, R. N. Harkins, E. Y. Chen, J. Ramachandran, A. U1hich, et at (1986) Proc. Nat. Acad. Sci. USA 83, 6795-6799. 9. T. Warne, J. Chirnside, G. F. Schertler (2003) Biochim Biophys Acta 1610, 25 133-140. 10. E. M. Parker, E. M. Ross (1991) JBiol Chem 266, 9987-9996. 11. E. M. Parker, K. Kameyama, T. Higashijima, E. M. Ross (1991) J Biol Chem 266, 519-527. 12. W. J. Degrip (1982) Methods in Enzymology 81, 256-265. 30 13. IC. Palczewski, T. Kumasaka, T. Hori, C. A. Behnke, H. Motoshima, B. A. Fox, . Le Trong, D. C. Teller, T. Okada, R. E. Stenkamp, et al (2000) Science 289, 739-745.
85 14. J. Li, P. C. Edwards, M. Burghammer, C. Villa, G. F. Schertler (2004) J Mo/Bio/343, 1409-1438. 15. R. Jaenicke, G. Bohm (1998) Current Opinion in Structural Biology 8, 738-748. 5 16. J. Tucker, R. Grisshammer (1996) Biochem J317 ( Pt 3), 891-899. 17. W. Schaffner, C. Weissmann (1973) Anal. Biochem. 56, 502-514. 18. C. G. Tate (1998) Methods Enzymol 296, 443-455. 19. H. M. Weiss, R. Grisshammer (2002) Eur JBiochem 269, 82-92. 20. Rasmussen, S. G., Choi, H. J., Rosenbaum, D. M., Kobilka, T. S., Thian, 10 F. S., Edwards, P. C., Burghammer, M., Ratnala, V. R., Sanishvili, R., Fischetti, R. F., Schertler, 0. F., Weis, W. I. and Kobilka, B. K. (2007) Nature 15, 383-387. 21. Cherezov, V., Rosenbaum, D. M., Hanson, M. A., Rasmussen, S. G., Thian, F. S., Kobilka, T. S., Choi, H. J., Kuhn, P., Weis, W. I., Kobilka, B. 15 K. and Stevens, R. C. (2007) Science 318:1258-1265. 22. Minneman, K. P., Weiland, G. A. and Molinoff, P. B. (1980) Mol Pharmacol 17:1-7. 23. Parker, E. M., Swigart, P., Nunnally, M. H., Perkins, J. P. and Ross, E. M. (1995) JBiol Chem 270:6482-6487. 20
Claims (16)
1. An isolated mutant G-protein coupled receptor (GPCR) with increased conformational stability compared to its parent GPCR, which mutant GPCR has an extended lifetime of a particular conformation relative to the same 5 conformation of its parent GPCR under denaturing conditions, the extended lifetime being manifest by retention of ligand binding ability; wherein the mutant GPCR is not a human cannabinoid receptor.
2. A mutant GPCR according to Claim 1, wherein the mutant GPCR has an extended lifetime under conditions selected from any one or more of heat, a 10 detergent, a chaotropic agent and an extreme of pH.
3. A mutant GPCR according to Claim 1 or 2, wherein the mutant GPCR has increased conformational thermostability.
4. A mutant GPCR according to Claim 3, wherein the mutant GPCR is at least 1C more stable than its parent. 15
5. A mutant GPCR according to any one of Claims 1 to 4 with increased conformational stability of an agonist or antagonist conformation.
6. A mutant GPCR according to any of Claims 1 to 5, wherein the mutant GPCR is a Class 1 GPCR, or wherein the mutant GPCR is a Class 2 GPCR, or wherein the mutant GPCR is a Class 3 GPCR. 20
7. An isolated polynucleotide encoding a mutant GPCR according to any of Claims I to 6.
8. A host cell comprising a polynucleotide according to Claim 7.
9. A mutant GPCR according to any of Claims 1 to 6, which is in a solubilised form or which is substantially free of other proteins or which is immobilised 25 to a solid support.
10. Use of a mutant GPCR according to any of Claims 1 to 6 for crystallisation; or for drug discovery such as in a ligand binding screen or in assay development; or as a biosensor. 87
11. A crystallised form of a mutant GPCR according to any of Claims 1 to 6, and 9.
12. An isolated mutant G-protein coupled receptor (GPCR) according to Claim 1, substantially as hereinbefore described. 5
13. An isolated polynucleotide according to Claim 7, substantially as hereinbefore described.
14. A host cell according to Claim 8, substantially as hereinbefore described.
15. Use according to Claim 10, substantially as hereinbefore described.
16. A crystallised form of a mutant GPCR according to Claim 11, substantially 10 as hereinbefore described.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2012200841A AU2012200841B2 (en) | 2007-03-22 | 2012-02-14 | Mutant proteins and methods for selecting them |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0705450.5 | 2007-03-22 | ||
GB0724052.6 | 2007-12-08 | ||
AU2008228085A AU2008228085B2 (en) | 2007-03-22 | 2008-03-20 | Mutant G-protein coupled receptors and methods for selecting them |
AU2012200841A AU2012200841B2 (en) | 2007-03-22 | 2012-02-14 | Mutant proteins and methods for selecting them |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2008228085A Division AU2008228085B2 (en) | 2007-03-22 | 2008-03-20 | Mutant G-protein coupled receptors and methods for selecting them |
Publications (2)
Publication Number | Publication Date |
---|---|
AU2012200841A1 AU2012200841A1 (en) | 2012-03-08 |
AU2012200841B2 true AU2012200841B2 (en) | 2013-11-28 |
Family
ID=49640573
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2012200841A Active AU2012200841B2 (en) | 2007-03-22 | 2012-02-14 | Mutant proteins and methods for selecting them |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU2012200841B2 (en) |
-
2012
- 2012-02-14 AU AU2012200841A patent/AU2012200841B2/en active Active
Non-Patent Citations (3)
Title |
---|
D'Antona A et al, Biochemistry, 2006, 45(17):5606-5617 * |
D'Antona A et al, Brain Research, 2006, 1108(1):1-11 * |
Zhang Y et al, PLoS Computational Biology, 2006, 2(2):e13 * |
Also Published As
Publication number | Publication date |
---|---|
AU2012200841A1 (en) | 2012-03-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11673938B2 (en) | Mutant G-protein coupled receptors and methods for selecting them | |
EP2220229B1 (en) | Mutant g-protein coupled receptors and methods for selecting them | |
AU2012200841B2 (en) | Mutant proteins and methods for selecting them | |
GB2456904A (en) | Mutant neurotensin receptors with improved stability |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGA | Letters patent sealed or granted (standard patent) |