WO1993024628A9 - SEQUENCE OF HUMAN DOPAMINE TRANSPORTER cDNA - Google Patents
SEQUENCE OF HUMAN DOPAMINE TRANSPORTER cDNAInfo
- Publication number
- WO1993024628A9 WO1993024628A9 PCT/US1993/005179 US9305179W WO9324628A9 WO 1993024628 A9 WO1993024628 A9 WO 1993024628A9 US 9305179 W US9305179 W US 9305179W WO 9324628 A9 WO9324628 A9 WO 9324628A9
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cdna
- cells
- dopamine
- leu
- protein
- Prior art date
Links
- 229920002676 Complementary DNA Polymers 0.000 title claims abstract description 72
- 239000002299 complementary DNA Substances 0.000 title claims abstract description 65
- 241000282414 Homo sapiens Species 0.000 title claims abstract description 60
- 102000006441 Dopamine Plasma Membrane Transport Proteins Human genes 0.000 title claims abstract description 47
- 108010044266 Dopamine Plasma Membrane Transport Proteins Proteins 0.000 title claims abstract description 47
- 230000014509 gene expression Effects 0.000 claims abstract description 32
- 201000010099 disease Diseases 0.000 claims abstract description 27
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 claims description 78
- 210000004027 cells Anatomy 0.000 claims description 72
- 102000004169 proteins and genes Human genes 0.000 claims description 61
- 108090000623 proteins and genes Proteins 0.000 claims description 61
- 239000003814 drug Substances 0.000 claims description 40
- 229960003638 dopamine Drugs 0.000 claims description 39
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 claims description 29
- 229960003920 cocaine Drugs 0.000 claims description 23
- ZPUCINDJVBIVPJ-BARDWOONSA-N cocaine Natural products O([C@@H]1C[C@H]2CC[C@H](N2C)[C@@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-BARDWOONSA-N 0.000 claims description 23
- 229920003013 deoxyribonucleic acid Polymers 0.000 claims description 23
- 201000009032 substance abuse Diseases 0.000 claims description 20
- 230000027455 binding Effects 0.000 claims description 16
- 230000002068 genetic Effects 0.000 claims description 15
- 108020004999 Messenger RNA Proteins 0.000 claims description 14
- 229920002106 messenger RNA Polymers 0.000 claims description 14
- 238000000034 method Methods 0.000 claims description 14
- 239000012528 membrane Substances 0.000 claims description 13
- 239000000523 sample Substances 0.000 claims description 13
- 229920001850 Nucleic acid sequence Polymers 0.000 claims description 11
- 230000000694 effects Effects 0.000 claims description 11
- 231100000736 substance abuse Toxicity 0.000 claims description 8
- 150000001875 compounds Chemical class 0.000 claims description 7
- 239000003550 marker Substances 0.000 claims description 7
- FMGYKKMPNATWHP-UHFFFAOYSA-N Cyperquat Chemical compound C1=C[N+](C)=CC=C1C1=CC=CC=C1 FMGYKKMPNATWHP-UHFFFAOYSA-N 0.000 claims description 6
- 230000002159 abnormal effect Effects 0.000 claims description 6
- 210000001175 Cerebrospinal Fluid Anatomy 0.000 claims description 5
- 210000004002 Dopaminergic cell Anatomy 0.000 claims description 5
- 206010061536 Parkinson's disease Diseases 0.000 claims description 5
- 238000002105 Southern blotting Methods 0.000 claims description 4
- 208000000323 Tourette Syndrome Diseases 0.000 claims description 4
- 206010044126 Tourette's disease Diseases 0.000 claims description 4
- 230000002401 inhibitory effect Effects 0.000 claims description 4
- 230000003042 antagnostic Effects 0.000 claims description 3
- 239000005557 antagonist Substances 0.000 claims description 3
- 230000035897 transcription Effects 0.000 claims description 3
- 238000001415 gene therapy Methods 0.000 claims description 2
- 238000003752 polymerase chain reaction Methods 0.000 claims description 2
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 2
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims 6
- 210000003527 eukaryotic cell Anatomy 0.000 claims 4
- 230000002265 prevention Effects 0.000 claims 4
- 230000001988 toxicity Effects 0.000 claims 3
- 231100000419 toxicity Toxicity 0.000 claims 3
- 238000000338 in vitro Methods 0.000 claims 2
- 230000010807 negative regulation of binding Effects 0.000 claims 2
- 229920000023 polynucleotide Polymers 0.000 claims 2
- 239000002157 polynucleotide Substances 0.000 claims 2
- 108020004511 Recombinant DNA Proteins 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 238000010276 construction Methods 0.000 claims 1
- 238000005259 measurement Methods 0.000 claims 1
- 210000001082 somatic cell Anatomy 0.000 claims 1
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 abstract description 38
- 230000003542 behavioural Effects 0.000 abstract description 6
- 238000010192 crystallographic characterization Methods 0.000 abstract description 2
- 238000009826 distribution Methods 0.000 abstract description 2
- 238000010367 cloning Methods 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 51
- 239000000126 substance Substances 0.000 description 41
- 229940079593 drugs Drugs 0.000 description 36
- 241000700159 Rattus Species 0.000 description 29
- 150000001413 amino acids Chemical group 0.000 description 21
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 21
- 229940014598 TAC Drugs 0.000 description 20
- 230000003899 glycosylation Effects 0.000 description 16
- 238000006206 glycosylation reaction Methods 0.000 description 16
- 238000009396 hybridization Methods 0.000 description 16
- 108010037850 glycylvaline Proteins 0.000 description 15
- 241000880493 Leptailurus serval Species 0.000 description 14
- STKYPAFSDFAEPH-LURJTMIESA-N gly-val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 12
- 108010051242 phenylalanylserine Proteins 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- ROHDXJUFQVRDAV-UWVGGRQHSA-N Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ROHDXJUFQVRDAV-UWVGGRQHSA-N 0.000 description 11
- 235000001014 amino acid Nutrition 0.000 description 11
- 238000004458 analytical method Methods 0.000 description 10
- 108010013835 arginine glutamate Proteins 0.000 description 10
- 108020005345 3' Untranslated Regions Proteins 0.000 description 9
- 229920000272 Oligonucleotide Polymers 0.000 description 9
- 229920000401 Three prime untranslated region Polymers 0.000 description 9
- 210000004556 Brain Anatomy 0.000 description 8
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 8
- 108010066427 N-valyltryptophan Proteins 0.000 description 8
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 8
- 230000006399 behavior Effects 0.000 description 8
- 230000003291 dopaminomimetic Effects 0.000 description 8
- 230000035622 drinking Effects 0.000 description 8
- 235000021271 drinking Nutrition 0.000 description 8
- 230000004634 feeding behavior Effects 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010081551 glycylphenylalanine Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 230000032258 transport Effects 0.000 description 8
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 7
- 208000007848 Alcoholism Diseases 0.000 description 7
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 7
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 7
- 201000007930 alcohol dependence Diseases 0.000 description 7
- 108090001123 antibodies Proteins 0.000 description 7
- 102000004965 antibodies Human genes 0.000 description 7
- 238000003745 diagnosis Methods 0.000 description 7
- 230000029087 digestion Effects 0.000 description 7
- 238000011160 research Methods 0.000 description 7
- 102220174578 rs731236 Human genes 0.000 description 7
- IOUPEELXVYPCPG-UHFFFAOYSA-N val-gly Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 7
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 6
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 6
- 108010045030 monoclonal antibodies Proteins 0.000 description 6
- 102000005614 monoclonal antibodies Human genes 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 150000007523 nucleic acids Chemical class 0.000 description 6
- 235000000346 sugar Nutrition 0.000 description 6
- LZDNBBYBDGBADK-KBPBESRZSA-N (2S)-2-[[(2S)-2-amino-3-methylbutanoyl]amino]-3-(1H-indol-3-yl)propanoic acid Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-KBPBESRZSA-N 0.000 description 5
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 5
- 210000000133 Brain Stem Anatomy 0.000 description 5
- 210000000349 Chromosomes Anatomy 0.000 description 5
- YXQDRIRSAHTJKM-IMJSIDKUSA-N Cys-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YXQDRIRSAHTJKM-IMJSIDKUSA-N 0.000 description 5
- 206010013654 Drug abuse Diseases 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 5
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 5
- HIZYETOZLYFUFF-BQBZGAKWSA-N Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O HIZYETOZLYFUFF-BQBZGAKWSA-N 0.000 description 5
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 5
- XWOBNBRUDDUEEY-UWVGGRQHSA-N Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XWOBNBRUDDUEEY-UWVGGRQHSA-N 0.000 description 5
- 229960002715 Nicotine Drugs 0.000 description 5
- SNICXCGAKADSCV-JTQLQIEISA-N Nicotine Chemical compound CN1CCC[C@H]1C1=CC=CN=C1 SNICXCGAKADSCV-JTQLQIEISA-N 0.000 description 5
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 5
- FSXRLASFHBWESK-HOTGVXAUSA-N Phe-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 FSXRLASFHBWESK-HOTGVXAUSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 230000001476 alcoholic Effects 0.000 description 5
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 5
- 235000019504 cigarettes Nutrition 0.000 description 5
- VPZXBVLAVMBEQI-VKHMYHEASA-N gly ala Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 210000004962 mammalian cells Anatomy 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 229930015196 nicotine Natural products 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 108010079317 prolyl-tyrosine Proteins 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 4
- RDIKFPRVLJLMER-BQBZGAKWSA-N Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)N RDIKFPRVLJLMER-BQBZGAKWSA-N 0.000 description 4
- BNODVYXZAAXSHW-UHFFFAOYSA-N Arginyl-Histidine Chemical compound NC(=N)NCCCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 BNODVYXZAAXSHW-UHFFFAOYSA-N 0.000 description 4
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 4
- PSZNHSNIGMJYOZ-WDSKDSINSA-N Asp-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PSZNHSNIGMJYOZ-WDSKDSINSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 240000000218 Cannabis sativa Species 0.000 description 4
- 101700040453 DRD2 Proteins 0.000 description 4
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 4
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N Leu-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 4
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 4
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 4
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 4
- JQOHKCDMINQZRV-WDSKDSINSA-N Pro-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 JQOHKCDMINQZRV-WDSKDSINSA-N 0.000 description 4
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 4
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 4
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 4
- OBTCMSPFOITUIJ-FSPLSTOPSA-N Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O OBTCMSPFOITUIJ-FSPLSTOPSA-N 0.000 description 4
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 4
- GVRKWABULJAONN-UHFFFAOYSA-N Valyl-Threonine Chemical compound CC(C)C(N)C(=O)NC(C(C)O)C(O)=O GVRKWABULJAONN-UHFFFAOYSA-N 0.000 description 4
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 4
- 230000001580 bacterial Effects 0.000 description 4
- 238000004166 bioassay Methods 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N edta Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 235000012765 hemp Nutrition 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 235000012766 marijuana Nutrition 0.000 description 4
- 210000002569 neurons Anatomy 0.000 description 4
- 238000004450 types of analysis Methods 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 230000003612 virological Effects 0.000 description 4
- KYPMKDGKAYQCHO-RYUDHWBXSA-N (2S)-2-[[(2S)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-4-methylsulfanylbutanoic acid Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N (2S)-2-[[(2S,3R)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-3-hydroxybutanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N (2S)-2-[[(2S,3R)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-3-hydroxybutanoyl]amino]propanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- QXRNAOYBCYVZCD-BQBZGAKWSA-N (2S)-6-amino-2-[[(2S)-2-aminopropanoyl]amino]hexanoic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN QXRNAOYBCYVZCD-BQBZGAKWSA-N 0.000 description 3
- RFCVXVPWSPOMFJ-UHFFFAOYSA-N 2-[(2-azaniumyl-3-phenylpropanoyl)amino]-4-methylpentanoate Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-UHFFFAOYSA-N 0.000 description 3
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 3
- 206010001584 Alcohol abuse Diseases 0.000 description 3
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 3
- 229920000453 Consensus sequence Polymers 0.000 description 3
- 102100014971 DRD2 Human genes 0.000 description 3
- 102000037169 GABA transporters Human genes 0.000 description 3
- 108091006208 GABA transporters Proteins 0.000 description 3
- HTOOKGDPMXSJSY-STQMWFEESA-N His-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 HTOOKGDPMXSJSY-STQMWFEESA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 3
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 3
- 210000000265 Leukocytes Anatomy 0.000 description 3
- 102000008300 Mutant Proteins Human genes 0.000 description 3
- 108010021466 Mutant Proteins Proteins 0.000 description 3
- 101700045263 NRPD2 Proteins 0.000 description 3
- 208000003715 Parkinsonian Disorders Diseases 0.000 description 3
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 3
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 3
- 210000003523 Substantia Nigra Anatomy 0.000 description 3
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 3
- 201000003082 alcohol use disease Diseases 0.000 description 3
- 229960000070 antineoplastic Monoclonal antibodies Drugs 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 230000030833 cell death Effects 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000001419 dependent Effects 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037240 fusion proteins Human genes 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 229960000060 monoclonal antibodies Drugs 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 210000001519 tissues Anatomy 0.000 description 3
- VNYDHJARLHNEGA-RYUDHWBXSA-N (2S)-1-[(2S)-2-azaniumyl-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carboxylate Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N (2S)-1-[2-[[(2S)-2-amino-3-carboxypropanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N (2S)-1-[2-[[(2S)-2-azaniumyl-4-methylpentanoyl]amino]acetyl]pyrrolidine-2-carboxylate Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- UCNNZELZXFXXJQ-BZSNNMDCSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 2
- MPZWMIIOPAPAKE-UHFFFAOYSA-N 4-amino-5-[[1-carboxy-4-(diaminomethylideneamino)butyl]amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CCCN=C(N)N MPZWMIIOPAPAKE-UHFFFAOYSA-N 0.000 description 2
- 108020003589 5' Untranslated Regions Proteins 0.000 description 2
- FSHURBQASBLAPO-WDSKDSINSA-N Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)N FSHURBQASBLAPO-WDSKDSINSA-N 0.000 description 2
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 2
- WVRUNFYJIHNFKD-WDSKDSINSA-N Arg-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N WVRUNFYJIHNFKD-WDSKDSINSA-N 0.000 description 2
- SIFXMYAHXJGAFC-WDSKDSINSA-N Arg-Asp Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SIFXMYAHXJGAFC-WDSKDSINSA-N 0.000 description 2
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 2
- HZYFHQOWCFUSOV-IMJSIDKUSA-N Asn-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O HZYFHQOWCFUSOV-IMJSIDKUSA-N 0.000 description 2
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 2
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 2
- DVUFTQLHHHJEMK-IMJSIDKUSA-N Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O DVUFTQLHHHJEMK-IMJSIDKUSA-N 0.000 description 2
- HSPSXROIMXIJQW-BQBZGAKWSA-N Asp-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 HSPSXROIMXIJQW-BQBZGAKWSA-N 0.000 description 2
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 2
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 2
- 229960001230 Asparagine Drugs 0.000 description 2
- TWXZVVXRRRRSLT-UHFFFAOYSA-N Asparaginyl-Cysteine Chemical compound NC(=O)CC(N)C(=O)NC(CS)C(O)=O TWXZVVXRRRRSLT-UHFFFAOYSA-N 0.000 description 2
- 210000004369 Blood Anatomy 0.000 description 2
- 229920001405 Coding region Polymers 0.000 description 2
- ZSRSLWKGWFFVCM-WDSKDSINSA-N Cys-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O ZSRSLWKGWFFVCM-WDSKDSINSA-N 0.000 description 2
- NXTYATMDWQYLGJ-UHFFFAOYSA-N Cysteinyl-Leucine Chemical compound CC(C)CC(C(O)=O)NC(=O)C(N)CS NXTYATMDWQYLGJ-UHFFFAOYSA-N 0.000 description 2
- 102100016784 DDC Human genes 0.000 description 2
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 2
- 108010008044 Dopa Decarboxylase Proteins 0.000 description 2
- 108050004812 Dopamine receptor family Proteins 0.000 description 2
- 102000015554 Dopamine receptor family Human genes 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- BTCSSZJGUNDROE-UHFFFAOYSA-N GABA Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 2
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 2
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 2
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 2
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 2
- AJHCSUXXECOXOY-NSHDSACASA-N Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-NSHDSACASA-N 0.000 description 2
- MAJYPBAJPNUFPV-UHFFFAOYSA-N Histidinyl-Cysteine Chemical compound SCC(C(O)=O)NC(=O)C(N)CC1=CN=CN1 MAJYPBAJPNUFPV-UHFFFAOYSA-N 0.000 description 2
- 210000004408 Hybridomas Anatomy 0.000 description 2
- 210000001503 Joints Anatomy 0.000 description 2
- VYZAGTDAHUIRQA-WHFBIAKZSA-N L-alanyl-L-glutamic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O VYZAGTDAHUIRQA-WHFBIAKZSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 229920001320 Leader sequence (mRNA) Polymers 0.000 description 2
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 2
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 2
- BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 description 2
- 210000004698 Lymphocytes Anatomy 0.000 description 2
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 2
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 2
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 2
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 2
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 2
- IGRMTQMIDNDFAA-UHFFFAOYSA-N Lysyl-Histidine Chemical compound NCCCCC(N)C(=O)NC(C(O)=O)CC1=CN=CN1 IGRMTQMIDNDFAA-UHFFFAOYSA-N 0.000 description 2
- QXOHLNCNYLGICT-YFKPBYRVSA-N Met-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(O)=O QXOHLNCNYLGICT-YFKPBYRVSA-N 0.000 description 2
- PBOUVYGPDSARIS-IUCAKERBSA-N Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C PBOUVYGPDSARIS-IUCAKERBSA-N 0.000 description 2
- DZMGFGQBRYWJOR-YUMQZZPRSA-N Met-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O DZMGFGQBRYWJOR-YUMQZZPRSA-N 0.000 description 2
- 229920002393 Microsatellite Polymers 0.000 description 2
- 206010029350 Neurotoxicity Diseases 0.000 description 2
- 231100000618 Neurotoxin Toxicity 0.000 description 2
- 238000010222 PCR analysis Methods 0.000 description 2
- 206010034010 Parkinsonism Diseases 0.000 description 2
- MIDZLCFIAINOQN-WPRPVWTQSA-N Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 MIDZLCFIAINOQN-WPRPVWTQSA-N 0.000 description 2
- BXNGIHFNNNSEOS-UWVGGRQHSA-N Phe-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 BXNGIHFNNNSEOS-UWVGGRQHSA-N 0.000 description 2
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 2
- KNPVDQMEHSCAGX-UHFFFAOYSA-N Phenylalanyl-Cysteine Chemical compound SCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 KNPVDQMEHSCAGX-UHFFFAOYSA-N 0.000 description 2
- HMNSRTLZAJHSIK-YUMQZZPRSA-N Pro-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 HMNSRTLZAJHSIK-YUMQZZPRSA-N 0.000 description 2
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 2
- 229920000970 Repeated sequence (DNA) Polymers 0.000 description 2
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 2
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 2
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 2
- LDEBVRIURYMKQS-UHFFFAOYSA-N Serinyl-Threonine Chemical compound CC(O)C(C(O)=O)NC(=O)C(N)CO LDEBVRIURYMKQS-UHFFFAOYSA-N 0.000 description 2
- LZLREEUGSYITMX-UHFFFAOYSA-N Serinyl-Tryptophan Chemical compound C1=CC=C2C(CC(NC(=O)C(CO)N)C(O)=O)=CNC2=C1 LZLREEUGSYITMX-UHFFFAOYSA-N 0.000 description 2
- 210000002966 Serum Anatomy 0.000 description 2
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfizole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 2
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 2
- IQHUITKNHOKGFC-MIMYLULJSA-N Thr-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IQHUITKNHOKGFC-MIMYLULJSA-N 0.000 description 2
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 2
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 2
- UQTNIFUCMBFWEJ-UHFFFAOYSA-N Threoninyl-Asparagine Chemical compound CC(O)C(N)C(=O)NC(C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-UHFFFAOYSA-N 0.000 description 2
- 206010044221 Toxic encephalopathy Diseases 0.000 description 2
- UYKREHOKELZSPB-JTQLQIEISA-N Trp-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(O)=O)=CNC2=C1 UYKREHOKELZSPB-JTQLQIEISA-N 0.000 description 2
- LYMVXFSTACVOLP-ZFWWWQNUSA-N Trp-Leu Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 LYMVXFSTACVOLP-ZFWWWQNUSA-N 0.000 description 2
- CGWAPUBOXJWXMS-HOTGVXAUSA-N Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 CGWAPUBOXJWXMS-HOTGVXAUSA-N 0.000 description 2
- OYOQKMOWUDVWCR-RYUDHWBXSA-N Tyr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OYOQKMOWUDVWCR-RYUDHWBXSA-N 0.000 description 2
- 102000031061 Tyrosine 3-monooxygenases Human genes 0.000 description 2
- 108091000118 Tyrosine 3-monooxygenases Proteins 0.000 description 2
- 210000002700 Urine Anatomy 0.000 description 2
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 2
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 2
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 2
- WPSXZFTVLIAPCN-UHFFFAOYSA-N Valyl-Cysteine Chemical compound CC(C)C(N)C(=O)NC(CS)C(O)=O WPSXZFTVLIAPCN-UHFFFAOYSA-N 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 229960003692 aminobutyric acid Drugs 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 230000000875 corresponding Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000028436 dopamine uptake Effects 0.000 description 2
- 230000002708 enhancing Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 229940013945 gamma-Aminobutyric Acid Drugs 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 125000000267 glycino group Chemical group [H]N([*])C([H])([H])C(=O)O[H] 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000003018 immunoassay Methods 0.000 description 2
- 230000002163 immunogen Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 230000002887 neurotoxic Effects 0.000 description 2
- 231100000228 neurotoxicity Toxicity 0.000 description 2
- 239000002581 neurotoxin Substances 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108091008117 polyclonal antibodies Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 229940001470 psychoactive drugs Drugs 0.000 description 2
- 239000004089 psychotropic agent Substances 0.000 description 2
- 230000001105 regulatory Effects 0.000 description 2
- 230000002787 reinforcement Effects 0.000 description 2
- 230000001177 retroviral Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 230000001225 therapeutic Effects 0.000 description 2
- 230000001131 transforming Effects 0.000 description 2
- 230000001052 transient Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- SITLTJHOQZFJGG-XPUUQOCRSA-N α-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 2
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-3-methylbutanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]amino]-3-hydroxypropanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N (2S)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoate Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- 229920000160 (ribonucleotides)n+m Polymers 0.000 description 1
- UMCMPZBLKLEWAF-BCTGSCMUSA-N 3-[(3-cholamidopropyl)dimethylammonio]propane-1-sulfonate Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCC[N+](C)(C)CCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 UMCMPZBLKLEWAF-BCTGSCMUSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N 4-amino-5-[(1-carboxy-2-phenylethyl)amino]-5-oxopentanoic acid Chemical compound OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 229940005513 ANTIDEPRESSANTS Drugs 0.000 description 1
- XZWXFWBHYRFLEF-FSPLSTOPSA-N Ala-His Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 XZWXFWBHYRFLEF-FSPLSTOPSA-N 0.000 description 1
- OMNVYXHOSHNURL-WPRPVWTQSA-N Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMNVYXHOSHNURL-WPRPVWTQSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UFBFGSQYSA-N Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UFBFGSQYSA-N 0.000 description 1
- 210000000628 Antibody-Producing Cells Anatomy 0.000 description 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 1
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- QJMCHPGWFZZRID-UHFFFAOYSA-N Asparaginyl-Lysine Chemical compound NCCCCC(C(O)=O)NC(=O)C(N)CC(N)=O QJMCHPGWFZZRID-UHFFFAOYSA-N 0.000 description 1
- 229960005261 Aspartic Acid Drugs 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000219495 Betulaceae Species 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 102000012605 Cystic Fibrosis Transmembrane Conductance Regulator Human genes 0.000 description 1
- 108010079245 Cystic Fibrosis Transmembrane Conductance Regulator Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 101700011961 DPOM Proteins 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- 206010013663 Drug dependence Diseases 0.000 description 1
- 108010013369 EC 3.4.21.9 Proteins 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N Ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229920001272 Exogenous DNA Polymers 0.000 description 1
- 208000001914 Fragile X Syndrome Diseases 0.000 description 1
- 108009000484 Fragile X Syndrome Proteins 0.000 description 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 1
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 208000005721 HIV Infections Diseases 0.000 description 1
- 230000036499 Half live Effects 0.000 description 1
- 229940120060 Heroin Drugs 0.000 description 1
- GVGLGOZIDCSQPN-PVHGPHFFSA-N Heroin Chemical compound O([C@H]1[C@H](C=C[C@H]23)OC(C)=O)C4=C5[C@@]12CCN(C)[C@@H]3CC5=CC=C4OC(C)=O GVGLGOZIDCSQPN-PVHGPHFFSA-N 0.000 description 1
- LNCFUHAPNTYMJB-IUCAKERBSA-N His-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNCFUHAPNTYMJB-IUCAKERBSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 210000003000 Inclusion Bodies Anatomy 0.000 description 1
- SFLSHLFXELFNJZ-MRVPVSSYSA-N L-Noradrenaline Natural products NC[C@@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-MRVPVSSYSA-N 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 1
- QCZYYEFXOBKCNQ-STQMWFEESA-N Lys-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCZYYEFXOBKCNQ-STQMWFEESA-N 0.000 description 1
- 101710029649 MDV043 Proteins 0.000 description 1
- 206010061284 Mental disease Diseases 0.000 description 1
- JHKXZYLNVJRAAJ-WDSKDSINSA-N Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(O)=O JHKXZYLNVJRAAJ-WDSKDSINSA-N 0.000 description 1
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 206010068871 Myotonic dystrophy Diseases 0.000 description 1
- CJWXCNXHAIFFMH-AVZHFPDBSA-N N-[(2S,3R,4S,5S,6R)-2-[(2R,3R,4S,5R)-2-acetamido-4,5,6-trihydroxy-1-oxohexan-3-yl]oxy-3,5-dihydroxy-6-methyloxan-4-yl]acetamide Chemical compound C[C@H]1O[C@@H](O[C@@H]([C@@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O)[C@H](O)[C@@H](NC(C)=O)[C@@H]1O CJWXCNXHAIFFMH-AVZHFPDBSA-N 0.000 description 1
- 210000001577 Neostriatum Anatomy 0.000 description 1
- 208000009025 Nervous System Disease Diseases 0.000 description 1
- 102000005348 Neuraminidase Human genes 0.000 description 1
- 108010006232 Neuraminidase Proteins 0.000 description 1
- 206010029305 Neurological disorder Diseases 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 229960002748 Norepinephrine Drugs 0.000 description 1
- 102000008092 Norepinephrine Plasma Membrane Transport Proteins Human genes 0.000 description 1
- 108010049586 Norepinephrine Plasma Membrane Transport Proteins Proteins 0.000 description 1
- 210000004940 Nucleus Anatomy 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 101700061424 POLB Proteins 0.000 description 1
- 238000001358 Pearson's chi-squared test Methods 0.000 description 1
- 229920004694 Pedigree® Polymers 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000000447 Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase Human genes 0.000 description 1
- 108010055817 Peptide-N4-(N-acetyl-beta-glucosaminyl) Asparagine Amidase Proteins 0.000 description 1
- IEHDJWSAXBGJIP-RYUDHWBXSA-N Phe-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 IEHDJWSAXBGJIP-RYUDHWBXSA-N 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 210000000063 Presynaptic Terminals Anatomy 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- UEKYKRQIAQHOOZ-KBPBESRZSA-N Pro-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)[O-])C(=O)[C@@H]1CCC[NH2+]1 UEKYKRQIAQHOOZ-KBPBESRZSA-N 0.000 description 1
- HXNYBZQLBWIADP-UHFFFAOYSA-N Prolyl-Cysteine Chemical compound OC(=O)C(CS)NC(=O)C1CCCN1 HXNYBZQLBWIADP-UHFFFAOYSA-N 0.000 description 1
- BEPSGCXDIVACBU-UHFFFAOYSA-N Prolyl-Histidine Chemical compound C1CCNC1C(=O)NC(C(=O)O)CC1=CN=CN1 BEPSGCXDIVACBU-UHFFFAOYSA-N 0.000 description 1
- 208000008425 Protein Deficiency Diseases 0.000 description 1
- 206010037175 Psychiatric disease Diseases 0.000 description 1
- 206010061920 Psychotic disease Diseases 0.000 description 1
- 101700054624 RF1 Proteins 0.000 description 1
- 108090000244 Rat Proteins Proteins 0.000 description 1
- 102100001821 SLC6A3 Human genes 0.000 description 1
- 108060007762 SLC6A3 Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- 241000580858 Simian-Human immunodeficiency virus Species 0.000 description 1
- 210000000952 Spleen Anatomy 0.000 description 1
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 1
- 102100009508 TMPRSS15 Human genes 0.000 description 1
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-UHFFFAOYSA-N Threoninyl-Valine Chemical compound CC(C)C(C(O)=O)NC(=O)C(N)C(C)O CKHWEVXPLJBEOZ-UHFFFAOYSA-N 0.000 description 1
- 231100000765 Toxin Toxicity 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- DZHDVYLBNKMLMB-ZFWWWQNUSA-N Trp-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 DZHDVYLBNKMLMB-ZFWWWQNUSA-N 0.000 description 1
- IMMPMHKLUUZKAZ-WMZOPIPTSA-N Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 IMMPMHKLUUZKAZ-WMZOPIPTSA-N 0.000 description 1
- NLKUJNGEGZDXGO-XVKPBYJWSA-N Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLKUJNGEGZDXGO-XVKPBYJWSA-N 0.000 description 1
- HPYDSVWYXXKHRD-VIFPVBQESA-N Tyr-Gly Chemical compound [O-]C(=O)CNC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 HPYDSVWYXXKHRD-VIFPVBQESA-N 0.000 description 1
- BMPPMAOOKQJYIP-WMZOPIPTSA-N Tyr-Trp Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C([O-])=O)C1=CC=C(O)C=C1 BMPPMAOOKQJYIP-WMZOPIPTSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- STTYIMSDIYISRG-WDSKDSINSA-N Val-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(O)=O STTYIMSDIYISRG-WDSKDSINSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 238000003641 Yates's correction for continuity Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000001430 anti-depressive Effects 0.000 description 1
- 230000000692 anti-sense Effects 0.000 description 1
- 239000000935 antidepressant agent Substances 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000003190 augmentative Effects 0.000 description 1
- 108091007235 betacoronavirus-specific marker domain Proteins 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 235000018185 birch Nutrition 0.000 description 1
- 235000018212 birch Nutrition 0.000 description 1
- UIIMBOGNXHQVGW-UHFFFAOYSA-M buffer Substances [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 1
- 150000001720 carbohydrates Chemical group 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000005859 cell recognition Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 108091006028 chimera Proteins 0.000 description 1
- 230000002759 chromosomal Effects 0.000 description 1
- 230000001684 chronic Effects 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 108010063333 cocaine receptor Proteins 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 201000003883 cystic fibrosis Diseases 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000004059 degradation Effects 0.000 description 1
- 239000007857 degradation product Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001627 detrimental Effects 0.000 description 1
- 229960002069 diamorphine Drugs 0.000 description 1
- 239000003210 dopamine receptor blocking agent Substances 0.000 description 1
- 239000003136 dopamine receptor stimulating agent Substances 0.000 description 1
- 230000002825 dopamine reuptake Effects 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000000534 elicitor Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000012520 frozen sample Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 150000004676 glycans Polymers 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine zwitterion Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 238000001631 haemodialysis Methods 0.000 description 1
- 239000000380 hallucinogen Substances 0.000 description 1
- 230000000322 hemodialysis Effects 0.000 description 1
- 210000001357 hemopoietic progenitor cell Anatomy 0.000 description 1
- 229940094991 herring sperm DNA Drugs 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 230000002209 hydrophobic Effects 0.000 description 1
- 239000002117 illicit drug Substances 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000001524 infective Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003834 intracellular Effects 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000009593 lumbar puncture Methods 0.000 description 1
- 230000002934 lysing Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006011 modification reaction Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000001575 pathological Effects 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- -1 polypropylene Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 150000004804 polysaccharides Polymers 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000003518 presynaptic Effects 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N pro glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- KCXFHTAICRTXLI-UHFFFAOYSA-M propane-1-sulfonate Chemical compound CCCS([O-])(=O)=O KCXFHTAICRTXLI-UHFFFAOYSA-M 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000002797 proteolythic Effects 0.000 description 1
- 239000003368 psychostimulant agent Substances 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000003014 reinforcing Effects 0.000 description 1
- 230000003252 repetitive Effects 0.000 description 1
- 108091007521 restriction endonucleases Proteins 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 125000003616 serine group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 125000005629 sialic acid group Chemical group 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- KEAYESYHFKHZAL-UHFFFAOYSA-N sodium Chemical compound [Na] KEAYESYHFKHZAL-UHFFFAOYSA-N 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000000087 stabilizing Effects 0.000 description 1
- 201000006152 substance dependence Diseases 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000005062 synaptic transmission Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 235000019505 tobacco product Nutrition 0.000 description 1
- 230000002588 toxic Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 230000002936 tranquilizing Effects 0.000 description 1
- 239000003204 tranquilizing agent Substances 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 230000036325 urinary excretion Effects 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Definitions
- the invention relates to a cloned cDNA which encodes the human dopamine transporter protein.
- the 10 cloned cDNA provides a means of expressing human dopamine transorter protein in a variety of contexts and also provides a means of diagnosing and treating diseases presenting abnormal expression of dopamine transporter protein.
- the dopamine transporter that acts to take released dopamine back up into presynaptic terminals has been implicated in several human disorders.
- Cocaine binds to the dopamine transporter and blocks dopamine reuptake in a fashion that correlates well with cocaine reward and reinforcement (M.C. Ritz et al., Science 237, 1219 (1987)).
- Neurotoxins that cause Parkinsonian syndromes are concentrated in dopaminergic neurons by this transporter (S.H. Snyder and R.J. D'Amato, Neurology 36, 250 (1986); G. Uhl, Eur. J. Neurol. 30, 21 (1990)).
- Binding to the dopamine transporter is altered in brains of patients with Tourette's syndrome (H.S. Singer et al., Ann. Neurol. 30, 558 (1991)). These clinical links enhance interest in the structure and function of the human dopamine transporter (HUDAT) . Vulnerability to these disorders may have genetic components (E.J. Devor and CR. Cloninger, Annu. Rev. Genet. 23, 19 (1989) ; D. Pauls and J. Leckman, New Eng. J. Med. 315, 993 (1986); R. Pickens et al., Arch. Gen. Psychiatry 48, 19 (1991)); thus identification of linkage markers for the human DAT is also of interest.
- Dopamine transporters act to terminate dopaminergic neurotransmission by sodium- and chloride- dependent reaccumulation of dopamine into pre-synaptic neurons (L.L. Iversen, in Handbook of Psychopharmacology, L.L. Iversen, S.J. Iversen, & S.H. Snyder, Eds. (Plenum, New York, 1976), pp. 381-442; M.J. Kuhar and M.A. Zarbin, J. Neurochem. 31, 251 (1978); A.S. Horn, Prog. Neurobiol. 34, 387 (1990)).
- Cocaine and related drugs bind to these transporters in a fashion that correlates well with their behavioral reinforcing and psychomotor stimulant properties; these transporters are thus the principal brain "cocaine receptors" related to drug abuse (M.C. Ritz, R.J. Lamb, S.R. Goldberg, M.J. Kuhar, Science 237,1219 (1987); J. Bergman, B. K. Madras, S. E. Johnson, R. 0. Spealman, J. Pharmacol. Exp. Ther. 251,
- the transporters accumulate neurotoxins with structural features resembling dopamine; their ability to concentrate the parkinsonism-inducing toxin MPP + (l-methyl-4-phenylpyridinium) is key to this agent's selective dopaminergic neurotoxicity (S.H.
- dopamine transporter protein is an 80 kDa glycoprotein, but have not yet yielded protein sequence data (D.E. Grigoriadis, A.A.
- pcHUDAT cDNA
- RFLPs restriction fragment length polymorphisms
- VNTRs Variable Number Tandem Repeats
- the cDNA encoding the human dopamine transporter protein provides a means for diagnosing and treating disorders that arise by expression of abnormal amounts of or dysfunctional dopamine transporter molecules in a human being. It is one object of the invention to produce a cDNA that encodes the human dopamine transporter protein, a product of dopaminergic neurons that binds dopamine, cocaine and cocaine analogs and will transport dopamine and MPP+ into mammalian cells expressing it on their surface. It is a further object of the invention to utilize the cDNA to produce cell lines that express human DAT on their surface and to provide a method for the screening of compounds that influence the binding and/or transport of dopamine or cocaine or functional analogs thereof to (into) the cells.
- a third object of the invention is to provide diagnostic means for assessing HUDAT expression in patients by DNA- or antibody- based tests and for assessing the onset or progression of disease by assay of HUDAT degradation.
- the invention is embodied in cell lines, created by stable transformation of cells by a vector encoding the dopamine transporter protein, expressing the dopamine transporter protein on their surface.
- Another aspect of the invention relates to a method of using such lines to screen pharmaceutical compositions for their ability to inhibit the binding of dopamine, cocaine or analogs of these compounds to the transporter protein. Such a screening can also be accomplished by use of cells transiently expressing dopamine transporter cDNA.
- the invention also relates to diagnostic applications of the dopamine transporter cDNA and anti-human DAT antibodies and to therapeutic applications of the HUDAT cDNA.
- Figure 1 shows the nucleotide sequence of the pcHUDAT cDNA encoding the human dopamine transporter protein; the sequence is a composite derived from the sequence of clones pHCDAT2, pHCDAT3 and pHCDAT7.
- Figure 2 shows the sequences of the repeat elements in the 3' untranslated portion of the pcHUDAT cDNA. Also shown is the consensus sequence of the repeats.
- Figure 3 shows a comparison of the amino acid sequence of the human dopamine transporter (Hdat) protein with the amino acid sequence of the rat DAT (Datl) and also with the sequences of the human norepinephrine transporter (Hnat) and of the human gamma-amino-butyric acid transporter (Hgabat) .
- Figure 4A shows a representative RFLP analysis of human genomic DNA from nine unrelated individuals digested with Taql and hybridized with the insert portion of the pHCDAT7 plasmid.
- Figure 4B shows the same DNA, but hybridized with the Taq492 probe, which corresponds to nucleotides 301-793 of the pcHUDAT sequence.
- heterologous proteins in E. coli is often utilized as a means of obtaining large quantities of a polypeptide.
- the product is an unglycosylated protein, which may be made as insoluble "inclusion bodies" in the bacterial cells.
- some proteins can be secreted into the perplasmic space by fusion to a leader sequence that directs the secretion of the translation product.
- Other useful fusion sequences are those which allow affinity purification of the product, such as the pGEX system (Pharmacia) , which allows purification by use of a glutathione- Sepharose column.
- the promoter to be employed is dependent upon the particular protein to be expressed. Some proteins are not detrimental to the physiology of the bacteria and may be expressed using a high-level constitutive promoter. Others are somewhat toxic and so are best expressed from an inducible promoter which keeps synthesis of the heterologous protein repressed until growth of the culture is complete. The promoter is then switched on and the heterologous protein is produced at a high level.
- bacterial expression includes the use of terminator seqences in the transcription unit and the use of sequences in the 5'- untranslated portion of the mRNA to abolish secondary structure which might impede translation. Also the choice of bacterial strain can be important. Some heterologous proteins are susceptible to proteolytic degradation and so are best expressed in strains of bacteria which lack proteolytic functions. Also, strains of bacteria other than E. coli are often useful as hosts for expression systems. The best-developed alternative currently being Bacillus strains. Expression of proteins in bacteria is well- reviewed in "Current Protocols in Molecular Biology", which is published with quarterly updates by Wiley Interscience. Expression of "foreign" proteins in mammalian cells can be accomplished in two general fashions.
- Transient expression refers to the creation of a pool of transfected cells which harbor plasmids that are not stably maintained in the cell and so are gradually diluted out of the population. Transient expression is by nature a short term method. For reproducible expression of a heterologous protein, stable expression systems are preferable.
- the current state of this art includes a variety of vector systems; both integrative and autonomous vectors are available. Inducible expression of heterologous proteins in mammalian cells is difficult to achieve at the current time. Some systems have been described, but they are not yet in general use. More commonly used are vectors bearing moderate to high- level constitutive promoters. Plasmid vectors are relatively easy to use.
- Retroviral vectors which rely upon packaging into infective viral particles and integration into the host cell chromosome are more difficult to use, due to the extra steps involved in creating the recombinant viruses and cell lines which secrete them, but have the advantage that they effectively introduce exogenous DNA into human cell lines.
- Vaccinia virus vector systems are also in widespread use.
- Other viral vectors are under development for gene therapy systems, including adenovirus-derived vectors.
- cDNA libraries prepared from "substantia nigra” and “brainstem” dissections containing cells known to express the transporter were screened with hybridization probes prepared from the rat cDNA, pDATl (S. Shimada et al.. Science 254, 576 (1991)). Sequences from the 3' untranslated region of the rat cDNA were not used because of the presence of CA dinucleotide repeats.
- Human brain stem and substantia nigra cDNA libraries (Stratagene, La Jolla, CA) were plated and blotted onto duplicate replica nitrocellulose (Schleicher and Schuell, Keene, NH) filters, which were incubated for 1 hour at 37°C with proteinase K (50 ⁇ g/ ⁇ l in 2 x SSPE/0.1%SDS) to reduce filter background, washed in 5 x SSC/0.5% SDS/lmM EDTA, prehybridized and hybridized at 42°C, and washed at 54°C in 0.4 x SSC/0.5% SDS.
- proteinase K 50 ⁇ g/ ⁇ l in 2 x SSPE/0.1%SDS
- the hybridization probe was a 2300 bp Eco RI fragment of the rat dopamine transporter cDNA6 (S. Shimada et al.. Science 254, 576 (1991)) [ 32 P] labeled by random priming (Prime It kit, Boehringer Mannheim) , and hybridized at approximately 10 6 cpm/ml. Positively-hybridizing cDNA clones were purified from the brainstem library, autoexcized according to protocols provided by the manufacturer (Stratagene) , and termed pHCDAT2, pHCDAT3, and pHCDAT7. Sequencing was performed on an Applied Biosystems automated sequencer as described (S. Shimada et al., Science 254, 576 (1991)) . Sequence analysis was performed using the GCG software package (J. Devereaux, et al., Nucleic Acids Res. 12, 387 (1984)).
- Screening of more than 2 x 10 6 plaques from the substantia nigra library produced no positives.
- Screening 1 x 10 6 plaques from the brainstem library yielded 11 positively-hybridizing plaques, three of which were identified as human DAT clones by sequence analysis. These clones were identified as representing the 5'-half (pHCDAT2, bases 1-1733), the 3'-half (pHCDAT3, bases 1679-3919), and an internal portion (pHCDAT7, bases 653-1434) of the human DAT cDNA whose reconstructed full-length sequence is shown in Figure 1.
- this cDNA resembles the structure of the rat cDNA DAT1, with a modest 5' untranslated region and a long 3' untranslated region. Both 5' and 3' untranslated regions are longer than those of the rat cDNA pDATl, however, making the length of the predicted human mRNA greater than the 3.7 kb observed for the rat mRNA (S. Shimada et al.. Science 254, 576 (1991)).
- a striking difference between rat and human cDNAs is found in the 3' untranslated region where the human cDNA displays 10 copies of a 40 bp repetitive element that are arrayed in head-to-tail fashion and are absent from the rat cDNA (Figs 1,2).
- sequence of each element is more than 90% identical to the consensus sequence listed at the bottom of Figure 2, although the seventh repeat displays a 5 base pair insertion from its 24th to 28th nucleotides.
- the consensus element found here is 68% G+C No exact match is found in searches of the EMBL/genbank data base, release 70. However, sequences conferring up to 70% nucleic acid identity over up to 37 of these bases are found in viral sequences, especially with herpesvirus sequences (e.g. locus HS1US) .
- the open reading frame predicted by the HUDAT cDNA encodes 620 amino acids, identical in size to the rat DATl cDNA except for an additional amino acid (199) not found in the rat sequence (Fig. 3) .
- This open reading frame predicts amino acid sequences that are 94% identical to those encoded by the rat dopamine transporter cDNA (S. Shimada et al., Science 254, 576 (1991)).
- This high degree of conservation, and the weaker identities with the human norepinephrine and GABA transporter cDNA H. Nelson et al., FEBS Lett. 269, 181-184 (1990); T. Pacholczyk et al., Nature 350, 350-354 (1991)) (Fig. 3), identifies this as the human homolog of the rat DATl.
- the amino acid sequence predicted by the HUDAT cDNA reveals interesting differences from the rat cDNA. It lacks one of the 4 consensus sites for N-linked glycosylation noted in the rat DATl cDNA (Fig 3, + symbol) . Three adjacent amino acids distinguish the human from the rat proteins at this locus; no other portion of the molecule differs by this extent.
- N-linked glycosylation sites indicated in the rat transporter contain classic asparagine - X - serine/threonine sequences. Three of these sites are conserved among the rat and human sequences, but a middle glycosylation site, potentially the most distant from the embedding membrane, is absent in the human transporter. The amino acids surrounding this site provide the largest area of amino acid sequence divergence between the rat and human transporters. If glycosylation is evenly distributed among the different potential sites for N-linked glycosylation, these observations would predict that the human dopamine transporter might display less glycosylation than the rat, and that its molecular weight might be correspondingly smaller.
- the rat sequence does demonstrate 25 copies of a small dinucleotide CA repeat from bases 2476 to 2525 of the 3' untranslated region of its mRNA; CA repeats are absent from the human cDNA (S. Shimada et al., Science 254, 576-578 (1991)).
- the sequence of the longer hDAT repeated element is not found the rat cDNA, nor in searches of other sequences found in databanks. The significance of the partial matches in viral genomes is unclear. These repeated elements might alter mRNA properties, perhaps including secondary structure and/or half-life, in ways that could contribute to the regulation of this gene's expression. Search of this sequence using the stemloop program yields more than 150 possible loops with as many as 18 stabilizing hydrogen bonds. Conceivably, population variants in the number of these repeats could also contribute to heterogeneity in DAT function.
- Example 2 Restriction Fragment Length Polymorphism (RFLP) analysis DNA was obtained from leukocytes, digested with Tagl, and analyzed by Southern blotting using pHCDAT7 as the initial hybridization probe. Simpler patterns were also obtained using two other hybridization probes. Taq 120 corresponds to bases 668 to 787 of the HDAT (see below) , and was generated by hybridizing 65 and 72 base oligonucleotides of opposite sense and extending the product using large fragment of DNA polymerase I and [ 32 P]-dCTP or by random priming of these two hybridized oligonucleotides, as described (A. Feinberg and B. Vogelstein, Analyt. Biochem. 132, 6-9 (1982); S.
- RFLP Restriction Fragment Length Polymorphism
- Hybridization survived washes of up to 68°C, consistent with specificity. A simpler pattern was revealed when hybridization was performed with the Taq 120 hybridization probe or with the cDNA hybridization probe Taq 492 (Fig. 4B) . Two hybridizing bands of 7 and 5.6 kilobases were observed and termed Al and A2. Taq I Al and A2 RFLP frequencies are presented the Table. Of 272 chromosomes from 136 individuals examined 36% showed the Al form, 64% showed the A2 form. There was a significant racial dimorphism in these distributions such that 26% of Caucasians, but 42% of blacks displayed the Al RFLP ( ⁇ 2 7.45, p ⁇ 0.01).
- the rich patterns of Taq I RFLPs identified with this cDNA sequence could relate to the fact that the clone itself contains three sites for Taq I cleavage. Further studies are thus likely to detect other polymorphisms, because extreme variability of bands in the initial Taq I restriction digestions has already been documented.
- the tandem repeat in the 3'region of this gene also provides a Variable Number Tandem Repeat (VNTR) .
- VNTR Variable Number Tandem Repeat
- the means for examining the distribution of alleles of the VNTR is set forth at the end of Example 3 below.
- the hybridization probes that we have described provide useful markers for linkage analysis that would help to exclude the regions around the dopamine transporter gene from involvement in familial disorders.
- Human dopamine systems are involved in a number of human disorders, with specific implication of involvement of transporter mechanisms in psychostimulant abuse, Parkinsonism, and Tourette's syndrome (E.J. Devor and CR. Cloninger, Annu. Rev. Genet. 23, 19-36 (1989); D. Pauls and J. Leckman, New Eng. J. Med. 315, 993-997 (1986); R. Pickens et al.. Arch. Gen. Psychiatry 48, 19-28 (1991) ; M.C. Ritz et al..
- RFLP Taql restriction fragment length polymorphism
- Subject Recruitment 288 Caucasian substance-using and control subjects were recruited from three sources; 21% were female. 224 drug-using and control volunteers consenting to research protocols at the Addiction Research Center (ARC) in Baltimore, Maryland were studied.
- the ARC is the major federal drug abuse research facility that recruits through advertisement and word of mouth for participation in treatment and non-treatment studies.
- a third group of users consisted of 52 HIV seronegative participants in an ongoing east Baltimore study of HIV infections in intravenous drug users (D. Vlahov et al., Am J Epid. 132, 847-856 (1990)). Each subject was individually interviewed to elicit information characterizing substance use. 192 users and 56 controls were assessed according to a quantity-frequency approach. 137 users met criteria for DSM-III-R psychoactive substance use disorders. 97 users received both assessments. Written informed consent was obtained from all subjects.
- DSM-III-R Diagnoses Trained interviewers administered the Diagnostic Interview Schedule Version III Revised (DIS-III-R, L.N. Robins et al..NIMH Diagnostic Interview Schedule Version III Revised (Version 11/7/89) . Department of Psychiatry, Washington University School of Medicine, St. Louis, MO.) to provide lifetime DSM-III-R diagnoses of psychoactive substance use disorders including nicotine and alcohol.
- DUS Drug Use Survey
- Structured Clinical Interview for DSM-III-R R.L. Spitzer et al. , Structured clinical interview for DSM-III-R - patient version (with psychotic screen) - SCID-P (W/Psvchotic Screen) - 5/1/89) .
- Cigarettes 0 never smoked cigarettes
- Marijuana 0 never used marijuana
- Amphetamines, 1 fewer than 1 use per week
- Hallucinogens, 3 7 or more uses/week (more than
- DNA Extraction and Analysis Blood was obtained in EDTA-containing evacuated sterile tubes from each subject and stored at 4°C and/or frozen at -70°C in polypropylene tubes. DNA was extracted from non-frozen samples after initial isolation of nuclei and from frozen blood by selective white blood cell sedimentation followed by standard extraction methods (J. Sambrook et al., eds. "Molecular cloning: a laboratory manual” (2nd edition) . Cold Spring Harbor (NY) Laboratory Press; 1989). 5-10 ⁇ g of this DNA was digested with Taql as recommended by the manufacturer, or with 20-fold excess of this enzyme for several individuals displaying A3 alleles. DNA fragments were electrophoresed using 0.8% agarose gels containing ethidium bromide at 1-2 volts per centimeter for 16 hours, transferred to nylon membranes, and immobilized by UV crosslinking.
- Hybridization was performed for 16-24 hours at 42°C in 50% formamide, 5xSSC, 50 mM NaP0 4 (pH 6.8), 1% SDS, lmM EDTA, 2.5 x Denhardt's solution, 200 ⁇ g/ml herring sperm DNA, and 4X10 6 cpm/ml of radiolabelled DNA (see below) . Washing for 20 minutes in 2XSSC at room temperature was followed by two 30 minute washes in 0.4 x SSC/0.5%SDS at 55°C Washed blots were exposed to Kodak XAR film 1-6 days with an intensifying screen at -70°C. Band sizes were compared to ⁇ DNA molecular weight standards, and with patterns previously defined (K.
- Hybridization probes A 1.7 kb BamHI fragment of the human genomic clone encoding the dopamine D 2 receptor ( ⁇ hD2Gl) was subcloned into the BamHI site on bluescript SK+ to produce phD2-9, which was used to detect Al, A2, and A3 patterns in the Southern analyses, as described (K. Blum et al., JAMA 263, 2055-2060 (1990) ; A.M. Bolos et al. , JAMA 264, 3156-3160 (1990)) (Dr Bruce O'Hara et al, unpublished data) . ⁇ hD2G2 was used to detect the Taql "B" patterns. DNAs were radiolabelled using random priming and 32 P-CTP to specific activities of approximately 10 9 cpm/ ⁇ g (A. Feinberg and B Vogelstein, Anal Biochem. 137, 266-267 (1984)).
- Tagl A and B RFLPs were assigned with 100% agreement between two independent raters.
- Taql A and B RFLP frequencies for substance-using and control subjects are presented in Table II.
- Table III presents Taql A and B genotypes (homozygotes and heterozygotes) for DUS-assessed controls and users.
- enominator total number of subjects with a given genotype (eg, Al/Al) .
- the RFLPs studied here are the result of polymorphic Tagl restriction sites in which "A" RFLPs are located ca. 9 kb 3' to the final exon of the D 2 receptor gene (0. Civelli, personal communication) and "B" RFLPs are located near the first coding exon (X.Y. Hauge et al., Genomics 10, 527-530 (1991)). These polymorphisms could have functional relevance if base pair differences directly influenced the gene's regulation.
- VNTR variable number tandem repeat
- Taql RFLP the Taql RFLP described in example 2
- examination of VNTR markers is preferred, as such markers have a larger number of alleles and hence are "more informative", i.e. VNTR markers identify more subtypes than a regular "site-no site” RFLP marker.
- the same methodology described above for the study of the D2 dopamine receptor gene can be employed. As shown above, particular attention must be paid to the diagnostic criteria for identifying the abuse behavior if the results are to be meaningful.
- DNA is obtained from leukocytes from research volunteers as described above.
- Genomic DNA 40 ng
- Genomic DNA 40 ng
- AmpliTaq DNA Polymerase (1.25 U)
- polymerase chain reaction with denaturing for 1 min at 93°C, and annealing/extension for 1 min at 72°C in buffer supplied by the manufacturer (Perkin- Elmer) .
- Oligonucleotides T3-5LONG (5'- TGTGGTGTAGGGAACGGCCTGAG-3', SEQ. ID. NO. 4)
- T7-3aLONG 5'-CTTCCTGGAGGTCACGGCTCAAGG-3', SEQ. ID. NO. 5'-CTTCCTGGAGGTCACGGCTCAAGG-3', SEQ. ID. NO. 5) are used at 0.5 uM final concentration.
- Reaction products are separated by 5% polyacrylamide gel electrophoresis, and product sizes estimated by comparison to molecular weight standards (BRL) .
- BTL molecular weight standards
- 242 of the 254 chromosomes examined displayed either 9 or 10 copies of the 40 basepair repeat. Two chromosomes showed three copies, two showed 5 copies, three showed 7, four showed 8 and one showed 11 copies of the VNTR. Among individuals with 9 and/or 10 copies per chromosome there were racial differences in copy number frequencies. Whites displayed 30% and Blacks displayed 20% of the 9-copy variant.
- the 3' VNTR marker defined by 9 versus 10 copies of the 40 basepair repeat displayed no significant linkage disequilibrium with the more 5' Taql RFLP ( ⁇ 2 values were 5.51 and 4.62 for White and Black subjects, respectively, with 8 degrees of freedom, p > 0.1.)
- Example 4 (predictive) Expression of HUDAT protein in Escherichia coli and purification of the bacterially expressed protein
- HUDAT protein expression in E. coli any of several expression systems can be utilized to obtain HUDAT protein expression in E. coli.
- the plasmid vector pFLAG system International Biotechnologies, Inc., New Haven, CT
- the open reading frame midportion of the HUDAT cDNA is obtained by digestion of the pHCDAT7 plasmid with EcoRI and purification of the insert fragment encoding the HUDAT protein by electrophoresis and elution from an agarose gel by standard techniques.
- Oligonucleotides having the sequences 5'-GGGTCTAGACG-3' and 5'- AATTCGTCTAGACCC-3' are annealed to form an adaptor and the adaptor is ligated to the ends of the insert DNA.
- the ligation product is digested with Xbal and cloned into the Xbal restriction site of the pFLAG vector (International Biotechnologies, Inc.).
- the appropriate E. coli host is transformed and colonies containing the HUDAT cDNA may be screened by colony hybridization using the pcHUDAT as probe.
- fusion protein is obtained in pure form by use of the monoclonal antibody affinity column as described by the manufacturer of the system, except that the elution buffer is modified by the addition of 0.5% CHAPS (3-[ (3-Cholamidopropyl)- dimethylammonio] 1-propane-sulfonate) .
- CHAPS 3-[ (3-Cholamidopropyl)- dimethylammonio] 1-propane-sulfonate
- Authentic DAT protein lacking the FLAG octapeptide is obtained by enterokinase cleavage of the fusion protein as described by the supplier of the FLAG system.
- Example 5 Purification of DAT from tissues or from transformed mammalian cells.
- protein isolated from transformed bacterial cells lacks post-translational modifications, such as sugar additions, that occur in mammalian cells, the purification of the protein from tranformed COS cells is discussed.
- COS cells transformed as described in are subjected to a purification protocol as described for the purification of the GABA transporter (Radian, et al., J. Biol. Chem. 261, 15437-15441 (1987) with the modification that binding of labelled CFT is used to assay for the presence of DAT in the sample rather than labelled gamma- amino butyric acid.
- the protocol is modified as required to allow the isolation of DAT as a distinct protein by techniques known to a practitioner of the art.
- mRNA obtained from tissue biopsy from a patient is converted subjected to quantitative reverse-transcript PCR (for example, see A. M. Wang, et al. PNAS USA 86:9717 (1989)) utilizing as primers oligonucleotides derived from the cDNA sequence of pcHUDAT.
- Variance in the expression level can be ascertained by comparison of product yield with a normal control.
- Abnormal mRNA structures can be diagnosed by observation of a product band of a length different from the normal control.
- Point mutants can be observed by use of primers and conditions appropriate for detection of the mismatch between the mutant and normal alleles. For example, the "reverse dot blot" procedure for screening the expression of several mutant alleles in a single experiment, which has been described for the CFTR gene, mutants of which cause cystic fibrosis (Erlich, H.A. , et al Science 252:1643 (1991).
- the HUDAT mRNA also contains a variable number tandem repeat element in the 3' untranslated portion of the mRNA which can be amplified for examination of an association between specific VNTR alleles and substance abuse behavior or diseases associated with expression of particular HUDAT alleles (See example 3) .
- Example 7 Use of dopamine transporter expression to incorporate as part of overexpression of a panel of dopaminergic genes to reconstruct a dopaminergic cell line for therapy in human diseases resulting from defective dopamine transporter expression.
- cDNAs for the human dopamine transporter, and for tyrosine hydroxylase and aromatic ammino acid decarboxylase (DOPA decarboxylase) are transfected into cell types including COS cells as described above.
- Cells are cotransfected with the neomycin resistance marker, selected by growth in G418, and then tested for their ability to synthesize and accumulate dopamine.
- Individual subclones may be able to take up dopamine, without the ability to synthesize it. However, individual subclones are also likely to integrate several of the plasmids.
- Example 8 (predictive) Production of variant sequences in HUDAT protein and testing of their biological function
- Site directed mutagenesis using olgonucleotides is used to introduce specific single- and multiple-base changes into the HUDAT cDNA that change specific amino acids in the HUDAT protein.
- the ability of mutant transporters to take up [ ] dopamine, [ 3 H] MPP+,and to bind [ 3 H] cocaine and cocaine analogues (especially [ 3 H] CFT) is tested as described previously (S. Shimada et al., Science 254, 576 (1991)).
- the Amersham mutagenesis system (version 2.1, technical bulletin code RPN1523) can be used.
- the largest difference in the structure of the proteins predicted by the human and the rat dopamine transporter dDNA sequences is the absence of one of the four consensus sites for N-linked glycosylation of the protein (See figure 3) .
- Site directed mutagaenesis can be performed as described for Example 8 introduce into the human DAT cDNA the asparagine residues to which N- linked sugars are attached and the remaining amino acids which constitute the glycosylation signal for that site that are found in the rat, but not the human cDNA.
- mutant proteins can be evaluated by photo-affinity labelling of the protein and analysis by SDS-PAGE. Digestion of the protein with various glycosidases can be performed to assess the degree to which the pattern of glycosylation has been altered, as described by Lew et al. (R. Lew et al.. Brain Research 539, 239 (1991)).
- More detailed information regarding the sugar structure can be obtained by exoglycosidase digestion experiments. For example, the presence of sialic acid residues in the polysaccharide can be detected by digestion with neuraminidase.
- the influence of the polysaccarhide structure on function of the protein is then assessed by testing the properties of the the transporter using either stably transfected cells expressing the mutant protein, or by using cells transiently expressing the mutant transporter on their surface.
- the means for carrying out such functional studies are described by Shimada et al. (S. Shimada et al., Science 254, 576 (1991)).
- Example 10 Cell lines expressing HUDAT protein on the cell surface can be used to screen candidate compounds for efficacy as dopamine (or cocaine or functional analogs thereof) agonists or antagonists by evaluating the influence of the candidate compound upon the binding of dopamine (or cocaine or functional analogs thereof) to the surface of such cells.
- Another assay for dopamine agonist or antagonist activity is to measure the cytotoxicity to such cells of MPP* to such cells in the presence and absence of the candidate compound.
- Such assays are described using cells expressing the rat DAT cDNA in Shimada et al. (S. Shimada et al., Science 254, 576 (1991)) and can be applied as well to cells expressing the human DAT cDNA.
- HUDAT protein obtained as described above or synthetic polypeptides of amino acid sequence derived from the HUDAT sequence are used as immunogens in an appropriate animal.
- the serum is obtained from the immunized animal and either utilized directly or the antibody may be purified from the serum by any commonly utilized techniques.
- Polyclonal antibody directed only toward HUDAT can be isolated by use of an affinity column derivatized with the immunogen utilized to raise the antibody, again using techniques familiar to one knowledgable in the art.
- Monoclonal antibodies to HUDAT or to particular epitopes of HUDAT may be produced by immunization of an appropriate animal with HUDAT protein obtained as above or with peptides of amino acid sequence derived from the HUDAT amino acid sequence. Hybridoma cultures are then established from spleen cells as described by Jaffe and McMahon-Pratt (Jaffe, CL. and MacMahon-Pratt, D. J. Immunol. 131, 1987-1993 (1983)). Alternatively, peripheral blood lymphocytes may be isolated and immortalized by transformation with Epstein-Barr virus.
- the death of dopaminergic neurons in the brain of a patient should result in the accumulation in the cerebrospinal fluid, which bathes these cells, of membrane debris as a product of lysis of the dead cells.
- Other pathologic conditions, short of cell death that result in the release of DAT protein, or degraded peptide fragments of HUDAT protein into the surrounding medium can also be imagined.
- the cerebospinal fluid can be sampled by lumbar puncture of a patient.
- the presence of degradation products of HUDAT protein is detected by immunoassay, using as the primary antibody at least one of the products obtained as described above.
- Elevated levels of HUDAT protein detected in the cerebrospinal fluid compared with the range seen in normal controls is indicative of Parkinsons's disease or drug-induced neurotoxicity.
- disease progression can be monitored by the assessment of HUDAT levels in serial samples from the same patient.
- ORGANISM Homo sapiens
- F TISSUE TYPE: brainstem
- GAATTCCCGC TCTCGGCGCC AGGACTCGCG TGCAAAGCCC AGGCCCGGGC GGCCAGACCA 60 AGAGGGAAGA AGCACAGAAT TCCTCAACTC CCAGTGTGCC C ATG AGT AAG AGC 113
- CAGTCTGTTC AGAGGCATTG GAGGATGGGG GTCCTGGTAT 2591
- CAGCTCAGGC TACTGCCACT CAGGCAGCCT GTGGGTCCTT GTGGTGTAGG GAACGGCCTG 2711
Abstract
The cloning and characterization of a human dopamine transporter (HUDAT) cDNA is described. RFLP analysis is used to determine the distribution of HUDAT alleles in two ethnic backgrounds. The means by which the association between HUDAT alleles and behavioral disorders which have altered HUDAT expression as a basis for their etiology is discussed. Methods for evaluating the expression of HUDAT are described.
Description
SEQUENCE OF HUMAN DOPAMINE TRANSPORTER cDNA
RELATED APPLICATIONS
This application is a Continuation-In-Part of U.S. Patent Application Serial Number 07/762,132, filed September 20, 1991, which is hereby incorporated in its 5 entirety by reference.
BACKGROUND OF THE INVENTION Field of the Invention
The invention relates to a cloned cDNA which encodes the human dopamine transporter protein. The 10 cloned cDNA provides a means of expressing human dopamine transorter protein in a variety of contexts and also provides a means of diagnosing and treating diseases presenting abnormal expression of dopamine transporter protein.
15 Description of the Related Art
Throughout this application, reference is made to articles of the scientific literature and the like.
The entire content of such citations is hereby incorporated by such reference. 20 The dopamine transporter that acts to take released dopamine back up into presynaptic terminals
has been implicated in several human disorders. Cocaine binds to the dopamine transporter and blocks dopamine reuptake in a fashion that correlates well with cocaine reward and reinforcement (M.C. Ritz et al., Science 237, 1219 (1987)). Neurotoxins that cause Parkinsonian syndromes are concentrated in dopaminergic neurons by this transporter (S.H. Snyder and R.J. D'Amato, Neurology 36, 250 (1986); G. Uhl, Eur. J. Neurol. 30, 21 (1990)). Binding to the dopamine transporter is altered in brains of patients with Tourette's syndrome (H.S. Singer et al., Ann. Neurol. 30, 558 (1991)). These clinical links enhance interest in the structure and function of the human dopamine transporter (HUDAT) . Vulnerability to these disorders may have genetic components (E.J. Devor and CR. Cloninger, Annu. Rev. Genet. 23, 19 (1989) ; D. Pauls and J. Leckman, New Eng. J. Med. 315, 993 (1986); R. Pickens et al., Arch. Gen. Psychiatry 48, 19 (1991)); thus identification of linkage markers for the human DAT is also of interest. Dopamine transporters act to terminate dopaminergic neurotransmission by sodium- and chloride- dependent reaccumulation of dopamine into pre-synaptic neurons (L.L. Iversen, in Handbook of Psychopharmacology, L.L. Iversen, S.J. Iversen, & S.H. Snyder, Eds. (Plenum, New York, 1976), pp. 381-442; M.J. Kuhar and M.A. Zarbin, J. Neurochem. 31, 251 (1978); A.S. Horn, Prog. Neurobiol. 34, 387 (1990)).
Cocaine and related drugs bind to these transporters in a fashion that correlates well with their behavioral reinforcing and psychomotor stimulant properties; these transporters are thus the principal brain "cocaine receptors" related to drug abuse (M.C. Ritz, R.J. Lamb, S.R. Goldberg, M.J. Kuhar, Science 237,1219 (1987); J. Bergman, B. K. Madras, S. E.
Johnson, R. 0. Spealman, J. Pharmacol. Exp. Ther. 251,
150 (1989)). The transporters accumulate neurotoxins with structural features resembling dopamine; their ability to concentrate the parkinsonism-inducing toxin MPP+ (l-methyl-4-phenylpyridinium) is key to this agent's selective dopaminergic neurotoxicity (S.H.
Snyder, and R. J. D'Amato, Neurology 36(2), 250 (1986);
S. B. Ross, Trend. Pharmacol. Sci. 8, 227 (1987)).
Studies of the dopamine transporter protein suggest that it is an 80 kDa glycoprotein, but have not yet yielded protein sequence data (D.E. Grigoriadis, A.A.
Wilson, R. Lew, J.S. Sharkey & M.J. Kuhar, J. Neurosci.
9, 2664 (1989)). Binding of cocaine analogs such as
[3H]CFT to membranes prepared from dopamine-rich brain regions reveals two sites with differing affinities (F.
Javory-Agid, and S.Z. Langer, Naunyn-Schmiedeberg's
Arch. Pharmacol. 329, 227 (1985) ; J.W. Boja, and M.J.
Kuhar, Eur. J. Pharmacol. 173, 215 (1989); B.K. Madras et al., Mol. Pharmacol. 36, 518 (1989); M.J. Kuhar et al., Eur. J. Neurol. 30(1), 15 (1990); M.C Ritz, E.J.
Cone, M.J. Kuhar, Life Sci. 46, 635 (1990).; D.O.
Calligaro, and M.E. Eldefrawi, J. Pharmacol. Exp. Ther.
243, 61 (1987); B.K. Madras et al., J. Pharmacol. Exp.
Ther. 251(1), 131 (1989); M.C. Ritz et al., J. Neurochem. 55, 1556 (1990)).
Recent elucidation of cDNAs encoding dopamine transporters from experimental animals (B. Gros et al.,
FEBS Lett. 295, 149 (1992); J.E. Kilty et al.. Science
254, 578 (1991); S. Shimada et al.. Science 254, 576 (1991); T.B. Usdin et al., Proc. Natl. Acad. Sci. USA
88, 11168 (1991) provides hybridization probes useful for isolation of their human cognate.
Summary of the Invention
Described herein is a cDNA (pcHUDAT) , which encodes the human dopamine transporter protein (HUDAT) . Also described are unique features of the nucleotide sequence of the pcHUDAT predicted for its encoded mRNA and protein, restriction fragment length polymorphisms (RFLPs) and Variable Number Tandem Repeats (VNTRs) identified by this cDNA and estimates of race-specific population frequencies of these RFLPs and VNTRs. By virtue of its representation of the human dopamine transporter sequence, the pcHUDAT is advantageous over those clones isolated from other species in that better results in applications having a human context would be expected. The cDNA encoding the human dopamine transporter protein (HUDAT) provides a means for diagnosing and treating disorders that arise by expression of abnormal amounts of or dysfunctional dopamine transporter molecules in a human being. It is one object of the invention to produce a cDNA that encodes the human dopamine transporter protein, a product of dopaminergic neurons that binds dopamine, cocaine and cocaine analogs and will transport dopamine and MPP+ into mammalian cells expressing it on their surface. It is a further object of the invention to utilize the cDNA to produce cell lines that express human DAT on their surface and to provide a method for the screening of compounds that influence the binding and/or transport of dopamine or cocaine or functional analogs thereof to (into) the cells. Such cell lines may also find therapeutic application for treatment of diseases caused by depletion of cell populations which normally provide for uptake of dopamine.
A third object of the invention is to provide diagnostic means for assessing HUDAT expression in patients by DNA- or antibody- based tests and for assessing the onset or progression of disease by assay of HUDAT degradation.
These and other objects are accomplished by providing a cDNA encoding the dopamine transporter protein and a purified polypeptide conferring upon cells the phenotype of dopamine uptake from the surrounding extracellular medium. Further, the invention is embodied in cell lines, created by stable transformation of cells by a vector encoding the dopamine transporter protein, expressing the dopamine transporter protein on their surface. Another aspect of the invention relates to a method of using such lines to screen pharmaceutical compositions for their ability to inhibit the binding of dopamine, cocaine or analogs of these compounds to the transporter protein. Such a screening can also be accomplished by use of cells transiently expressing dopamine transporter cDNA.
The invention also relates to diagnostic applications of the dopamine transporter cDNA and anti-human DAT antibodies and to therapeutic applications of the HUDAT cDNA.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 (SEQ. ID. NO. 1) shows the nucleotide sequence of the pcHUDAT cDNA encoding the human dopamine transporter protein; the sequence is a composite derived from the sequence of clones pHCDAT2, pHCDAT3 and pHCDAT7.
Figure 2 shows the sequences of the repeat elements in the 3' untranslated portion of the pcHUDAT cDNA. Also shown is the consensus sequence of the
repeats.
Figure 3 shows a comparison of the amino acid sequence of the human dopamine transporter (Hdat) protein with the amino acid sequence of the rat DAT (Datl) and also with the sequences of the human norepinephrine transporter (Hnat) and of the human gamma-amino-butyric acid transporter (Hgabat) .
Figure 4A shows a representative RFLP analysis of human genomic DNA from nine unrelated individuals digested with Taql and hybridized with the insert portion of the pHCDAT7 plasmid. Figure 4B shows the same DNA, but hybridized with the Taq492 probe, which corresponds to nucleotides 301-793 of the pcHUDAT sequence.
DETAILED DESCRIPTION OF THE INVENTION
For many of the applications described in the examples below subfragments or variants of the HUDAT protein disclosed in the present application wherein the original amino acid sequence is modified or changed by insertion, addition, substitution, inversion or deletion of one or more amino acids are useful so far as they retain the essential binding or transport specificity for dopamine, cocaine, or functional analogs thereof. Thus, such variants of the HUDAT are considered to fall within the scope of the present invention. Such variants are easily produced by mutagenic techniques well developed in the art of genetic engineering.
Expression of heterologous proteins in E. coli is often utilized as a means of obtaining large quantities of a polypeptide. The product is an unglycosylated protein, which may be made as insoluble "inclusion bodies" in the bacterial cells. Alternatively, some
proteins can be secreted into the perplasmic space by fusion to a leader sequence that directs the secretion of the translation product. Other useful fusion sequences are those which allow affinity purification of the product, such as the pGEX system (Pharmacia) , which allows purification by use of a glutathione- Sepharose column.
The promoter to be employed is dependent upon the particular protein to be expressed. Some proteins are not detrimental to the physiology of the bacteria and may be expressed using a high-level constitutive promoter. Others are somewhat toxic and so are best expressed from an inducible promoter which keeps synthesis of the heterologous protein repressed until growth of the culture is complete. The promoter is then switched on and the heterologous protein is produced at a high level.
Other considerations in bacterial expression include the use of terminator seqences in the transcription unit and the use of sequences in the 5'- untranslated portion of the mRNA to abolish secondary structure which might impede translation. Also the choice of bacterial strain can be important. Some heterologous proteins are susceptible to proteolytic degradation and so are best expressed in strains of bacteria which lack proteolytic functions. Also, strains of bacteria other than E. coli are often useful as hosts for expression systems. The best-developed alternative currently being Bacillus strains. Expression of proteins in bacteria is well- reviewed in "Current Protocols in Molecular Biology", which is published with quarterly updates by Wiley Interscience.
Expression of "foreign" proteins in mammalian cells can be accomplished in two general fashions. Transient expression refers to the creation of a pool of transfected cells which harbor plasmids that are not stably maintained in the cell and so are gradually diluted out of the population. Transient expression is by nature a short term method. For reproducible expression of a heterologous protein, stable expression systems are preferable. The current state of this art includes a variety of vector systems; both integrative and autonomous vectors are available. Inducible expression of heterologous proteins in mammalian cells is difficult to achieve at the current time. Some systems have been described, but they are not yet in general use. More commonly used are vectors bearing moderate to high- level constitutive promoters. Plasmid vectors are relatively easy to use. Retroviral vectors, which rely upon packaging into infective viral particles and integration into the host cell chromosome are more difficult to use, due to the extra steps involved in creating the recombinant viruses and cell lines which secrete them, but have the advantage that they effectively introduce exogenous DNA into human cell lines. Vaccinia virus vector systems are also in widespread use. Other viral vectors are under development for gene therapy systems, including adenovirus-derived vectors.
The preferred embodiments of the invention are described by means of the following examples. These examples are intended to be illustrative, rather than limiting in scope. It is understood that variations in the materials and techniques described below will be apparent to those skilled in art and such are to be
considered to fall within the scope and spirit of the instant application.
Example 1 Isolation and sequencing of cDNA encoding human dopamine transporter
To isolate human cDNAs for the dopamine transporter, cDNA libraries prepared from "substantia nigra" and "brainstem" dissections containing cells known to express the transporter were screened with hybridization probes prepared from the rat cDNA, pDATl (S. Shimada et al.. Science 254, 576 (1991)). Sequences from the 3' untranslated region of the rat cDNA were not used because of the presence of CA dinucleotide repeats. Human brain stem and substantia nigra cDNA libraries (Stratagene, La Jolla, CA) were plated and blotted onto duplicate replica nitrocellulose (Schleicher and Schuell, Keene, NH) filters, which were incubated for 1 hour at 37°C with proteinase K (50 μg/μl in 2 x SSPE/0.1%SDS) to reduce filter background, washed in 5 x SSC/0.5% SDS/lmM EDTA, prehybridized and hybridized at 42°C, and washed at 54°C in 0.4 x SSC/0.5% SDS. The hybridization probe was a 2300 bp Eco RI fragment of the rat dopamine transporter cDNA6 (S. Shimada et al.. Science 254, 576 (1991)) [32P] labeled by random priming (Prime It kit, Boehringer Mannheim) , and hybridized at approximately 106 cpm/ml. Positively-hybridizing cDNA clones were purified from the brainstem library, autoexcized according to protocols provided by the manufacturer (Stratagene) , and termed pHCDAT2, pHCDAT3, and pHCDAT7. Sequencing was performed on an Applied Biosystems automated sequencer as described (S. Shimada et al., Science 254, 576 (1991)) . Sequence analysis was performed using the
GCG software package (J. Devereaux, et al., Nucleic Acids Res. 12, 387 (1984)).
Screening of more than 2 x 106 plaques from the substantia nigra library produced no positives. Screening 1 x 106 plaques from the brainstem library yielded 11 positively-hybridizing plaques, three of which were identified as human DAT clones by sequence analysis. These clones were identified as representing the 5'-half (pHCDAT2, bases 1-1733), the 3'-half (pHCDAT3, bases 1679-3919), and an internal portion (pHCDAT7, bases 653-1434) of the human DAT cDNA whose reconstructed full-length sequence is shown in Figure 1. The structure of this cDNA resembles the structure of the rat cDNA DAT1, with a modest 5' untranslated region and a long 3' untranslated region. Both 5' and 3' untranslated regions are longer than those of the rat cDNA pDATl, however, making the length of the predicted human mRNA greater than the 3.7 kb observed for the rat mRNA (S. Shimada et al.. Science 254, 576 (1991)). A striking difference between rat and human cDNAs is found in the 3' untranslated region where the human cDNA displays 10 copies of a 40 bp repetitive element that are arrayed in head-to-tail fashion and are absent from the rat cDNA (Figs 1,2). These elements are highly stereotyped. The sequence of each element is more than 90% identical to the consensus sequence listed at the bottom of Figure 2, although the seventh repeat displays a 5 base pair insertion from its 24th to 28th nucleotides. The consensus element found here is 68% G+C No exact match is found in searches of the EMBL/genbank data base, release 70. However, sequences conferring up to 70% nucleic acid identity over up to 37 of these bases are found in
viral sequences, especially with herpesvirus sequences (e.g. locus HS1US) .
The open reading frame predicted by the HUDAT cDNA encodes 620 amino acids, identical in size to the rat DATl cDNA except for an additional amino acid (199) not found in the rat sequence (Fig. 3) . This open reading frame predicts amino acid sequences that are 94% identical to those encoded by the rat dopamine transporter cDNA (S. Shimada et al., Science 254, 576 (1991)). This high degree of conservation, and the weaker identities with the human norepinephrine and GABA transporter cDNA (H. Nelson et al., FEBS Lett. 269, 181-184 (1990); T. Pacholczyk et al., Nature 350, 350-354 (1991)) (Fig. 3), identifies this as the human homolog of the rat DATl.
The amino acid sequence predicted by the HUDAT cDNA reveals interesting differences from the rat cDNA. It lacks one of the 4 consensus sites for N-linked glycosylation noted in the rat DATl cDNA (Fig 3, + symbol) . Three adjacent amino acids distinguish the human from the rat proteins at this locus; no other portion of the molecule differs by this extent.
Human DAT amino acids predicted to lie within hydrophobic, putative transmembrane domains show 97% amino acid identity between the rat and human transporter cDNAs. This conservation is higher than the 87% conservation in regions thought not to span the membrane, and is consistent with the high conservation in these regions among different sodium dependent transporter family members. The most striking difference between the rat and human transporters occurs in the putative second extracellular domain, at which each of the transporters cloned to date displays consensus sites for N-linked glycosylation. The
glycosylation of the rat dopamine transporter has been defined in biochemical studies that suggest 20 to 30 kd of the molecular weight of the mature protein may consist of sugar (R. Lew et al.. Brain Research 539, 239 (1991) . Four potential N-linked glycosylation sites indicated in the rat transporter contain classic asparagine - X - serine/threonine sequences. Three of these sites are conserved among the rat and human sequences, but a middle glycosylation site, potentially the most distant from the embedding membrane, is absent in the human transporter. The amino acids surrounding this site provide the largest area of amino acid sequence divergence between the rat and human transporters. If glycosylation is evenly distributed among the different potential sites for N-linked glycosylation, these observations would predict that the human dopamine transporter might display less glycosylation than the rat, and that its molecular weight might be correspondingly smaller. The function of the glycosylation has not been identified to date; changes in ligand recognition, membrane targeting of the molecule, or even in cell/ cell recognition might conceivably result from these differences in glycosylation. The repeated motifs in the 3' untranslated regions of these cDNAs are another interesting difference from the rat sequence. Smaller polymorphic repeated elements have gained recent attention due to their implication in the fragile X syndrome and myotonic dystrophy (J.D. Brook, et al., Cell 68, 799-808 (1992) ; Y. Fu et al., Cell 67, 1047-1058 (1991) ; V.A. McKusick, Mendelian Inheritance in Man. 9th edn. , Johns Hopkins University Press, Baltimore, 1990, 2028 p.). The rat sequence does demonstrate 25 copies of a small
dinucleotide CA repeat from bases 2476 to 2525 of the 3' untranslated region of its mRNA; CA repeats are absent from the human cDNA (S. Shimada et al., Science 254, 576-578 (1991)). The sequence of the longer hDAT repeated element is not found the rat cDNA, nor in searches of other sequences found in databanks. The significance of the partial matches in viral genomes is unclear. These repeated elements might alter mRNA properties, perhaps including secondary structure and/or half-life, in ways that could contribute to the regulation of this gene's expression. Search of this sequence using the stemloop program yields more than 150 possible loops with as many as 18 stabilizing hydrogen bonds. Conceivably, population variants in the number of these repeats could also contribute to heterogeneity in DAT function.
Example 2 Restriction Fragment Length Polymorphism (RFLP) analysis DNA was obtained from leukocytes, digested with Tagl, and analyzed by Southern blotting using pHCDAT7 as the initial hybridization probe. Simpler patterns were also obtained using two other hybridization probes. Taq 120 corresponds to bases 668 to 787 of the HDAT (see below) , and was generated by hybridizing 65 and 72 base oligonucleotides of opposite sense and extending the product using large fragment of DNA polymerase I and [32P]-dCTP or by random priming of these two hybridized oligonucleotides, as described (A. Feinberg and B. Vogelstein, Analyt. Biochem. 132, 6-9 (1982); S. Shimada et al., Science 254, 576-578 (1991)). Identical results were also obtained using a random-priming labeled 492 base pair cDNA fragment (Taq
492) corresponding to bases 301 to 793 of this sequence. Probes were hybridized to filters containing DNA from unrelated individuals at 42°C in hybridization solution containing 50% formamide as described. Identical results were obtained with final washes at 68°C in 0.2 X SSC/0.2% SDS or at 54°C in 0.4 X SSC/0.5% SDS. Patterns from these Southern blots were analyzed by two independent observers.
Digestion of DNA from 20 unrelated individuals with nine different restriction endonucleases revealed Southern blot patterns in each case that were consistent with the presence of a single gene. There were no clear interindividual differences in Southern blot restriction patterns using radiolabeled pHCDAT 7 after digestion with Alu I, Bam HI, Eco RI, Hae III, Hind III, Msp I, and Rsa I. Three other enzymes, Pst I, Hinf I and Tag I revealed polymorphisms. We focused on the polymorphisms identified by Taq I. When probed with radiolabeled pHCDAT 7, more than six bands were obtained from Taq I restricted DNA, many of which showed polymorphic patterns (Fig 4A) . Hybridization survived washes of up to 68°C, consistent with specificity. A simpler pattern was revealed when hybridization was performed with the Taq 120 hybridization probe or with the cDNA hybridization probe Taq 492 (Fig. 4B) . Two hybridizing bands of 7 and 5.6 kilobases were observed and termed Al and A2. Taq I Al and A2 RFLP frequencies are presented the Table. Of 272 chromosomes from 136 individuals examined 36% showed the Al form, 64% showed the A2 form. There was a significant racial dimorphism in these distributions such that 26% of Caucasians, but 42% of blacks displayed the Al RFLP (χ2=7.45, p< 0.01).
The rich patterns of Taq I RFLPs identified with this cDNA sequence could relate to the fact that the clone itself contains three sites for Taq I cleavage. Further studies are thus likely to detect other polymorphisms, because extreme variability of bands in the initial Taq I restriction digestions has already been documented.
The tandem repeat in the 3'region of this gene also provides a Variable Number Tandem Repeat (VNTR) . The means for examining the distribution of alleles of the VNTR is set forth at the end of Example 3 below.
The hybridization probes that we have described provide useful markers for linkage analysis that would help to exclude the regions around the dopamine transporter gene from involvement in familial disorders. Human dopamine systems are involved in a number of human disorders, with specific implication of involvement of transporter mechanisms in psychostimulant abuse, Parkinsonism, and Tourette's syndrome (E.J. Devor and CR. Cloninger, Annu. Rev. Genet. 23, 19-36 (1989); D. Pauls and J. Leckman, New Eng. J. Med. 315, 993-997 (1986); R. Pickens et al.. Arch. Gen. Psychiatry 48, 19-28 (1991) ; M.C. Ritz et al.. Science 237, 1219-1223 (1987); S. Shimada et al., Science 254, 576-578 (1991); H.S. Singer et al., Ann. Neurol. 30, 558-562 (1991); S.H. Snyder S.H. and R.J. D'Amato, Neurology 36, 250-258 (1986); G. Uhl, Eur. J. Neurol. 30, 21-30 (1990)). The human dopamine transporter cDNAs and RFLP information described here should provide useful tools to study its possible role in these and other human disorders.
Example 3 (Predictive) A genetic component of substance abuse behavior identified bv RFLP analysis of the human DAT gene
Abuse of substances, including drugs and alcohol, is currently viewed as arising from a combination of biological, psychological, and social factors (J.S. Searles, J Abnorm Psychol. 97,153-167 (1988) ; E.J. Devor and CR. Cloninger, Annu Rev Genet. 23, 19-36 (1989) ; K.R. Merikangas, Psychological Medicine 20, 11-22 (1990)). Genetic contributions to susceptibility to alcoholism are supported by family, twin, and adoption studies. (D.S. Goodwin, Arch Gen Psychiatry 36, 57-61 (1979); CR. Cloninger et al., Arch Gen Psychiatry 38, 861-868 (1981) ; CR. Cloninger, Science 236, 410-416 (1987)). A genetic component of vulnerability to drug abuse has also been suggested in both twin and adoption studies (R.J. Cadoret et al., Arch Gen Psychiatry 43, 1131-1136 (1987); R.W. Pickens et al., Arch Gen Psychiatry 48, 19-28 (1991)). A number of substances which share the potential for abuse by humans also share the ability to enhance dopamine activity in mesolimbic/mesocortical circuits thought to be important for behavioral reward and reinforcement (A.S. Lippa et al., Pharmacol Biochem Behav. 1, 23-28 (1973) ; G. Di Chiara and A. Imperato, Proc Natl Acad Sci USA 85, 5274-5278 (1988); R.A. Wise and P.P. Rompre, Annu Rev Psychol. 40, 191-225 (1989)). Cocaine's ability to inhibit re-uptake of dopamine, for example, points strongly toward a possible direct action for this highly-reinforcing drug in these dopaminergic circuits (M.C. Ritz et al., Science 237, 1219-1223 (1987); D.E. Grigoriadis et al., J Neurosci. 9, 2664-2670 (1989)) .
Blum, Noble and co-workers first reported that the "Al" Taql restriction fragment length polymorphism (RFLP) of the human dopamine D2 receptor gene (DRD2, D.K. Grandy et al.. Am J Hum Genet. 45, 778-785 (1989)) was associated with alcoholism (K. Blum et al., JAMA 263, 2055-2060 (1990)); 69% of alcoholics displayed this RFLP compared to 20% of non-alcoholics. 42% of 504 Caucasian alcoholic individuals reported in literature to date display this RFLP, while only 27% of 461 Caucasian "control" individuals are Al positive (G.R. Uhl et al.. Arch Gen Psychiatry 49, 157-160 (1992); E. Turner et al., Biol Psychiatry 31, 285-290 (1991) . These data come from eight previous studies, five of which find significant associations between RFLP and alcoholism, and provide evidence for a significant association between gene markers and behavior.
Examination of gene marker/behavior associations in drug abusers raises several methodological concerns. Relatively few individuals who abuse drugs abstain from alcohol, and many individuals who use drugs often self-administer multiple substances (D.R. Wesson et al., eds. Polvdrug Abuse: The Results of a National Collaborative Study. New York, NY: Academic Press, Inc.; 1978). Drug-using populations may also differ from one another and from the general population in racial, ethnic and other features that might be associated with altered distributions of the alleles for different genes (M.R. Gillmore et al., Am J Drug Alcohol Abuse 16, 185-206 (1990)). Also, some clinical assessments may not focus on the heritable features of the disorder (R.W. Pickens et al., Arch Gen Psychiatry 48, 19-28 (1991)).
A study of D2 dopamine receptor gene markers in polysubstance users and control subjects provides a useful model for investigating the association between alleles of the DAT gene and substance abuse behaviors or other behavioral disorders such as Tourette's syndrome. We have investigated the 3' Tagl Al RFLP examined in previous studies of alcoholics, and a more 5' Taql RFLP ("B") located closer to regulatory and structural/coding regions of the gene (X.Y. Hauge et al., Genomics 10, 527-530 (1991)). Only Caucasian individuals were included in this study because of evidence for different distributions of Taql A and B markers in white and black individuals (Dr Bruce O'Hara et al, unpublished data) . Substance users were identified according to two approaches. One group of users met criteria for lifetime DSM-III-R (Diagnostic and Statistical Manual of Mental Disorders. Revised Third Edition. Washington, DC: American Psychiatric Association; 1987) psychoactive substance use disorder(s) . A second group of users was identified based on their peak lifetime use of psychoactive substances. This quantity-frequency approach was chosen because of evidence that heavy use of alcohol may display significant heritability in males and females (R.W. Pickens et al.. Arch Gen Psychiatry 48, 19-28 (1991)). Control subjects were free of significant lifetime substance use.
i) Subject Recruitment: 288 Caucasian substance-using and control subjects were recruited from three sources; 21% were female. 224 drug-using and control volunteers consenting to research protocols at the Addiction Research Center (ARC) in Baltimore, Maryland were studied. The ARC is the major federal drug abuse
research facility that recruits through advertisement and word of mouth for participation in treatment and non-treatment studies. 12 volunteers from a chronic hemodialysis unit on the same campus, both users and controls, augmented this sample. A third group of users consisted of 52 HIV seronegative participants in an ongoing east Baltimore study of HIV infections in intravenous drug users (D. Vlahov et al., Am J Epid. 132, 847-856 (1990)). Each subject was individually interviewed to elicit information characterizing substance use. 192 users and 56 controls were assessed according to a quantity-frequency approach. 137 users met criteria for DSM-III-R psychoactive substance use disorders. 97 users received both assessments. Written informed consent was obtained from all subjects.
Quantity-Freguency Approach: Trained interviewers assessed subjects with the Drug Use Survey (DUS) interview (see below) in a confidential setting. The amount, frequency, and/or dollar cost at the time of lifetime peak use were recorded for each of 15 different psychoactive drugs or drug classes used more than five times. Blinded ratings of lifetime peak use of each individual substance were made on a four-point scale: 0=absent, l=minimal, 2=moderate, or 3=heavy use as indicated in Table I. A composite "Total Use" index was constructed from the pooled ratings of use of all individual substances as follows: "0" = up to minimal use of alcohol, marijuana, or nicotine and no use of other drugs; "1" = moderate use of alcohol or nicotine and/or minimal use of other drugs; "2" = heavy use of alcohol or nicotine, moderate use of marijuana, and/or up to moderate use of other drugs; "3" = heavy use of any illicit drug. Thus, neither heavy use of alcohol
or nicotine was sufficient to confer a rating of heavy total drug use. Control subjects were identified as those individuals with Total Use scores of 0 or 1; substance abusers were individuals with Total Use scores of 2 or 3.
DSM-III-R Diagnoses: Trained interviewers administered the Diagnostic Interview Schedule Version III Revised (DIS-III-R, L.N. Robins et al..NIMH Diagnostic Interview Schedule Version III Revised (Version 11/7/89) . Department of Psychiatry, Washington University School of Medicine, St. Louis, MO.) to provide lifetime DSM-III-R diagnoses of psychoactive substance use disorders including nicotine and alcohol. Reliability and Validity of Drug Use Information: Drug Use Survey (DUS) ratings were evaluated in subjects who were: (a) assessed with the DUS on two different occasions at 3 to 13 months apart (n=31) , (b) tested for lifetime DSM-III-R psychoactive substance use disorders by the DIS-III-R23 (n=18) and the Structured Clinical Interview for DSM-III-R (R.L. Spitzer et al. , Structured clinical interview for DSM-III-R - patient version (with psychotic screen) - SCID-P (W/Psvchotic Screen) - 5/1/89) . Biometrics Research Department, New York State Psychiatric Institute, New York, New York) (SCID; n=17) , and (c) checked for urinary excretion of psychoactive drugs and metabolites on the day of the DUS (n=56) . For the 18 DIS-III-R-assessed subjects and the 17 SCID-assessed subjects, DUS ratings were completed without knowledge of psychiatric assessment information. Genotypes were not available for 17 subjects assessed with the SCID and were not included in the genetic analyses.
Table I. Drug Use Survey - Rating Criteria
Substance
Cigarettes 0 — never smoked cigarettes
1 = 1 to 15 cigarettes per day
2 = 16 to 25 cigarettes per day
3 — more than 25 cigarettes per day
Alcohol 0 = never used alcohol
1 = up to 4 drinks per drinking occasion, fewer than 10 drinking occasions/month = up to 4 drinks per drinking occasion, more than 10 drinking occasions/month, OR, more than 4 drinks per drinking occasion, but fewer than 10 drinking occasions/month
= 5 or more drinks per drinking occasion, more than 10 drinking occasions/month
= never used heroin/other opiates
= used 1 time/week or less than
= 2 to 6 times/week , spending $30 to
= daily use, typically spending >
= never used cocaine
2 == 2 to 4 grams per week (more than
$150/week but less than $300/week)
3 = more than 4 grams per week, usually up to 7 to 10 grams/week; (more than
$300/week, usually much higher; daily use common)
Marijuana 0 = never used marijuana
1 = up to one joint/day
2 = 2 to 3 joints per day
3 = 4 or more joints per day
Minor
Tranquilizers, 0 - never used substance
Amphetamines, 1 = fewer than 1 use per week
Barbiturates, 2 = 1 to 6 uses per week (4 to 24 uses/month)
Hallucinogens, 3 = 7 or more uses/week (more than
24 Inhalants, PCP, uses/month)
Antidepressants,
Other Tobacco products,
Other Substances
ii) DNA Extraction and Analysis: Blood was obtained in EDTA-containing evacuated sterile tubes from each subject and stored at 4°C and/or frozen at -70°C in polypropylene tubes. DNA was extracted from non-frozen samples after initial isolation of nuclei and from frozen blood by selective white blood cell sedimentation followed by standard extraction methods (J. Sambrook et al., eds. "Molecular cloning: a laboratory manual" (2nd edition) . Cold Spring Harbor (NY) Laboratory Press; 1989). 5-10 μg of this DNA was digested with Taql as recommended by the manufacturer, or with 20-fold excess of this enzyme for several individuals displaying A3 alleles. DNA fragments were electrophoresed using 0.8% agarose gels containing ethidium bromide at 1-2 volts per centimeter for 16 hours, transferred to nylon membranes, and immobilized by UV crosslinking.
Hybridization was performed for 16-24 hours at 42°C in 50% formamide, 5xSSC, 50 mM NaP04 (pH 6.8), 1% SDS, lmM EDTA, 2.5 x Denhardt's solution, 200 μg/ml herring sperm DNA, and 4X106 cpm/ml of radiolabelled DNA (see below) . Washing for 20 minutes in 2XSSC at room temperature was followed by two 30 minute washes in 0.4 x SSC/0.5%SDS at 55°C Washed blots were exposed to Kodak XAR film 1-6 days with an intensifying screen at -70°C. Band sizes were compared to λ DNA molecular weight standards, and with patterns previously defined (K. Blum et al., JAMA 263, 2055-2060 (1990); A.M. Bolos et al., JAMA 264, 3156-3160 (1990). After Taql A RFLP status was determined, 32P decay allowed re- hybridization of the same blots with hybridization probe for Taql B ascertainment. When background levels of radiation were not reached, filters were incubated at 65°C for 30 min in 2 mM TRIS (pH 8) , 1 mM EDTA, and
0.1% SDS to remove residual hybridized probe. RFLP status was assigned by two independent raters unaware of the clinical status of the subjects.
iii) Hybridization probes: A 1.7 kb BamHI fragment of the human genomic clone encoding the dopamine D2 receptor (λhD2Gl) was subcloned into the BamHI site on bluescript SK+ to produce phD2-9, which was used to detect Al, A2, and A3 patterns in the Southern analyses, as described (K. Blum et al., JAMA 263, 2055-2060 (1990) ; A.M. Bolos et al. , JAMA 264, 3156-3160 (1990)) (Dr Bruce O'Hara et al, unpublished data) . λhD2G2 was used to detect the Taql "B" patterns. DNAs were radiolabelled using random priming and 32P-CTP to specific activities of approximately 109 cpm/μg (A. Feinberg and B Vogelstein, Anal Biochem. 137, 266-267 (1984)).
iv) Analyses:
a) Association analyses: A two-tailed Pearson chi square test (with Yates' correction for continuity) was used to evaluate the association between Al RFLP presence and substance use/abuse; the same analysis were repeated for the Bl RFLP. Association was first tested contrasting controls and substance users meeting criteria for any lifetime DSM-III-R substance dependence disorder. Next, controls were contrasted with substance users who had been assessed with the DUS. Data for both groups of substance-using subjects were pooled and compared to RFLP frequencies for controls, b) Comparisons with other data: Pooled Taql Al RFLP data from ARC users was compared with
values obtained for Caucasian controls in other studies. c) Subtracting heavy alcohol users: DUS-assessed users free of heavy alcohol use were compared with controls to test whether the associations observed might be attributed solely to alcohol.
Tagl A and B RFLPs were assigned with 100% agreement between two independent raters.
Substance use assessment by means of the Drug Use Survey showed several features suggesting validity and reliability. For 31 subjects whose DUS was elicited twice, interrater reliability correlations for severity ratings ranged from 0.83 to 1.00 (median = 0.94), while test-retest reliability correlations for individual drugs ranged from 0.53 to 0.94 (median = 0.78) . For 35 subjects with DIS-III-R or SCID assessments and independent DUS ratings, analysis of the correspondence between a positive lifetime DSM-III-R Substance Use diagnosis and moderate to heavy substance use on the DUS yielded a kappa value of 0.68 (91% agreement). Finally, drugs tested as positive in urine drug screening were reported used 84% of the time (n= 56) in the DUS assessment.
Taql A and B RFLP frequencies for substance-using and control subjects are presented in Table II. For the Taql Bl RFLP, a significant association was found comparing users with at least one lifetime DIS-III-R Substance Use Disorder diagnosis and DUS-assessed controls (χ2 = 6.74, p < 0.01). For the Taql Al RFLP, analysis of the same groups revealed a significant association (χ2 = 3.98, p < 0.05). Comparison of DUS- assessed users to DUS-assessed controls revealed a significant association for the Taql Bl RFLP (χ2 = 5.45,
p < 0.02) and a trend towards significant association for the Taql Al RFLP (χ2 = 3.14, p < 0.08). Table III presents Taql A and B genotypes (homozygotes and heterozygotes) for DUS-assessed controls and users.
Table II. D2 Dopamine Receptor Gene RFLPs in Users and Controls
Group Al Present (%) Bl Present (%)
I. POLYSUBSTANCE USERS a) DSM-III-R Substance Use Diagnosis b) DUS* Heavy Use (Total Use=3 c) DUS Moderate Use (Total Use=2) d) DUS Total Use 2 & 3 Combined Combined Users©
III. PREVIOUSLY REPORTED CONTROLS
Blum et al , 15 (1990) * Blum et al ,36 (1991) * Comings et al ,35 (1991)* Parsian et al,37 (1991) * Todd et al ,+ ( 1991) * Bolos et al , 26 (1990) Comings et al ,35 (1991) Gelernter et al ,3° (1991) CEPH-Comings et al ,35 (1991) Grandy et al , 14 (1989) O'Hara et al,++ (1992)
subtotal 27.1 (152/560)
Combined Controls: Prior and Current Reports total ( )
*DUS = Drug Use Survey aThe subject total here (n=232 users) reflects 40 users who received only the DIS-III-R + 82 users who received the DIS-III-R and the DUS + lio users who received only the DUS. Four subjects who received both the DIS-III-R and the DUS did not meet criteria for a DSM- III-R diagnosis but were classified as users (in the DUS and "Combined Users" comparisons) on the basis of their DUS ratings. 'Screened to exclude alcohol or drug abuse. +Cited in Cloninger et al., JAMA 266, 1833 (1991). ++Unpublished data.
No significant differences in RFLP frequencies were found between DUS-assessed substance users and DIS-III-R-assessed users (for Taql Al, χ2 = 0.005, ns; for Taql Bl, χ2 = 0.331, ns) . We thus reanalyzed the data by pooling substance users assessed in both fashions for comparison with controls. Analysis of Taql Bl data (χ2 = 6.31, p < 0.02) and Taql Al data (χ2 = 4.46, p < 0.04) again revealed significant associations.
Comparisons of Taql Al RFLP frequencies in ARC users and controls from all other published studies revealed significant associations when the controls were assessed (χ2 = 15.41, p < 0.001), unassessed (χ2 = 9.31, p < 0.003), or pooled (χ2 = 14.80, p < 0.001).
To examine whether the effects noted could be attributed chiefly to the extent of alcohol intake, heavy alcohol users achieving DUS alcohol ratings of 3 were eliminated from the user group and reanalysis performed. Omitting heavy alcohol users did not significantly alter the elevated frequencies of the Taql Bl RFLP found in the user group. 32.3% of all polysubstance users and 32.9% of the 73 polysubstance users free of heavy alcohol use displayed the Taql Bl marker.
The hypothesis that individual differences in substance abuse may be due, in part, to different dopamine D2 receptor alleles marked by Taq I RFLPs at the gene's 3' and 5' ends arises from initial work in alcoholics (K. Blum et al., JAMA 263, 2055-2060 (1990)). The hypothesis is
strengthened by a compelling biological rationale for interactions between abused drugs and brain dopamine systems (G. Di Chiara and A. Imperato, Proc Natl Acad Sci USA 85, 5274-5278 (1988) ; R.A. Wise and P.P. Ro pre, Annu Rev Psychol. 40, 191-225 (1989)). In the current example, significant associations with heavy substance use or abuse were found consistently for the Taql Bl RFLP and less consistently for the Taql Al RFLP. These findings provide preliminary evidence that a more 5' Taql RFLP (Bl) may represent a better marker for a DRD2 gene variant possibly predisposing carriers to heavy substance use or abuse. Selection of drug using and control populations provides opportunities for different approaches that could influence the results obtained. Many substance abusers use multiple psychoactive substances ( D.R. Wesson et al., eds. Polvdruq Abuse: The Results of a National Collaborative Study. New York, NY: Academic Press, Inc.; 1978); 71% of the 192 DUS-assessed users in the current study reported moderate to heavy use of three or more different substances. To reflect this fact, we first studied subjects who frequently use multiple drugs, attempted to characterize each drug used by each subject, and analyzed data on the basis of overall lifetime peak use. This approach might provide a weaker test of linkage if only a single abused substance displayed such genetic association. For example, if only alcohol abuse contributed to the
associations noted here, we might anticipate a weaker association between Bl RFLPs and substance abuse if individuals with heavy alcohol consumption were eliminated from our sample. In fact, elimination of heavy alcohol users (DUS rating=3) resulted in no decrease in the differences between the remaining DUS-assessed drug-abusing and control individuals for the Taql B RFLP. The characterization of these subjects also raises important issues of assessment type, validity and reliability. Errors in clinical assessment would weaken tests of the allelic association hypothesis. In addition, studying behaviors that could contribute to features of clinical diagnosis but might not reflect the behavioral impact of a DRD2 gene variant could yield false-negative results.
We originally began work with the DUS, an interview-based assessment of substance use that enabled approximate quantification of peak lifetime use for several types of substances and appeared to provide an assessment of a basic feature of substance abuse: level of substance consumption. Psychiatric genetic work using classical methods suggests that heavy substance use can show substantial genetic determinants (R.W. Pickens et al.. Arch Gen Psychiatry 48, 19-28 (1991) ; CR. Cloninger and T. Reich, In: Kety SS, Rowland LP, Sidman RL, Matthysse SW, eds. Genetics of neurological and psychiatric disorders. New York, NY: Raven Press: 1983; pp.
145-166) . Reliability and validity of the quantity-frequency approach to subjects' drug use were supported by the correlations between drug use assessments made on two occasions, assessments made with multiple instruments, and correlations with results of urine drug screens. However, several individuals who reported heavy use of various drugs did not fulfill criteria for DSM-III-R diagnoses of dependence or abuse on SCID or DIS-III-R assessments of the same drugs (S.S. Smith et al., "Validation of an instrument for quantifying drug use self-report: The ARC Drug Use Scale". Presented at the 53rd Annual Scientific Meeting, The Committee on Problems of Drug Dependence, June 16-20, 1991, Palm Beach, FL) .
We also evaluated subjects by determining lifetime psychiatric diagnoses of psychoactive substance use disorders using a structured psychiatric interview, the DIS-III-R, which can also demonstrate reliability and validity (J.E. Helzer et al.. Arch Gen Psychiatry 42, 657-666 (1985); N. Oskooilar et al., DIS Newsletter 8, 9- 10 (1991)). Comparison of Taql A and B RFLP frequencies in substance-using subjects failed to indicate significant differences between the quantity- frequency and psychiatric diagnosis approaches. These results suggested that we could combine subjects meeting criteria for DSM-III-R Substance Use diagnsoses with subjects reporting moderate to heavy drug use. It is still conceivable,
however, that behavioral effects of a gene might be differentially reflected in quantity/frequency or in disease/disorder approaches to defining the affected group. The RFLPs studied here are the result of polymorphic Tagl restriction sites in which "A" RFLPs are located ca. 9 kb 3' to the final exon of the D2 receptor gene (0. Civelli, personal communication) and "B" RFLPs are located near the first coding exon (X.Y. Hauge et al., Genomics 10, 527-530 (1991)). These polymorphisms could have functional relevance if base pair differences directly influenced the gene's regulation. Alternatively, they could provide markers for structural or regulatory changes in other regions of the gene if these other changes and the Taql variations were maintained together by linkage disequilibrium resulting in specific haplotypes (G.R. Uhl et al., Arch Gen Psychiatry 49, 157-160 (1992)). This linkage disequilibrium does exist (X.Y. Hauge et al., Genomics 10, 527- 530 (1991) ; Dr. Bruce O'Hara et al, unpublished data) . In our data, for example, the expected frequency of the A2/A2-B2/B2 haplotype would be 43% based on the frequencies of the A2 and B2 allelic markers. However, the observed frequency of this haplotype was 61% (χ2 = 16.33, p < 0.0001) (76% for controls and 57% for users, χ2 = 5.15, p < 0.03), indicating substantial linkage disequilibrium.
The lack of strong association between D2 receptor gene RFLPs and substance use evident in
this study is consistent with estimates of the heritable components of alcoholism and drug abuse (E.J. Devor and CR. Cloninger, Annu Rev Genet. 23, 19-36 (1989).; R.J. Cadoret et al.. Arch Gen Psychiatry 43, 1131-1136 (1987)). One recent study of concordance rates for alcoholism in twin populations suggests that between 20 and 30% of the vulnerability to abuse or dependence on this substance may be genetic in origin (R.W. Pickens et al.. Arch Gen Psychiatry 48, 19-28 (1991)). Attempts to link familial alcohol susceptibility to specific chromosomal markers and patterns of inheritance in families have not been consistent with a single genetic locus (S.B Gilligan et al., Genet Epidemiol. 4, 395-414 (1987); C.E. Aston and S.Y. Hill, Am J Hum Genet. 46, 879-887 (1990)) . The strong association between a single gene RFLP and alcoholism found by Blum et al. (K. Blum et al., JAMA 263, 2055-2060 (1990)) would thus fit poorly with this extent of heritability. The large environmental influences on expression of alcoholism, and their study of unrelated individuals rather than defined pedigrees also make the strength of their findings surprising. To investigate the association between the DAT gene and substance abuse behaviors, one can make use of the variable number tandem repeat (VNTR) at the 3'-end of the mRNA described in example 1. Alternatively the Taql RFLP described in example 2 could be utilized. In general, examination of VNTR markers is preferred, as such markers have a larger number of alleles and hence
are "more informative", i.e. VNTR markers identify more subtypes than a regular "site-no site" RFLP marker. The same methodology described above for the study of the D2 dopamine receptor gene can be employed. As shown above, particular attention must be paid to the diagnostic criteria for identifying the abuse behavior if the results are to be meaningful.
To assess frequencies of the VNTR, DNA is obtained from leukocytes from research volunteers as described above. Genomic DNA (40 ng) is subjected to 35 cycles of amplification using AmpliTaq DNA Polymerase (1.25 U) and polymerase chain reaction with denaturing for 1 min at 93°C, and annealing/extension for 1 min at 72°C in buffer supplied by the manufacturer (Perkin- Elmer) . Oligonucleotides T3-5LONG (5'- TGTGGTGTAGGGAACGGCCTGAG-3', SEQ. ID. NO. 4) and T7-3aLONG (5'-CTTCCTGGAGGTCACGGCTCAAGG-3', SEQ. ID. NO. 5) are used at 0.5 uM final concentration. Reaction products are separated by 5% polyacrylamide gel electrophoresis, and product sizes estimated by comparison to molecular weight standards (BRL) . 242 of the 254 chromosomes examined displayed either 9 or 10 copies of the 40 basepair repeat. Two chromosomes showed three copies, two showed 5 copies, three showed 7, four showed 8 and one showed 11 copies of the VNTR. Among individuals with 9 and/or 10 copies per chromosome there were racial differences in copy number frequencies. Whites displayed 30% and
Blacks displayed 20% of the 9-copy variant. The 3' VNTR marker defined by 9 versus 10 copies of the 40 basepair repeat displayed no significant linkage disequilibrium with the more 5' Taql RFLP (χ2 values were 5.51 and 4.62 for White and Black subjects, respectively, with 8 degrees of freedom, p > 0.1.)
Example 4 (predictive) Expression of HUDAT protein in Escherichia coli and purification of the bacterially expressed protein
Any of several expression systems can be utilized to obtain HUDAT protein expression in E. coli. For example, the plasmid vector pFLAG system (International Biotechnologies, Inc., New Haven, CT) produces the polypeptide of interest attached to a short protein sequence that allows purification of the fusion protein by use of a monoclonal antibody directed against a hydrophilic, and thus surface localized, octapeptide. The open reading frame midportion of the HUDAT cDNA is obtained by digestion of the pHCDAT7 plasmid with EcoRI and purification of the insert fragment encoding the HUDAT protein by electrophoresis and elution from an agarose gel by standard techniques. Oligonucleotides having the sequences 5'-GGGTCTAGACG-3' and 5'- AATTCGTCTAGACCC-3' are annealed to form an adaptor and the adaptor is ligated to the ends of the insert DNA. The ligation product is digested with Xbal and cloned into the Xbal restriction
site of the pFLAG vector (International Biotechnologies, Inc.). The appropriate E. coli host is transformed and colonies containing the HUDAT cDNA may be screened by colony hybridization using the pcHUDAT as probe. Positive clones are grown as large-scale cultures and the fusion protein is obtained in pure form by use of the monoclonal antibody affinity column as described by the manufacturer of the system, except that the elution buffer is modified by the addition of 0.5% CHAPS (3-[ (3-Cholamidopropyl)- dimethylammonio] 1-propane-sulfonate) . Authentic DAT protein lacking the FLAG octapeptide is obtained by enterokinase cleavage of the fusion protein as described by the supplier of the FLAG system.
Example 5 (predictive) Purification of DAT from tissues or from transformed mammalian cells. As protein isolated from transformed bacterial cells lacks post-translational modifications, such as sugar additions, that occur in mammalian cells, the purification of the protein from tranformed COS cells is discussed. COS cells transformed as described in (S. Shimada et al., Science 254, 576 (1991)) are subjected to a purification protocol as described for the purification of the GABA transporter (Radian, et al., J. Biol. Chem. 261, 15437-15441 (1987) with the modification that binding of labelled CFT is used to assay for the presence of
DAT in the sample rather than labelled gamma- amino butyric acid. The protocol is modified as required to allow the isolation of DAT as a distinct protein by techniques known to a practitioner of the art.
Example 6 (predictive) Diagnosis of deficiency, mutant or overexpression of dopamine transporter by PCR
mRNA obtained from tissue biopsy from a patient is converted subjected to quantitative reverse-transcript PCR (for example, see A. M. Wang, et al. PNAS USA 86:9717 (1989)) utilizing as primers oligonucleotides derived from the cDNA sequence of pcHUDAT. Use of the 5' 19-mer, GCTCCGTGGACTCATGTCTTC, bases 118 through 139 of Fig. 1 (SEQ. I.D. NO. 1) as the upstream primer and CACCTTGAGCCAGTGGCGG, the reverse complement of bases 1942 to 1960 of Fig. 1 (SEQ. I.D. NO. 1) as the downstream primer allows examination of the character of the protein coding region of the HUDAT mRNA. Variance in the expression level can be ascertained by comparison of product yield with a normal control. Abnormal mRNA structures can be diagnosed by observation of a product band of a length different from the normal control. Point mutants can be observed by use of primers and conditions appropriate for detection of the mismatch between the mutant and normal alleles. For example, the "reverse dot blot" procedure for screening the expression of several mutant
alleles in a single experiment, which has been described for the CFTR gene, mutants of which cause cystic fibrosis (Erlich, H.A. , et al Science 252:1643 (1991). The HUDAT mRNA also contains a variable number tandem repeat element in the 3' untranslated portion of the mRNA which can be amplified for examination of an association between specific VNTR alleles and substance abuse behavior or diseases associated with expression of particular HUDAT alleles (See example 3) .
Example 7 (predictive) Use of dopamine transporter expression to incorporate as part of overexpression of a panel of dopaminergic genes to reconstruct a dopaminergic cell line for therapy in human diseases resulting from defective dopamine transporter expression.
cDNAs for the human dopamine transporter, and for tyrosine hydroxylase and aromatic ammino acid decarboxylase (DOPA decarboxylase) are transfected into cell types including COS cells as described above. Cells are cotransfected with the neomycin resistance marker, selected by growth in G418, and then tested for their ability to synthesize and accumulate dopamine. Individual subclones may be able to take up dopamine, without the ability to synthesize it. However, individual subclones are also likely to
integrate several of the plasmids. If the plasmids cannot be introduced serially or together in this direction, serial edition of tyrosine hydroxylase and DOPA decarboxylase to stable cell lines already expressing the dopamine transporter stably should be employed (see above) . The ability of cells to incorporate tritiated tyrosine into tritiated dopamine is tested via HPLC analysis and radiochemical detection as described (Uhl et al.. Molecular Brain Research, 1991) , their ability to take up tritiated dopamine is performed as described in the same reference.
These same procedures are used in transfecting cells obtained from an individual with a disease state caused by defects in dopamine transporter expression, either in the amount expressed or due to expression of a defective protein, so that stable immortalized cell lines expressing human dopamine transporter could be constructed with immunologic identity to the patient. Means of controlling the replication of these cells by encapsulating them in a matrix that is not porous to cell bodies, but able to be permeated by cell processes, or by use of inhibitory growth factors, can also be employed. A third strategy, temperature sensitive cell mutants that would not divide under physiologic temperatures (e.g. temperature sensitive COS cells variants) could be used to be able to express the dopaminergic cDNA stably, in a fashion that would produce dopaminergic cells.
Each of these cell types are potential candidates for use in transplantation into striatum in individuals with striatal dopamine depletion in Parkinson's disease. Alternatively, genes could be incorporated with retroviral vectors as well- known for practitioners of the art.
Example 8 (predictive) Production of variant sequences in HUDAT protein and testing of their biological function Site directed mutagenesis using olgonucleotides is used to introduce specific single- and multiple-base changes into the HUDAT cDNA that change specific amino acids in the HUDAT protein. The ability of mutant transporters to take up [ ] dopamine, [3H] MPP+,and to bind [3H] cocaine and cocaine analogues (especially [3H] CFT) is tested as described previously (S. Shimada et al., Science 254, 576 (1991)). The Amersham mutagenesis system (version 2.1, technical bulletin code RPN1523) can be used. Initial studies of mutants of the aspartic acid residue in transmembrane domain 1, and the serine residues in transmembrane domain 7 of the rat DAT protein have revealed substantial effects on dopamine transport, and more modest effects on cocaine binding. These results document that the residues key to dopamine transport are not identical to those crucial to cocaine binding; the first transmembrane residue change of aspartic acid (residue 79) to glycine reduces
cocaine binding by 10%, but reduces dopamine transport by over 95%. Mutations in the second extracellular domain in glycosylation sites help elucidate the role of glycosylation in the functions of this molecule (See also Example 9) . Selective removal of the N and C terminal intracellular and second extracellular loop, and production of chimeric molecules with replacement of these regions with the corresponding regions of the GABA transporter further confirm the molecular features of DAT that are essential for dopamine transport and cocaine binding and allow development of agents dissociating the two processes.
Example 9 (predictive)
Alteration of carbohydrate structure in the extracellular domain of the HUDAT protein
As noted in example 1, the largest difference in the structure of the proteins predicted by the human and the rat dopamine transporter dDNA sequences is the absence of one of the four consensus sites for N-linked glycosylation of the protein (See figure 3) . By virtue of their location in the same domain of the protein expected to most influence substrate binding, that is in the extracellular portion of the protein, it is of interest to investigate the contribution of the sugars to substrate binding. Site directed mutagaenesis can be performed as described for Example 8 introduce into the human DAT cDNA the asparagine residues to which N-
linked sugars are attached and the remaining amino acids which constitute the glycosylation signal for that site that are found in the rat, but not the human cDNA. The result of expression of such mutant proteins can be evaluated by photo-affinity labelling of the protein and analysis by SDS-PAGE. Digestion of the protein with various glycosidases can be performed to assess the degree to which the pattern of glycosylation has been altered, as described by Lew et al. (R. Lew et al.. Brain Research 539, 239 (1991)). For instance, compararison of the wild-type and mutant protein, both untreated and digested with N-glycanase, would should show similar sized proteins for the digested protein, but a larger protein for the untreated mutant, compared to the untreated wild-type protein if the introduction of the asparagine glycosylation signal resulted in successful incorporation of sugar into the protein at that site. More detailed information regarding the sugar structure can be obtained by exoglycosidase digestion experiments. For example, the presence of sialic acid residues in the polysaccharide can be detected by digestion with neuraminidase. The influence of the polysaccarhide structure on function of the protein is then assessed by testing the properties of the the transporter using either stably transfected cells expressing the mutant protein, or by using cells transiently expressing the mutant transporter on their surface. The means for carrying out such
functional studies are described by Shimada et al. (S. Shimada et al., Science 254, 576 (1991)).
Example 10 (predictive) Cell lines expressing HUDAT protein on the cell surface can be used to screen candidate compounds for efficacy as dopamine (or cocaine or functional analogs thereof) agonists or antagonists by evaluating the influence of the candidate compound upon the binding of dopamine (or cocaine or functional analogs thereof) to the surface of such cells. Another assay for dopamine agonist or antagonist activity is to measure the cytotoxicity to such cells of MPP* to such cells in the presence and absence of the candidate compound. Such assays are described using cells expressing the rat DAT cDNA in Shimada et al. (S. Shimada et al., Science 254, 576 (1991)) and can be applied as well to cells expressing the human DAT cDNA.
Example 11 (predictive)
Production of antibodies to HUDAT and use of same in a diagnostic test for dopaminergic cell death.
A. Production of polyclonal antibodies.
HUDAT protein obtained as described above or synthetic polypeptides of amino acid sequence derived from the HUDAT sequence are used as immunogens in an appropriate animal. The serum is obtained from the immunized animal and either
utilized directly or the antibody may be purified from the serum by any commonly utilized techniques. Polyclonal antibody directed only toward HUDAT can be isolated by use of an affinity column derivatized with the immunogen utilized to raise the antibody, again using techniques familiar to one knowledgable in the art.
B. Production of monoclonal antibodies to HUDAT Monoclonal antibodies to HUDAT or to particular epitopes of HUDAT may be produced by immunization of an appropriate animal with HUDAT protein obtained as above or with peptides of amino acid sequence derived from the HUDAT amino acid sequence. Hybridoma cultures are then established from spleen cells as described by Jaffe and McMahon-Pratt (Jaffe, CL. and MacMahon-Pratt, D. J. Immunol. 131, 1987-1993 (1983)). Alternatively, peripheral blood lymphocytes may be isolated and immortalized by transformation with Epstein-Barr virus. These cells produce monoclonal antibodies, but if desired, hybridomas can then be made from the transformed lymphocytes (Yamaguchi, H. et al. Proc. Natl. Acad. Sci. 84, 2416-2420 (1987)). Cell lines producing anti-HUDAT antibodies are identified by commonly employed screening techniques. Monoclonal antibody is then purified by well known techniques from the supernatants of large-scale cultures of the antibody producing cells.
C. Diagnosis of dopaminergic cell death in vivo by immunoassay of cerebrospinal fluid of a patient using anti-HUDAT antibodies.
The death of dopaminergic neurons in the brain of a patient should result in the accumulation in the cerebrospinal fluid, which bathes these cells, of membrane debris as a product of lysis of the dead cells. Other pathologic conditions, short of cell death that result in the release of DAT protein, or degraded peptide fragments of HUDAT protein into the surrounding medium can also be imagined. The cerebospinal fluid can be sampled by lumbar puncture of a patient. The presence of degradation products of HUDAT protein is detected by immunoassay, using as the primary antibody at least one of the products obtained as described above. Elevated levels of HUDAT protein detected in the cerebrospinal fluid, compared with the range seen in normal controls is indicative of Parkinsons's disease or drug-induced neurotoxicity. Alternatively, disease progression can be monitored by the assessment of HUDAT levels in serial samples from the same patient.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Uhl, George R.
Vandenbergh, David Persico, Antonio
(ii) TITLE OF INVENTION: Sequence of Human Dopamine Transporter
(iii) NUMBER OF SEQUENCES: 5
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Birch, Stewart, Kolasch & Birch
(B) STREET: 301 N. Washington St.
(C) CITY: Falls Church
(D) STATE: Virginia
(E) COUNTRY: USA
(F) ZIP: 22046-3487
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentln Release #1.0, Version #1.25
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Murphy Jr., Gerald M.
(B) REGISTRATION NUMBER: 28,977
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 703-241-1300
(B) TELEFAX: 703-241-2848
(C) TELEX: 248345
(2) INFORMATION FOR SEQ ID NO:l:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3919 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: NO
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Homo sapiens (F) TISSUE TYPE: brainstem
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 102..1961
(D) OTHER INFORMATION: /function-- "dopamine transport" /product= "HUDAT polypeptide"
(ix) FEATURE:
(A) NAME/KEY: misc_RNA
(B) LOCATION: 2724..3117
(D) OTHER INFORMATION: /function= "unknown" /label= VNTR_region
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
GAATTCCCGC TCTCGGCGCC AGGACTCGCG TGCAAAGCCC AGGCCCGGGC GGCCAGACCA 60 AGAGGGAAGA AGCACAGAAT TCCTCAACTC CCAGTGTGCC C ATG AGT AAG AGC 113
Met Ser Lys Ser
1
AAA TGC TCC GTG GGA CTC ATG TCT TCC GTG GTG GCC CCG GCT AAG GAG 161 Lys Cys Ser Val Gly Leu Met Ser Ser Val Val Ala Pro Ala Lys Glu 5 10 15 20
CCC AAT GCC GTG GGC CCG AAG GAG GTG GAG CTC ATC CTT GTC AAG GAG 209 Pro Asn Ala Val Gly Pro Lys Glu Val Glu Leu He Leu Val Lys Glu 25 30 35
CAG AAC GGA GTG CAG CTC ACC AGC TCC ACC CTC ACC AAC CCG CGG CAG 257 Gin Asn Gly Val Gin Leu Thr Ser Ser Thr Leu Thr Asn Pro Arg Gin 40 45 50
AGC CCC GTG GAG GCC CAG GAT CGG GAG ACC TGG GGC AAG AAG ATC GAC 305 Ser Pro Val Glu Ala Gin Asp Arg Glu Thr Trp Gly Lys Lys He Asp 55 60 65
TTT CTC CTG TCC GTC ATT GGC TTT GCT GTG GAC CTG GCC AAC GTC TGG 353 Phe Leu Leu Ser Val He Gly Phe Ala Val Asp Leu Ala Asn Val Trp 70 75 80
CGG TTC CCC TAC CTG TGC TAC AAA AAT GGT GGC GGT GCC TTC CTG GTC 401 Arg Phe Pro Tyr Leu Cys Tyr Lys Asn Gly Gly Gly Ala Phe Leu Val 85 90 95 100
CCC TAC CTG CTC TTC ATG GTC ATT GCT GGG ATG CCA CTT TTC TAC ATG 449 Pro Tyr Leu Leu Phe Met Val He Ala Gly Met Pro Leu Phe Tyr Met 105 110 115
GAG CTG GCC CTC GGC CAG TTC AAC AGG GAA GGG GCC GCT GGT GTC TGG 497 Glu Leu Ala Leu Gly Gin Phe Asn Arg Glu Gly Ala Ala Gly Val Trp 120 125 130
AAG ATC TGC CCC ATA CTG AAA GGT GTG GGC TTC ACG GTC ATC CTC ATC 545
Lys He Cys Pro He Leu Lys Gly Val Gly Phe Thr Val He Leu He 135 140 145
TCA CTG TAT GTC GGC TTC TTC TAC AAC GTC ATC ATC GCC TGG GCG CTG 593
Ser Leu Tyr Val Gly Phe Phe Tyr Asn Val He He Ala Trp Ala Leu
150 155 160
CAC TAT CTC TTC TCC TCC TTC ACC ACG GAG CTC CCC TGG ATC CAC TGC 641
His Tyr Leu Phe Ser Ser Phe Thr Thr Glu Leu Pro Trp He His Cys
165 170 175 180
AAC AAC TCC TGG AAC AGC CCC AAC TGC TCG GAT GCC CAT CCT GGT GAC 689
Asn Asn Ser Trp Asn Ser Pro Asn Cys Ser Asp Ala His Pro Gly Asp 185 190 195
TCC AGT GGA GAC AGC TCG GGC CTC AAC GAC ACT TTT GGG ACC ACA CCT 737
Ser Ser Gly Asp Ser Ser Gly Leu Asn Asp Thr Phe Gly Thr Thr Pro 200 205 210
GCT GCC GAG TAC TTT GAA CGT GGC GTG CTG CAC CTC CAC CAG AGC CAT 785
Ala Ala Glu Tyr Phe Glu Arg Gly Val Leu His Leu His Gin Ser His 215 220 225
GGC ATC GAC GAC CTG GGG CCT CCG CGG TGG CAG CTC ACA GCC TGC CTG 833
Gly He Asp Asp Leu Gly Pro Pro Arg Trp Gin Leu Thr Ala Cys Leu
230 235 240
GTG CTG GTC ATC GTG CTG CTC TAC TTC AGC CTC TGG AAG GGC GTG AAG 881
Val Leu Val He Val Leu Leu Tyr Phe Ser Leu Trp Lys Gly Val Lys
245 250 255 260
ACC TCA GGG AAG GTG GTA TGG ATC ACA GCC ACC ATG CCA TAC GTG GTC 929
Thr Ser Gly Lys Val Val Trp He Thr Ala Thr Met Pro Tyr Val Val 265 270 275
CTC ACT GCC CTG CTC CTG CGT GGG GTC ACC CTC CCT GGA GCC ATA GAC 977 Leu Thr Ala Leu Leu Leu Arg Gly Val Thr Leu Pro Gly Ala He Asp 280 285 290
GGC ATC AGA GCA TAC CTG AGC GTT GAC TTC TAC CGG CTC TGC GAG GCG 1025 Gly He Arg Ala Tyr Leu Ser Val Asp Phe Tyr Arg Leu Cys Glu Ala 295 300 305
TCT GTT TGG ATT GAC GCG GCC ACC CAG GTG TGC TTC TCC CTG GGC GTG 1073 Ser Val Trp He Asp Ala Ala Thr Gin Val Cys Phe Ser Leu Gly Val 310 315 320
GGG TTC GGG GTG CTG ATC GCC TTC TCC AGC TAC AAC AAG TTC ACC AAC 1121 Gly Phe Gly Val Leu He Ala Phe Ser Ser Tyr Asn Lys Phe Thr Asn 325 330 335 340
AAC TGC TAC AGG GAC GCG ATT GTC ACC ACC TCC ATC AAC TCC CTG ACG 1169 Asn Cys Tyr Arg Asp Ala He Val Thr Thr Ser He Asn Ser Leu Thr 345 350 355
AGC TTC TCC TCC GGC TTC GTC GTC TTC TCC TTC CTG GGG TAC ATG GCA 1217 Ser Phe Ser Ser Gly Phe Val Val Phe Ser Phe Leu Gly Tyr Met Ala 360 365 370
CAG AAG CAC AGT GTG CCC ATC GGG GAC GTG GCC AAG GAC GGG CCA GGG 1265 Gin Lys His Ser Val Pro He Gly Asp Val Ala Lys Asp Gly Pro Gly 375 380 385
CTG ATC TTC ATC ATC TAC CCG GAA GCC ATC GCC ACG CTC CCT CTG TCC 1313 Leu He Phe He He Tyr Pro Glu Ala He Ala Thr Leu Pro Leu Ser 390 395 400
TCA GCC TGG GCC GTG GTC TTC TTC ATC ATG CTG CTC ACC CTG GGT ATC 1361 Ser Ala Trp Ala Val Val Phe Phe He Met Leu Leu Thr Leu Gly He 405 410 415 420
GAC AGC GCC ATG GGT GGT ATG GAG TCA GTG ATC ACC GGG CTC ATC GAT 1409
Asp Ser Ala Met Gly Gly Met Glu Ser Val He Thr Gly Leu He Asp
425 430 435
GAG TTC CAG CTG CTG CAC AGA CAC CGT GAG CTC TTC ACG CTC TTC ATC 1457
Glu Phe Gin Leu Leu His Arg His Arg Glu Leu Phe Thr Leu Phe He 440 445 450
GTC CTG GCG ACC TTC CTC CTG TCC CTG TTC TGC GTC ACC AAC GGT GGC 1505
Val Leu Ala Thr Phe Leu Leu Ser Leu Phe Cys Val Thr Asn Gly Gly 455 460 465
ATC TAC GTC TTC ACG CTC CTG GAC CAT TTT GCA GCC GGC ACG TCC ATC 1553
He Tyr Val Phe Thr Leu Leu Asp His Phe Ala Ala Gly Thr Ser He 470 475 480
CTC TTT GGA GTG CTC ATC GAA GCC ATC GGA GTG GCC TGG TTC TAT GGT 1601
Leu Phe Gly Val Leu He Glu Ala He Gly Val Ala Trp Phe Tyr Gly
485 490 495 500
GTT GGG CAG TTC AGC GAC GAC ATC CAG CAG ATG ACC GGG CAG CGG CCC 1649
Val Gly Gin Phe Ser Asp Asp He Gin Gin Met Thr Gly Gin Arg Pro
505 510 515
AGC CTG TAC TGG CGG CTG TGC TGG AAG CTG GTC AGC CCC TGC TTT CTC 1697
Ser Leu Tyr Trp Arg Leu Cys Trp Lys Leu Val Ser Pro Cys Phe Leu 520 525 530
CTG TTC GTG GTC GTG GTC AGC ATT GTG ACC TTC AGA CCC CCC CAC TAC 1745
Leu Phe Val Val Val Val Ser He Val Thr Phe Arg Pro Pro His Tyr 535 540 545
GGA GCC TAC ATC TTC CCC GAC TGG GCC AAC GCG CTG GGC TGG GTC ATC 1793
Gly Ala Tyr He Phe Pro Asp Trp Ala Asn Ala Leu Gly Trp Val He 550 555 560
GCC ACA TCC TCC ATG GCC ATG GTG CCC ATC TAT GCG GCC TAC AAG TTC 1841 Ala Thr Ser Ser Met Ala Met Val Pro He Tyr Ala Ala Tyr Lys Phe 565 570 575 580
TGC AGC CTG CCT GGG TCC TTT CGA GAG AAA CTG GCC TAC GCC ATT GCA 1889 Cys Ser Leu Pro Gly Ser Phe Arg Glu Lys Leu Ala Tyr Ala He Ala 585 590 595
CCC GAG AAG GAC CGT GAG CTG GTG GAC AGA GGG GAG GTG CGC CAG TTC 1937 Pro Glu Lys Asp Arg Glu Leu Val Asp Arg Gly Glu Val Arg Gin Phe 600 605 610
ACG CTC CGC CAC TGG CTC AAG GTG TAGAGGGAGC AGAGACGAAG ACCCCAGGAA 1991 Thr Leu Arg His Trp Leu Lys Val 615 620
GTCATCCTGC AATGGGAGAG ACACGAACAA ACCAAGGAAA TCTAAGTTTC GAGAGAAAGG 2051
AGGGCAACTT CTACTCTTCA ACCTCTACTG AAAACACAAA CAACAAAGCA GAAGACTCCT 2111
CTCTTCTGAC TGTTTACACC TTTCCGTGCC GGGAGCGCAC CTCGCCGTGT CTTGTGTTGC 2171
TGTAATAACG ACGTAGATCT GTGCAGCGAG GTCCACCCCG TTGTTGTCCC TGCAGGGCAG 2231
AAAAACGTCT AACTTCATGC TGTCTGTGTG AGGCTCCCTC CCTCCCTGCT CCCTGCTCCC 2291
GGCTCTGAGG CTGCCCCAGG GGCACTGTGT TCTCAGGCGG GGATCACGAT CCTTGTAGAC 2351
GCACCTGCTG AGAATCCCCG TGCTCACAGT AGCTTCCTAG ACCATTTACT TTGCCCATAT 2411
TAAAAAGCCA AGTGTCCTGC TTGGTTTAGC TGTGCAGAAG GTGAAATGGA GGAAACCACA 2471
AATTCATGCA AAGTCCTTTC CCGATGCGTG GCTCCCAGCA GAGGCCGTAA ATTGAGCGTT 2531
CAGTTGACAC ATTGCACACA CAGTCTGTTC AGAGGCATTG GAGGATGGGG GTCCTGGTAT 2591
GTCTCACCAG GAAATTCTGT TTATGTTCTT GCAGCAGAGA GAAATAAAAC TCCTTGAAAC 2651
CAGCTCAGGC TACTGCCACT CAGGCAGCCT GTGGGTCCTT GTGGTGTAGG GAACGGCCTG 2711
AGAGGAGCGT GTCCTATCCC CGGACGCATG CAGGGCCCCC ACAGGAGCGT GTCCTATCCC 2771
CGGACGCATG CAGGGCCCCC ACAGGAGCAT GTCCTATCCC TGGACGCATG CAGGGCCCCC 2831
ACAGGAGCGT GTACTACCCC AGAACGCATG CAGGGCCCCC ACAGGAGCGT GTACTACCCC 2891
AGGACGCATG CAGGGCCCCC ACTGGAGCGT GTACTACCCC AGGACGCATG CAGGGCCCCC 2951
ACAGGAGCGT GTCCTATCCC CGGACCGGAC GCATGCAGGG CCCCCACAGG AGCGTGTACT 3011
ACCCCAGGAC GCATGCAGGG CCCCCACAGG AGCGTGTACT ACCCCAGGAT GCATGCAGGG 3071
CCCCCACAGG AGCGTGTACT ACCCCAGGAC GCATGCAGGG CCCCCATGCA GGCAGCCTGC 3131
AGACCAACAC TCTGCCTGGC CTTGAGCCGT GACCTCCAGG AAGGGACCCC ACTGGAATTT 3191
TATTTCTCTC AGGTGCGTGC CACATCAATA ACAACAGTTT TTATGTTTGC GAATGGCTTT 3251
TTAAAATCAT ATTTACCTGT GAATCAAAAC AAATTCAAGA ATGCAGTATC CGCGAGCCTG 3311
CTTGCTGATA TTGCAGTTTT TGTTTACAAG AATAATTAGC AATACTGAGT GAAGGATGTT 3371
GGCCAAAAGC TGCTTTCCAT GGCACACTGC CCTCTGCCAC TGACAGGAAA GTGGATGCCA 3431
TAGTTTGAAT TCATGCCTCA AGTCGGTGGG CCTGCCTACG TGCTGCCCGA GGGCAGGGGC 3491
CGTGCAGGGC CAGTCATGGC TGTCCCCTGC AAGTGGACGT GGGCTCCAGG GACTGGAGTG 3551
TAATGCTCGG TGGGAGCCGT CAGCCTGTGA ACTGCCAGGC AGCTGCAGTT AGCACAGAGG 3611
ATGGCTTCCC CATTGCCTTC TGGGGAGGGA CACAGAGGAC GGCTTCCCCA TCGCCTTCTG 3671
GCCGCTGCAG TCAGCACAGA GAGCGGCTTC CCCATTGCCT TCTGGGGAGG GACACAGAGG 3731
ACAGTTTCCC CATCGCCTTC TGGTTGTTGA AGACAGCACA GAGAGCGGCT TCCCCATCGC 3791
CTTCTGGGGA GGGGCTCCGT GTAGCAACCC AGGTGTTGTC CGTGTCTGTT GACCAATCTC 3851
TATTCAGCAT CGTGTGGGTC CCTAAGCACA ATAAAAGACA TCCACAATGG AAAAAAAAAA 3911
AGGAATTC 3919
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 620 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Met Ser Lys Ser Lys Cys Ser Val Gly Leu Met Ser Ser Val Val Ala 1 5 10 15
Pro Ala Lys Glu Pro Asn Ala Val Gly Pro Lys Glu Val Glu Leu He 20 25 30
Leu Val Lys Glu Gin Asn Gly Val Gin Leu Thr Ser Ser Thr Leu Thr 35 40 45
Asn Pro Arg Gin Ser Pro Val Glu Ala Gin Asp Arg Glu Thr Trp Gly 50 55 60
Lys Lys He Asp Phe Leu Leu Ser Val He Gly Phe Ala Val Asp Leu 65 70 75 80
Ala Asn Val Trp Arg Phe Pro Tyr Leu Cys Tyr Lys Asn Gly Gly Gly 85 90 95
Ala Phe Leu Val Pro Tyr Leu Leu Phe Met Val He Ala Gly Met Pro 100 105 110
Leu Phe Tyr Met Glu Leu Ala Leu Gly Gin Phe Asn Arg Glu Gly Ala 115 120 125
Ala Gly Val Trp Lys He Cys Pro He Leu Lys Gly Val Gly Phe Thr 130 135 140
Val He Leu He Ser Leu Tyr Val Gly Phe Phe Tyr Asn Val He He 145 150 155 160
Ala Trp Ala Leu His Tyr Leu Phe Ser Ser Phe Thr Thr Glu Leu Pro 165 170 175
Trp He His Cys Asn Asn Ser Trp Asn Ser Pro Asn Cys Ser Asp Ala 180 185 190
His Pro Gly Asp Ser Ser Gly Asp Ser Ser Gly Leu Asn Asp Thr Phe 195 200 205
Gly Thr Thr Pro Ala Ala Glu Tyr Phe Glu Arg Gly Val Leu His Leu 210 215 220
His Gin Ser His Gly He Asp Asp Leu Gly Pro Pro Arg Trp Gin Leu 225 230 235 240
Thr Ala Cys Leu Val Leu Val He Val Leu Leu Tyr Phe Ser Leu Trp 245 250 255
Lys Gly Val Lys Thr Ser Gly Lys Val Val Trp He Thr Ala Thr Met 260 265 270
Pro Tyr Val Val Leu Thr Ala Leu Leu Leu Arg Gly Val Thr Leu Pro 275 280 285
Gly Ala He Asp Gly He Arg Ala Tyr Leu Ser Val Asp Phe Tyr Arg 290 295 300
Leu Cys Glu Ala Ser Val Trp He Asp Ala Ala Thr Gin Val Cys Phe 305 310 315 320
Ser Leu Gly Val Gly Phe Gly Val Leu He Ala Phe Ser Ser Tyr Asn 325 330 335
Lys Phe Thr Asn Asn Cys Tyr Arg Asp Ala He Val Thr Thr Ser He 340 345 350
Asn Ser Leu Thr Ser Phe Ser Ser Gly Phe Val Val Phe Ser Phe Leu 355 360 365
Gly Tyr Met Ala Gin Lys His Ser Val Pro He Gly Asp Val Ala Lys 370 375 380
Asp Gly Pro Gly Leu He Phe He He Tyr Pro Glu Ala He Ala Thr 385 390 395 400
Leu Pro Leu Ser Ser Ala Trp Ala Val Val Phe Phe He Met Leu Leu 405 410 415
Thr Leu Gly He Asp Ser Ala Met Gly Gly Met Glu Ser Val He Thr 420 425 430
Gly Leu He Asp Glu Phe Gin Leu Leu His Arg His Arg Glu Leu Phe 435 440 445
Thr Leu Phe He Val Leu Ala Thr Phe Leu Leu Ser Leu Phe Cys Val 450 455 460
Thr Asn Gly Gly He Tyr Val Phe Thr Leu Leu Asp His Phe Ala Ala 465 470 475 480
Gly Thr Ser He Leu Phe Gly Val Leu He Glu Ala He Gly Val Ala 485 490 495
Trp Phe Tyr Gly Val Gly Gin Phe Ser Asp Asp He Gin Gin Met Thr 500 505 510
Gly Gin Arg Pro Ser Leu Tyr Trp Arg Leu Cys Trp Lys Leu Val Ser 515 520 525
Pro Cys Phe Leu Leu Phe Val Val Val Val Ser He Val Thr Phe Arg 530 535 540
Pro Pro His Tyr Gly Ala Tyr He Phe Pro Asp Trp Ala Asn Ala Leu 545 550 555 560
Gly Trp Val He Ala Thr Ser Ser Met Ala Met Val Pro He Tyr Ala 565 570 575
Ala Tyr Lys Phe Cys Ser Leu Pro Gly Ser Phe Arg Glu Lys Leu Ala 580 585 590
Tyr Ala He Ala Pro Glu Lys Asp Arg Glu Leu Val Asp Arg Gly Glu 595 600 605
Val Arg Gin Phe Thr Leu Arg His Trp Leu Lys Val 610 615 620
(2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA
(iii) HYPOTHETICAL: YES
(iv) AN I-SENSE: NO
(ix) FEATURE:
(A) NAME/KEY: -
(B) LOCATION: 1..40
(D) OTHER INFORMATION: /label= consensus
/note= "consensus sequence of VNTR element in 3' untranslated region of HUDAT cDNA"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 :
AGGAGCGTGT ACTATCCCAG GACGCATGCA GGGCCCCCAC 40
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (synthetic)
(iii) HYPOTHETICAL: NO
(iv) ANTI-SENSE: NO
(ix) FEATURE:
(A) NAME/KEY: -
(B) LOCATION: 1..23
(D) OTHER INFORMATION: /label= oligonucleotide
/note= "synthetic oligonucleotide T3-5LONG, upstream primer for PCR analysis of VNTR region of in 3' untranslated region of HUDAT gene "
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
TGTGGTGTAG GGAACGGCCT GAG 23
(2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 24 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (synthetic)
(iii) HYPOTHETICAL: NO
(iv) AN I-SENSE: YES
(ix) FEATURE:
(A) NAME/KEY: -
(B) LOCATION: 1..24
(D) OTHER INFORMATION: /label= oligonucleotide
/note= "synthetic oligonucleotide, T7-3aLONG; downstream primer for PCR analysis of VNTR region of 3' untranslated region of HUDAT gene"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
CTTCCTGGAG GTCACGGCTC AAGG 24
Claims
1. An isolated, purified protein comprising the amino acid sequence described for Hdat in Figure 3 of the specification (seq. I.D. NO. 2) , and variants or portions thereof which retain the activity of said protein, said activity comprising the selective binding of dopamine, cocaine or functional analogs thereof.
2. An isolated, purified protein as recited in claim 1, wherein said biochemical activity further comprises transport of dopamine, cocaine or functional analogs thereof from one side of a membrane to the other.
3. An isolated cDNA comprising a nucleotide sequence which encodes the protein of claim 1.
4. An isolated cDNA comprising a nucleotide sequence which encodes the protein of claim 2.
5. An isolated cDNA comprising a nucleotide sequence as set forth in Figure 1 (SEQ. ID. NO. 1) of the Specification.
6. A plasmid DNA comprising the cDNA of claim 3, and a DNA sequence that provides for replication in a prokaryotic host cell and DNA sequences that provide for transcription of the cDNA in vitro, such that the resulting mRNA can be isolated and translated upon introduction into a eukaryotic cell type.
7. A plasmid DNA comprising the cDNA of claim , and a DNA sequence that provides for replication in a prokaryotic host cell and DNA sequences that provide for transcription of the cDNA in vitro, such that the resulting mRNA can be isolated and translated upon introduction into a eukaryotic cell type.
8. A eukaryotic cell line derived from a cell type that does not normally express dopamine transport activity at its surface that has been made to transport dopamine or to bind CFT by the introduction of the DNA of claim 2 into its genome.
9. A eukaryotic cell line derived from a cell type that does not normally express dopamine transport activity at its surface that has been made to transport dopamine or to bind CFT by the introduction of the DNA of claim 4 into its genome.
10. The cell line of claim 9 wherein the cell type is COS cells.
11. A method for screening cocaine antagonists which comprises measuring the inhibition of binding of cocaine, or of functionally equivalent cocaine analogs, to the surface of cells or to membrane preparations obtained from said cells, wherein said cells express the cDNA of claim 3.
12. A method for screening cocaine antagonists which comprises measuring the inhibition of cell toxicity due to MPP+ import into cells expressing the protein of claim 2 or transport across reconstituted membrane preparations obtained from such cells.
13. A method for screening therapeutic agents effective in the prevention or treatment of Parkinson's disease which comprises measuring the inhibition of cell toxicity due to MPP+ import into cells expressing the protein of claim 2, or measuring MPP+ transport across reconstituted membrane preparations obtained such cells.
14. A method for screening therapeutic agents effective in the prevention or treatment of Parkinson's disease which comprises measuring the inhibition of binding of cocaine, or of functioanlly equivalent cocaine analogs, to the surface of cells or to membrane preparations obtained from said cells, wherein said cells express the cDNA of claim 3.
15. A method for the screening of therapeutic agents effective in the prevention or treatment of diseases caused by abnormal dopamine transport which comprises measuring either the inhibition or facilitation of binding of cocaine, or of functionally equivalent cocaine analogs, to the cell surface of cells of the line or to membrane preparations obtained from the cell line of claim 9, by candidate compounds.
16. A method for the screening of therapeutic agents effective in the prevention or treatment of diseases caused by abnormal dopamine transport which comprises measuring the cell toxicity due to PP+ import into cells of the line or transport across reconstituted membrane preparations obtained from the cell line of claim 9, by candidate compounds .
17. A method for the treatment of Parkinson's disease, Tourette's syndrome or other disease caused by abnormal HUDAT expression by a gene therapy technique comprising transplanting into a patient somatic cells which have been transformed with a recombinant DNA construction expressing the cDNA of claim 4, thus obtaining expression of said cDNA in said patient.
18. A method for diagnosing genetic variants in dopamine transporter protein utilizing a Southern blot based testing method which detects genetic rearrangements or deletions in the DAT gene, which comprises using the cDNA of claim 3 as a probe.
19. A method for diagnosing genetic variants in dopamine transporter protein as in claim 18, wherein the cDNA probe comprises DNA having the nucleotide sequence of Figure 1 (SEQ. ID. NO. 1) or a portion thereof.
20. A method for screening an individual for a genetic marker associated with substance abuse behavior which comprises analyzing the alleles of HUDAT present in the genotype of said individual.
21. The method of claim 20, wherein said alleles are variable number tandem repeat alleles.
22. A method for diagnosing variant expression of dopamine transporter utilizing a testing method based on the polymerase chain reaction which comprises using polynucleotide primers derived from the cDNA of claim 3.
23. A method for diagnosing variant dopamine transporter expression as in claim 22, wherein the polynucleotide primers are derived from the nucleotide sequence shown in Figure 1 (SEQ. ID. NO. 1) .
24. A method for detecting changes in the rate of release of human dopamine transporter from dopaminergic cells into cerebrospinal fluid that comprises measurement of peptides of the human dopamine transporter released into cerebrospinal fluid.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002136087A CA2136087C (en) | 1992-06-01 | 1993-06-01 | Sequence of human dopamine transporter cdna |
EP93916415A EP0644934A1 (en) | 1992-06-01 | 1993-06-01 | SEQUENCE OF HUMAN DOPAMINE TRANSPORTER cDNA |
AU45975/93A AU686564B2 (en) | 1992-06-01 | 1993-06-01 | Sequence of human dopamine transporter cDNA |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US88972392A | 1992-06-01 | 1992-06-01 | |
US07/889,723 | 1992-06-01 |
Publications (3)
Publication Number | Publication Date |
---|---|
WO1993024628A2 WO1993024628A2 (en) | 1993-12-09 |
WO1993024628A3 WO1993024628A3 (en) | 1994-02-03 |
WO1993024628A9 true WO1993024628A9 (en) | 1994-05-26 |
Family
ID=25395672
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1993/005179 WO1993024628A2 (en) | 1992-06-01 | 1993-06-01 | SEQUENCE OF HUMAN DOPAMINE TRANSPORTER cDNA |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0644934A1 (en) |
AU (1) | AU686564B2 (en) |
CA (1) | CA2136087C (en) |
WO (1) | WO1993024628A2 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997012513A1 (en) * | 1995-10-02 | 1997-04-10 | Duke University | Dopamine transporter knockout mice |
US20020192821A1 (en) * | 2001-05-22 | 2002-12-19 | Active Pass Pharmaceuticals, Inc. | Increased functional activity and/or expression of ABC transporters protects against the loss of dopamine neurons associated with Parkinson's disease |
GB2381525A (en) * | 2001-11-01 | 2003-05-07 | Tcs Cellworks Ltd | Regulating gene expression |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5312734A (en) * | 1991-09-20 | 1994-05-17 | The United States Of America As Represented By The Department Of Health And Human Services | CDNA encoding a dopamine transporter |
WO1993008262A1 (en) * | 1991-10-25 | 1993-04-29 | The State Of Oregon Acting By And Through The State Board Of Higher Education On Behalf Of Oregon Health Sciences University | A cDNA CLONE ENCODING AN EXPRESSIBLE DOPAMINE TRANSPORTER |
-
1993
- 1993-06-01 AU AU45975/93A patent/AU686564B2/en not_active Ceased
- 1993-06-01 CA CA002136087A patent/CA2136087C/en not_active Expired - Lifetime
- 1993-06-01 WO PCT/US1993/005179 patent/WO1993024628A2/en not_active Application Discontinuation
- 1993-06-01 EP EP93916415A patent/EP0644934A1/en not_active Withdrawn
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7292836B6 (en) | MECP2E1 gene | |
US5891628A (en) | Identification of polycystic kidney disease gene, diagnostics and treatment | |
Paoloni-Giacobino et al. | Cloning of a novel human neural cell adhesion molecule gene (NCAM2) that maps to chromosome region 21q21 and is potentially involved in Down syndrome | |
WO1995034573A9 (en) | Identification of polycystic kidney disease gene, diagnostics and treatment | |
US6429011B1 (en) | Neuronal apoptosis inhibitor protein gene sequence and mutations causative of spinal muscular atrophy | |
US11155598B2 (en) | GPR156 variants and uses thereof | |
US5756307A (en) | Sequence of human dopamine transporter cDNA | |
AU668106B2 (en) | cDNA encoding a dopamine transporter and protein encoded thereby | |
AU686564B2 (en) | Sequence of human dopamine transporter cDNA | |
WO1993024628A9 (en) | SEQUENCE OF HUMAN DOPAMINE TRANSPORTER cDNA | |
EP0970243B1 (en) | Diagnosis and treatment of glaucoma | |
EP1476553A2 (en) | Gene for peripheral arterial occlusive disease | |
AU2003201728A1 (en) | Gene for peripheral arterial occlusive disease | |
CA2534382C (en) | Epm2b gene mutations associated with lafora's disease | |
WO2000021985A2 (en) | Genes encoding olfactory receptors and biallelic markers thereof | |
WO2004042358A2 (en) | HUMAN TYPE II DIABETES GENE-SLIT-3 LOCATED ON CHROMOSOME 5q35 | |
JPH0892289A (en) | Protein related to human machado joseph disease, cdna and gene coding for the same protein, vector containing the same dna or gene, host cell transformed with the same manifestation vector, method for diagnosing machado joseph disease and therapeutic agent therefor | |
EP0678028A1 (en) | Dna encoding a human betaine/gaba transporter and uses thereof | |
JP2005525079A (en) | ALS2 gene and amyotrophic lateral sclerosis type 2 | |
WO2000018787A1 (en) | Gene mutated in wolfram syndrome | |
Webb | Positional cloning of the psrt mutations on mouse chromosome 7 | |
US20030105001A1 (en) | Pro-apoptotic proteins and DNA molecules encoding them | |
WO2004074302A2 (en) | Autosomal recessive polycystic kidney disease nucleic acids and peptides | |
CA2345915A1 (en) | Gpi-anchored small leucine-rich proteoglycan gene | |
WO1997001573A2 (en) | Early onset alzheimer's disease gene and gene products |