WO1997006257A1 - Cellular co-factor for hiv rev and htlv rex - Google Patents
Cellular co-factor for hiv rev and htlv rex Download PDFInfo
- Publication number
- WO1997006257A1 WO1997006257A1 PCT/US1996/012986 US9612986W WO9706257A1 WO 1997006257 A1 WO1997006257 A1 WO 1997006257A1 US 9612986 W US9612986 W US 9612986W WO 9706257 A1 WO9706257 A1 WO 9706257A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- rev
- dna
- protein
- proteins
- hiv
- Prior art date
Links
- 230000001413 cellular effect Effects 0.000 title claims abstract description 37
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 122
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 105
- 108020004414 DNA Proteins 0.000 claims abstract description 81
- 238000000034 method Methods 0.000 claims abstract description 39
- 101900102211 Human T-cell leukemia virus 1 Protein Rex Proteins 0.000 claims abstract description 17
- 210000004027 cell Anatomy 0.000 claims description 95
- 230000004913 activation Effects 0.000 claims description 62
- 101710150344 Protein Rev Proteins 0.000 claims description 48
- 108020001507 fusion proteins Proteins 0.000 claims description 46
- 102000037865 fusion proteins Human genes 0.000 claims description 46
- 230000027455 binding Effects 0.000 claims description 45
- 239000013598 vector Substances 0.000 claims description 22
- 102100039556 Galectin-4 Human genes 0.000 claims description 21
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 claims description 20
- 241000713666 Lentivirus Species 0.000 claims description 20
- 241000598436 Human T-cell lymphotropic virus Species 0.000 claims description 19
- 238000001727 in vivo Methods 0.000 claims description 19
- 239000013612 plasmid Substances 0.000 claims description 19
- 150000001875 compounds Chemical class 0.000 claims description 18
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 claims description 16
- 238000013518 transcription Methods 0.000 claims description 16
- 230000035897 transcription Effects 0.000 claims description 16
- 241000700605 Viruses Species 0.000 claims description 15
- 108020004511 Recombinant DNA Proteins 0.000 claims description 10
- 239000000523 sample Substances 0.000 claims description 9
- 239000011780 sodium chloride Substances 0.000 claims description 8
- 108020005187 Oligonucleotide Probes Proteins 0.000 claims description 7
- 210000004962 mammalian cell Anatomy 0.000 claims description 7
- 239000002751 oligonucleotide probe Substances 0.000 claims description 7
- 239000001509 sodium citrate Substances 0.000 claims description 7
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 claims description 7
- 238000012360 testing method Methods 0.000 claims description 7
- 230000004568 DNA-binding Effects 0.000 claims description 6
- 108010001515 Galectin 4 Proteins 0.000 claims description 6
- 108700008625 Reporter Genes Proteins 0.000 claims description 6
- 230000015572 biosynthetic process Effects 0.000 claims description 6
- 230000002068 genetic effect Effects 0.000 claims description 6
- 238000000338 in vitro Methods 0.000 claims description 6
- 239000002773 nucleotide Substances 0.000 claims description 6
- 125000003729 nucleotide group Chemical group 0.000 claims description 6
- 210000005253 yeast cell Anatomy 0.000 claims description 6
- 241000238631 Hexapoda Species 0.000 claims description 5
- 238000012216 screening Methods 0.000 claims description 4
- 241000701447 unidentified baculovirus Species 0.000 claims description 4
- 108020004705 Codon Proteins 0.000 claims description 3
- 230000002285 radioactive effect Effects 0.000 claims 1
- 108091006104 gene-regulatory proteins Proteins 0.000 abstract description 5
- 102000034356 gene-regulatory proteins Human genes 0.000 abstract description 5
- 230000001124 posttranscriptional effect Effects 0.000 abstract description 3
- 108700020121 Human Immunodeficiency Virus-1 rev Proteins 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 85
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 32
- 150000001413 amino acids Chemical class 0.000 description 31
- 101710150336 Protein Rex Proteins 0.000 description 30
- 239000013613 expression plasmid Substances 0.000 description 28
- 235000001014 amino acid Nutrition 0.000 description 25
- 230000014509 gene expression Effects 0.000 description 23
- 230000000694 effects Effects 0.000 description 21
- 230000006870 function Effects 0.000 description 19
- 102000005720 Glutathione transferase Human genes 0.000 description 18
- 108010070675 Glutathione transferase Proteins 0.000 description 18
- 230000035772 mutation Effects 0.000 description 17
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 16
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 15
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 15
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 15
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 15
- 230000003993 interaction Effects 0.000 description 15
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 14
- 241000713730 Equine infectious anemia virus Species 0.000 description 12
- 241000283973 Oryctolagus cuniculus Species 0.000 description 12
- 101710149951 Protein Tat Proteins 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 12
- 241000282414 Homo sapiens Species 0.000 description 11
- 241000725303 Human immunodeficiency virus Species 0.000 description 11
- 238000003556 assay Methods 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 9
- 230000004570 RNA-binding Effects 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- 230000014616 translation Effects 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- 210000004940 nucleus Anatomy 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 241000714266 Bovine leukemia virus Species 0.000 description 6
- 108010081734 Ribonucleoproteins Proteins 0.000 description 6
- 102000004389 Ribonucleoproteins Human genes 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 238000004113 cell culture Methods 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 210000005260 human cell Anatomy 0.000 description 5
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 238000001262 western blot Methods 0.000 description 5
- 229920000936 Agarose Polymers 0.000 description 4
- 108090000565 Capsid Proteins Proteins 0.000 description 4
- 102100023321 Ceruloplasmin Human genes 0.000 description 4
- 241001559589 Cullen Species 0.000 description 4
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- 241000288906 Primates Species 0.000 description 4
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 4
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 108010005774 beta-Galactosidase Proteins 0.000 description 4
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 4
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 150000007523 nucleic acids Chemical group 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 239000002953 phosphate buffered saline Substances 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 230000007115 recruitment Effects 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 208000031886 HIV Infections Diseases 0.000 description 3
- 241000714259 Human T-lymphotropic virus 2 Species 0.000 description 3
- 241000713340 Human immunodeficiency virus 2 Species 0.000 description 3
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 3
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 3
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- WQZGKKKJIJFFOK-FPRJBGLDSA-N beta-D-galactose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-FPRJBGLDSA-N 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 238000012761 co-transfection Methods 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 108010031719 prolyl-serine Proteins 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 238000010396 two-hybrid screening Methods 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 101710132601 Capsid protein Proteins 0.000 description 2
- 101710094648 Coat protein Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 2
- 108010016626 Dipeptides Proteins 0.000 description 2
- 241000283073 Equus caballus Species 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 101710125418 Major capsid protein Proteins 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 101710141454 Nucleoprotein Proteins 0.000 description 2
- 241000286209 Phasianidae Species 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- 101710182846 Polyhedrin Proteins 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- 101710083689 Probable capsid protein Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 241000255588 Tephritidae Species 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 2
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 2
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- 108010022394 Threonine synthase Proteins 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 208000010094 Visna Diseases 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 102000004419 dihydrofolate reductase Human genes 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 238000010185 immunofluorescence analysis Methods 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 108020004707 nucleic acids Proteins 0.000 description 2
- 102000039446 nucleic acids Human genes 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 230000001566 pro-viral effect Effects 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 210000001995 reticulocyte Anatomy 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 108700004030 rev Genes Proteins 0.000 description 2
- 101150098213 rev gene Proteins 0.000 description 2
- 125000003607 serino group Chemical class [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 238000003146 transient transfection Methods 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- 241000269350 Anura Species 0.000 description 1
- 102100024365 Arf-GAP domain and FG repeat-containing protein 1 Human genes 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- 241000951889 Autographa californica multiple nucleopolyhedrovirus Species 0.000 description 1
- 241000304886 Bacilli Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- ZPCVINYAFSETFH-UTCJRWHESA-N C=C/C=C\C1C=CC=C1 Chemical compound C=C/C=C\C1C=CC=C1 ZPCVINYAFSETFH-UTCJRWHESA-N 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- 241000713756 Caprine arthritis encephalitis virus Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- TULNGKSILXCZQT-IMJSIDKUSA-N Cys-Asp Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O TULNGKSILXCZQT-IMJSIDKUSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000008157 ELISA kit Methods 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108091060211 Expressed sequence tag Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000255890 Galleria Species 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000833314 Homo sapiens Arf-GAP domain and FG repeat-containing protein 1 Proteins 0.000 description 1
- 108700020123 Human Immunodeficiency Virus-2 rev Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- QQVXERGIFIRCGW-NAKRPEOUSA-N Ile-Ser-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N QQVXERGIFIRCGW-NAKRPEOUSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 108020004485 Nonsense Codon Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- 108700020497 Nucleopolyhedrovirus polyhedrin Proteins 0.000 description 1
- 241000713695 Ovine/caprine lentivirus group Species 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 1
- RFCVXVPWSPOMFJ-STQMWFEESA-N Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-STQMWFEESA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108091036407 Polyadenylation Proteins 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 241001598112 Rachiplusia ou MNPV Species 0.000 description 1
- 101100041796 Rattus norvegicus Slc6a4 gene Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 101001059240 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) Site-specific recombinase Flp Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- YZMPDHTZJJCGEI-BQBZGAKWSA-N Ser-His Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 YZMPDHTZJJCGEI-BQBZGAKWSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 208000000389 T-cell leukemia Diseases 0.000 description 1
- 208000028530 T-cell lymphoblastic leukemia/lymphoma Diseases 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 241000255993 Trichoplusia ni Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 206010003246 arthritis Diseases 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 230000001268 conjugating effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 1
- UFJPAQSLHAGEBL-RRKCRQDMSA-N dITP Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(N=CNC2=O)=C2N=C1 UFJPAQSLHAGEBL-RRKCRQDMSA-N 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000001378 electrochemiluminescence detection Methods 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000012296 in situ hybridization assay Methods 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108700010826 lentivirus proteins Proteins 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 238000003159 mammalian two-hybrid assay Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 230000006849 nucleocytoplasmic transport Effects 0.000 description 1
- 210000000287 oocyte Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 229950009506 penicillinase Drugs 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 238000011200 topical administration Methods 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 238000001086 yeast two-hybrid system Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
- C07K14/4705—Regulators; Modulating activity stimulating, promoting or activating activity
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2740/00—Reverse transcribing RNA viruses
- C12N2740/00011—Details
- C12N2740/10011—Retroviridae
- C12N2740/16011—Human Immunodeficiency Virus, HIV
- C12N2740/16311—Human Immunodeficiency Virus, HIV concerning HIV regulatory proteins
- C12N2740/16322—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- the present invention relates to cellular co ⁇ factors for the HIV-l Rev and the HTLV-I Rex post- transcriptional regulator proteins, DNAs encoding the same, and methods of use thereof.
- the HIV-l Rev protein is required for the nucleocytoplasmic transport, and hence translation, of a class of incompletely spliced HIV-l mRNAs that encode the viral structural proteins (B. Cullen, Microbiol . Rev . 56, 375-394 (1992)). In the absence of Rev, these late viral RNAs remain sequestered in the nucleus until they are either spliced or degraded (M. Emerman et al. , Cell 57, 1155-1165 (1989); B. Felber et al., Proc . Natl . Acad . Sci . USA 86, 1495-1499 (1989); M. Malim et al., Nature 338, 254-257 (1989)).
- Rev function is therefore essential for the production of progeny virions by HIV-l infected cells (M. Feinberg et al., Cell 46, 807-817 (1986); J. Sodroski et al., Nature 321, 412-417 (1986)).
- Rev mediated nuclear R ⁇ A export requires the direct interaction of Rev with a cis-acting, -234 nucleotide (nt) R ⁇ A stem-loop structure termed the Rev Response Element or RRE (T. Daly et al., Wature 342, 816-819 (1989); Mali et al., supra ; M. Malim et al., Cell 60, 675-683 (1990)).
- Rev first binds to a high-affinity site within the RRE as a monomer (D. Bartel et al.. Cell 67, 529-536 (1991); L. Tiley et al., Proc . Natl . Acad . Sci . USA 89, 758-762 (1992)) . Subsequently, additional Rev monomers assemble.onto the RRE in an ordered process mediated by both protein-protein and protein-RNA interactions. This multimerization event is critical for Rev function via the RRE (M. Malim and B. Cullen, Cell 65, 241-248 (1991); S. Iwai et al., Nucl. Acids Res . 20, 6465-6472 (1992); D. McDonald et al., J . Virol . 66, 7232-7238 (1992)).
- Rev activation domain a short, ⁇ 10 aa leucine-rich sequence, termed the Rev activation domain, that is dispensable for RRE binding but nevertheless essential for Rev function
- M. Malim et al., J . Virol . 65, 4248-4254 (1991) ; L. Venkatesh and G. Chinnadurai, Virology 178, 327-330 (1990) At least two lines of evidence suggest that the activation domain represents a binding site for an essential cellular Rev co-factor.
- this short sequence element is the only part of Rev whose integrity is essential for Rev function via a heterologous RNA target site (J. McDonald et al., supra) .
- a fusion protein consisting of Rev linked to the MS2 bacteriophage coat protein can induce the nuclear export of an RNA containing multiple copies of the MS2 coat protein RNA binding site. Mutations known to block the activity of the Rev RNA binding and/or multimerization sequence prevent the function of this fusion protein via the RRE yet fail to inhibit function via the MS2 RNA operator. In contrast, mutation of the Rev activation domain abrogates function via either RNA target (J. McDonald et al., supra) . These data therefore imply that the primary role of-the Rev RNA binding/multimerization domain is to facilitate the assembly of a Rev/RRE ribonucleoprotein complex that then recruits the appropriate cellular co-factor to the Rev activation motif.
- a second line of evidence indicating that the activation domain is a Rev co-factor binding domain derives from the observation that mutant Rev proteins containing an intact RNA binding/multimerization domain, but lacking a functional activation domain, exhibit a potent dominant negative phenotype in;vivo (Malim et al., supra ; Venkatesh and Chinnadurai, supra) . It has been proposed that these Rev mutants participate with wild-type Rev in the formation of the Rev/RRE ribonucleoprotein complex but then block the function of the wild-type Rev by interfering with the co-operative recruitment of a cellular co-factor.
- Rev is functional in cells derived from a wide range of eukaryotes, including primates, mice, birds, frogs, fruit flies and even yeast, implies that this unidentified cellular co-factor must have been evolutionarily conserved (M. Ivey-Hoyle and M. Rosenberg, Mol . Cell . Biol . 10, 6152-6159 (1990); Malim et al., supra ; U. Fischer et al., EMBO J . 13, 4105-4112 (1994); F. Stutz and M. Rosbash, EMBO J . 13, 4096-4104 (1994)).
- HIV-l Rev is the most intensely studied retroviral RNA transport factor
- other members of the lentivirinae also encode Rev proteins
- the effectively unrelated T-cell leukemia viruses including Human T-cell leukemia virus type I (HTLV-I)
- Rex an equivalent regulatory protein termed Rex (Cullen, supra)
- Rex an equivalent regulatory protein termed Rex (Cullen, supra)
- Rex Rex protein
- a first aspect of the invention is isolated DNA encoding a cellular co-factor for HIV Rev selected from the group consisting of:
- DNA which encodes a protein of SEQ ID NO:2 (e.g., DNA of SEQ ID NO:l);
- a second aspect of the invention is a recombinant DNA comprising vector DNA and a DNA as given above.
- a third aspect of the invention is a host cell containing a recombinant DNA as given above.
- a fourth aspect of the invention is an oligonucleotide probe capable of selectively binding to a DNA as given above (e.g., a probe that is capable of serving as a PCR extension primer; a probe labelled with a detectable group) .
- a fifth aspect of the invention is an isolated protein comprising a cellular co-factor for HIV Rev, which protein is coded for by a DNA as given above.
- a sixth aspect of the present invention is an antibody which specifically binds to a cellular co-factor for HIV Rev.
- Figure 1 shows the primary sequence of the minimal activation domains of selected Rev and Rex proteins.
- the upper portion of the figure is a schematic representation of the Rev transactivation region.
- the Rev transactivation region contains an RNA binding domain, indicated by cross-hatching, and an activation domain, which is shown in double cross-hatching.
- Amino acid sequences of the activation domains of Rev and Rex proteins from several lentiviruses are shown in the lower portion of the figure. All amino acid sequences are shown from the carboxyl to the amino terminus, going from left to right. The numbers on the left and right ends of each sequence indicate the positions of the first and last amino acids, respectively, in the activation domain. Labels on the right hand side of the sequences designate the lentivirus source of the sequences.
- the displayed sequences correspond to the activation domains of HIV-l Rev, HIV-2 Rev, VMV (Visna Maedi Virus) Rev, CAEV (Caprine Arthritis Encephalitis Virus) Rev, HTLV-I (Human T-cell Leukemia virus type I) Rex and EIAV (Equine Infectious Anemia Virus) Rev.
- the four leucine residues in the Rev activation domain and their putative homologues in the other lentivirus proteins are indicated by boxes.
- Figure 2 shows that functioning of Rev activation domain mutants in vivo is closely correlated with their ability to bind proteins of the invention (sometimes referred to herein as "Rab", for the Rev/Rex Activation Domain Binding protein) .
- the amino acid sequence shown corresponds to amino acids 69 to 90 of the HIV-l Rev protein. Missense mutants are designated by MIO, M15 to M25, M27 and M29. The amino acid substitutions in each missense mutant are indicated by a line above or below the altered amino acids, which is labeled with the name of the mutant. All mutants substituting two amino acid (MIO, M15-M18, M20-M22) have Asp-Leu in place of the wild-type sequence. Single amino acid mutations feature a substituted Asp (M19) or Ala (M27 and M29) residue while three amino acid mutations contain the inserted sequence Glu-Asp-Leu (M23 and M25) or Lys-Asp-Leu (M24) .
- Rev mutant fusion protein fusion protein
- WT wild-type Rev
- Rev activity center column
- ++ >50% WT activity
- + ⁇ 50% WT activity
- - no activity
- Rab binding of Rev mutants was assessed by expressing each Rev mutant in the yeast indicator strain GGY1::171 as a GAL4 fusion protein and determining the level of 3-gal activity induced by co-expression of the VP16-Rab fusion (right column) . Binding of each Rev mutant to Rab is expressed relative to the wild-type GAL4/Rev fusion protein, which is set at 1.00.
- Figure 3 shows the predicted primary amino acid sequence of the human Rab protein. Numbers running down the left side of the figure indicate amino acid position in the sequence. Phenylalanine residues, including the dipeptide motif "FG,” as well as runs of serines are indicated by boxes or underlining, respectively.
- Figure 4 shows a Western blot analysis of Rab protein expression in several species.
- the blot shown in the right panel was performed in the presence of soluble Rab protein to assess signal specificity.
- the source of the protein extract in each lane is designated across the top of the panels.
- the relative mobility of marker proteins of the indicated size, in kilodaltons, is given to the left of the panels.
- Figure 5 shows Rab interaction with the activation domains of multiple Rev and Rex proteins in the mammalian nucleus using a two-hybrid analysis in COS cells.
- the vertical axis indicates the VP16 fusion proteins tested (HIV Tat, HIV Rev, HLTV Rex, VMV Rev, and EIAV Rev) .
- wild-type (WT) and missense mutants (MIO, M32, and ⁇ AD) were evaluated.
- the horizontal axis indicates fold trans-activation of CAT enzyme activity detected in a COS cell culture co-transfected with the indicated VP16 fusion protein expression plasmids above that observed with pBC12/GAL4-Rab and the pG5B/CAT indicator plasmids alone.
- Data shown are representative of multiple independent transfection experiments that were each internally controlled by co-transfection of a j3-gal expression plasmid.
- FIG. 6 shows that Rab binds to the Rev:RRE ribonucleoprotein complex in vivo .
- Panel A is a schematic representation of an in vivo mono-hybrid assay for Rab binding to the Rev:RRE ribonucleoprotein complex.
- the Tat/Rab fusion protein is indicated by boxes.
- the Rev protein is shown bound to the SLIIB RNA target sequence. Arrows indicate that Tat will transactivate the HIV-l LTR only if Rab recruits the TAT/Rab fusion protein to the SLIIB-bound Rev protein.
- Panel B shows activation of the SLIIB/CAT indicator construct in HeLa cells upon co-transfection of the expression plasmids designated on the vertical axis. The horizontal axis indicates the fold transactivation of HIV-l LTR driven pSLIIB/CAT expression when cells were co-transfected with the indicated expression plasmids.
- Figure 7 shows the effects of overexpressing Rab on HIV-l Rev and HTLV-I Rex function in COS cells.
- the vertical axis indicates that COS cells were transfected with the Rev-defective HIV-l provirus expression plasmid HIV ⁇ REV, either alone or in the presence of the HIV-l Rev expression plasmid pcRev or the HTLV-I Rex expression plasmid pcRex.
- Co-transfections with pcrev and pcrex were done in the presence of an ⁇ 20-fold molar excess of a plasmid expressing either the full-length Rab protein or expressing CAT or /3-gal as negative control proteins.
- the horizontal axis shows the concentrations of HIV-l P24 capsid protein secreted into the culture medium. Data are representative of three independent experiments.
- Amino acid sequences disclosed herein are presented in the amino to carboxy direction, from left to right. The amino and carboxy groups are not presented in the sequence. Nucleotide sequences are presented herein by single strand only, in the 5' to 3' direction, from left to right. Nucleotides and amino acids are represented herein in the manner recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or (for amino acids) by three letter code, in accordance with 37 CFR ⁇ 1.822 and established usage. See, e.g., Patentin User Manual, 99-102 (Nov. 1990) (U.S. Patent and Trademark Office, Office of the Assistant Commissioner for Patents, Washington, D.C. 20231); U.S. Patent No.
- DNAs of the present invention that code for cellular cofactors of HIV-l REV and/or HTLV-I Rex may be of any species of origin, including mouse, rat, rabbit, cat, and human, but preferably code for proteins of mammalian origin.
- DNA sequences which hybridize to a DNA encoding a protein having the sequence given in SEQ ID NO:2 e.g., a DNA having the nucleotide sequence given in SEQ ID NO:l
- Conditions which will permit other DNA sequences which code for expression of such a protein to hybridize to said sequence can be determined in a routine manner.
- hybridization of such sequences may be carried out under conditions of reduced stringency or even stringent conditions (e.g., conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C or even 70°C to DNA encoding the rat serotonin transporter disclosed herein in a standard in situ hybridization assay.
- stringent conditions e.g., conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C or even 70°C to DNA encoding the rat serotonin transporter disclosed herein in a standard in situ hybridization assay.
- sequences which code for a protein of the invention and hybridize to the DNA encoding the protein of the invention disclosed herein as SEQ ID NO:2 will be at least 75% homologous, 85% homologous, or even 95% homologous or more with the sequence of the DNA encoding the protein of SEQ ID NO:2.
- DNA sequences which code for polypeptides coded of the protein given in SEQ ID NO:2, or sequences which hybridize to the DNA encoding the same and code for a protein of the invention, but which differ in codon sequence from these due to the degeneracy of the genetic code are also an aspect of this invention.
- Proteins of the invention are, in general, cellular co-factors for HIV-l Rev and/or HTLV-l Rex post-transcriptional regulator proteins, and specifically bind to HIV-l Rev and/or HTLV-l Rex at the same binding site bound by the protein of SEQ ID NO:2, and may have the same biological activity thereof.
- a vector is a replicable DNA construct.
- Vectors are used herein either to amplify DNA encoding proteins of the invention as given herein and/or to express DNA which encodes proteins of the invention as given herein.
- An expression vector is a replicable DNA construct in which a DNA sequence encoding a protein of the invention is operably linked to suitable control sequences capable of effecting the expression of the receptor in a suitable host. The need for such control sequences will vary depending upon the host selected and the transformation method chosen. Generally, control sequences include a transcriptional promoter, an optional operator sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding sites, and sequences which control the termination of transcription and translation.
- a plificatiQn vectors do not require expression control domains. All that is needed is the ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants.
- Vectors comprise plasmids, viruses (e.g., adenovirus, cytomegalovirus) , phage, and integratable DNA fragments (i.e., fragments integratable into the host genome by recombination) .
- the vector replicates and functions independently of the host genome, or may, in some instances, integrate into the genome itself.
- Expression vectors should contain a promoter and RNA binding sites which are operably linked to the gene to be expressed and are operable in the host organism. DNA regions are operably linked or operably associated when they are functionally related to each other. For example, a promoter is operably linked to a coding sequence if it controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation.
- Transformed host cells are cells which have been transformed or transfected with vectors containing a DNA sequence as disclosed herein constructed using recombinant DNA techniques. Transformed host cells ordinarily express the receptor, but host cells transformed for purposes of cloning or amplifying the receptor DNA do not need to express the receptor.
- Suitable host cells include prokaryote, yeast or higher eukaryotic cells such as mammalian cells and insect cells.
- Cells derived from multicellular organisms are a particularly suitable host for recombinant protein synthesis, and mammalian cells are particularly preferred. Propagation of such cells in cell culture has become a routine procedure (Tissue Culture, Academic Press, Kruse and Patterson, editors (1973)).
- useful host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cell lines, and WI138, BHK, COS-7, CV, and MDCK cell lines.
- Expression vectors for such cells ordinarily include (if necessary) an origin of replication, a promoter located upstream from the DNA encoding the protein of the invention to be expressed and operatively associated therewith, along with a ribosome binding site, an RNA splice site (if intron-containing genomic DNA is used) , a polyadenylation site, and a transcriptional termination sequence.
- the transcriptional and translational control sequences in expression vectors to be used in transforming ordinarily include (if necessary) an origin of replication, a promoter located upstream from the DNA encoding the protein of the invention to be expressed and operatively associated therewith, along with a ribosome binding site, an RNA splice site (if intron-containing genomic DNA is used) , a polyadenylation site, and a transcriptional termination sequence.
- 'vertebrate cells are often provided by viral sources.
- viral sources For example, commonly used promoters are derived from polyoma,
- Adenovirus 2 and Simian Virus 40 (SV40) . See, e.g.. U.S. Patent No. 4,599,308.
- SV40 Simian Virus 40
- An origin of replication may be provided either by construction of the vector to include an exogenous origin, such as may be derived from SV 40 or other viral (e.g. Polyoma, Adenovirus, VSV, or BPV) source, or may be provided by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell chromosome, the latter is often sufficient. Rather than using vectors which contain viral origins of replication, one can transform mammalian cells by the method of cotransformation with a selectable marker and the receptor DNA. Examples of suitable selectable markers are dihydrofolate reductase (DHFR) or thymidine kinase. This method is further described in U.S. Pat. No. 4,399,216.
- DHFR dihydrofolate reductase
- thymidine kinase thymidine kinase
- baculovirus expression vector e.g., vectors derived from Autographa californica MNPV, Trichoplusia ni MNPV, Rachiplusia ou MNPV, or Galleria ou MNPV
- baculovirus expression vector e.g., vectors derived from Autographa californica MNPV, Trichoplusia ni MNPV, Rachiplusia ou MNPV, or Galleria ou MNPV
- a baculovirus expression vector comprises a baculovirus genome containing the gene to be expressed inserted into the polyhedrin gene at a position ranging from the polyhedrin transcriptional start signal to the ATG start site and under the transcriptional control of a baculovirus polyhedrin promoter.
- Prokaryote host cells include gram negative or gram positive organisms, for example Escherichia coli (E. coli) or Bacilli. Higher eukaryotic cells include established cell lines of mammalian origin as described below. Exemplary host cells are E. coli W3110 (ATCC 27,325), E. COli B, E. COU X1776 (ATCC 31,537), E. coli 294 (ATCC 31,446). A broad variety of suitable prokaryotic and microbial vectors are available. E. coli is typically transformed using pBR322.
- Promoters most commonly used in recombinant microbial expression vectors include the beta- lactamase (penicillinase) and lactose promoter systems (Chang et al., Nature 275. 615 (1978); and Goeddel et al. , Nature 281, 544 (1979)), a tryptophan (trp) promoter system (Goeddel et al., Nucleic Acids Res. 8., 4057 (1980) and EPO App. Publ. No. 36,776) and the tac promoter (H. De Boer et al., Proc. Natl. Acad. Sci. USA 80. 21 (1983)).
- the promoter and Shine-Dalgarno sequence are operably linked to the DNA encoding the protein of the invention, i.e., they are positioned so as to promote transcription of messenger RNA from the DNA.
- Eukaryotic microbes such as yeast cultures may also be transformed with vectors carrying the isolated DNA's disclosed herein. see, e.g.. U.S. Patent No. 4,745,057. Saccharomyces cerevisiae is the most commonly used among lower eukaryotic host microorganisms, although a number of other strains are commonly available.
- Yeast vectors may contain an origin of replication from the 2 micron yeast plasmid or an autonomously replicating sequence (ARS) , a promoter, DNA encoding the receptor as given herein, sequences for polyadenylation and transcription termination, and a selection gene.
- An exemplary plasmid is YRp7, (Stinchcomb et al., Nature 282.
- Suitable promoting sequences in yeast vectors include the promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255, 2073 (1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 1_, 149 (1968); and Holland et al., Biochemistry 17, 4900 (1978)). Suitable vectors and promoters for use in yeast expression are further described in R. Hitzeman et al., EPO Publn. No. 73,657. Proteins of the present invention may be isolated and/or purified from natural sources or recombinant sources as given above in accordance with conventional techniques, optionally employing antibodies that specifically bind the proteins as given below.
- detectable groups can be employed to label antibodies and probes as disclosed herein, and the term "labelled" is used herein to refer to the conjugating or covalent bonding of any suitable detectable group, including enzymes (e.g., horseradish peroxidase, ⁇ - glucuronidase, alkaline phosphatase, and ⁇ -D- galactosidase) , fluorescent labels (e.g., fluorescein, luciferase), and radiolabels (e.g., 1 C, 13l I, 3 H, 32 P, and 35 S) to the compound being labelled.
- enzymes e.g., horseradish peroxidase, ⁇ - glucuronidase, alkaline phosphatase, and ⁇ -D- galactosidase
- fluorescent labels e.g., fluorescein, luciferase
- radiolabels e.g., 1 C, 13l I, 3 H, 32 P, and 35 S
- Oligonucleotide probes of the instant invention may be of any suitable length, depending on the specific application thereof.
- such probes may, in general, be 6, 8 or 12 nucleotides in length to 16, 20, or 30 nucleotides in length or more.
- Such probes are useful for identifying and making DNAs encoding proteins of the invention.
- Antibodies which specifically bind to the proteins of the invention may be polyclonal or monoclonal in origin, but are preferably of monoclonal origin. Such antibodies are useful for the affinity purification of the proteins of the invention, and for the identification; and assay of the proteins in human tissue samples.
- the antibodies may be of any suitable species, such as rat, rabbit, or horse, but are generally of mammalian origin.
- the antibodies may be of any suitable immunoglobulin, such as IgG and IgM.
- antibody Fragments of antibodies which retain the ability to specifically bind the proteins of the invention, such as F(ab') 2/ F(ab'), and Fab fragments, are intended to be encompassed by the term "antibody” herein.
- the antibodies may be chimeric, as described by M. Walker et al., Molecular Immunol . 26, 403 (1989) .
- Antibodies may be immobilized on a solid support of the type used as a packing in an affinity chromatography column, such as sepharose, silica, or glass beads, in accordance with known techniques.
- Monoclonal antibodies which bind to the proteins of the invention are made by culturing a cell or cell line capable of producing the antibody under conditions suitable for the production of the antibody (e.g., by maintaining the cell line in HAT media) , and then collecting the antibody from the culture (e.g., by precipitation, ion exchange chromatography, affinity chromatography, or the like) .
- the antibodies may be generated in a hybridoma cell line in the widely used procedure described by G. Kohler and C. Milstein, Nature 256, 495 (1975) , or may be generated with a recombinant vector in a suitable host cell such as Escherichia coli in the manner described by W. Huse et al., Generation of a Large Combinatorial Library of the Immunoglobulin Repertoire in Phage Lambda, Science 246, 1275 (1989).
- DNAs of the invention are useful for making proteins of the invention, as described above.
- Proteins of the invention have a variety of uses. For example, since proteins of the invention specifically bind to HIV-l Rev and HTLV-I Rex (and other Lentivirus and HTLV family Rev's), they are useful in combination therewith as a member of a specific binding pair. Specific binding pairs are useful for a variety of purposes, such as in various immunoassay systems.
- DNAs and proteins of the invention are useful in methods of screening for compounds that affect the binding of a Lentivirus Rev protein or an HTLV family Rev protein with the cellular co-factor thereof.
- Such compounds are useful in treating or inhibiting the growth of the corresponding virus (e.g., HIV-l, HIV-2, HTLV-I, HTLV-II, bovine leukemia virus (or "BLV") .
- such methods comprise providing together (e.g., in vitro in an aqueous solution; in vivo in a cell) a first protein and a second protein so that the first and second protein form a complex thereof, wherein the first protein is selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins (e.g., Rev proteins or Rev protein counterparts from HIV-l, HIV-2, HTLV-I, HTLV-II, bovine leukemia virus (or "BLV”), and equine infectious anemia virus) , and wherein the second protein is encoded by a DNA encoding a co-factor of the invention as given above.
- Lentivirus Rev proteins and HTLV family Rev proteins e.g., Rev proteins or Rev protein counterparts from HIV-l, HIV-2, HTLV-I, HTLV-II, bovine leukemia virus (or "BLV"), and equine infectious anemia virus
- a test compound is combined with the first and second protein (e.g., by adding it to the aqueous solution or adding to the media containing the cells) and the influence of the test compound on the formation of the complex is detected (for example, by detecting the association or dissociation constant of the complex, the stability of the complex, etc.).
- Compounds that adversely affect complex formation are candidate compounds for treating or inhibiting the growth of the lentivirus or HTLV family virus.
- any suitable host cell may be employed as described above (e.g., yeast cells; insect cells). Also disclosed herein are recombinant cell useful for carrying out screening methods as given above.
- such cells are made and used in accordance with the two-hybrid interaction trap method. See, e.g., Fields and Song, Nature 340, 245-246 (1989); Fridell et al., Virology 209, 347-357 (1995)) .
- Such cells comprise a first DNA that expresses a first fusion protein comprising a DNA binding domain and a first binding partner; a second DNA that expresses a second fusion protein comprising a second binding partner and a transcription activation domain; and a third DNA comprising a binding site operatively associated with a reporter gene, wherein the DNA binding domain specifically binds to the binding site, so that the reporter gene is expressed when the first and second fusion proteins bind to one another (and wherein the reporter gene is not significantly expressed when the first and second binding proteins are not bound to one another, so that the binding of the two may be detected through the activation of the reporter gene) .
- the first binding partner is either
- any suitable host cell as given above may be employed, with yeast cells particularly preferred.
- Any suitable binding site and corresponding binding domain specifically bound thereby may be employed, with one example being a Gal4 binding site and a Gal4 binding domain.
- Any suitable transcription activation domain may be employed, including, but not limited to, the Gal4 transcription activation domain, the VP16 transcription activation domain, and the TAT transcription activation domain.
- Such methods comprise introducing into a cell infected with the virus (e.g., by administering to an animal infected with the virus in need of such treatment) an amount of a compound that inhibits the formation of a complex between a lentivirus Rev protein or an HTLV family Rev protein with the cellular co-factor thereof effective to inhibit the growth of the virus.
- Such compounds are identified by the methods described above.
- Animals that may be treated by the method of the invention include both human and animal subjects (e.g., horse, bovine, or any other animal infected by the corresponding virus) .
- Administration of the compound may be carried out by any suitable means, including parenteral administration (e.g., intraveneous, intraperitoneal, intramuscular, and subcutaneous injection) , topical administration, and oral administration.
- parenteral administration e.g., intraveneous, intraperitoneal, intramuscular, and subcutaneous injection
- topical administration e.g., topical administration
- oral administration e.g., topical administration
- the compounds may optionally be encapsulated into a liposome to facilitate their transport into cells. Dosage of the active compound may be determined by routine experimentation in animal models in accordance with known techniques. The present invention is explained in greater detail in the following non-limiting examples.
- yeast expression plasmid pGAL4-Rev encodes a fusion protein consisting of the GAL4 DNA binding domain linked to the full-length HIV-l Rev protein (R. Fridell et al.. Virology 209, 347-357 (1995)).
- a series of 14 missense mutations, in or near the HIV-l Rev activation domain, were generated in the PGAL4-Rev context by replacement of the wild-type Rev CDNA sequence (Xbal to EcoRI) with polymerase chain reaction (PCR) generated CDNA fragments bearing the previously described mutations (M.
- the plasmid PGAL4-Rex was constructed by substituting amino acids 2 to 189 of HTLV-I Rex in place of the Rev sequence present in PGAL4-Rev.
- the Y190 yeast indicator strain The preparation of the oligo-dT primed, pVP16 based CEM CDNA library and the methodology used to screen for interacting proteins have been described (J. Harper et al., Cell 75, 805-816 (1993);
- molecular clones were sequenced using the dideoxy chain termination method and the sequenase version 2.0 sequencing kit (United States Biochemical) .
- the Rab CDNA insert was sequenced by the same method using both dGTP and dITP.
- the pGEX-4T plasmid was used to express a fusion protein consisting of GST linked to amino acids 101 to 562 of the Rab open reading frame (ORF) ( Figure 3) in the BL21
- the agarose coupled MBP-Rab fusion protein was used to affinity purify Rab-specific antibodies from the serum of the GST-Rab injected rabbits using buffers and procedures detailed in the Amino Link Immobilization Kit (Pierce) .
- the resultant Rab-specific rabbit antiserum was! concentrated and dialyzed against phosphate buffered saline (PBS) prior to use.
- PBS phosphate buffered saline
- the tissue culture cell lines used to make protein extracts were HeLa (human) , C127 (mouse) , QC13 (quail) and Schneider 2 (Drosophila) .
- Frog protein extracts were prepared from Xenopus oocytes. Cells were washed with PBS and then resuspended in 100 11 of PBS. After addition of 100 11 of 2X Laemmli gel loading buffer containing 2-mercaptoethanol, the samples were sonicated, boiled and centrifuged to remove debris. Soluble proteins were separated by 12% SDS-PAGE, transferred to a nitrocellulose filter and then incubated with a 1:20,000 dilution of the affinity purified rabbit Rab antiserum. After vigorous washing, bound antibodies were detected using a horseradish peroxidase-conjugated goat anti-rabbit antiserum and enhanced chemiluminescence (Amersham) .
- Immunofluorescence analysis was performed essentially as previously described (M. Malim et al.. Cell 58, 205-214 (1989A)) .
- CV-1 cells were fixed and permeabilized prior to incubation with a 1:1000 dilution of the rabbit anti-Rab antiserum. After extensive washing, the fixed cells were incubated with a 1:100 dilution of an affinity purified,rhodamine-conjugated goat anti-rabbit antiserum (Boehringer Mannheim) and examined using a Zeiss Axioskop immunofluorescence microscope.
- both the GST/Rev and the GST/M10 fusion proteins contain the full-length Rev ORF, in the former case containing an intact activation domain and in the latter the defective, MIO mutant form of the activation domain (M. Malim et al., Cell 58, 205-214 (1989A)).
- both Rev fusion proteins also contain the previously described M6 missense mutation of the Rev RNA binding domain (M. Malim et al., Cell 58, 205-214 (1989A) ) .
- This mutation which affects amino acids 41 to 44 of Rev, is located well outside the Rev activation domain ( Figure 1) . However, inclusion of this mutation markedly increases the yield of full-length Rev protein upon expression in E . coli (data not shown) .
- a DNA fragment encoding the Rab ORF (amino acids 1 to 562, Ncol to Xhol) was cloned into the pGEM3ZF(+) expression plasmid and 35S-labeled Rab protein prepared in a 200 11 rabbit reticulocyte lysate coupled transcription-translation reaction (Promega) using T7 RNA polymerase and 35S-methionine/35S-cysteine.
- the lysate was then diluted 1:5 with chromatography buffer (CB) (10 Mm Hepes Ph 7.5, 0.1 M NaCl, ImM EDTA, 1 Mm DTT, 10% glycerol, 2mg/ml BSA, 0.5% NP40 and 0.25 Mm PMSF) and equal amounts loaded onto the immobilized GST, GST/Rev and GST/M10 affinity columns. These were then washed with 10 column volumes of CB before bound proteins were eluted with 4 column volumes of 0.1 M glycine, Ph 2.8. Eluted proteins were concentrated and dialyzed against CB prior to analysis by 12% SDS/PAGE.
- CB chromatography buffer
- the introduced mutations are: HIV Rev M10, LG (78,79) to DL; HIV Rev M32, L 78, 81 and 83 all to A; RexDAD, LSLD (90-93) to GGGG; VMV RevDAD, LE (114, 115) to DL; EIAV RevDAD, L49 and 154 to A.
- COS cell cultures (100 mm) were transfected with 1 lg
- the PSLIIB indicator construct and plasmids expressing wild-type (pcrev) and mutant (pM10, pM32) forms of Rev have been described (M. Malim et al., J . Virol . 65, 4248-4254 (1991); L. Tiley et al., Genes Dev . 6, 2077-2087 (1992A) ) .
- the pcTat/Rab plasmid expresses a fusion protein consisting of the full-length Tat protein linked to the first amino acid of the Rab ORF indicated in Figure 3.
- HeLa cultures 35 mm were transfected with 1 lg of the PSLIIB/CAT reporter plasmid, 0.5 lg each of the Tat/Rab fusion protein and Rev protein expression plasmid and 1 lg of carrier DNA using the calcium phosphate procedure.
- the parental PBC12/CMV expression plasmid was used as a negative control.
- Cultures were harvested at -48 hrs after transfection and CAT activity quantified (R. Fridell et al., Virology 209, 347-357 (1995)). HIV-l virus rescue HIV-l provirus rescue assays were performed in COS cell cultures essentially as described (M. Malim et al., J. Virol . 65, 4248-4254 (1991)).
- the Rev- provirus expression plasmid pHIVDRev and the pcrev, pcrex, PBC12/CMV/SEAP, PBC12/CMV/CAT and PBC12/CMV//3gal expression plasmids have been described (L. Rimsky et al., Nature 335, 738-740 (1988); M. Malim et al., J. Virol . 65, 4248-4254 (1991)).
- the Rab expression plasmid PBC12/CMV/Rab contains the 562 amino acid Rab ORF indicated in Figure 3 (Ncol to Xhol) cloned into the expression plasmid PBC12/CMV. Transfections were performed as described in Figure 7.
- the screened library consisted of the VP16 transcription activation domain fused to CDNA sequences derived from the human CEM T-cell line. This screen led to the identification of 3 clones that encoded VP16 fusion proteins that specifically interacted with the GAL4/Rev bait protein, and 10 clones expressing VP16 fusion proteins that specifically bound GAL4/Rex. None of the three Rev specific clones were able to interact with GAL4/Rex while only one of the ten Rex specific clones could bind to GAL4/Rev. Further experiments demonstrated that this interaction was entirely blocked by the introduction of point mutations into the activation domains of either Rev or Rex (data not shown) .
- Rev mutants was cloned into a yeast GAL4 fusion protein expression plasmid and individually tested for their ability to interact with the VP16/Rab fusion protein in the yeast cell nucleus, as assessed by the level of activation of an integrated lacZ indicator gene.
- Each of these mutant GAL4/Rev fusion proteins was equivalently stable in yeast, as measured by western blot analysis (data not shown) .
- Rev mutants that entirely lack effector domain function, including two mutants (M27 and M29) that bear only single amino acid changes in Rev, all proved entirely unable to interact with the Rab fusion protein.
- all of the Rev mutants previously shown (M. Malim et al., J. Virol . 65, 4248-4254 (1991)) to exhibit substantially wild-type Rev activity in vivo also induced substantial levels of j3-gal activity, ranging from a minimum of one quarter to a maximum of -3 times the level seen with wild-type Rev. Strikingly, the three Rev mutants (M18, M20 and M24) previously shown (M. Malim et al., J. Virol .
- the Rab CDNA clone contains a 2584 base pair insert flanked 3' by a stretch of A residues, consistent with priming at the mRNA poly(A) tail. Starting at the first in frame methionine residue, this CDNA contains an open reading frame (ORF) of 562 amino acids which would be predicted to encode a protein of -58 kD ( Figure 3) . Computer analysis of available sequence data bases failed to identify any proteins displaying significant homology to the predicted Rab ORF. Similarly, we were also unable to identify any nucleic acid sequences that displayed significant homology to Rab except for two short, unidentified "expressed sequence tags.” Rab is therefore a novel human gene.
- Rab protein sequence is the high concentration of phenylalanine residues, including ten in the form of the dipeptide motif "FG," as well as several runs of serine residues, found concentrated towards the carboxyl-terminus of Rab. The potential significance of these is discussed below.
- the Rab CDNA insert was used to probe a northern blot of MRNA derived from the human T-cell line CEM (the origin of the CDNA clone) and the human cell line HeLa. In both cases, a prominent band of -2.8 kilobases (kb) was observed (data not shown) . Both cell lines also expressed a hybridizing band of -4.6 kb, although this was faint in the CEM cells.
- Rev and Rex are functional in a wide range of animal cells (M. Ivey-Hoyle and M. Rosenberg, Mol . Cell . Biol . 10, 6152-6159 (1990); M. Malim and B, Cullen, Cell 65, 241-248 (1991); U. Fischer et al., EMBO J. 13, 4105-4112 (1994)). Therefore, it is predicted that the cellular co-factor for Rev and Rex should be conserved across species boundaries. To test this hypothesis, we performed a western analysis on cellular protein extracts of mammalian, avian, amphibian and invertebrate origin using an affinity purified polyclonal rabbit anti-Rab antiserum (Figure 4) .
- Tat RNA sequence specific transcriptional activator to provide an indirect measure of the assembly of a protein complex onto an RNA target sequence ( Figure 6A) .
- Tat activates gene expression from the HIV-l LTR after binding a cis-acting RNA target sequence, termed TAR, that forms the first 59 nt of all HIV-l transcripts (B. Cullen, Microbiol . Rev. 56, 375-394 (1992)).
- RNA stem-loop is replaced with the RRE-derived stem-loop IIB (SLIIB) minimal RNA target sequence for HIV-l Rev
- SLIIB RRE-derived stem-loop IIB
- trans-activation of the HIV-l LTR can only be observed if Tat is fused to an HIV-l Rev protein bearing an intact RNA binding domain (L. Tiley et al., Proc . Natl . Acad . Sci . USA 89, 758-762 (1992B) ) .
- the assay delinea.ted in Fig. 6A takes this approach one step further.
- the TAR element is again replaced by RRE SLIIB in an indicator construct in which the HIV-l LTR is linked to CAT (PSLIIB/CAT) .
- Rev is expressed in its wild-type form while Tat is expressed as a Rab fusion protein. Only if Rab can induce the efficient recruitment of this Tat/Rab fusion protein to the RRE-bound Rev protein will Tat be brought to the HIV-l LTR promoter element and, hence, be able to activate HIV-l LTR driven CAT expression (Figure 6A) .
- Rab is the authentic Rev co-factor
- overexpression of Rab might promote the recruitment of Rab to the RRE RNA target by sub-optimal levels of Rev protein, leading to an increase in Rev activity.
- the HTLV-I Rex protein which can also act via the HIV-l RRE element (L. Rimsky et al., Nature 335, 738-740 (1988)), might also display enhanced activity in the presence of Rab if this protein is indeed the authentic Rev/Rex co-factor.
- TTC CCC AAA TCC TCC AGT GCT GAT TTT GGA ACC TTC AAT ACT TCC CAG 1152 Phe Pro Lys Ser Ser Ser Ala Asp Phe Gly Thr Phe Asn Thr Ser Gin 290 295 300
- GGA TTC GGC ACT CCT GCT CCC 1680 Gly Thr Ala Ser Met Ser Met Pro Thr Gly Phe Gly Thr Pro Ala Pro 465 470 475
- GGTCACATTA CATCTCTCCA CCTCTTGCAC TGTTGTCTTG TTTCACTGAT CTTAGCTTTA 2029
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Virology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
DNAs encoding cellular co-factors for the HIV-1 Rev and the HTLV-I Rex post-transcriptional regulator proteins are disclosed. Also disclosed are proteins produced from the DNAs and methods of using the same.
Description
Cellular Co-Factor for HIV Rev and HTLV Rex
Field ofthe Invention
The present invention relates to cellular co¬ factors for the HIV-l Rev and the HTLV-I Rex post- transcriptional regulator proteins, DNAs encoding the same, and methods of use thereof.
Background ofthe Invention
The HIV-l Rev protein is required for the nucleocytoplasmic transport, and hence translation, of a class of incompletely spliced HIV-l mRNAs that encode the viral structural proteins (B. Cullen, Microbiol . Rev . 56, 375-394 (1992)). In the absence of Rev, these late viral RNAs remain sequestered in the nucleus until they are either spliced or degraded (M. Emerman et al. , Cell 57, 1155-1165 (1989); B. Felber et al., Proc . Natl . Acad . Sci . USA 86, 1495-1499 (1989); M. Malim et al., Nature 338, 254-257 (1989)). Rev function is therefore essential for the production of progeny virions by HIV-l infected cells (M. Feinberg et al., Cell 46, 807-817 (1986); J. Sodroski et al., Nature 321, 412-417 (1986)). Rev mediated nuclear RΝA export requires the direct interaction of Rev with a cis-acting, -234 nucleotide (nt) RΝA stem-loop structure termed the Rev Response Element or RRE (T. Daly et al., Wature 342, 816-819 (1989); Mali et al., supra ; M. Malim et al., Cell 60, 675-683 (1990)). It has been proposed that Rev first binds to a high-affinity site within the RRE as a monomer (D. Bartel et al.. Cell 67, 529-536 (1991); L. Tiley et
al., Proc . Natl . Acad . Sci . USA 89, 758-762 (1992)) . Subsequently, additional Rev monomers assemble.onto the RRE in an ordered process mediated by both protein-protein and protein-RNA interactions. This multimerization event is critical for Rev function via the RRE (M. Malim and B. Cullen, Cell 65, 241-248 (1991); S. Iwai et al., Nucl. Acids Res . 20, 6465-6472 (1992); D. McDonald et al., J . Virol . 66, 7232-7238 (1992)).
Mutational analysis of the 116 amino acid (aa) Rev transactivator has identified two functional domains. A more amino terminal basic domain acts as both a nuclear/nucleolar localization signal and as a sequence specific RNA binding domain (M. Malim et al., Cell 58, 205-214 (1989A); A. Cochrane et al., J . Virol . 64, 881-885 (1990); J. Kje s et al., EMBO J. 11, 1119-1129 (1992)). This domain is flanked by sequences that are essential for efficient Rev multimerization (Malim and Cullen, supra) . Towards the carboxy-terminus of Rev lies a short, ~10 aa leucine-rich sequence, termed the Rev activation domain, that is dispensable for RRE binding but nevertheless essential for Rev function (M. Malim et al., J . Virol . 65, 4248-4254 (1991) ; L. Venkatesh and G. Chinnadurai, Virology 178, 327-330 (1990)). At least two lines of evidence suggest that the activation domain represents a binding site for an essential cellular Rev co-factor. Most importantly, this short sequence element is the only part of Rev whose integrity is essential for Rev function via a heterologous RNA target site (J. McDonald et al., supra) . Thus, it has been demonstrated that a fusion protein consisting of Rev linked to the MS2 bacteriophage coat protein can induce the nuclear export of an RNA containing multiple copies of the MS2 coat protein RNA binding site. Mutations known to block the activity of the Rev RNA binding and/or multimerization sequence prevent the function of this fusion protein via the RRE yet fail to inhibit function via the MS2 RNA operator. In contrast, mutation of the Rev activation domain abrogates function
via either RNA target (J. McDonald et al., supra) . These data therefore imply that the primary role of-the Rev RNA binding/multimerization domain is to facilitate the assembly of a Rev/RRE ribonucleoprotein complex that then recruits the appropriate cellular co-factor to the Rev activation motif.
A second line of evidence indicating that the activation domain is a Rev co-factor binding domain derives from the observation that mutant Rev proteins containing an intact RNA binding/multimerization domain, but lacking a functional activation domain, exhibit a potent dominant negative phenotype in;vivo (Malim et al., supra ; Venkatesh and Chinnadurai, supra) . It has been proposed that these Rev mutants participate with wild-type Rev in the formation of the Rev/RRE ribonucleoprotein complex but then block the function of the wild-type Rev by interfering with the co-operative recruitment of a cellular co-factor. The observation that Rev is functional in cells derived from a wide range of eukaryotes, including primates, mice, birds, frogs, fruit flies and even yeast, implies that this unidentified cellular co-factor must have been evolutionarily conserved (M. Ivey-Hoyle and M. Rosenberg, Mol . Cell . Biol . 10, 6152-6159 (1990); Malim et al., supra ; U. Fischer et al., EMBO J . 13, 4105-4112 (1994); F. Stutz and M. Rosbash, EMBO J . 13, 4096-4104 (1994)).
While HIV-l Rev is the most intensely studied retroviral RNA transport factor, other members of the lentivirinae also encode Rev proteins, while the effectively unrelated T-cell leukemia viruses, including Human T-cell leukemia virus type I (HTLV-I) , encode an equivalent regulatory protein termed Rex (Cullen, supra) . For many of these other Rev and Rex proteins, it has proven possible to mutationally define short protein motifs that can functionally substitute for the activation domain of HIV-l Rev (T. Hope et al., J . Virol . 65, 6001-6007 (1991); L. Tiley et al., J. Virol . 65, 3877-3881 (1991)). In the case of the other primate lentiviruses, as well as the more
divergent ovine/caprine lentiviruses Visna Maedi Virus (VMV) and Caprine Arthritis Encephelitis Virus- (CAEV) , the Rev activation domain has been found to be similar in size and composition to that present in HIV-l Rev (Malim et al. , supra ; Tiley et al., supra). However, both the Rev protein encoded by the lentivirus Equine Infectious Anemia Virus (EIAV) and the Rex protein of HTLV-I, share no significant sequence homology with HIV-l Rev and contain larger, divergent activation domains (Hope et al., supra ; I. Weichselbraun et al. J. Virol . 66, 2583-2587 (1992); R. Fridell et al., Rev . J. Virol . 67, 7317-7323 (1993); V. Mancuso et al., J. Virol . 68, 1998-2001 (1994)). In the case of HTLV-I Rex, this -15 amino acid sequence can be at least partially aligned with the relevant HIV-l Rev sequence. In contrast, the -18 aa activation domain of EIAV Rev displays no evident sequence similarity to the -10 aa HIV-l Rev activation motif, even though it can effectively substitute for the latter in mediating HIV-l Rev function.
Summary ofthe Invention
A first aspect of the invention is isolated DNA encoding a cellular co-factor for HIV Rev selected from the group consisting of:
(a) DNA which encodes a protein of SEQ ID NO:2 (e.g., DNA of SEQ ID NO:l);
(b) DNA which hybridizes to DNA of (a) above under stringent conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C, and which encodes a cellular co-factor for HIV Rev; and (c) DNA differing from the DNAs of (a) and (b) above in codon sequence due to the degeneracy of the genetic code, and which encodes a cellular co-factor for HIV Rev.
A second aspect of the invention is a recombinant DNA comprising vector DNA and a DNA as given above.
A third aspect of the invention is a host cell containing a recombinant DNA as given above.
A fourth aspect of the invention is an oligonucleotide probe capable of selectively binding to a DNA as given above (e.g., a probe that is capable of serving as a PCR extension primer; a probe labelled with a detectable group) .
A fifth aspect of the invention is an isolated protein comprising a cellular co-factor for HIV Rev, which protein is coded for by a DNA as given above.
A sixth aspect of the present invention is an antibody which specifically binds to a cellular co-factor for HIV Rev.
The foregoing and other objects and aspects of the present invention are explained in detail in the drawings herein and the specification set forth below.
BriefDescription ofthe Drawings
Figure 1 shows the primary sequence of the minimal activation domains of selected Rev and Rex proteins. The upper portion of the figure is a schematic representation of the Rev transactivation region. The Rev transactivation region contains an RNA binding domain, indicated by cross-hatching, and an activation domain, which is shown in double cross-hatching. Amino acid sequences of the activation domains of Rev and Rex proteins from several lentiviruses are shown in the lower portion of the figure. All amino acid sequences are shown from the carboxyl to the amino terminus, going from left to right. The numbers on the left and right ends of each sequence indicate the positions of the first and last amino acids, respectively, in the activation domain. Labels on the right hand side of the sequences designate the lentivirus source of the sequences. Going from top to bottom, the displayed sequences correspond to the activation domains of HIV-l Rev, HIV-2 Rev, VMV (Visna Maedi Virus) Rev, CAEV (Caprine Arthritis Encephalitis Virus) Rev, HTLV-I (Human
T-cell Leukemia virus type I) Rex and EIAV (Equine Infectious Anemia Virus) Rev. The four leucine residues in the Rev activation domain and their putative homologues in the other lentivirus proteins are indicated by boxes. Figure 2 shows that functioning of Rev activation domain mutants in vivo is closely correlated with their ability to bind proteins of the invention (sometimes referred to herein as "Rab", for the Rev/Rex Activation Domain Binding protein) . The amino acid sequence shown corresponds to amino acids 69 to 90 of the HIV-l Rev protein. Missense mutants are designated by MIO, M15 to M25, M27 and M29. The amino acid substitutions in each missense mutant are indicated by a line above or below the altered amino acids, which is labeled with the name of the mutant. All mutants substituting two amino acid (MIO, M15-M18, M20-M22) have Asp-Leu in place of the wild-type sequence. Single amino acid mutations feature a substituted Asp (M19) or Ala (M27 and M29) residue while three amino acid mutations contain the inserted sequence Glu-Asp-Leu (M23 and M25) or Lys-Asp-Leu (M24) . Below the amino acid sequence is a listing of each Rev mutant fusion protein and the wild-type Rev (WT) (left column) . Rev activity (center column) for each mutant was compared with the WT and is indicated by ++ (>50% WT activity) , + (<50% WT activity) , or - (no activity) . Rab binding of Rev mutants was assessed by expressing each Rev mutant in the yeast indicator strain GGY1::171 as a GAL4 fusion protein and determining the level of 3-gal activity induced by co-expression of the VP16-Rab fusion (right column) . Binding of each Rev mutant to Rab is expressed relative to the wild-type GAL4/Rev fusion protein, which is set at 1.00.
Figure 3 (SEQ ID NO:l) shows the predicted primary amino acid sequence of the human Rab protein. Numbers running down the left side of the figure indicate amino acid position in the sequence. Phenylalanine residues, including the dipeptide motif "FG," as well as
runs of serines are indicated by boxes or underlining, respectively.
Figure 4 shows a Western blot analysis of Rab protein expression in several species. The blot shown in the right panel was performed in the presence of soluble Rab protein to assess signal specificity. The source of the protein extract in each lane is designated across the top of the panels. Left panel: lane 1—human, lane 2— mouse, lane 3—quail, lane 4—frog, and lane 5—fly. Right panel: lane 6—human, lane 7—frog, and lane 8—fly. The relative mobility of marker proteins of the indicated size, in kilodaltons, is given to the left of the panels.
Figure 5 shows Rab interaction with the activation domains of multiple Rev and Rex proteins in the mammalian nucleus using a two-hybrid analysis in COS cells. The vertical axis indicates the VP16 fusion proteins tested (HIV Tat, HIV Rev, HLTV Rex, VMV Rev, and EIAV Rev) . For each fusion protein, wild-type (WT) and missense mutants (MIO, M32, and ΔAD) were evaluated. The horizontal axis indicates fold trans-activation of CAT enzyme activity detected in a COS cell culture co-transfected with the indicated VP16 fusion protein expression plasmids above that observed with pBC12/GAL4-Rab and the pG5B/CAT indicator plasmids alone. Data shown are representative of multiple independent transfection experiments that were each internally controlled by co-transfection of a j3-gal expression plasmid.
Figure 6 shows that Rab binds to the Rev:RRE ribonucleoprotein complex in vivo . Panel A is a schematic representation of an in vivo mono-hybrid assay for Rab binding to the Rev:RRE ribonucleoprotein complex. The Tat/Rab fusion protein is indicated by boxes. The Rev protein is shown bound to the SLIIB RNA target sequence. Arrows indicate that Tat will transactivate the HIV-l LTR only if Rab recruits the TAT/Rab fusion protein to the SLIIB-bound Rev protein. Panel B shows activation of the SLIIB/CAT indicator construct in HeLa cells upon
co-transfection of the expression plasmids designated on the vertical axis. The horizontal axis indicates the fold transactivation of HIV-l LTR driven pSLIIB/CAT expression when cells were co-transfected with the indicated expression plasmids.
Figure 7 shows the effects of overexpressing Rab on HIV-l Rev and HTLV-I Rex function in COS cells. The vertical axis indicates that COS cells were transfected with the Rev-defective HIV-l provirus expression plasmid HIVΔREV, either alone or in the presence of the HIV-l Rev expression plasmid pcRev or the HTLV-I Rex expression plasmid pcRex. Co-transfections with pcrev and pcrex were done in the presence of an ~20-fold molar excess of a plasmid expressing either the full-length Rab protein or expressing CAT or /3-gal as negative control proteins. The horizontal axis shows the concentrations of HIV-l P24 capsid protein secreted into the culture medium. Data are representative of three independent experiments.
Detailed Description ofthe Invention Amino acid sequences disclosed herein are presented in the amino to carboxy direction, from left to right. The amino and carboxy groups are not presented in the sequence. Nucleotide sequences are presented herein by single strand only, in the 5' to 3' direction, from left to right. Nucleotides and amino acids are represented herein in the manner recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or (for amino acids) by three letter code, in accordance with 37 CFR §1.822 and established usage. See, e.g., Patentin User Manual, 99-102 (Nov. 1990) (U.S. Patent and Trademark Office, Office of the Assistant Commissioner for Patents, Washington, D.C. 20231); U.S. Patent No. 4,871,670 to Hudson et al. at Col. 3 lines 20-43 (applicants specifically intend that the disclosure of this and all other patent references cited herein be incorporated herein by reference) .
A. DNAs ENCODING CELLULAR CO-FACTORS OF HIV REV AND HTLV REX.
DNAs of the present invention that code for cellular cofactors of HIV-l REV and/or HTLV-I Rex may be of any species of origin, including mouse, rat, rabbit, cat, and human, but preferably code for proteins of mammalian origin. Thus, DNA sequences which hybridize to a DNA encoding a protein having the sequence given in SEQ ID NO:2 (e.g., a DNA having the nucleotide sequence given in SEQ ID NO:l) and which code for expression of a cellular co-factor of HIV-l REV and/or HTLV-I Rex are also an aspect of this invention. Conditions which will permit other DNA sequences which code for expression of such a protein to hybridize to said sequence can be determined in a routine manner. For example, hybridization of such sequences may be carried out under conditions of reduced stringency or even stringent conditions (e.g., conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C or even 70°C to DNA encoding the rat serotonin transporter disclosed herein in a standard in situ hybridization assay. See J. Sambrook et al.. Molecular Cloning, A Laboratory Manual , 2d Ed. (Cold Spring Harbor Laboratory 1989) . In general, sequences which code for a protein of the invention and hybridize to the DNA encoding the protein of the invention disclosed herein as SEQ ID NO:2 will be at least 75% homologous, 85% homologous, or even 95% homologous or more with the sequence of the DNA encoding the protein of SEQ ID NO:2. Further, DNA sequences which code for polypeptides coded of the protein given in SEQ ID NO:2, or sequences which hybridize to the DNA encoding the same and code for a protein of the invention, but which differ in codon sequence from these due to the degeneracy of the genetic code, are also an aspect of this invention. The degeneracy of the genetic code, which allows different nucleic acid sequences to code for the same protein or peptide, is well known in the literature. See, e.g., U.S. Patent No.
4,757,006 to Toole et al. at Col. 2, Table 1. DNAs of the invention may be naturally occuring or synthetic in origin. Those skilled in the art will appreciate that minor alterations can be introduced into the DNAs without departing from the instant invention. Proteins of the invention are, in general, cellular co-factors for HIV-l Rev and/or HTLV-l Rex post-transcriptional regulator proteins, and specifically bind to HIV-l Rev and/or HTLV-l Rex at the same binding site bound by the protein of SEQ ID NO:2, and may have the same biological activity thereof.
B. GENETIC; ENGINEERING TECHNIQUES The production of cloned genes, recombinant DNA, vectors, transformed host cells, proteins and protein fragments by genetic engineering is well known. See, e.g.. U.S. Patent No. 4,761,371 to Bell et al. at Col. 6 line 3 to Col. 9 line 65; U.S. Patent No. 4,877,729 to Clark et al. at Col. 4 line 38 to Col. 7 line 6; U.S. Patent No. 4,912,038 to Schilling at Col. 3 line 26 to Col. 14 line 12; and U.S. Patent No. 4,879,224 to Wallner at Col. 6 line 8 to Col. 8 line 59.
A vector is a replicable DNA construct. Vectors are used herein either to amplify DNA encoding proteins of the invention as given herein and/or to express DNA which encodes proteins of the invention as given herein. An expression vector is a replicable DNA construct in which a DNA sequence encoding a protein of the invention is operably linked to suitable control sequences capable of effecting the expression of the receptor in a suitable host. The need for such control sequences will vary depending upon the host selected and the transformation method chosen. Generally, control sequences include a transcriptional promoter, an optional operator sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding sites, and sequences which control the termination of transcription and translation.
A plificatiQn vectors do not require expression control domains. All that is needed is the ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants.
Vectors comprise plasmids, viruses (e.g., adenovirus, cytomegalovirus) , phage, and integratable DNA fragments (i.e., fragments integratable into the host genome by recombination) . The vector replicates and functions independently of the host genome, or may, in some instances, integrate into the genome itself. Expression vectors should contain a promoter and RNA binding sites which are operably linked to the gene to be expressed and are operable in the host organism. DNA regions are operably linked or operably associated when they are functionally related to each other. For example, a promoter is operably linked to a coding sequence if it controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation.
Transformed host cells are cells which have been transformed or transfected with vectors containing a DNA sequence as disclosed herein constructed using recombinant DNA techniques. Transformed host cells ordinarily express the receptor, but host cells transformed for purposes of cloning or amplifying the receptor DNA do not need to express the receptor.
Suitable host cells include prokaryote, yeast or higher eukaryotic cells such as mammalian cells and insect cells. Cells derived from multicellular organisms are a particularly suitable host for recombinant protein synthesis, and mammalian cells are particularly preferred. Propagation of such cells in cell culture has become a routine procedure (Tissue Culture, Academic Press, Kruse and Patterson, editors (1973)). Examples of useful host cell lines are VERO and HeLa cells, Chinese hamster ovary
(CHO) cell lines, and WI138, BHK, COS-7, CV, and MDCK cell lines. Expression vectors for such cells- ordinarily include (if necessary) an origin of replication, a promoter located upstream from the DNA encoding the protein of the invention to be expressed and operatively associated therewith, along with a ribosome binding site, an RNA splice site (if intron-containing genomic DNA is used) , a polyadenylation site, and a transcriptional termination sequence. The transcriptional and translational control sequences in expression vectors to be used in transforming
'vertebrate cells are often provided by viral sources. For example, commonly used promoters are derived from polyoma,
Adenovirus 2, and Simian Virus 40 (SV40) . See, e.g.. U.S. Patent No. 4,599,308.
An origin of replication may be provided either by construction of the vector to include an exogenous origin, such as may be derived from SV 40 or other viral (e.g. Polyoma, Adenovirus, VSV, or BPV) source, or may be provided by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell chromosome, the latter is often sufficient. Rather than using vectors which contain viral origins of replication, one can transform mammalian cells by the method of cotransformation with a selectable marker and the receptor DNA. Examples of suitable selectable markers are dihydrofolate reductase (DHFR) or thymidine kinase. This method is further described in U.S. Pat. No. 4,399,216.
Other methods suitable for adaptation to the synthesis of the proteins of the invention in recombinant vertebrate cell culture include those described in M-J. Gething et al.. Nature 293, 620 (1981); N. Mantei et al., Nature 281. 40; A. Levinson et al., EPO Application Nos. 117,060A and 117,058A. Host cells such as insect cells (e.g., cultured
Spodoptera frugiperda cells) and expression vectors such as the baculovirus expression vector (e.g., vectors derived
from Autographa californica MNPV, Trichoplusia ni MNPV, Rachiplusia ou MNPV, or Galleria ou MNPV) may be employed in carrying out the present invention, as described in U.S. Patents Nos. 4,745,051 and 4,879,236 to Smith et al. In general, a baculovirus expression vector comprises a baculovirus genome containing the gene to be expressed inserted into the polyhedrin gene at a position ranging from the polyhedrin transcriptional start signal to the ATG start site and under the transcriptional control of a baculovirus polyhedrin promoter.
Prokaryote host cells include gram negative or gram positive organisms, for example Escherichia coli (E. coli) or Bacilli. Higher eukaryotic cells include established cell lines of mammalian origin as described below. Exemplary host cells are E. coli W3110 (ATCC 27,325), E. COli B, E. COU X1776 (ATCC 31,537), E. coli 294 (ATCC 31,446). A broad variety of suitable prokaryotic and microbial vectors are available. E. coli is typically transformed using pBR322. Promoters most commonly used in recombinant microbial expression vectors include the beta- lactamase (penicillinase) and lactose promoter systems (Chang et al., Nature 275. 615 (1978); and Goeddel et al. , Nature 281, 544 (1979)), a tryptophan (trp) promoter system (Goeddel et al., Nucleic Acids Res. 8., 4057 (1980) and EPO App. Publ. No. 36,776) and the tac promoter (H. De Boer et al., Proc. Natl. Acad. Sci. USA 80. 21 (1983)). The promoter and Shine-Dalgarno sequence (for prokaryotic host expression) are operably linked to the DNA encoding the protein of the invention, i.e., they are positioned so as to promote transcription of messenger RNA from the DNA.
Eukaryotic microbes such as yeast cultures may also be transformed with vectors carrying the isolated DNA's disclosed herein. see, e.g.. U.S. Patent No. 4,745,057. Saccharomyces cerevisiae is the most commonly used among lower eukaryotic host microorganisms, although a number of other strains are commonly available. Yeast vectors may contain an origin of replication from the 2
micron yeast plasmid or an autonomously replicating sequence (ARS) , a promoter, DNA encoding the receptor as given herein, sequences for polyadenylation and transcription termination, and a selection gene. An exemplary plasmid is YRp7, (Stinchcomb et al., Nature 282. 39 (1979); Kingsman et al., Gene 2, 141 (1979); Tschemper et al., Gene 10. 157 (1980)). Suitable promoting sequences in yeast vectors include the promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255, 2073 (1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 1_, 149 (1968); and Holland et al., Biochemistry 17, 4900 (1978)). Suitable vectors and promoters for use in yeast expression are further described in R. Hitzeman et al., EPO Publn. No. 73,657. Proteins of the present invention may be isolated and/or purified from natural sources or recombinant sources as given above in accordance with conventional techniques, optionally employing antibodies that specifically bind the proteins as given below.
C. ANTIBODIES AND OLIGONUCLEOTIDE PROBES.
A variety of detectable groups can be employed to label antibodies and probes as disclosed herein, and the term "labelled" is used herein to refer to the conjugating or covalent bonding of any suitable detectable group, including enzymes (e.g., horseradish peroxidase, β- glucuronidase, alkaline phosphatase, and β-D- galactosidase) , fluorescent labels (e.g., fluorescein, luciferase), and radiolabels (e.g., 1C, 13lI, 3H, 32P, and 35S) to the compound being labelled. Techniques for labelling various compounds, including proteins, peptides, and antibodies, are well known. See, e.g., Morrison, Methods in Enzymology 32b, 103 (1974); Syvanen et al., J . Biol . Chem . 284, 3762 (1973); Bolton and Hunter, Biochem . J . 133, 529 (1973). Oligonucleotide probes of the instant invention may be of any suitable length, depending on the specific
application thereof. For example, such probes may, in general, be 6, 8 or 12 nucleotides in length to 16, 20, or 30 nucleotides in length or more. Such probes are useful for identifying and making DNAs encoding proteins of the invention.
Antibodies which specifically bind to the proteins of the invention (i.e., antibodies which bind to a single antigenic site or epitope on the transporter) may be polyclonal or monoclonal in origin, but are preferably of monoclonal origin. Such antibodies are useful for the affinity purification of the proteins of the invention, and for the identification; and assay of the proteins in human tissue samples. The antibodies may be of any suitable species, such as rat, rabbit, or horse, but are generally of mammalian origin. The antibodies may be of any suitable immunoglobulin, such as IgG and IgM. Fragments of antibodies which retain the ability to specifically bind the proteins of the invention, such as F(ab')2/ F(ab'), and Fab fragments, are intended to be encompassed by the term "antibody" herein. The antibodies may be chimeric, as described by M. Walker et al., Molecular Immunol . 26, 403 (1989) . Antibodies may be immobilized on a solid support of the type used as a packing in an affinity chromatography column, such as sepharose, silica, or glass beads, in accordance with known techniques.
Monoclonal antibodies which bind to the proteins of the invention are made by culturing a cell or cell line capable of producing the antibody under conditions suitable for the production of the antibody (e.g., by maintaining the cell line in HAT media) , and then collecting the antibody from the culture (e.g., by precipitation, ion exchange chromatography, affinity chromatography, or the like) . The antibodies may be generated in a hybridoma cell line in the widely used procedure described by G. Kohler and C. Milstein, Nature 256, 495 (1975) , or may be generated with a recombinant vector in a suitable host cell such as Escherichia coli in the manner described by W. Huse
et al., Generation of a Large Combinatorial Library of the Immunoglobulin Repertoire in Phage Lambda, Science 246, 1275 (1989).
DNAs of the invention are useful for making proteins of the invention, as described above. Proteins of the invention have a variety of uses. For example, since proteins of the invention specifically bind to HIV-l Rev and HTLV-I Rex (and other Lentivirus and HTLV family Rev's), they are useful in combination therewith as a member of a specific binding pair. Specific binding pairs are useful for a variety of purposes, such as in various immunoassay systems.
In addition, DNAs and proteins of the invention are useful in methods of screening for compounds that affect the binding of a Lentivirus Rev protein or an HTLV family Rev protein with the cellular co-factor thereof. Such compounds are useful in treating or inhibiting the growth of the corresponding virus (e.g., HIV-l, HIV-2, HTLV-I, HTLV-II, bovine leukemia virus (or "BLV") . In general, such methods comprise providing together (e.g., in vitro in an aqueous solution; in vivo in a cell) a first protein and a second protein so that the first and second protein form a complex thereof, wherein the first protein is selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins (e.g., Rev proteins or Rev protein counterparts from HIV-l, HIV-2, HTLV-I, HTLV-II, bovine leukemia virus (or "BLV"), and equine infectious anemia virus) , and wherein the second protein is encoded by a DNA encoding a co-factor of the invention as given above. Then, a test compound is combined with the first and second protein (e.g., by adding it to the aqueous solution or adding to the media containing the cells) and the influence of the test compound on the formation of the complex is detected (for example, by detecting the association or dissociation constant of the complex, the stability of the complex, etc.). Compounds that adversely affect complex formation are candidate compounds for
treating or inhibiting the growth of the lentivirus or HTLV family virus. Where the assay is carried out in vivo , any suitable host cell may be employed as described above (e.g., yeast cells; insect cells). Also disclosed herein are recombinant cell useful for carrying out screening methods as given above. In general, such cells are made and used in accordance with the two-hybrid interaction trap method. See, e.g., Fields and Song, Nature 340, 245-246 (1989); Fridell et al., Virology 209, 347-357 (1995)) . Such cells comprise a first DNA that expresses a first fusion protein comprising a DNA binding domain and a first binding partner; a second DNA that expresses a second fusion protein comprising a second binding partner and a transcription activation domain; and a third DNA comprising a binding site operatively associated with a reporter gene, wherein the DNA binding domain specifically binds to the binding site, so that the reporter gene is expressed when the first and second fusion proteins bind to one another (and wherein the reporter gene is not significantly expressed when the first and second binding proteins are not bound to one another, so that the binding of the two may be detected through the activation of the reporter gene) . The first binding partner is either
(a) selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins, as given above, or
(b) a co-factor thereof as given above, and the second binding partner is the other of (a) or (b) . Any suitable host cell as given above may be employed, with yeast cells particularly preferred. Any suitable binding site and corresponding binding domain specifically bound thereby may be employed, with one example being a Gal4 binding site and a Gal4 binding domain. Any suitable transcription activation domain may be employed, including, but not limited to, the Gal4 transcription activation domain, the VP16 transcription activation domain, and the TAT transcription activation domain.
Also disclosed are methods of treating or inhibiting the growth (e.g., by interfering- with viral replication) of Lentiviruses or HTLV family viruses (e.g., HIV-l, HIV-2, HTLV-I, HTLV-II, bovine leukemia virus (or "BLV") . Such methods comprise introducing into a cell infected with the virus (e.g., by administering to an animal infected with the virus in need of such treatment) an amount of a compound that inhibits the formation of a complex between a lentivirus Rev protein or an HTLV family Rev protein with the cellular co-factor thereof effective to inhibit the growth of the virus. Such compounds are identified by the methods described above. Animals that may be treated by the method of the invention include both human and animal subjects (e.g., horse, bovine, or any other animal infected by the corresponding virus) . Administration of the compound may be carried out by any suitable means, including parenteral administration (e.g., intraveneous, intraperitoneal, intramuscular, and subcutaneous injection) , topical administration, and oral administration. Where desired, the compounds may optionally be encapsulated into a liposome to facilitate their transport into cells. Dosage of the active compound may be determined by routine experimentation in animal models in accordance with known techniques. The present invention is explained in greater detail in the following non-limiting examples.
EXAMPLES
EXPERIMENTAL PROCEDURES
Yeast two-hvbrid interaction experiments The yeast expression plasmid pGAL4-Rev encodes a fusion protein consisting of the GAL4 DNA binding domain linked to the full-length HIV-l Rev protein (R. Fridell et al.. Virology 209, 347-357 (1995)). A series of 14 missense mutations, in or near the HIV-l Rev activation domain, were generated in the PGAL4-Rev context by replacement of the wild-type Rev CDNA sequence (Xbal to
EcoRI) with polymerase chain reaction (PCR) generated CDNA fragments bearing the previously described mutations (M.
Malim et al., J . Virol . 65, 4248-4254 (1991)) listed in
Figure 2. The plasmid PGAL4-Rex was constructed by substituting amino acids 2 to 189 of HTLV-I Rex in place of the Rev sequence present in PGAL4-Rev.
The Y190 yeast indicator strain, the preparation of the oligo-dT primed, pVP16 based CEM CDNA library and the methodology used to screen for interacting proteins have been described (J. Harper et al., Cell 75, 805-816 (1993);
R. Fridell et al., Virology 209, 347-357 (1995)).
Following rescue, library plasmids expressing relevant VP16 fusion proteins were rescreened by transformation into the yeast indicator strain GGY1::171 (G. Gill and M. Ptashne, Cell 51, 121-126 (1987) together with the panel of GAL4 fusion protein expression plasmids described in Figure 2.
After 3 days of selection on culture plates, double transformants were transferred to selective liquid medium.
The next day, equal cell equivalents were analyzed for levels of β-gal expression (R. Fridell et al., Virology
209, 347-357 (1995)).
Where appropriate, molecular clones were sequenced using the dideoxy chain termination method and the sequenase version 2.0 sequencing kit (United States Biochemical) . The Rab CDNA insert was sequenced by the same method using both dGTP and dITP.
RNA and protein expression analysis
Northern analysis of poly(A)+ RNA derived from the human cell lines CEM and HeLa was performed as previously described (M. Malim et al., Cell 60, 675-683 (1990)) using a random primed, α32P-DCTP labeled probe prepared from an internal 853 bp BamHI to Accl Rab CDNA fragment.
The pGEX-4T plasmid was used to express a fusion protein consisting of GST linked to amino acids 101 to 562 of the Rab open reading frame (ORF) (Figure 3) in the BL21
(Ion-) strain of E . coli . Expression was induced by addition of 0.1 Mm IPTG and the resultant GST-Rab fusion
protein extracted, purified and used to immunize rabbits. A second fusion protein, consisting of MBP linked to amino acids 101 to 326 of Rab, was expressed in E. coli using the pMAL-C2 expression plasmid, purified using a commercial kit (MBP protein fusion and purification system, New England Biolabs) and then coupled to cyanogen bromide-activated agarose beads (Pierce) . The agarose coupled MBP-Rab fusion protein was used to affinity purify Rab-specific antibodies from the serum of the GST-Rab injected rabbits using buffers and procedures detailed in the Amino Link Immobilization Kit (Pierce) . The resultant Rab-specific rabbit antiserum was! concentrated and dialyzed against phosphate buffered saline (PBS) prior to use.
The tissue culture cell lines used to make protein extracts were HeLa (human) , C127 (mouse) , QC13 (quail) and Schneider 2 (Drosophila) . Frog protein extracts were prepared from Xenopus oocytes. Cells were washed with PBS and then resuspended in 100 11 of PBS. After addition of 100 11 of 2X Laemmli gel loading buffer containing 2-mercaptoethanol, the samples were sonicated, boiled and centrifuged to remove debris. Soluble proteins were separated by 12% SDS-PAGE, transferred to a nitrocellulose filter and then incubated with a 1:20,000 dilution of the affinity purified rabbit Rab antiserum. After vigorous washing, bound antibodies were detected using a horseradish peroxidase-conjugated goat anti-rabbit antiserum and enhanced chemiluminescence (Amersham) .
Immunofluorescence analysis was performed essentially as previously described (M. Malim et al.. Cell 58, 205-214 (1989A)) . CV-1 cells were fixed and permeabilized prior to incubation with a 1:1000 dilution of the rabbit anti-Rab antiserum. After extensive washing, the fixed cells were incubated with a 1:100 dilution of an affinity purified,rhodamine-conjugated goat anti-rabbit antiserum (Boehringer Mannheim) and examined using a Zeiss Axioskop immunofluorescence microscope. GST affinity chromatography
Both the GST/Rev and the GST/M10 fusion proteins contain the full-length Rev ORF, in the former case containing an intact activation domain and in the latter the defective, MIO mutant form of the activation domain (M. Malim et al., Cell 58, 205-214 (1989A)). In addition, both Rev fusion proteins also contain the previously described M6 missense mutation of the Rev RNA binding domain (M. Malim et al., Cell 58, 205-214 (1989A) ) . This mutation, which affects amino acids 41 to 44 of Rev, is located well outside the Rev activation domain (Figure 1) . However, inclusion of this mutation markedly increases the yield of full-length Rev protein upon expression in E . coli (data not shown) . Bacterially expressed GST, GST/Rev and GST/M10 proteins were purified as previously described (M. Malim et al., J . Virol . 65, 4248-4254 (1991)). Equivalent levels were then coupled to cyanogen bromide-activated agarose beads (Pierce) and used to prepare columns with a 0.5 ml bed volume.
A DNA fragment encoding the Rab ORF (amino acids 1 to 562, Ncol to Xhol) was cloned into the pGEM3ZF(+) expression plasmid and 35S-labeled Rab protein prepared in a 200 11 rabbit reticulocyte lysate coupled transcription-translation reaction (Promega) using T7 RNA polymerase and 35S-methionine/35S-cysteine. The lysate was then diluted 1:5 with chromatography buffer (CB) (10 Mm Hepes Ph 7.5, 0.1 M NaCl, ImM EDTA, 1 Mm DTT, 10% glycerol, 2mg/ml BSA, 0.5% NP40 and 0.25 Mm PMSF) and equal amounts loaded onto the immobilized GST, GST/Rev and GST/M10 affinity columns. These were then washed with 10 column volumes of CB before bound proteins were eluted with 4 column volumes of 0.1 M glycine, Ph 2.8. Eluted proteins were concentrated and dialyzed against CB prior to analysis by 12% SDS/PAGE. Mammalian two-hybrid assay All mammalian expression plasmids were constructed in the context of plasmid PBC12/CMV (M. Malim et al., J. Virol . 65, 4248-4254 (1991)) with the exception of the
pG5B/CAT indicator construct, which has been described (Bogerd and Greene, 1993) . PBC12/GAL4-Rab expresses the GAL4 DNA binding domain (amino acids 1-147) linked to the full-length Rab CDNA sequence. pRev/VP16 and pRex/VP16 have been described (H. Bogerd and W. Greene, J. Virol . 67, 2496-2502 (1993)) and express full-length HIV-l Rev and HTLV-l Rex linked to the VP16 transcription activation domain. Variants of Rev (MIO, M32) and Rex (DAD) bearing defective activation domains were substituted into this same expression plasmid by excision of the wild-type rev gene by cleavage with Sad and Bglll and replacement with a PCR generated Sad; to Bell DNA fragment encoding the appropriate Rev or Rex mutants. Similarly, PCR-generated Sad to Bell DNA fragments encoding wild-type and mutant (DAD) forms of VMV Rev and EIAV Rev were also substituted in place of HIV-l Rev to generate the relevant VP16 fusion protein expression plasmids. All Rev and Rex activation domain mutants have been previously described and characterized (T. Hope et al., J. Virol . 65, 6001-6007 (1991); M. Malim et al., J. Virol . 65, 4248-4254 (1991); L. Tiley et al., J. Virol . 65, 3877-3881 (1991); R. Fridell et al., Rev. J. Virol . 67, 7317-7323 (1993)). The introduced mutations are: HIV Rev M10, LG (78,79) to DL; HIV Rev M32, L 78, 81 and 83 all to A; RexDAD, LSLD (90-93) to GGGG; VMV RevDAD, LE (114, 115) to DL; EIAV RevDAD, L49 and 154 to A.
COS cell cultures (100 mm) were transfected with 1 lg
PG5B/CAT, 500 ng PBC12/GAL4-Rab, 5 lg of a VP16 fusion protein expression plasmid and 500 ng of the internal control indicator plasmid PBC12/CMV/0gal using the DEAE-dextran procedure. The parental PBC12/CMV expression plasmid served as a negative control. At -48 hrs post-transfection, cell extracts were prepared and CAT and 0-gal expression quantified (R. Fridell et al., Virology 209, 347-357 (1995)). Mammalian mono-hybrid assay
The PSLIIB indicator construct and plasmids expressing wild-type (pcrev) and mutant (pM10, pM32) forms of Rev have
been described (M. Malim et al., J . Virol . 65, 4248-4254 (1991); L. Tiley et al., Genes Dev . 6, 2077-2087 (1992A) ) . The pcTat/Rab plasmid expresses a fusion protein consisting of the full-length Tat protein linked to the first amino acid of the Rab ORF indicated in Figure 3. HeLa cultures (35 mm) were transfected with 1 lg of the PSLIIB/CAT reporter plasmid, 0.5 lg each of the Tat/Rab fusion protein and Rev protein expression plasmid and 1 lg of carrier DNA using the calcium phosphate procedure. The parental PBC12/CMV expression plasmid was used as a negative control. Cultures were harvested at -48 hrs after transfection and CAT activity quantified (R. Fridell et al., Virology 209, 347-357 (1995)). HIV-l virus rescue HIV-l provirus rescue assays were performed in COS cell cultures essentially as described (M. Malim et al., J. Virol . 65, 4248-4254 (1991)). The Rev- provirus expression plasmid pHIVDRev and the pcrev, pcrex, PBC12/CMV/SEAP, PBC12/CMV/CAT and PBC12/CMV//3gal expression plasmids have been described (L. Rimsky et al., Nature 335, 738-740 (1988); M. Malim et al., J. Virol . 65, 4248-4254 (1991)). The Rab expression plasmid PBC12/CMV/Rab contains the 562 amino acid Rab ORF indicated in Figure 3 (Ncol to Xhol) cloned into the expression plasmid PBC12/CMV. Transfections were performed as described in Figure 7. Secreted p24 levels were quantified using a commercial ELISA kit (DuPont) while SEAP activity was determined as described (J. Berger et al., Gene 66, 1-10 (1988)). RESULTS Using the yeast two-hybrid protein interaction trap (S. Fields and O-K Song, Nature 340, 245-246 (1989); R. Fridell et al., Virology 209, 347-357 (1995)) we sought to identify a human CDNA that encoded a protein able to interact with not only HIV-l Rev, but also HTLV-I Rex, in an activation domain dependent manner. The screened library consisted of the VP16 transcription activation domain fused to CDNA sequences derived from the human CEM
T-cell line. This screen led to the identification of 3 clones that encoded VP16 fusion proteins that specifically interacted with the GAL4/Rev bait protein, and 10 clones expressing VP16 fusion proteins that specifically bound GAL4/Rex. None of the three Rev specific clones were able to interact with GAL4/Rex while only one of the ten Rex specific clones could bind to GAL4/Rev. Further experiments demonstrated that this interaction was entirely blocked by the introduction of point mutations into the activation domains of either Rev or Rex (data not shown) . Based on this initial observation, we designated this human protein as the Rev/Rex Activation Domain Binding or Rab protein. Rab binds the Rev activation domain specifically To more fully define the protein sequence specificity of the interaction of Rev with Rab, we took advantage of a previously described (M. Malim et al., J. Virol . 65, 4248-4254 (1991)) set of scanning missense mutations that precisely map the HIV-l Rev activation domain (Figure 2) . These mutations each affect from one to three amino acids either within or adjacent to the activation domain and give rise to wild-type, intermediate or negative phenotypes when assayed for Rev function in mammalian cells. This complete set of Rev mutants was cloned into a yeast GAL4 fusion protein expression plasmid and individually tested for their ability to interact with the VP16/Rab fusion protein in the yeast cell nucleus, as assessed by the level of activation of an integrated lacZ indicator gene. Each of these mutant GAL4/Rev fusion proteins was equivalently stable in yeast, as measured by western blot analysis (data not shown) .
As shown in Figure 2, five Rev mutants that entirely lack effector domain function, including two mutants (M27 and M29) that bear only single amino acid changes in Rev, all proved entirely unable to interact with the Rab fusion protein. In contrast, all of the Rev mutants previously shown (M. Malim et al., J. Virol . 65, 4248-4254 (1991)) to
exhibit substantially wild-type Rev activity in vivo also induced substantial levels of j3-gal activity, ranging from a minimum of one quarter to a maximum of -3 times the level seen with wild-type Rev. Strikingly, the three Rev mutants (M18, M20 and M24) previously shown (M. Malim et al., J. Virol . 65, 4248-4254 (1991)) to exhibit only partial (<50%) Rev function in vivo also proved to be significantly attenuated in their ability to interact with Rab (<16% of wild-type) . It is therefore apparent that there is a very close concordance between the ability of Rev activation domain mutants to function in the recruitment of the Rev co-factor in mammaliani cells and the ability of these same Rev activation domain mutants to interact with the Rab protein in the yeast cell nucleus. Sequence of the Rab gene product
The Rab CDNA clone contains a 2584 base pair insert flanked 3' by a stretch of A residues, consistent with priming at the mRNA poly(A) tail. Starting at the first in frame methionine residue, this CDNA contains an open reading frame (ORF) of 562 amino acids which would be predicted to encode a protein of -58 kD (Figure 3) . Computer analysis of available sequence data bases failed to identify any proteins displaying significant homology to the predicted Rab ORF. Similarly, we were also unable to identify any nucleic acid sequences that displayed significant homology to Rab except for two short, unidentified "expressed sequence tags." Rab is therefore a novel human gene. An unusual aspect of the Rab protein sequence is the high concentration of phenylalanine residues, including ten in the form of the dipeptide motif "FG," as well as several runs of serine residues, found concentrated towards the carboxyl-terminus of Rab. The potential significance of these is discussed below. Analysis of Rab MRNA and protein expression The Rab CDNA insert was used to probe a northern blot of MRNA derived from the human T-cell line CEM (the origin of the CDNA clone) and the human cell line HeLa. In both
cases, a prominent band of -2.8 kilobases (kb) was observed (data not shown) . Both cell lines also expressed a hybridizing band of -4.6 kb, although this was faint in the CEM cells. Similarly, northern analysis of Rab MRNA expression in a range of human tissues also identified a major 2.8 kb and a minor 4.6 kb Rab transcript in all tissues examined (data not shown) . Based on this analysis, it therefore appeared that this cloned Rab CDNA is close to the full-length of the major species of Rab MRNA expressed in human cell lines and tissues.
An interesting characteristic of both Rev and Rex is that these retroviral regulatory proteins are functional in a wide range of animal cells (M. Ivey-Hoyle and M. Rosenberg, Mol . Cell . Biol . 10, 6152-6159 (1990); M. Malim and B, Cullen, Cell 65, 241-248 (1991); U. Fischer et al., EMBO J. 13, 4105-4112 (1994)). Therefore, it is predicted that the cellular co-factor for Rev and Rex should be conserved across species boundaries. To test this hypothesis, we performed a western analysis on cellular protein extracts of mammalian, avian, amphibian and invertebrate origin using an affinity purified polyclonal rabbit anti-Rab antiserum (Figure 4) . In humans, this procedure identified a predominant band of ~60 Kd (lane 1) and bands of identical mobility were also detected in cellular extracts of mouse, quail and frog origin (lanes 2 to 4) . In the fruit fly, a slightly smaller cross-reactive band was observed (lane 5) . These bands all represented authentic Rab related proteins, in that reactivity could be specifically blocked by addition of a purified, soluble Maltose Binding Protein (MBP)-Rab fusion 3 protein to the western blot during the antibody binding step (lanes 6 to 8) .
Indirect immunofluorescence analysis of the localization of endogenous Rab protein in the primate cell line CVI, using the Rab-specific rabbit antiserum, revealed that Rab is concentrated in the nucleus and nucleolus of these cells in a pattern that is similar to that observed
previously with HIV-l Rev and HTLV-I Rex (H. Siomi et al., Cell 55, 197-209 (1988); B. Felber et al., Proc . Natl . Acad . Sci . USA 86, 1495-1499 (1989)). No specific signal was detected when a matched preimmune rabbit serum was used (data not shown) . While Western analysis of nuclear and cytopasmic protein fractions derived from HeLa cells further confirmed the nuclear concentration of Rab, this latter analysis also detected significant levels of cytoplasmic Rab protein (data not shown) . It therefore appears that Rab is primarily, but not exclusively, nuclear in localization.
Although we have been unable to express the full-length Rab protein in a soluble form in bacteria, we are able to express limited amounts of soluble Rab protein in vitro by translation of Rab MRNA in a rabbit reticulocyte lysate. Translation of the Rab ORF indicated in Figure 3 gave rise to a protein that migrated on polyacrylamide gels with the same relative molecular mass as the major Rab-antibody reactive species detected in human cells in vivo (data not shown) . Therefore, the Rab ORF shown in Figure 3 is full-length.
We next used 3SS-labeled full-length Rab protein generated by in vitro translation to ask whether Rab would bind the Rev activation domain specifically in vitro . For this purpose, we loaded equal levels of the 35S-labeled Rab protein onto columns containing either Glutathione-S-transferase (GST) , or fusion proteins consisting of GST linked to Rev (GST/Rev) or the MIO activation domain Rev mutant (GST/M10) , conjugated to agarose beads. After extensive washing, remaining bound proteins were eluted and subjected to SDS-PAGE. The labeled Rab protein was not retained by the GST or GST/M10 columns but did bind to the GST/Rev column (data not shown) . These data therefore demonstrate that the specific interaction of Rab with the Rev activation domain can be recapitulated in vitro and also strongly suggest that this interaction is direct.
Rab binds the activation domains of diverse Rev and Rex proteins in vivo
We next wished to ask whether the interaction of Rab with Rev could be demonstrated in mammalian cells and whether other Rev and Rex proteins, particularly including the highly divergent EIAV Rev (Figure 1) , would also bind to Rab specifically. To address these issues, we performed a mammalian version of the two-hybrid analysis using transient transfection of the primate cell line COS (Figure 5) . The pG5B/CAT indicator plasmid used in this analysis contains 5 GAL4 sites located 5' to a minimal promoter element directing expression of the chloramphenicol acetyl transferase (CAT) gene (H. Bogerd and W. Greene, J. Virol . 67, 2496-2502 (1993)). In this assay, the identities of the bait and prey proteins were reversed such that the Rab protein was now expressed as a GAL4/Rab fusion while the full-length HIV-l, VMV and EIAV Rev proteins and the HTLV-I Rex protein were each expressed as VP16 fusion proteins. For each Rev and Rex protein, plasmids bearing a deleterious missense mutation in the mutationally defined activation domain were also generated (Figure 5) . All VP16 fusion proteins were expressed at comparable levels after transfection into COS cells (data not shown) .
Transient transfection of COS cell cultures with the pG5B/CAT indicator plasmid, along with the PBC12/GAL4-Rab expression plasmid, gave only a low level of CAT enzyme activity. While this level was not significantly enhanced by co-expression of a Tat/VP16 fusion protein, a similar Rev/VP16 fusion protein produced a marked, -18 fold induction in the level of CAT activity (Figure 5) . In contrast, two Rev/VP16 fusion proteins bearing either the M10 or the M32 missense mutation of the Rev activation domain (M. Malim and B, Cullen, Cell 65, 241-248 (199Ϊ)) each failed to enhance expression of the CAT indicator gene.
Similar evidence for an in vivo interaction with Rab was also obtained using VP16 fusion proteins containing
either the HTLV-I Rex protein or the VMV Rev or more highly divergent EIAV Rev protein (Figure 5) . In • each case, introduction of a missense mutation (DAD) known to block activation domain function markedly inhibited the in vivo interaction with GAL4/Rab. We therefore conclude that Rab can interact with a range of highly divergent Rev and Rex proteins in the mammalian cell nucleus and that the integrity of the mutationally defined activation motifs is, in each case, critical to this interaction. It should be noted that the endogenously expressed Rab protein (Figure 4) would be predicted to act as an efficient competitive inhibitor of the in vivo interaction between GAL4/Rab and these various Rev or Rex fusion proteins. It is therefore likely that the data presented in Figure 5 significantly underestimate the efficiency of this in vivo interaction. Rab binds to the Rev activation domain when Rev is bound to the RRE
An important predicted property of the cellular co-factor for HIV-l Rev is that it should be able to efficiently interact with the Rev activation domain when Rev is assembled onto the RRE RNA target. To test whether Rab satisfied this prediction, we devised an in vivo genetic assay that uses the HIV-l Tat RNA sequence specific transcriptional activator to provide an indirect measure of the assembly of a protein complex onto an RNA target sequence (Figure 6A) . Normally, Tat activates gene expression from the HIV-l LTR after binding a cis-acting RNA target sequence, termed TAR, that forms the first 59 nt of all HIV-l transcripts (B. Cullen, Microbiol . Rev. 56, 375-394 (1992)). We have previously shown that if the TAR RNA stem-loop is replaced with the RRE-derived stem-loop IIB (SLIIB) minimal RNA target sequence for HIV-l Rev, then trans-activation of the HIV-l LTR can only be observed if Tat is fused to an HIV-l Rev protein bearing an intact RNA binding domain (L. Tiley et al., Proc . Natl . Acad . Sci . USA 89, 758-762 (1992B) ) .
The assay delinea.ted in Fig. 6A takes this approach one step further. Here, the TAR element is again replaced by RRE SLIIB in an indicator construct in which the HIV-l LTR is linked to CAT (PSLIIB/CAT) . However, in this case Rev is expressed in its wild-type form while Tat is expressed as a Rab fusion protein. Only if Rab can induce the efficient recruitment of this Tat/Rab fusion protein to the RRE-bound Rev protein will Tat be brought to the HIV-l LTR promoter element and, hence, be able to activate HIV-l LTR driven CAT expression (Figure 6A) .
Results obtained using this mono-hybrid assay on the homologous RRE RNA target are presented in Figure 6B. As can be readily seen, neither HIV-l Rev alone, nor the Tat/Rab fusion protein alone, was able to induce a level of CAT expression that exceeded the basal level seen in HeLa cells transfected with the PSLIIB/CAT indicator construct alone. However, the simultaneous expression of both Rev and Tat/Rab in the nuclei of these human cells induced a readily detectable -19 fold induction in CAT protein expression. Assembly of Rab onto the Rev:RRE ribonucleoprotein complex was, as predicted, fully dependent on the integrity of the Rev activation domain as shown by the lack of activity of the Rev MIO and M32 activation domain mutants in this assay (Figure 6B) . Importantly, these two mutations have no effect on the ability of Rev to bind the RRE in vivo (L. Tiley et al., Genes Dev. 6, 2077-2087 (1992A) ) .
Overexpression of Rab can enhance Rev and Rex function in vivo The data presented in Figure 6 show that Rab can bind to the Rev activation domain when Rev is part of the Rev:RRE ribonucleoprotein complex. Therefore, Rab can effectively compete with the endogenous Rev co-factor for binding to the Rev activation domain. This finding suggested that an in vivo assay for Rev function performed under conditions where the level of Rev expression was sub-optimal should be able to uncover a phenotype for Rab
even in the face of a significant endogenous level of Rab expression. In particular, we hypothesized that', if Rab is not the authentic Rev co-factor, then overexpression of Rab should competitively inhibit Rev function. In contrast, if Rab is the authentic Rev co-factor, then overexpression of Rab might promote the recruitment of Rab to the RRE RNA target by sub-optimal levels of Rev protein, leading to an increase in Rev activity. Similarly, the HTLV-I Rex protein, which can also act via the HIV-l RRE element (L. Rimsky et al., Nature 335, 738-740 (1988)), might also display enhanced activity in the presence of Rab if this protein is indeed the authentic Rev/Rex co-factor.
To test this hypothesis, we first introduced a premature termination codon into the rev gene present in a full-length HIV-l proviral clone. We then transfected the resultant HIVDRev provirus into COS cells along with different levels of the HIV-l Rev expression plasmid pcrev or the HTLV-I Rex expression plasmid pcrex (L. Rimsky et al., Nature 335, 738-740 (1988)) and determined the level of each of these plasmids that induced -10% of the level of P24 capsid protein production that was seen with a saturating level of Rev. We next transfected COS cells with the HIVDRev proviral clone, along with this limiting level of either pcrev or pcrex, together with an -20 fold molar excess of a PBC12/CMV based plasmid (M. Malim et al., J. Virol . 65, 4248-4254 (1991)) expressing the full-length Rab protein or expressing either CAT or β-gal as negative control proteins. Based on immunoprecipitation analysis, we estimate that cells transfected with the Rab expression plasmid express 5 to 10 fold higher levels of Rab than normal (data not shown) . Transfection efficiency was monitored by co-transfecting a constant level of an expression plasmid encoding the secreted alkaline phosphatase (SEAP) indicator gene (J. Berger et al.. Gene 66, 1-10 (1988)). After -88 hours, the supernatant media were removed from the transfected cultures and levels of P24 capsid protein and SEAP activity determined. As shown
in Figure 7, we consistently observed a 3-6 fold increase in the level of HIV-l P24 capsid protein secreted by the culture overexpressing the Rab protein when compared to the cultures expressing either CAT or β-gal. In contrast, Rab overexpression had no significant effect on the level of supernatant SEAP activity. These data therefore strongly indicate that Rab is an important human cellular co-factor for HIV-l Rev and HTLV-I Rex.
The foregoing examples are intended to be illustrative of the present invention, and are not to be taken as limiting thereof. The invention is defined by the following claims, with equivalents of the claims to be included therein.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Cullen, Bryan R. Bogerd, Hal P.
(ii) TITLE OF INVENTION: Cellular Co-Factor for HIV Rev and HTLV Rex
(iii) NUMBER OF SEQUENCES: 2
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Bell, Seltzer, Park and Gibson
(B) STREET: Post office Drawer 34009
(C) CITY: Charlotte
(D) STATE: North Carolina
(E) COUNTRY: USA
(F) ZIP: 28234
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentin Release #1.0, Version #1.30
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE:
(C) CLASSIFICATION:
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Sibley, Kenneth D.
(B) REGISTRATION NUMBER: 31,665
(C) REFERENCE/DOCKET NUMBER: 5405-118
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 919-420-2200
(B) TELEFAX: 919-881-3175
(C) TELEX: 575102
(2) INFORMATION FOR SEQ ID NO:l:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2583 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 244..1929
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
TGGCGGCGGC GGCGGCGGTT GTCCCGGCTG TGCCGGTTGG TGTGGCCCGT CAGCCCGCGT 60
ACCACAGCGC CCGGGCCGCG TCGAGCCCAG TACAGCCAAG CCGCTGCGGC CGGGTCCGGC 120
GCGGGCGGCG CGCGCAGACG GAGGGCGGCG GCCGCGGCCA GGGCGGCCCG TGGGACCGCG 180
GGCCCCCGGC GCAGCGCTGC CCGGCTCCCG GCCCTGCCGG CCTCCTCCCT TGGCGCCGCG 240
GCC ATG GCG GCC AGC GCG AAG CGG AAG CAG GAG GAG AAG CAC CTG AAG 288 Met Ala Ala Ser Ala Lys ArgtLys Gin Glu Glu Lys His Leu Lys 1 5 10 15
ATG CTG CGG GAC ATG ACC GGC CTC CCG CAC AAC CGA AAG TGC TTC GAC 336 Met Leu Arg Asp Met Thr Gly Leu Pro His Asn Arg Lys Cys Phe Asp 20 25 30
TGC GAC CAG CGC GGC CCC ACC TAC GTT AAC ATG ACG GTC GGC TCC TTC 384 Cys Asp Gin Arg Gly Pro Thr Tyr Val Asn Met Thr Val Gly Ser Phe 35 40 45
GTG TGT ACC TCC TGC TCC GGC AGC CTG CGA GGA TTA AAT CCA CCA CAC 432 Val Cys Thr Ser Cys Ser Gly Ser Leu Arg Gly Leu Asn Pro Pro His 50 55 60
AGG GTG AAA TCT ATC TCC ATG ACA ACA TTC ACA CAA CAG GAA ATT GAA 480 Arg Val Lys Ser Ile Ser Met Thr Thr Phe Thr Gin Gin Glu Ile Glu 65 70 75
TTC TTA CAA AAA CAT GGA AAT GAA JGTC TGT AAA CAG ATT TGG CTA GGA 528 Phe Leu Gin Lys His Gly Asn Glu Val Cys Lys Gin Ile Trp Leu Gly 80 85 90 95
TTA TTT GAT GAT AGA TCT TCA GCA ATT CCA GAC TTC AGG GAT CCA CAA 576 Leu Phe Asp Asp Arg Ser Ser Ala Ile Pro Asp Phe Arg Asp Pro Gin 100 105 110
AAA GTG AAA GAG TTT CTA CAA GAA AAG TAT GAA AAG AAA AGA TGG TAT 624 Lys Val Lys Glu Phe Leu Gin Glu Lys Tyr Glu Lys Lys Arg Trp Tyr 115 120 125
GTC CCG CCA GAA CAA GCC AAA GTC GTG GCA TCA GTT CAT GCA TCT ATT 672 Val Pro Pro Glu Gin Ala Lys Val Val Ala Ser Val His Ala Ser Ile 130 135 140
TCA GGG TCC TCT GCC AGT AGC ACA AGC AGC ACA CCT GAG GTC AAA CCA 720 Ser Gly Ser Ser Ala Ser Ser Thr Ser Ser Thr Pro Glu Val Lys Pro 145 150 155
CTG AAA TCT CTT TTA GGG GAT TCT GCA CCA ACA CTG CAC TTA AAT AAG 768 Leu Lys Ser Leu Leu Gly Asp Ser Ala Pro Thr Leu His Leu Asn Lys 160 165 170 175
GGC ACA CCT AGT CAG TCC CCA GTT GTA GGT CGT TCT CAA GGG CAG CAG 816 Gly Thr Pro Ser Gin Ser Pro Val al Gly Arg Ser Gin Gly Gin Gin 180 185 190
CAG GAG AAG AAG CAA TTT GAC CTT TTA AGT GAT CTC GGC TCA GAC ATC 864 Gin Glu Lys Lys Gin Phe Asp Leu Leu Ser Asp Leu Gly Ser Asp Ile 195 200 205
TTT GCT GCT CCA GCT CCT CAG TCA ACA GCT ACA GCC AAT TTT GCT AAC 912 Phe Ala Ala Pro Ala Pro Gin Ser Thr Ala Thr Ala Asn Phe Ala Asn 210 215 220
TTT GCA CAT TTC AAC AGT CAT GCA GCT CAG AAT TCT GCA AAT GCA GAT 960 Phe Ala His Phe Asn Ser His Ala Ala Gin Asn Ser Ala Asn Ala Asp 225 230 235
TTT GCA AAC TTT GAT GCA TTT GGA CAG TCT AGT GGT TCG AGT AAT TTT 1008 Phe Ala Asn Phe Asp Ala Phe Gly Gin Ser Ser Gly Ser Ser Asn Phe 240 245 250 255
GGA GGT TTC CCC ACA GCA AGT CAC TCT CCT TTT CAG CCC CAA ACT ACA 1056 Gly Gly Phe Pro Thr Ala Ser His Ser Pro Phe Gin Pro Gin Thr Thr 260 265 270
GGT GGA AGT GCT GCA TCA GTA AAT GCT AAT TTT GCT CAT TTT GAT AAC 1104 Gly Gly Ser Ala Ala Ser Val Asn Ala Asn Phe Ala His Phe Asp Asn 275 280 285
TTC CCC AAA TCC TCC AGT GCT GAT TTT GGA ACC TTC AAT ACT TCC CAG 1152 Phe Pro Lys Ser Ser Ser Ala Asp Phe Gly Thr Phe Asn Thr Ser Gin 290 295 300
AGT CAT CAA ACA GCA TCA GCT GTT AGT AAA GTT TCA ACG AAC AAA GCT 1200 Ser His Gin Thr Ala Ser Ala Val Ser Lys Val Ser Thr Asn Lys Ala 305 310 315
GGT TTA CAG ACT GCA GAC AAA TAT GCA GCA CTT GCT AAT TTA GAC AAT 1248 Gly Leu Gin Thr Ala Asp Lys Tyr Ala Ala Leu Ala Asn Leu Asp Asn 320 325 330 335
ATC TTC AGT GCC GGG CAA GGT GGT GAT CAG GGA AGT GGC TTT GGG ACC 1296 Ile Phe Ser Ala Gly Gin Gly Gly Asp Gin Gly Ser Gly Phe Gly Thr 340 345 350
ACA GGT AAA GCT CCT GTT GGT TCT GTG GTT TCA GTT CCC AGT CAG TCA 1344 Thr Gly Lys Ala Pro Val Gly Ser Val Val Ser Val Pro Ser Gin Ser 355 360 365
AGT GCA TCT TCA GAC AAG TAT GCA GCT CTG GCA GAA CTA GAC AGC GTT 1392 Ser Ala Ser Ser Asp Lys Tyr Ala Ala Leu Ala Glu Leu Asp Ser Val 370 375 380
TTC AGT TCT GCA GCC ACC TCC AGT AAT GCG TAT ACT TCC ACA AGT AAT 1440 Phe Ser Ser Ala Ala Thr Ser Ser Asn Ala Tyr Thr Ser Thr Ser Asn 385 390 395
GCT AGC AGC AAT GTT TTT GGA ACA GTG CCA GTG GTT GCT TCT GCA CAG 1488 Ala Ser Ser Asn Val Phe Gly Thr Val Pro Val Val Ala Ser Ala Gin 400 405 410 415
ACA CAG CCT GCT TCA TCA AGT GTG CCT GCT CCA TTT GGA GCT ACG CCT 1536 Thr Gin Pro Ala Ser Ser Ser Val Pro Ala Pro Phe Gly Ala Thr Pro 420 425 430
TCC ACA AAT CCA TTT GTT GCT GCT GCT GGT CCT TCT GTG GCA TCT TCT 1584 Ser Thr Asn Pro Phe Val Ala Ala Ala Gly Pro Ser Val Ala Ser Ser 435 440 445
ACA AAC CCA TTT CAG ACC AAT GCC AGA GGA GCA ACA GCG GCA ACC TTT 1632 Thr Asn Pro Phe Gin Thr Asn Ala Arg Gly Ala Thr Ala Ala Thr Phe 450 455 460
GGC ACT GCA TCC ATG AGC ATG CCC ACG GGA TTC GGC ACT CCT GCT CCC 1680 Gly Thr Ala Ser Met Ser Met Pro Thr Gly Phe Gly Thr Pro Ala Pro 465 470 475
TAC AGT CTT CCC ACC AGC TTT AGT GGC AGC TTT CAG CAG CCT GCC TTT 1728 Tyr Ser Leu Pro Thr Ser Phe Ser Gly Ser Phe Gin Gin Pro Ala Phe 480 485 490 495
CCA GCC CAA GCA GCT TTC CCT CAA CAG ACA GCT TTT TCT CAA CAG CCC 1776 Pro Ala Gin Ala Ala Phe Pro Gin Gin Thr Ala Phe Ser Gin Gin Pro 500 505 510
AAT GGT GCA GGT TTT GCA GCA TTT GGA CAA ACA AAG CCA GTA GTA ACC 1824 Asn Gly Ala Gly Phe Ala Ala Phe Gly Gin Thr Lys Pro Val Val Thr 515 520 525
CCT TTT GGT CAA GTT GCA GCT GCT iGGA GTA TCT AGT AAT CCT TTT ATG 1872 Pro Phe Gly Gin Val Ala Ala Ala Gly Val Ser Ser Asn Pro Phe Met 530 535 540
ACT GGT GCA CCA ACA GGA CAA TTT CCA ACA GGA AGC TCA TCA ACC AAT 1920 Thr Gly Ala Pro Thr Gly Gin Phe Pro Thr Gly Ser Ser Ser Thr Asn 545 550 555
CCT TTC TTA TAGCCTTATA TAGACAATTT ACTGGAACGA ACTTTTATGT 1969
Pro Phe Leu
560
GGTCACATTA CATCTCTCCA CCTCTTGCAC TGTTGTCTTG TTTCACTGAT CTTAGCTTTA 2029
AACACAAGAG AAGTCTTTAA AAAGCCTGCA TTGTGTATTA AACACCAGGT AATATGTGCA 2089
AAACAGAGGG CTCCAGTAAC ACCTTCTAAC CTGTGAATTG GCAGAAAAGG GTAGCGGTAT 2149
CATGTATATT AAAATTGGCT AATATTAAGT TATTGCAGAT ACCACATTCA TTATGCTGCA 2209
GTACTGTACA TATTTTTCTT AGAAATTAGC TATTTGTGCA TATCAGTATT TGTAACTTTA 2269
ACACATTGTT ATGTGAGAAA TGTTACTGGG GAAATAGATC AGCCACTTTT AAGGTGCTGT 2329
CATATATCTT TGGAATGAAT GACCTAAAAT CATTTTAACC ATGCTACTGG AAAGTAACAG 2389
AGTCAAAATT GGAAGGTTTT ATTCATTCTT GAATTTTTCC TTTCTAAAGA GCTCTTCTAT 2449
TTATACATGC CTAAATTCTT TTAAAATGTA GAGGGATACC TGTCTGCATA ATAAAGCTGA 2509
TCATGTTTTG CTACAGTTTG CAGGTGAAAA AAAATAAATA TTATAAAATA AAAAAAAAAA 2569
AAAAAAAAAA AAAA 2583
(2) INFORMATION FOR SEQ ID NO:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 562 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
Met Ala Ala Ser Ala Lys Arg Lys Gin Glu Glu Lys His Leu Lys Met
1 5 10 15
Leu Arg Asp Met Thr Gly Leu Pro His Asn Arg Lys Cys Phe Asp Cys 20 25 30
Asp Gin Arg Gly Pro Thr Tyr Val Asn Met Thr Val Gly Ser Phe Val 35 40 45
Cys Thr Ser Cys Ser Gly Ser Leu Arg Gly Leu Asn Pro Pro His Arg 50 55 60
Val Lys Ser Ile Ser Met Thr Thr Phe Thr Gin Gin Glu Ile Glu Phe 65 70 75 80
Leu Gin Lys His Gly Asn Glu Val Cys Lys Gin Ile Trp Leu Gly Leu 85 90 95
Phe Asp Asp Arg Ser Ser Ala Ile Pro Asp Phe Arg Asp Pro Gin Lys 100 105 110
Val Lys Glu Phe Leu Gin Glu Lys JTyr Glu Lys Lys Arg Trp Tyr Val 115 120 125
Pro Pro Glu Gin Ala Lys Val Val Ala Ser Val His Ala Ser Ile Ser 130 135 140
Gly Ser Ser Ala Ser Ser Thr Ser Ser Thr Pro Glu Val Lys Pro Leu 145 150 155 160
Lys Ser Leu Leu Gly Asp Ser Ala Pro Thr Leu His Leu Asn Lys Gly 165 170 175
Thr Pro Ser Gin Ser Pro Val Val Gly Arg Ser Gin Gly Gin Gin Gin 180 185 190
Glu Lys Lys Gin Phe Asp Leu Leu Ser Asp Leu Gly Ser Asp Ile Phe 195 200 205
Ala Ala Pro Ala Pro Gin Ser Thr Ala Thr Ala Aβn Phe Ala Asn Phe 210 215 220
Ala His Phe Asn Ser His Ala Ala Gin Asn Ser Ala Asn Ala Asp Phe 225 230 235 240
Ala Asn Phe Asp Ala Phe Gly Gin Ser Ser Gly Ser Ser Asn Phe Gly 245 250 255
Gly Phe Pro Thr Ala Ser His Ser Pro Phe Gin Pro Gin Thr Thr Gly 260 265 270
Gly Ser Ala Ala Ser Val Asn Ala Asn Phe Ala His Phe Asp Asn Phe 275 280 285
Pro Lys Ser Ser Ser Ala Asp Phe Gly Thr Phe Asn Thr Ser Gin Ser 290 295 300
His Gin Thr Ala Ser Ala Val Ser Lys Val Ser Thr Asn Lys Ala Gly 305 310 315 320
Leu Gin Thr Ala Asp Lys Tyr Ala Ala Leu Ala Asn Leu Asp Aβn Ile 325 330 335
Phe Ser Ala Gly Gin Gly Gly Asp Gin Gly Ser Gly Phe Gly Thr Thr 340 345 350
Gly Lys Ala Pro Val Gly Ser Val Val Ser Val Pro Ser Gin Ser Ser 355 360 365
Ala Ser Ser Asp Lys Tyr Ala Ala Leu Ala Glu Leu Asp Ser Val Phe 370 375 380
Ser Ser Ala Ala Thr Ser Ser Asn Ala Tyr Thr Ser Thr Ser Asn Ala 385 390 395 400
Ser Ser Asn Val Phe Gly Thr Val Pro Val Val Ala Ser Ala Gin Thr 405 410 415
Gin Pro Ala Ser Ser Ser Val Pro Ala Pro Phe Gly Ala Thr Pro Ser 420 425 430
Thr Asn Pro Phe Val Ala Ala Ala Gly Pro Ser Val Ala Ser Ser Thr 435 440 445
Asn Pro Phe Gin Thr Asn Ala Arg Gly Ala Thr Ala Ala Thr Phe Gly 450 455 460
Thr Ala Ser Met Ser Met Pro Thr Gly Phe Gly Thr Pro Ala Pro Tyr 465 470 475 480
Ser Leu Pro Thr Ser Phe Ser Gly Ser Phe Gin Gin Pro Ala Phe Pro 485 490 495
Ala Gin Ala Ala Phe Pro Gin Gin Thr Ala Phe Ser Gin Gin Pro Asn 500 505 510
Gly Ala Gly Phe Ala Ala Phe Gly Gin Thr Lys Pro Val Val Thr Pro 515 520 525
Phe Gly Gin Val Ala Ala Ala Gly Val Ser Ser Asn Pro Phe Met Thr
530 535 540
Gly Ala Pro Thr Gly Gin Phe Pro Thr Gly Ser Ser Ser Thr Asn Pro 545 550 555 560
Phe Leu
Claims
1. Isolated DNA encoding a cellular co-factor for HIV Rev selected from the group consisting of:
(a) DNA which encodes a protein of SEQ ID NO:2;
(b) DNA which hybridizes to DNA of (a) above under stringent conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C, and which encodes a cellular co-factor for HIV Rev; and
(c) DNA differing from the DNAs of (a) and (b) above in codon sequence due to the degeneracy of the genetic code, and which encodes a cellular co-factor for HIV Rev.
2. Isolated DNA according to claim 1 which encodes a protein of SEQ ID NO:2.
3. Isolated DNA according to claim 1 having the nucleotide sequence of SEQ ID NO:l.
4. A recombinant DNA comprising vector DNA and a DNA according to claim 1.
5. A recombinant DNA according to claim 4, wherein said vector DNA comprises a plasmid.
6. A recombinant DNA according to claim 4, wherein said vector DNA comprises a virus.
7. A recombinant DNA according to claim 4, wherein said vector DNA comprises a baculovirus.
8. A host cell containing a recombinant DNA of claim 4.
9. A host cell containing a recombinant DNA of claim 4 and capable of expressing the encoded cellular co¬ factor for HIV Rev.
10. A host cell according to claim 9, wherein said host cell is a mammalian cell.
11. A host cell according to claim 9, wherein said host cell is an insect cell.
12. An oligonucleotide probe capable of selectively binding to a DNA of claim 1.
13. An oligonucleotide probe according to claim 12, which probe is capable of serving as a PCR extension primer.
14. An oligonucleotide probe according to claim 12, which probe is labelled with a detectable group.
15. An oligonucleotide probe according to claim 14, which detectable group is a radioactive atom.
16. An isolated protein comprising a cellular co-factor for HIV Rev, which protein is coded for by a DNA according to claim 1.
17. An antibody which specifically binds to a protein of claim 16.
18. An antibody according to claim 17 which is labelled with a detectable group.
19. An antibody according to claim 18, which antibody comprises a monoclonal antibody.
20. A method of screening for compounds that affect the binding of a Lentivirus Rev protein or an HTLV family Rev protein with the cellular co-factor thereof, comprising: providing together a first protein and a second protein so that said first and second protein form a complex thereof, wherein said first protein is selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins, and wherein said second protein is encoded by a DNA selected from the group consisting of (a) DNA which encodes a protein of SEQ ID NO:2; and (b) DNA which hybridizes to DNA of (a) above under stringent conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C, and which encodes a cellular co-factor for HIV Rev; then combining a test compound with said first and second protein; and then detecting the influence of said test compound on the formation of said complex.
21. A method according to claim 20, wherein said method is carried out in vitro .
22. A method according to claim 20, wherein said method is carried out in vivo .
23. A method according to claim 20, wherein said method is carried out in vivo in a yeast cell.
24. A method according to claim 20, wherein said method is carried out in vivo in an insect cell.
25. A recombinant cell useful for screening for compounds that affect the binding of a Lentivirus Rev protein or an HTLV family Rev protein to the cellular co¬ factor thereof, comprising: a first DNA that expresses a first fusion protein comprising a DNA binding domain and a first binding partner; a second DNA that expresses a second fusion protein comprising a second binding partner and a transcription activation domain; and a third DNA comprising a binding site operatively associated with a reporter gene, wherein said DNA binding domain specifically binds to said binding site, so that said reporter gene is expressed when said first and second fusion proteins bind to one another; wherein said first binding partner is either (a) selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins, or (b) encoded by a DNA selected from the group consisting of (i) DNA which encodes a protein of SEQ ID NO:2 and (ii) DNA which hybridizes to DNA of (i) above under stringent conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C, and which encodes a cellular co-factor for HIV Rev; and said second binding partner is the other of (a) or (b) above.
26. A recombinant cell according to claim 25, wherein said cell is a yeast cell.
27. A recombinant cell according to claim 25, wherein said binding site is a Gal4 binding site and said binding domain is a Gal4 binding domain.
28. A recombinant cell according to claim 25, wherein said transcription activation domain is selected from the group consisting of the Gal4 transcription activation domain, the VP16 transcription activation domain, and the TAT transcription activation domain.
29. A recombinant cell according to claim 25, wherein said first binding partner is selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins, and said second binding partner is encoded by a DNA selected from the group consisting of (i) DNA which encodes a protein of SEQ ID NO:2 and (ii) DNA which hybridizes to DNA of (i) above under stringent conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C, and which encodes a cellular co-factor for HIV Rev.
30. A recombinant cell according to claim 25, wherein said first binding partner is encoded by a DNA selected from the group consisting of (i) DNA which encodes a protein of SEQ ID NO:2 and (ii) DNA which hybridizes to DNA of (i) above under stringent conditions represented by a wash stringency of 0.3 Molar NaCl, 0.03 M sodium citrate, 0.1% SDS at 60°C, and which encodes a cellular co-factor for HIV Rev; and wherein said second binding partner is selected from the group consisting of Lentivirus Rev proteins and HTLV family proteins.
31. A recombinant cell according to claim 25, wherein said binding partner is selected from the group consisting of Lentivirus Rev proteins and HTLV family Rev proteins is HIV-l Rev or HTLV-I Rex.
32. A method of treating a lentivirus or HTLV family viruses, comprising introducing into a cell infected with the virus a compound that inhibits the formation of a complex between a lentivirus Rev protein or an HTLV family Rev protein with the cellular co-factor thereof.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US51357095A | 1995-08-10 | 1995-08-10 | |
US08/513,570 | 1995-08-10 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO1997006257A1 true WO1997006257A1 (en) | 1997-02-20 |
WO1997006257A9 WO1997006257A9 (en) | 1997-07-10 |
Family
ID=24043831
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1996/012986 WO1997006257A1 (en) | 1995-08-10 | 1996-08-09 | Cellular co-factor for hiv rev and htlv rex |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO1997006257A1 (en) |
-
1996
- 1996-08-09 WO PCT/US1996/012986 patent/WO1997006257A1/en active Application Filing
Non-Patent Citations (5)
Title |
---|
BOGERD H.P. ET AL.: "Identification of a human co-factor for the rev/rex class of retroviral regulatory proteins.", AIDS RESEARCH AND HUMAN RETROVIRUSES, vol. 11, no. S1, 1 August 1995 (1995-08-01), pages s86, XP002021067 * |
BOGERD H.P. ET AL.: "Identification of a novel cellular cofactor for the Rev/Rex class of retroviral regulatory proteins.", CELL, vol. 82, 11 August 1995 (1995-08-11), pages 485 - 494, XP002021069 * |
FRIDELL R.A. ET AL.: "Identification of a novel human zinc finger protein that specifically interacts with the activation domain of lentiviral tat proteins.", VIROLOGY, 1 June 1995 (1995-06-01), pages 347 - 357, XP002021167 * |
FRITZ C.C. ET AL.: "A human nucleoporin-like protein that specifically interacts with HIV Rev.", NATURE, vol. 376, 10 August 1995 (1995-08-10), pages 530 - 533, XP002021068 * |
RUHL M. ET AL.: "Eukaryotic initiation factor 5a is a cellular target of the human immunodeficiency virus type 1 rev activation domain mediating trans-activation.", JOURNAL OF CELL BIOLOGY, vol. 123, 1993, pages 1309 - 1320, XP000615292 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5677143A (en) | Cellular nucleic acid binding protein and uses thereof in regulating gene expression and in the treatment of aids | |
Kamine et al. | Identification of a cellular protein that specifically interacts with the essential cysteine region of the HIV-1 Tat transactivator | |
US5786464A (en) | Overexpression of mammalian and viral proteins | |
US5795737A (en) | High level expression of proteins | |
McMillan et al. | HIV-1 Tat directly interacts with the interferon-induced, double-stranded RNA-dependent kinase, PKR | |
Green et al. | Mutational analysis of HIV-1 Tat minimal domain peptides: identification of trans-dominant mutants that suppress HIV-LTR-driven gene expression | |
RU2233329C2 (en) | Synthetic gene encoding green fluorescent protein, method for preparing gene, expressing vector, method for preparing cell culture | |
Yu et al. | Molecular cloning and characterization of a cellular protein that interacts with the human immunodeficiency virus type 1 Tat transactivator and encodes a strong transcriptional activation domain | |
US5688511A (en) | Cellular protein TDP-43 and regulation of HIV-1 gene expression | |
AU733100B2 (en) | A transcriptional coactivator that interacts with tat protein and regulates its binding to tar RNA | |
Zhao et al. | Biochemical mechanism of HIV-1 Vpr function. Oligomerization mediated by the N-terminal domain. | |
Wang et al. | Characterization of a leucine-zipper-like domain in Vpr protein of human immunodeficiency virus type 1 | |
JP3159671B2 (en) | Multivalent repressor of gene function | |
WO1996023899A1 (en) | Methods of selecting a random peptide that binds to a target protein | |
CA2370098C (en) | Human beta-trcp protein | |
WO1995031904A1 (en) | Methods for the identification of hiv anti-viral agents capable of abrogating integrase interactor protein binding | |
Rhim et al. | Exon2 of HIV-2 Tat contributes to transactivation of the HIV-2 LTR by increasing binding affinity to HIV-2 TAR RNA | |
US5468624A (en) | Cell lysis activity of a modified fragment of the glucocorticoid receptor | |
Desai-Yajnik et al. | Interactions of thyroid hormone receptor with the human immunodeficiency virus type 1 (HIV-1) long terminal repeat and the HIV-1 Tat transactivator | |
Chumakov et al. | Localization of the c-ets-2 transactivation domain | |
Xu et al. | A novel hnRNP specifically interacts with HIV-1 RRE RNA | |
WO1997006257A1 (en) | Cellular co-factor for hiv rev and htlv rex | |
WO1997006257A9 (en) | Cellular co-factor for hiv rev and htlv rex | |
Derse et al. | Mutagenesis of EIAV TAT reveals structural features essential for transcriptional activation and TAR element recognition | |
EP0871711A1 (en) | Compositions and methods for site-directed integration into dna |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): CA JP |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
COP | Corrected version of pamphlet |
Free format text: PAGES 1-39,DESCRIPTION,AND PAGES 40-44,CLAIMS,REPLACED BY NEW PAGES BEARING THE SAME NUMBER;PAGES 1/7-7/7,DRAWINGS,REPLACED BY NEW PAGES 1/4-4/4;DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: CA |