WO2014070227A1 - Methods and reagents for identifying proximate proteins - Google Patents
Methods and reagents for identifying proximate proteins Download PDFInfo
- Publication number
- WO2014070227A1 WO2014070227A1 PCT/US2013/030261 US2013030261W WO2014070227A1 WO 2014070227 A1 WO2014070227 A1 WO 2014070227A1 US 2013030261 W US2013030261 W US 2013030261W WO 2014070227 A1 WO2014070227 A1 WO 2014070227A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- group
- proteins
- absent
- protein
- cells
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 232
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 217
- 238000000034 method Methods 0.000 title claims abstract description 70
- 239000003153 chemical reaction reagent Substances 0.000 title abstract description 9
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N biotin Natural products N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims abstract description 139
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 75
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 74
- 229960002685 biotin Drugs 0.000 claims abstract description 72
- 239000011616 biotin Substances 0.000 claims abstract description 72
- 235000020958 biotin Nutrition 0.000 claims abstract description 71
- 108090000364 Ligases Proteins 0.000 claims abstract description 8
- 102000003960 Ligases Human genes 0.000 claims abstract description 8
- 229910052720 vanadium Inorganic materials 0.000 claims description 92
- 108050003866 Bifunctional ligase/repressor BirA Proteins 0.000 claims description 87
- 102100033743 Biotin-[acetyl-CoA-carboxylase] ligase Human genes 0.000 claims description 87
- 229910052717 sulfur Inorganic materials 0.000 claims description 62
- 229910052740 iodine Inorganic materials 0.000 claims description 54
- 229910052757 nitrogen Inorganic materials 0.000 claims description 53
- 150000007523 nucleic acids Chemical class 0.000 claims description 50
- 229910052731 fluorine Inorganic materials 0.000 claims description 47
- 230000014509 gene expression Effects 0.000 claims description 47
- 229910052727 yttrium Inorganic materials 0.000 claims description 46
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 45
- 108020004707 nucleic acids Proteins 0.000 claims description 42
- 102000039446 nucleic acids Human genes 0.000 claims description 42
- 229910052700 potassium Inorganic materials 0.000 claims description 40
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 40
- 239000013604 expression vector Substances 0.000 claims description 33
- 229920001184 polypeptide Polymers 0.000 claims description 33
- 229910052799 carbon Inorganic materials 0.000 claims description 32
- 229910052698 phosphorus Inorganic materials 0.000 claims description 29
- 229910052739 hydrogen Inorganic materials 0.000 claims description 18
- 230000001413 cellular effect Effects 0.000 claims description 14
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 8
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 8
- 239000001963 growth medium Substances 0.000 claims description 7
- 230000009261 transgenic effect Effects 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 4
- 239000013611 chromosomal DNA Substances 0.000 claims description 4
- 239000013598 vector Substances 0.000 claims description 4
- 101100515517 Arabidopsis thaliana XI-I gene Proteins 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 8
- 239000002253 acid Substances 0.000 claims 3
- 210000004962 mammalian cell Anatomy 0.000 abstract description 12
- 238000001727 in vivo Methods 0.000 abstract description 10
- 235000018102 proteins Nutrition 0.000 description 203
- 210000004027 cell Anatomy 0.000 description 170
- 108010021099 Lamin Type A Proteins 0.000 description 68
- 102000008201 Lamin Type A Human genes 0.000 description 68
- 230000006287 biotinylation Effects 0.000 description 47
- 238000007413 biotinylation Methods 0.000 description 45
- 210000000633 nuclear envelope Anatomy 0.000 description 34
- 108091006004 biotinylated proteins Proteins 0.000 description 33
- 239000011324 bead Substances 0.000 description 31
- 238000004458 analytical method Methods 0.000 description 28
- 238000004949 mass spectrometry Methods 0.000 description 27
- 150000001413 amino acids Chemical group 0.000 description 26
- 239000012528 membrane Substances 0.000 description 23
- 230000003993 interaction Effects 0.000 description 20
- 230000006870 function Effects 0.000 description 18
- 210000002353 nuclear lamina Anatomy 0.000 description 18
- 102100026937 Soluble lamin-associated protein of 75 kDa Human genes 0.000 description 17
- 239000000523 sample Substances 0.000 description 17
- 239000011534 wash buffer Substances 0.000 description 17
- 108010090804 Streptavidin Proteins 0.000 description 16
- 235000019731 tricalcium phosphate Nutrition 0.000 description 16
- 210000005053 lamin Anatomy 0.000 description 15
- 238000013459 approach Methods 0.000 description 14
- 238000003119 immunoblot Methods 0.000 description 14
- 108010047294 Lamins Proteins 0.000 description 13
- 239000000872 buffer Substances 0.000 description 13
- 238000002474 experimental method Methods 0.000 description 13
- 230000004927 fusion Effects 0.000 description 13
- 238000001890 transfection Methods 0.000 description 13
- 229920004890 Triton X-100 Polymers 0.000 description 12
- 230000000903 blocking effect Effects 0.000 description 12
- 230000001939 inductive effect Effects 0.000 description 12
- 230000008569 process Effects 0.000 description 12
- 102000006835 Lamins Human genes 0.000 description 11
- 239000013504 Triton X-100 Substances 0.000 description 11
- 230000001419 dependent effect Effects 0.000 description 11
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 11
- 238000001262 western blot Methods 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 239000007983 Tris buffer Substances 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 10
- 210000004492 nuclear pore Anatomy 0.000 description 10
- 239000006228 supernatant Substances 0.000 description 10
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 10
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 9
- 102100034239 Emerin Human genes 0.000 description 9
- 101000911601 Homo sapiens Soluble lamin-associated protein of 75 kDa Proteins 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 238000010791 quenching Methods 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- 238000012546 transfer Methods 0.000 description 9
- 108010052285 Membrane Proteins Proteins 0.000 description 8
- 235000001014 amino acid Nutrition 0.000 description 8
- 229940024606 amino acid Drugs 0.000 description 8
- 239000000470 constituent Substances 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 238000010166 immunofluorescence Methods 0.000 description 8
- 239000012139 lysis buffer Substances 0.000 description 8
- 239000000203 mixture Substances 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 230000006916 protein interaction Effects 0.000 description 8
- 238000000527 sonication Methods 0.000 description 8
- 238000003146 transient transfection Methods 0.000 description 8
- 102100037151 Barrier-to-autointegration factor Human genes 0.000 description 7
- 230000004568 DNA-binding Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 230000027455 binding Effects 0.000 description 7
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 7
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 7
- 238000000799 fluorescence microscopy Methods 0.000 description 7
- 238000002372 labelling Methods 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 230000000171 quenching effect Effects 0.000 description 7
- HCHFRAXBELVCGG-JYFOCSDGSA-N (2z,3z)-2,3-bis[(4-methoxyphenyl)methylidene]butanedinitrile Chemical compound C1=CC(OC)=CC=C1\C=C(/C#N)\C(\C#N)=C\C1=CC=C(OC)C=C1 HCHFRAXBELVCGG-JYFOCSDGSA-N 0.000 description 6
- HCHFRAXBELVCGG-UHFFFAOYSA-N Emerin Natural products C1=CC(OC)=CC=C1C=C(C#N)C(C#N)=CC1=CC=C(OC)C=C1 HCHFRAXBELVCGG-UHFFFAOYSA-N 0.000 description 6
- 108010056197 emerin Proteins 0.000 description 6
- 239000013613 expression plasmid Substances 0.000 description 6
- 238000001228 spectrum Methods 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- SGKRLCUYIXIAHR-AKNGSSGZSA-N (4s,4ar,5s,5ar,6r,12ar)-4-(dimethylamino)-1,5,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4a,5,5a,6-tetrahydro-4h-tetracene-2-carboxamide Chemical compound C1=CC=C2[C@H](C)[C@@H]([C@H](O)[C@@H]3[C@](C(O)=C(C(N)=O)C(=O)[C@H]3N(C)C)(O)C3=O)C3=C(O)C2=C1O SGKRLCUYIXIAHR-AKNGSSGZSA-N 0.000 description 5
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 5
- 108010077544 Chromatin Proteins 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 101000801088 Homo sapiens Transmembrane protein 201 Proteins 0.000 description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 5
- 108010029485 Protein Isoforms Proteins 0.000 description 5
- 102000001708 Protein Isoforms Human genes 0.000 description 5
- 102100033708 Transmembrane protein 201 Human genes 0.000 description 5
- 210000004899 c-terminal region Anatomy 0.000 description 5
- 210000003483 chromatin Anatomy 0.000 description 5
- 230000001086 cytosolic effect Effects 0.000 description 5
- 229960003964 deoxycholic acid Drugs 0.000 description 5
- KXGVEGMKQFWNSR-UHFFFAOYSA-N deoxycholic acid Natural products C1CC2CC(O)CCC2(C)C2C1C1CCC(C(CCC(O)=O)C)C1(C)C(O)C2 KXGVEGMKQFWNSR-UHFFFAOYSA-N 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- 229960003722 doxycycline Drugs 0.000 description 5
- 238000001378 electrochemiluminescence detection Methods 0.000 description 5
- 210000003527 eukaryotic cell Anatomy 0.000 description 5
- 235000019253 formic acid Nutrition 0.000 description 5
- 235000018977 lysine Nutrition 0.000 description 5
- 238000007885 magnetic separation Methods 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 230000008685 targeting Effects 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 230000001052 transient effect Effects 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 108010033040 Histones Proteins 0.000 description 4
- 102000006947 Histones Human genes 0.000 description 4
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 4
- 102100023981 Lamina-associated polypeptide 2, isoform alpha Human genes 0.000 description 4
- 239000004472 Lysine Substances 0.000 description 4
- -1 M 565 S Inorganic materials 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 4
- 101710159648 Uncharacterized protein Proteins 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 230000006037 cell lysis Effects 0.000 description 4
- 230000002759 chromosomal effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 229910052805 deuterium Inorganic materials 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 238000010820 immunofluorescence microscopy Methods 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 4
- 230000005291 magnetic effect Effects 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 210000004940 nucleus Anatomy 0.000 description 4
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 150000003141 primary amines Chemical class 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 238000002864 sequence alignment Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000009469 supplementation Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 229910052721 tungsten Inorganic materials 0.000 description 4
- 238000010200 validation analysis Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- 102100039164 Acetyl-CoA carboxylase 1 Human genes 0.000 description 3
- 108010018763 Biotin carboxylase Proteins 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- 230000033616 DNA repair Effects 0.000 description 3
- 102100034690 Delta(14)-sterol reductase LBR Human genes 0.000 description 3
- 201000009344 Emery-Dreifuss muscular dystrophy Diseases 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- 101001092912 Haloferax volcanii (strain ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB 2012 / VKM B-1768 / DS2) Small archaeal modifier protein 1 Proteins 0.000 description 3
- 101001059713 Homo sapiens Inner nuclear membrane protein Man1 Proteins 0.000 description 3
- 101000662686 Homo sapiens Torsin-1A Proteins 0.000 description 3
- 102100028799 Inner nuclear membrane protein Man1 Human genes 0.000 description 3
- 102000011782 Keratins Human genes 0.000 description 3
- 108010076876 Keratins Proteins 0.000 description 3
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 3
- 101710097668 Leucine aminopeptidase 2 Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 101150001569 Nup50 gene Proteins 0.000 description 3
- 102100026531 Prelamin-A/C Human genes 0.000 description 3
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 3
- 239000012722 SDS sample buffer Substances 0.000 description 3
- 102100037454 Torsin-1A Human genes 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 238000010382 chemical cross-linking Methods 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 230000009089 cytolysis Effects 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 239000003599 detergent Substances 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 210000001671 embryonic stem cell Anatomy 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 108010022838 lamin B receptor Proteins 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000002093 peripheral effect Effects 0.000 description 3
- 230000008823 permeabilization Effects 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 101710082686 probable leucine aminopeptidase 2 Proteins 0.000 description 3
- 230000004850 protein–protein interaction Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000004885 tandem mass spectrometry Methods 0.000 description 3
- 239000003104 tissue culture media Substances 0.000 description 3
- 229910052722 tritium Inorganic materials 0.000 description 3
- 238000001086 yeast two-hybrid system Methods 0.000 description 3
- UUUHXMGGBIUAPW-UHFFFAOYSA-N 1-[1-[2-[[5-amino-2-[[1-[5-(diaminomethylideneamino)-2-[[1-[3-(1h-indol-3-yl)-2-[(5-oxopyrrolidine-2-carbonyl)amino]propanoyl]pyrrolidine-2-carbonyl]amino]pentanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-3-methylpentanoyl]pyrrolidine-2-carbon Chemical compound C1CCC(C(=O)N2C(CCC2)C(O)=O)N1C(=O)C(C(C)CC)NC(=O)C(CCC(N)=O)NC(=O)C1CCCN1C(=O)C(CCCN=C(N)N)NC(=O)C1CCCN1C(=O)C(CC=1C2=CC=CC=C2NC=1)NC(=O)C1CCC(=O)N1 UUUHXMGGBIUAPW-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 102000007469 Actins Human genes 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- ATRRKUHOCOJYRX-UHFFFAOYSA-N Ammonium bicarbonate Chemical compound [NH4+].OC([O-])=O ATRRKUHOCOJYRX-UHFFFAOYSA-N 0.000 description 2
- 229910000013 Ammonium bicarbonate Inorganic materials 0.000 description 2
- 102100030988 Angiotensin-converting enzyme Human genes 0.000 description 2
- 241000893512 Aquifex aeolicus Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 108010031746 Dam methyltransferase Proteins 0.000 description 2
- QRLVDLBMBULFAL-UHFFFAOYSA-N Digitonin Natural products CC1CCC2(OC1)OC3C(O)C4C5CCC6CC(OC7OC(CO)C(OC8OC(CO)C(O)C(OC9OCC(O)C(O)C9OC%10OC(CO)C(O)C(OC%11OC(CO)C(O)C(O)C%11O)C%10O)C8O)C(O)C7O)C(O)CC6(C)C5CCC4(C)C3C2C QRLVDLBMBULFAL-UHFFFAOYSA-N 0.000 description 2
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 2
- 108090000217 Frataxin Proteins 0.000 description 2
- 102000003869 Frataxin Human genes 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 101150077556 LMNA gene Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 101710202335 Nesprin-1 Proteins 0.000 description 2
- 102100023306 Nesprin-1 Human genes 0.000 description 2
- 101710202339 Nesprin-2 Proteins 0.000 description 2
- 102100023305 Nesprin-2 Human genes 0.000 description 2
- 102000007999 Nuclear Proteins Human genes 0.000 description 2
- 108010089610 Nuclear Proteins Proteins 0.000 description 2
- 108090000163 Nuclear pore complex proteins Proteins 0.000 description 2
- 102000003789 Nuclear pore complex proteins Human genes 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 229930040373 Paraformaldehyde Natural products 0.000 description 2
- 108090000882 Peptidyl-Dipeptidase A Proteins 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 235000012538 ammonium bicarbonate Nutrition 0.000 description 2
- 239000001099 ammonium carbonate Substances 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 108091006090 chromatin-associated proteins Proteins 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 108091036078 conserved sequence Proteins 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 229940009976 deoxycholate Drugs 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- UVYVLBIGDKGWPX-KUAJCENISA-N digitonin Chemical compound O([C@@H]1[C@@H]([C@]2(CC[C@@H]3[C@@]4(C)C[C@@H](O)[C@H](O[C@H]5[C@@H]([C@@H](O)[C@@H](O[C@H]6[C@@H]([C@@H](O[C@H]7[C@@H]([C@@H](O)[C@H](O)CO7)O)[C@H](O)[C@@H](CO)O6)O[C@H]6[C@@H]([C@@H](O[C@H]7[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O7)O)[C@@H](O)[C@@H](CO)O6)O)[C@@H](CO)O5)O)C[C@@H]4CC[C@H]3[C@@H]2[C@@H]1O)C)[C@@H]1C)[C@]11CC[C@@H](C)CO1 UVYVLBIGDKGWPX-KUAJCENISA-N 0.000 description 2
- UVYVLBIGDKGWPX-UHFFFAOYSA-N digitonine Natural products CC1C(C2(CCC3C4(C)CC(O)C(OC5C(C(O)C(OC6C(C(OC7C(C(O)C(O)CO7)O)C(O)C(CO)O6)OC6C(C(OC7C(C(O)C(O)C(CO)O7)O)C(O)C(CO)O6)O)C(CO)O5)O)CC4CCC3C2C2O)C)C2OC11CCC(C)CO1 UVYVLBIGDKGWPX-UHFFFAOYSA-N 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 108010025649 holocarboxylase synthetases Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 239000000411 inducer Substances 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000016507 interphase Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 208000026585 laminopathy Diseases 0.000 description 2
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000002438 mitochondrial effect Effects 0.000 description 2
- 230000011278 mitosis Effects 0.000 description 2
- 230000037230 mobility Effects 0.000 description 2
- 238000005319 nano flow HPLC Methods 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 229920002866 paraformaldehyde Polymers 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000003531 protein hydrolysate Substances 0.000 description 2
- 238000000575 proteomic method Methods 0.000 description 2
- 229950010131 puromycin Drugs 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000014493 regulation of gene expression Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 239000012723 sample buffer Substances 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000005063 solubilization Methods 0.000 description 2
- 230000007928 solubilization Effects 0.000 description 2
- 210000001082 somatic cell Anatomy 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 230000010473 stable expression Effects 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 150000003431 steroids Chemical class 0.000 description 2
- 230000000153 supplemental effect Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- MXCVHSXCXPHOLP-UHFFFAOYSA-N 4-oxo-6-propylchromene-2-carboxylic acid Chemical compound O1C(C(O)=O)=CC(=O)C2=CC(CCC)=CC=C21 MXCVHSXCXPHOLP-UHFFFAOYSA-N 0.000 description 1
- ODHCTXKNWHHXJC-VKHMYHEASA-N 5-oxo-L-proline Chemical compound OC(=O)[C@@H]1CCC(=O)N1 ODHCTXKNWHHXJC-VKHMYHEASA-N 0.000 description 1
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 1
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 1
- 102100021641 Acetyl-CoA carboxylase 2 Human genes 0.000 description 1
- 239000012103 Alexa Fluor 488 Substances 0.000 description 1
- 239000012109 Alexa Fluor 568 Substances 0.000 description 1
- 239000012099 Alexa Fluor family Substances 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 102100038105 Arpin Human genes 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108010078286 Ataxins Proteins 0.000 description 1
- 102000014461 Ataxins Human genes 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101710118343 Barrier-to-autointegration factor Proteins 0.000 description 1
- 101710201279 Biotin carboxyl carrier protein Proteins 0.000 description 1
- 240000008100 Brassica rapa Species 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 241000251556 Chordata Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 101100301096 Drosophila melanogaster Nup358 gene Proteins 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- IMROMDMJAWUWLK-UHFFFAOYSA-N Ethenol Chemical compound OC=C IMROMDMJAWUWLK-UHFFFAOYSA-N 0.000 description 1
- 102000013366 Filamin Human genes 0.000 description 1
- 108060002900 Filamin Proteins 0.000 description 1
- 102100026561 Filamin-A Human genes 0.000 description 1
- 206010017577 Gait disturbance Diseases 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 101000963424 Homo sapiens Acetyl-CoA carboxylase 1 Proteins 0.000 description 1
- 101000677540 Homo sapiens Acetyl-CoA carboxylase 2 Proteins 0.000 description 1
- 101000884441 Homo sapiens Arpin Proteins 0.000 description 1
- 101000913549 Homo sapiens Filamin-A Proteins 0.000 description 1
- 101001047746 Homo sapiens Lamina-associated polypeptide 2, isoform alpha Proteins 0.000 description 1
- 101001047731 Homo sapiens Lamina-associated polypeptide 2, isoforms beta/gamma Proteins 0.000 description 1
- 101001014213 Homo sapiens Morphogenetic neuropeptide Proteins 0.000 description 1
- 101000988394 Homo sapiens PDZ and LIM domain protein 5 Proteins 0.000 description 1
- 101001003584 Homo sapiens Prelamin-A/C Proteins 0.000 description 1
- 101000880385 Homo sapiens STING ER exit protein Proteins 0.000 description 1
- 101000800495 Homo sapiens Telomere length and silencing protein 1 homolog Proteins 0.000 description 1
- 101000655171 Homo sapiens Transmembrane protein 230 Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 102000012411 Intermediate Filament Proteins Human genes 0.000 description 1
- 108010061998 Intermediate Filament Proteins Proteins 0.000 description 1
- 201000008645 Joubert syndrome Diseases 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000283986 Lepus Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 101710097665 Leucine aminopeptidase 1 Proteins 0.000 description 1
- 102100033292 Leucine-rich repeat-containing protein 7 Human genes 0.000 description 1
- 206010049287 Lipodystrophy acquired Diseases 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- 229920002858 MOWIOL ® 4-88 Polymers 0.000 description 1
- 102100025169 Max-binding protein MNT Human genes 0.000 description 1
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 1
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 1
- 108010006519 Molecular Chaperones Proteins 0.000 description 1
- 102000005431 Molecular Chaperones Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 101710150912 Myc protein Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 101150118334 Nup214 gene Proteins 0.000 description 1
- 102100029181 PDZ and LIM domain protein 5 Human genes 0.000 description 1
- 239000004185 Penicillin G procaine Substances 0.000 description 1
- 108010064218 Poly (ADP-Ribose) Polymerase-1 Proteins 0.000 description 1
- 102100023712 Poly [ADP-ribose] polymerase 1 Human genes 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 206010063493 Premature ageing Diseases 0.000 description 1
- 208000032038 Premature aging Diseases 0.000 description 1
- 101710082688 Probable leucine aminopeptidase 1 Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101710186027 Propionyl-CoA carboxylase alpha chain Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- 108010053763 Pyruvate Carboxylase Proteins 0.000 description 1
- 102100039895 Pyruvate carboxylase, mitochondrial Human genes 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 102000002278 Ribosomal Proteins Human genes 0.000 description 1
- 108010000605 Ribosomal Proteins Proteins 0.000 description 1
- 102100037658 STING ER exit protein Human genes 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 101100127690 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FAA2 gene Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000710960 Sindbis virus Species 0.000 description 1
- 102100033113 Telomere length and silencing protein 1 homolog Human genes 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102100033033 Transmembrane protein 230 Human genes 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 101150095565 WRKY62 gene Proteins 0.000 description 1
- 208000015953 X-linked Emery-Dreifuss muscular dystrophy Diseases 0.000 description 1
- 201000001520 X-linked Emery-Dreifuss muscular dystrophy 1 Diseases 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 208000026913 autosomal dominant Emery-Dreifuss muscular dystrophy Diseases 0.000 description 1
- 201000001553 autosomal dominant Emery-Dreifuss muscular dystrophy 2 Diseases 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 108010032108 biotin repressor Proteins 0.000 description 1
- 150000001615 biotins Chemical class 0.000 description 1
- 239000012888 bovine serum Substances 0.000 description 1
- GEHJBWKLJVFKPS-UHFFFAOYSA-N bromochloroacetic acid Chemical compound OC(=O)C(Cl)Br GEHJBWKLJVFKPS-UHFFFAOYSA-N 0.000 description 1
- 238000010366 cell biology technique Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- SMNPLAKEGAEPJD-UHFFFAOYSA-N chembl34922 Chemical compound Cl.Cl.Cl.C1CN(C)CCN1C1=CC=C(NC(=N2)C=3C=C4N=C(NC4=CC=3)C=3C=CC(O)=CC=3)C2=C1 SMNPLAKEGAEPJD-UHFFFAOYSA-N 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 208000031214 ciliopathy Diseases 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 230000005757 colony formation Effects 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- ALEXXDVDDISNDU-JZYPGELDSA-N cortisol 21-acetate Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@@](C(=O)COC(=O)C)(O)[C@@]1(C)C[C@@H]2O ALEXXDVDDISNDU-JZYPGELDSA-N 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 150000001945 cysteines Chemical class 0.000 description 1
- 230000003436 cytoskeletal effect Effects 0.000 description 1
- 230000006240 deamidation Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000012520 frozen sample Substances 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 210000004907 gland Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 108010007628 lamina-associated polypeptide 2 Proteins 0.000 description 1
- 208000006132 lipodystrophy Diseases 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 150000002669 lysines Chemical class 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010071806 methylcrotonoyl-CoA carboxylase Proteins 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 230000012223 nuclear import Effects 0.000 description 1
- 230000006849 nucleocytoplasmic transport Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000013021 overheating Methods 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 238000005375 photometry Methods 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 208000030087 premature aging syndrome Diseases 0.000 description 1
- 230000013823 prenylation Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000008213 purified water Substances 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 102220204833 rs1057522953 Human genes 0.000 description 1
- 238000010079 rubber tapping Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 108091006107 transcriptional repressors Proteins 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 108091005703 transmembrane proteins Proteins 0.000 description 1
- 102000035160 transmembrane proteins Human genes 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- WVLBCYQITXONBZ-UHFFFAOYSA-N trimethyl phosphate Chemical compound COP(=O)(OC)OC WVLBCYQITXONBZ-UHFFFAOYSA-N 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 238000001665 trituration Methods 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
- G01N33/6845—Methods of identifying protein-protein interactions in protein mixtures
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/435—Assays involving biological materials from specific organisms or of a specific nature from animals; from humans
- G01N2333/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin, cold insoluble globulin [CIG]
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/28—Neurological disorders
- G01N2800/2878—Muscular dystrophy
Definitions
- the present invention provides methods for identifying in vivo proximate proteins, comprising (a) culturing recombinant cells comprising a recombinant nucleic acid capable of directing expression of a fusion protein comprising (i) a heterologous promiscuous biotin protein ligase (BPL), or functional equivalent thereof, and (ii) a bait polypeptide, wherein the culturing is carried out under conditions suitable for expression of the fusion protein in the eukaryotic cells; and (b) identifying biotinylated proteins, wherein the biotinylated proteins are proteins present in the cells proximate to the bait polypeptide.
- the BPL, or functional equivalent thereof comprises a protein with an amino acid sequence of general formula 1;
- I is selected from the group consisting of A, L, I, T, V, or is absent;
- X2 is selected from the group consisting of C, L, V, H, A, or is absent;
- X3 is selected from the group consisting of I, V, and L;
- X4 is selected from the group consisting of A and G;
- X5 is selected from the group consisting of E, D, R, T, N, V, and A;
- X6 is selected from the group consisting of Y, R_ K, E, and I;
- X8 selected from the group consisting of Q, T, N, F, V, and S;
- X9 is selected from the group consisting of A, K, N, S, Q, and E;
- X10 is G
- XI I is selected from the group consisting of R and K;
- X12 is G
- X13 is any residue other than R
- X14 is selected from the group consisting of R, L, W, G, and S;
- X15 is selected from the group consisting of G, Q, P, and ;
- X16 is selected from the group consisting of R and N;
- X17 is selected from the group consisting of , Q, V, E, T, M f and A,
- X18 is W
- X19 is selected from the group consisting of F, L, Y, E, I, and V;
- X20 is selected from the group consisting of S, M, and N;
- X21 is selected from the group consisting of P, Q, and D;
- X22 is selected from the group consisting of F, E, A, , Y, and V;
- X23 is selected from the group consisting of G and A
- X24 is selected from the group consisting of V and C, or is absent;
- X25 is selected from the group consisting of C and A, or is absent;
- X26 is selected from the group consisting of A and L, or is absent;
- X27 is selected from the group consisting of A, G, and S, or is absent;
- X28 is selected from the group consisting of N, G, Q, T, and C;
- X29 is selected from the group consisting of L, I, A, and F;
- X30 is selected from the group consisting of Y, M, A, V, and L;
- X31 is selected from the group consisting of L, G f V, I, and F;
- X32 is selected from the group consisting of S, T, and F.
- the present invention provides recombinant nucleic acids, comprising (a) a first nucleic acid domain encoding a promiscuous biotin protein ligase (BPL); and (b) a second nucleic acid domain encoding a bait polypeptide.
- BPL promiscuous biotin protein ligase
- the first nucleic acid domain encodes a protein comprising an amino acid sequence of general formula 1 :
- XI is selected from the group consisting of A, L, I, T, V, or is absent;
- X2 is selected from the group consisting of C, L, V, H, A, or is absent;
- X3 is selected from the group consisting of I, V, and L;
- X4 is selected from the group consisting of A and G;
- X5 is selected from the group consisting of E, D, R, T, N, V, and A;
- X6 is selected from the group consisting of Y, R, K, E, and I;
- X8 selected from the group consisting of Q, T, N, F, V, and S;
- X9 is selected from the group consisting of A, , N, S, Q, and E;
- XI I is selected from the group consisting of R and K,
- X12 is G
- R 14 is selected from the group consisting of R, L, W, G, and S;
- X15 is selected from the group consisting of G, Q, P, and K;
- X16 is selected from the group consisting of R and N;
- X17 is selected from the group consisting of , Q, V, E, T, M, and A;
- X19 is selected from the group consisting of F, L, Y, E, I, and V;
- X20 is selected from the group consisting of S, M, and N;
- X21 is selected from the group consisting of P, Q, and D;
- X22 is selected from the group consisting of F, E, A, K, Y, and V;
- X23 is selected from the group consisting of G and A
- X24 is selected from the group consisting of V and C, or is absent;
- X25 is selected from the group consisting of C and A, or is absent;
- X26 is selected from the group consisting of A and L, or is absent;
- X27 is selected from the group consisting of A, G, and S, or is absent;
- X28 is selected from the group consisting of N, G, Q, T, and C;
- X29 is selected from the group consisting of L, I, A, and F;
- X30 is selected from the group consisting of Y, M, A, V, and L;
- X31 is selected from the group consisting of L, G, V, I, and F;
- X32 is selected from the group consisting of S, T, and F.
- the present invention provides recombinant expression vectors comprising a recombinant nucleic acid of the invention.
- the invention provides recombinant host cells comprising a recombinant expression vector of the invention.
- the invention provides transgenic, non- human organism comprising a recombinant host cell of the invention.
- the present invention provides recombinant fusion proteins, comprising (a) a first domain encoding a biotin protein ligase (BPL); and (b) a second domain encoding a bait polypeptide.
- BPL biotin protein ligase
- the first domain comprises an amino acid sequence of general formula 1 :
- XI is selected from the group consisting of A, L, I, T, V, or is absent,
- X2 is selected from the group consisting of C, L, V, H, A, or is absent;
- X3 is selected from the group consisting of I, V, and L;
- X4 is selected from the group consisting of A and G;
- X5 is selected from the group consisting of E, D, R, T, N, V, and A;
- X6 is selected from the group consisting of Y, R, K, E, and I;
- X8 selected from the group consisting of Q, T, N, F, V, and S;
- X9 is selected from the group consisting of A, K, N, S, Q, and E;
- X10 is G; XI 1 is selected from the group consisting of R and K.;
- X12 is G
- X 13 is any residue other than R
- X14 is selected from the group consisting of R, L, W, G, and S;
- X 15 is selected from the group consisting of G, Q, P, and ;
- X16 is selected from the group consisting of R and N;
- X17 is selected from the group consisting of , Q, V, E, T, M, and A;
- X18 is W
- X19 is selected from the group consisting of F, L, Y, E, I, and V;
- X20 is selected from the group consisting of S, M, and N;
- X21 is selected from the group consisting of P, Q, and D;
- X22 is selected from the group consisting of F, E, A, K, Y, and V;
- X23 is selected from the group consisting of G and A
- X24 is selected from the group consisting of V and C, or is absent;
- X25 is selected from the group consisting of C and A, or is absent;
- X26 is selected from the group consisting of A and L, or is absent;
- X27 is selected from the group consisting of A, G, and S, or is absent;
- X28 is selected from the group consisting of N, G, Q, T, and C;
- X29 is selected from the group consisting of L, I, A, and F;
- X30 is selected from the group consisting of Y, M, A, V, and L;
- X31 is selected from the group consisting of L, G, V, I, and F;
- X32 is selected from the group consisting of S, T, and F.
- FIG. 1 Model for application of BioID method
- BirA* promiscuously biotinylates endogenous proteins in mammalian cells.
- HeLa cells were analyzed 24 h after transient transfection with myc-BirA- WT or myc-BirA* (Rl 18G). After transfection, cells were cultured either with or without supplemental biotin (50 ⁇ ).
- supplemental biotin 50 ⁇ .
- LaA HEK293 cells inducibly expressing myc-BirA*LaA, or parental controls, were analyzed 24 h after induction with or without excess biotin.
- the LaA fusion protein (asterisk) is detected with anti-myc.
- levels of endogenously biotinylated proteins are unaffected by the supplemental biotin; however, the biotinylation of endogenous proteins by myc-BirA*LaA is dramatically enhanced in the presence of excess biotin.
- the myc-BirA*LaA is detected at the nuclear rim and to a lesser extent in the nucleoplasm. Biotinylated proteins colocalize with the LaA fusion protein.
- SLAP75 is a novel peripheral membrane constituent of the NE identified with BioIDTM-LaA.
- F AM 169A/SL AP75 is a novel peripheral membrane constituent of the NE its subcellular localization was analyzed by immunofluorescence microcopy, (a) As detected with anti-SLAP75, the endogenous protein is co-localized with LaA at the NE of HEK293 cells, (b) Although endogenous SLAP75 is not detected in HeLa cells (not depicted), transiently expressed HA-SLAP75 detected with anti-FAM169A/SLAP75 is localized to the NE, labeled with anti-Nu l53. DNA was labeled with Hoechst . Bar, 10 ⁇ .
- FIG. 7 Comparison of Gl- and G2-BioIDTM-LaA by immuno- and streptavidin-blot analysis.
- G2 2 nd generation
- G2 1*
- BioK 1*
- Protein lysates from these cells prepared 24 hours after the addition of excess biotin were separated by SDS-PAGE and analyzed by immuno- and streptavidin-blotting.
- a similar level and pattern of biotinylation of endogenous proteins was observed with Gl- and G2-BioIDTM-LaA .
- FIG. 8a through 8c Exemplary Biotin Protein Ligase Amino Acid Sequences.
- the term "about” means +/- 5% of the relevant unit of measurement.
- amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gin; Q), glycine (Gly; G), histidine (His; H), isoleucine (He; I), leucine (Leu; L), lysine (Lys; ), methionine (Met; M), phenylalanine (Phc; F), proline (Pro; P), serine (Scr; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V).
- the present invention provides methods for identifying in vivo proximate proteins, comprising
- culturing recombinant cells comprising recombinant nucleic acids capable of directing expression of a fusion protein comprising (i) a heterologous promiscuous biotin protein ligase (BPL), or functional equivalent thereof, and (ii) a bait polypeptide, wherein the culturing is carried out under conditions suitable for expression of the fusion protein in the eukaryotic cells; and
- BPL heterologous promiscuous biotin protein ligase
- biotinylated proteins are proteins present in the cells proximate to the bait polypeptide.
- BioIDTM for proximity-dependent biotin identification
- this approach is based on fusion of a promiscuous biotin protein ligase to a targeting protein (the "bait polypeptide”).
- BioIDTM features proximity-dependent biotinylation of proteins that are near- neighbors of the fusion protein.
- the examples presented herein demonstrate that the methods and reagents of the invention are generally applicable method to screen for both interacting and neighboring proteins in their native cellular environment.
- BioIDTM is effective when applied to insoluble and membrane associated proteins, two classes of proteins that may be refractory to screening with conventional approaches. Additional advantages include the ability to identify weak and/or transient interactions, to screen for interactions in a relatively natural cellular setting and temporal inducibility of biotin labeling
- the proteins present in the cells proximate to the bait polypeptide are candidate proteins for interacting with the bait polypeptide.
- proximate means within at least about 25 nm or closer.
- biotinylated proteins are proteins present in the cells within at least about 20 nm of the bait polypeptide.
- biotinylated proteins are proteins present in the cells within at least about 19, 18, 17, 16, 15, 14, 13, 12, 11, 10 nm of the bait polypeptide, or closer.
- the methods can be used with any recombinant cell type that capable of expressing the fusion protein.
- the cells may be prokaryotic or eukaryotic. Non- limiting embodiments include bacterial cells, trypanosomal cells, protozoan cells, fungal cells, and mammalian cells.
- heterologous means that the BPL is not naturally expressed in the cell (i.e., the gene encoding the promiscuous biotin protein ligase is recombinantly introduced in the cell).
- the BPL is "promiscuous,” in that it is not selective for its endogenous substrate, and thus shows improved proximity-dependent biotinylation of proteins.
- the promiscuous BPL may be a naturally occurring BPL f agment; in another embodiment, the promiscuous biotin protein ligase is a mutated biotin protein ligase, or a fragment thereof.
- Any suitable promiscuous BPL can be used in the methods of the invention. Virtually every organism has a BPL that has been identified and its amino acid sequence demonstrated. Any such promiscuous BPL fragment, or BPL mutation can be used in the methods of the invention, so long as it is not endogenous to the recombinant cell.
- the BPL is a fragment of a wild-type BPL, including but not limited to a fragment of any of the BPLs in Figure 9, or functional equivalent thereof.
- the functional equivalent can be any modification to the promiscuous BPL that retains BPL activity (i.e.: insertions, deletions, point mutations, etc.)
- the BPL is deleted for all or part of its DNA binding domain.
- the DNA binding domain is not conserved beyond some prokaryotes and is fragmented in some BPLs.
- the E. eeli BPL (SEQ ID NO: 1) is found between residues 22-46.
- “deleted for” includes both BPLs that naturally lack all or a part of their DNA binding domain, and BPLs that are engineered to delete all or a part of their DNA binding domain.
- the BPL may be full length but with one or more mutations, such as mutations to increase "promiscuity" of the BPL. Exemplary such embodiments are shown in Figure 8, which possess an R to G mutation to increase promiscuity (SEQ ID NOS: 1-4, 6-12).
- One such embodiment is the E. colt BirA protein (SEQ ID NO: 1).
- the BPL comprises a protein with an amino acid sequence of general formula 1 :
- I is selected from the group consisting of A, L, ⁇ , T, V, or is absent;
- X2 is selected from the group consisting of C, L, V, H, A, or is absent;
- X3 is selected from the group consisting of I, V, and L;
- X4 is selected from the group consisting of A and G;
- X5 is selected from the group consisting of E, D, R, T, N, V, and A;
- X6 is selected from the group consisting of Y, R, K, E, and I;
- X8 selected from the group consisting of Q, T, N, F, V, and S;
- X9 is selected from the group consisting of A, K, N, S, Q, and E;
- X10 is G
- XI 1 is selected from the group consisting of R and K.
- X12 is G
- X13 is any amino acid other than R
- X14 is selected from the group consisting of R, L, W, G, and S;
- X15 is selected from the group consisting of G, Q, P, and K;
- X16 is selected from the group consisting of R and N;
- X17 is selected from the group consisting of , Q, V, E, T, M, and A;
- X18 is W
- X19 is selected from the group consisting of F, L, Y, E, I, and V;
- X20 is selected from the group consisting of S, M, and N;
- X21 is selected from the group consisting of P, Q, and D;
- X22 is selected from the group consisting of F, E, A, K, Y, and V;
- X23 is selected from the group consisting of G and A
- X24 is selected from the group consisting of V and C, or is absent;
- X25 is selected from the group consisting of C and A, or is absent;
- X26 is selected from the group consisting of A and L, or is absent;
- X27 is selected from the group consisting of A, G, and S, or is absent,
- X28 is selected from the group consisting of N, G, Q, T, and C;
- X29 is selected from the group consisting of L, I, A, and F;
- X30 is selected from the group consisting of Y, M, A, V, and L;
- X31 is selected from the group consisting of L, G, V, I, and F;
- X32 is selected from the group consisting of S, T, and F.
- Proteins according to general formula 1 include the BPL active site based on a number of known BPLs (See Figure 9— sequence alignment; residues 511 to 542). In many BPLs, the XI 3 position is R; mutating this to any other amino acid helps to make the resulting BPL more promiscuous. In an exemplary embodiment, X13 is G. As will be understood by those of skill in the art, there are many BPLs that in which the active site may include residues different from those recited in general formula I. Thus, this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
- X4 is A
- X8 is T or S
- X13 is G
- X1 is L, I, or V
- X20 is S
- X23 is G
- X2 is L or I
- X30 is A. V, or L;
- X31 is L, G, V, or I;
- X32 is S.
- the BPL comprises or consists of an amino acid sequence according to SEQ ID NO: 14, as represented in Table 1 below. Numbering in the Table is based on the numbering in the sequence alignment shown in Figure 9.
- Proteins with an amino acid sequence according to SEQ ID NO: 14 include the BPL active site and additional residues required for BPL activity based on a number of known BPLs (See Figure 9; residues 506 to 796). As will be understood by those of skill in the art, there arc many BPLs that in which the amino acid sequence required for activity may include residues different from those recited in SEQ ID NO: 14.
- this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
- Residues X1-X32 of general formula 1 are shown in Table 1 at residues 511- 542.
- the BPL for use in the invention comprises any residue other than R, including but not limited to a G residue at position 523 (XI 3 in general formula 1 ).
- the invariant R residue in wild type BPLs at position X13 is modified, which results in enhanced promiscuity of the resulting BPL
- the E. coli BirA* protein of SEQ ID NO: 1 or the A. eolicus BPL protein mutant of SEQ ID NO: 2, or functional equivalents thereof are used
- the bait polypeptide maybe any bait polypeptide deemed suitable for a given use.
- the bait polypeptide can be anything that will directly or indirectly target the protein to specific cellular regions. It can be a full length protein or any fragment that imparts some kind of targeting or binding property.
- the bait polypeptide may comprise lamin-A (P02545; SEQ ID NO: 18), frataxin (Q16595; SEQ ID NO: 19), sun2 (Q9UH99; SEQ ID NO: 20), Nupl07 (P57740; SEQ ID NO: 21), torsin-A (014656; SEQ ID NO: 22), or fragments or homologues thereof.
- the BPL and the bait polypeptide may be immediately adjacent in the fusion protein, or may be separated by any number of additional amino acids as desirable for a given purpose. Including such a "linker" region between the BPL and the bait polypeptide may, for example, extend the range of biotinylation and/or provide the enzyme with increased accessibility to label adjacent proteins.
- the BPL protein may be anywhere within the fusion protein, even internal between other domains N- terminal and C-terminal to the promiscuous BPL.
- the recombinant cells are capable of expressing the fusion protein; thus, the recombinant cells comprise nucleic acids encoding the fusion protein, including but not limited to recombinant nucleic acids and vectors as described below.
- the fusion protein-encoding nucleic acids in the recombinant cells may be present in an expression vector, such as a plasmid or viral-based vector. Any expression vector suitable for an intended use can be used in the present invention. Such expression vectors can be of any type known in the art, including but not limited to plasmid and viral-based expression vectors.
- expression vectors for use in transfccting prokaryotic and eukaryotic cells is also well known in the art, and thus can be accomplished via standard techniques.
- the expression vector must be rcplicablc in the host organisms either as an episome or by integration into host chromosomal DNA.
- the expression vector comprises a plasmid.
- the invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors.
- Specifics of the expression vector will depend on the ultimate desired use. Designing appropriate expression vectors for an intended use is well within the level of those of skill in the art based on the teachings herein.
- the fusion protein-encoding nucleic acids in the expression vector are under control of a promoter capable of directing expression of the encoded fusion protein in the recombinant cell.
- a promoter capable of directing expression of the encoded fusion protein in the recombinant cell.
- Any suitable promoter may be used that can direct expression (i.e.: is "operatively linked") of the encoded proteins.
- the term "promoter” includes any nucleic acid sequence sufficient to direct expression of the encoded protein(s), including inducible promoters, repressible promoters and constitutive promoters. If inducible, there are sequences present which mediate regulation of protein expression so that the polynucleotide is transcribed only when an inducer molecule is present.
- the expression vector may comprise any other control sequences as may be suitable for an intended use.
- the control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof.
- intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered "operably linked" to the coding sequence.
- Other such control sequences include, but are not limited to, polyadenylation signals, enhancers, termination signals, and ribosome binding sites.
- the promoter sequence used to drive expression of the fusion protein may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive, etc.).
- the fusion protein-encoding nucleic acids may be integrated into cellular chromosomal DNA
- the fusion protein-encoding nucleic acids may comprise a "knock-in" of the fusion protein- encoding nucleic acids into an endogenous gene.
- the fusion protein-encoding nucleic acid may be knocked-in to any suitable cellular chromosomal gene.
- the knock-in comprises a mouse knock-in to any suitable mouse cellular chromosomal gene.
- the method comprises supplementation of the culture medium with biotin.
- biotin is added to the culture medium, it can be added at any suitable concentration, including but not limited to those described in the examples that follow.
- biotin can be added at con concentrations ranging from about 1 um to about 100 urn, or at about 50 um.
- Suitable culture conditions will depend on the cell type used and all other specifics of a given assay, as will be understood by those of skill in the art.
- the recombinant cells are cultured in the presence of biotin (endogenous or exogenously added) for between about 0.5 and about 96 hours.
- the recombinant cells are cultured in the presence of biotin (endogenous or exogenously added) for between about 1 and about 96 hours, about 2 and 96 hours, about 6 and 96 hours, about 12 and 96 hours, about 24 and 96 hours, about 0.5 and 72 hours, about 0.5 and 48 hours, about 0.5 and 24 hours, about 1 and 96 hours, about 1 and 72 hours, about 1 and 48 hours, about 1 and 24 hours, about 2 and 96 hours, about 2 and 72 hours, about 2 and 48 hours, about 2 and 24 hours, about 6 and 96 hours, about 6 and 72 hours, about 6 and 48 hours, about 6 and 24 hours, about 12 and 96 hours, about 12 and 72 hours, about 12 and 48 hours, about 12 and 24 hours, about 24 and 96 hours, about 24 and 72 hours, or about 24 and 48 hours.
- biotin endogenous or exogenously added
- identifying the biotinylated proteins comprises isolating the biotinylated polypeptides. This embodiment is optional, as one can use any technique for identifying the biotinylated proteins, including but not limited to by mass spectrometry analysis of crude cell lysates. In embodiments where the biotinylated proteins are isolated, any suitable method can be used, including but not limited to affinity purification, mass spectrometry, and Western blotting.
- the method comprises the following;
- each experimental condition eg. control, fusion protein
- plate at least two wells of cells, one with glass coverslips and the other without.
- ECL enhanced chemiluminescence
- the method comprises the following: Begin with four 10 cm dishes for each experimental condition (cells expressing fusion protein or control cells).
- Incubate cells for suitable time period such as 18-24 hours.
- Triton X- 100 such as a final concentration -25% and mix.
- the present invention provides recombinant nucleic acids, comprising:
- BPL biotin protein ligase
- the BPL may be any BPL as deemed appropriate for a given use, including but not limited to any of those disclosed herein.
- the BPL comprises a protein with an amino acid sequence of general formula 1 :
- XI is selected from the group consisting of A, L, I, T, V, or is absent;
- X2 is selected from the group consisting of C, L, V, H, A, or is absent;
- X3 is selected from the group consisting of I, V, and L;
- X4 is selected from the group consisting of A and G;
- X5 is selected from the group consisting of E, D, R, T, N, V, and A;
- X6 is selected from the group consisting of Y, R, K, E, and I;
- X8 selected from the group consisting of Q, T, N, F, V, and S;
- X9 is selected from the group consisting of A, K, N, S, Q, and E; X10 is G;
- XI 1 is selected from the group consisting of R and K;
- X12 is G
- X13 is any residue other than R
- X14 is selected from the group consisting of R, L, W, G, and S;
- X15 is selected from the group consisting of G, Q, P, and K;
- X16 is selected from the group consisting of R and N;
- X17 is selected from the group consisting of K, Q, V, E, T, M, and A;
- X18 is W
- X19 is selected from the group consisting of F, L, Y, E, I, and V;
- X20 is selected from the group consisting of S, M, and N;
- X21 is selected from the group consisting of P, Q, and D;
- X22 is selected from the group consisting of F, E, A, , Y, and V;
- X23 is selected from the group consisting of G and A
- X24 is selected from the group consisting of V and C, or is absent;
- X25 is selected from the group consisting of C and A, or is absent;
- X26 is selected from the group consisting of A and L, or is absent;
- X27 is selected from the group consisting of A, G, and S, or is absent;
- X28 is selected from the group consisting of N, G, Q, T, and C;
- X29 is selected from the group consisting of L, I, A, and F;
- X30 is selected from the group consisting of Y, M, A, V, and L;
- X31 is selected from the group consisting of L, G, V, I, and F;
- X32 is selected from the group consisting of S, T, and F.
- Proteins according to general formula 1 include the BPL active site based on a number of known BPLs (Sec Figure — sequence alignment, residues 511 to 542). As will be understood by those of skill in the art, there are many BPLs that in which the active site may include residues different from those recited in general formula I. Thus, this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
- X4 is A
- X8 is T or S
- X1 is L, I, or V
- X20 is S
- X23 is G
- X29 is L or I
- X30 is A. V, or L;
- X31 is L, G, V, or I;
- X32 is S.
- X13 is G.
- the invariant R residue in wild type BPLs at position XI 3 is modified to any other amino acid residue, including but not limited to G, which results in enhanced promiscuity of the resulting BPL.
- the BPL comprises or consists of an amino acid sequence according to SEQ ID NO: 14, as represented in Table 1 below. Numbering in the Table is based on the numbering in the sequence alignment shown in Figure 9.
- Proteins with an amino acid sequence according to SEQ ID NO: 14 include the BPL active site and additional residues required for BPL activity based on a number of known BPLs (See Figure 9; residues 506 to 796). As will be understood by those of skill in the art, there are many BPLs that in which the amino acid sequence required for activity may include residues different from those recited in SEQ ID NO: 14.
- this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
- the BPL for use in the invention comprises any residue other than R, including but not limited to a G residue at position 523 (XI 3 in general formula 1).
- the BPL comprises or consists of a BPL comprising or consisting of an amino acid sequence selected from the group consisting of SEQ ID NO:l-4 and 6-12. These amino acid sequences are also shown in Figure 8a through
- the E. coli BirA* protein of SEQ ID NO: 1 or the A. aeolicus BPL mutant protein of SEQ ID NO: 2, or functional equivalents thereof are used.
- the nucleic acid encoding the bait polypeptide may be any bait polypeptide deemed suitable for a given use.
- the bait polypeptide may comprise lamin-A (SEQ ID NO: 18), f ataxin (SEQ ID NO: 19), sun2 (SEQ ID NO: 20), Nup 107 (SEQ ID NO: 21), torsin-A (SEQ ID NO: 22), or fragments or homologues thereof.
- the coding regions for the BPL and the bait polypeptide may be immediately adjacent in the fusion protein, or may be separated by any number of additional nucleic acids as desirable for a given purpose. Including such a "linker" region between the BPL and the bait polypeptide coding regions may, for example, extend the range of biotinylation and/or provide the encoded fusion protein with increased accessibility to label adjacent proteins.
- the recombinant nucleic acid may be of any type, including DNA and RNA, and may be single stranded or double stranded. It will be understood by those of skill in the art that the first domain and the second domain can be present in any order in the recombinant nucleic acid (i.e. : the first domain may be 5 ' or 3 ' to the second domain); similarly the BPL protein may be anywhere within the fusion protein, even internal between other domains N-terminal and C-terminal to the BPL.
- the recombinant nucleic acid may further comprise "spacer" nucleotides between the first domain and the second domain; such spacer sequences may be of any length suitable for a given purpose, as will be understood by those of skill in the art.
- the fusion protein-encoding nucleic acids in the recombinant cells may be present in an expression vector, such as a plasmid or viral- based vector.
- an expression vector such as a plasmid or viral- based vector.
- Any expression vector suitable for an intended use can be used in the immunogenic compositions of the present invention.
- Such expression vectors can be of any type known in the art, including but not limited to plasmid and viral-based expression vectors.
- the construction of expression vectors for use in transfecting prokar otic or eukaryotic cells is also well known in the art, and thus can be accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and Maniatis, in; Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1 89; Gene Transfer and Expression Protocols, pp.
- the expression vector must be replicable in the host organisms either as an episome ⁇ by integration into host chromosomal DNA.
- the expression vector comprises a plasmid.
- the invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors. Specifics of the expression vector will depend on the ultimate desired use. Designing appropriate expression vectors for an intended use is well within the level of those of skill in the art based on the teachings herein.
- the first domain and the second domain are operatively linked to a promoter.
- Any suitable promoter may be used that can direct expression (i.e.: is "operatively linked") of the encoded fusion protein.
- the term "promoter” includes any nucleic acid sequence sufficient to direct expression of the encoded protein(s), including inducible promoters, repressible promoters and constitutive promoters. If inducible, there are sequences present which mediate regulation of protein expression so that the polynucleotide is transcribed only when an inducer molecule is present. Such cis-active sequences for regulated expression of an associated polynucleotide in response to environmental signals are well known to the art.
- the expression vector may comprise any other control sequences as may be suitable for an intended use.
- the control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered "operably linked" to the coding sequence.
- Other such control sequences include, but are not limited to, polyadenylation signals, enhancers, termination signals, and ribosome binding sites.
- the promoter sequence used to drive expression of the fusion protein may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive, etc.).
- the present invention provides recombinant fusion proteins expressed from the recombinant nucleic acids of the invention.
- the fusion proteins comprises a BPL according to any embodiment or combination of the embodiments disclosed herein and a bait polypeptide, including but not limited to lamin-A, frataxin, sun2, Nupl07, torsin-A, or portions thereof.
- the isolated polypeptides can be used, for example, in contacting candidate proteins in an extracellular environment to test verify candidate protein interactions with the bait polypeptide.
- the present invention provides recombinant host cells comprising an expression vector of the present invention.
- the cells may be prokaryotic or eukaryotic. Non-limiting embodiments include bacterial cells, trypanosomal cells, protozoan cells, fungal cells, vertebrate cells, and mammalian cells (such as mouse cells).
- the expression vectors may be transiently or stably present in the host cell.
- the recombinant host cells of the invention are capable of expressing the fusion protein, and thus can be used in the methods of the invention.
- the recombinant host cell comprises a recombinant nucleic acid of any embodiment or combination of embodiments of the present invention knocked-in to an endogenous host cell gene.
- a "knock-in” refers to insertion of the recombinant nucleic acid encoding the fusion protein at a specific chromosomal locus (i.e., a targeted insertion).
- the fusion protein-encoding nucleic acid may be knocked-in to any suitable cellular chromosomal gene.
- the knock-in is generated in a cell (such as an embryonic stem cell) of a non-human organism, such as a mammal including but not limited to a mouse. Knock-in procedures are well defined in a variety of organisms where embryonic stem cells are easily obtained and manipulated.
- the present invention provides a transgenic, non- human organism comprising a recombinant host cell of the present invention.
- the organism is a mammal.
- the organism is a mouse.
- recombinant host cells of the present invention comprise, embryonic stem cells which are then used to produce the transgenic non- human organisms of the present invention using standard techniques in the art.
- biotin protein ligase is fused to a protein of interest, and then introduced into mammalian (or other) cells where it will biotinylatc vicinal proteins upon supplementation of the culture medium with biotin. Biotinylated proteins can then be selectively isolated and identified by conventional methods, most notably mass spectrometry.
- BioIDTM a strategy for identifying candidate proteins that are proximate to and/or interact with human lamin A (LaA), a well-characterized component of the nuclear envelope (NE), a specialized extension of the endoplasmic reticulum that surrounds the nuclear contents during interphase.
- LaA is an intermediate filament protein and member of the A-type lamin family that is encoded by the LMNA gene (Gerace and Huber, 2012). Together with B-type lamins, the A-type lamins are constituents of the nuclear lamina, a filamentous protein meshwork that is intimately associated with the inner nuclear membrane (INM), the membranous portion of the NE that faces the interior of the nucleus. This association is mediated, at least in part, by multiple interactions with integral INM proteins.
- INM inner nuclear membrane
- nuclear pore complexes large multi-protein channels that span the nuclear membranes and which mediate nucleocytoplasmic trafficking of macromolecules, are anchored to the nuclear lamina (Aaronson and Blobel, 1975; Dwyer and Blobel, 1976). Although the bulk of the A- and B-type lamins are localized to the nuclear lamina, a nucleoplasmic population is thought to function in various aspects of nuclear metabolism, including transcription and replication (Mon ⁇ et al., 2000; Goldman et al., 2002).
- the nuclear lamina In mammalian somatic cells, the nuclear lamina is roughly 15-20 tun thick and is considered to represent an important structural element of the NE (Gerace and Huber, 2012). Indeed, the role of the nuclear lamina as a determinant of both NE and global nuclear architecture has been highlighted by findings that mutations in the LMNA gene are linked to multiple human diseases including muscular dystrophy, lipodystrophy, and premature aging syndromes ⁇ Worman et al., 2009; Worman, 2012). Many of these disorders, known as laminopathies, are associated with oftentimes gross perturbations in nuclear and NE organization. To better understand the etiology of the laminopathies, much effort has been focused on identifying lamin- interacting proteins.
- BioIDTM to identify LaA-proximal proteins has revealed a number of abundant candidates among which are known inteiactors of LaA. These include integral proteins of the INM as well as NPC components. Less abundant candidates fall into functional categories that include transcription, chromatin regulation, RNA processing, and DNA repair. An uncharacterized protein was also among the more prominent candidates revealed by LaA BiolDTM. We demonstrate that this protein, which we have named SLAP75, is a novel constituent of the NE that appears to be expressed in a cell type-specific fashion. Taken together, these findings demonstrate that BiolDTM is an effective method to screen for proximate and interacting proteins. This relatively simple and rapid technique has broad applicability to monitor protein behavior in live cells, providing a number of advantages over existing methods.
- BirA is a 35-kD DNA-binding biotin protein ligase in Escherichia coli that regulates the biotinylation of a subunit of acetyl-CoA carboxylase and acts as a transcriptional repressor for the biotin biosynthetic operon (Chapman-Smith and
- BirA has been harnessed for experimental applications, including use in eukaryotic cells.
- the BirA acceptor-peptide system takes advantage of the extreme specificity of BirA in biotinylating its substrate peptide (Beckett et al., 1999).
- a biotin acceptor tag BAT
- BAT biotin acceptor tag
- This leads to the biotinylation of the BAT sequence permitting one-step high affinity (Kj 10 ⁇ H M; Green, 1 63) avidin streptavidin-mediated purification of the tagged protein.
- biotinylation is a rare modification, in mammalian cells it is restricted primarily to only a few carboxylases (Chapman-Smith and Cronan, 1 99); BAT-independent binding is minimal.
- Biotinylation by BirA is a two-step process. The first of these combines biotin and ATP to form biotinoyl-5'-AMP (bioAMP; Lane et al., 1964). This activated biotin is held within the BirA active site until it reacts with a specific lysine residue of the BAT sequence in the second step.
- the problem with BirA lies with its stringent selectivity for its endogenous substrate. What we desired was a far more promiscuous biotin ligase. This requirement led us to certain BirA mutants that prematurely release the highly reactive yet labile bioAMP (Kwon and Beckett, 2000; Streaker and Beckett, 2006).
- BirA* Ras 18G, hereafter called BirA*
- BirA-WT Kwon and Beckett
- BirA* expression results in promiscuous protein biotinylation because free bioAMP will readily react with primary amines. More significantly however, it has been demonstrated in vitro that BirA* will promiscuously biotinylate proteins in a proximity-dependent fashion (Choi-Rhee et al., 2004; Cronan, 2005).
- BirA* would promiscuously biotinylate proteins in live mammalian cells.
- HEK293 cells that stably and inducibly express myc-BirA*LaA (Fig. 1 b).
- myc-BirA*LaA localizes predominantly to the nuclear envelope, similar to both endogenous LaA and LaA harboring an N-terminal GFP tag, a modification that does not appear to alter the function of LaA (Broers et al., 1999; Shumaker et al. 5 2006).
- streptavidin-HRP As is the case with myc-BirA* alone, the presence of 50 ⁇ biotin in the culture medium strongly stimulates biotinylation of a wide range of endogenous proteins, in addition to myc-BirA *LaA itself (Fig. 3 a).
- Microscopy using fluorescent streptavidin reveals that the bulk of these biotinylated proteins must reside at the NE and colocalize with myc-BirA*LaA (Fig. 3 b).
- proteins in the vicinity of myc-BirA*LaA are preferentially biotinylated. It should be noted that not only does the intracellular localization of these biotinylated proteins differ between myc-BirA* (predominantly nucleoplasmic) and myc-BirA*LaA (predominantly at the NE), but their electrophoretic mobilities and hence identities also differ as revealed by Western blot analysis (Figs. 2 a and 3 a). These results suggest that BirA* can be targeted to a specific cellular location and will biotinylate endogenous proteins in a proximity-dependent manner.
- Biotinylated proteins were captured with streptavidin immobilized on paramagnetic beads, rigorously washed, and bound proteins analyzed by mass spectrometry. Proteins unique to the BiolDTM-LaA (myc-BirA*LaA) pull-down, and not detected with identical pull-downs from control cells, were categorized based on localization and function (Fig. 3 e). The relative abundance of the identified proteins within each category is given as a percentage of the total. The bulk of the proteins identified by BioIDTM-LaA are known NE components, including a number of INM proteins.
- LAP2 lamina-associated polypeptide 2
- LAPl lamina-associated polypeptide 1
- LAPl TORIAIP
- LAPl has a documented association with LaA (Foisner and Gerace, 1993), as does LAP2a, a soluble LAP2 isoform that also appeared prominently in the dataset (Dechat et al., 2000).
- EMD emerin
- MAN1 LMD3; Mansharamani and Wilson, 2005
- BioIDTM-LaA An additional ⁇ protein detected in our screen was SAMP1 (TMEM201).
- SAMP1 was originally identified in a proteomic analysis of rat liver nuclear membrane proteins (Schirmer et al., 2003).
- a recent study suggests that SAMP1 is part of a protein network that includes A-type lamins and L1NC complexes. The latter are evolutionarily conserved protein assemblies that span the NE and couple nucleoskeletal and cytoskeletal structures (Burke and Roux, 2009).
- BioIDTM- LaA candidates Several additional classes of proteins were represented among the BioIDTM- LaA candidates, albeit at lower levels. These included proteins associated with DNA repair, transcription, chromatin regulation, and RNA-processing. Proteins considered to be components of a nucleoskeleton were also detected, the most abundant of which was filamin A (FLNA; Castano et al., 2010).
- FAM169A An uncharacterized protein of 75 kD, FAM169A (KIAA0888), featured prominently in the BioIDTM-LaA dataset. FAM169A has no predicted transmembrane domain and lacks any sequence motifs that might provide clues to its function.
- FAM169A is a novel NE constituent
- Fig. 4 a clearly shows that FAM169A is concentrated at the NE. Differential permeabilization of HE 293 cells with digitonin versus Triton X-100 indicates that FAM169A resides on the nuclear face of the NE.
- BioIDTM provides a means of identifying neighboring and potentially interacting proteins in vivo.
- the method takes advantage of BirA*, a highly promiscuous form of the E. coti BirA biotin protein ligase. BirA* may be targeted to specific subcellular locations by fusion to a "bait" protein. Nearby proteins, biotinylated by BirA*, can then be recovered in a single step on streptavidin-coated beads and identified by mass spectrometry.
- the only requirement for BioIDTM is the expression of a single fusion protein.
- BioIDTM is applicable to map protein associations in essentially any accessible cell type, mammalian or otherwise.
- the yeast two-hybrid ( ⁇ 2 ⁇ ) system takes advantage of the ability of hybrid transcription factor domains to functionally associate, thereby driving expression of reporter genes.
- the second strategy is based upon co-immunoprecipitation or pull-down, frequently involving expression of single- or double-tagged bait proteins. Irnmunoprecipitated proteins are then identified by mass spectrometry.
- a significant attribute of the Y2H approach is that because it is based on a cDNA library screen, it is more likely to detect weak interactions or interactions between low abundance proteins. Furthermore, it is the method of choice where the focus is on proteins that may only be expressed in rare cell types.
- the pull-down approach has provided valuable data in a variety of systems.
- the first of these is the problem of scale when dealing with low abundance proteins. Simply put, such proteins may not be detected where it proves impractical to prepare or manipulate sufficient start material.
- the second limitation concerns solubility. Conditions required to sohibilize many bait proteins may not be compatible with preserving interactions with partner proteins and vice versa. This becomes especially significant when considering weak interactions. In the case of lamin A, a highly insoluble protein, this has proved to be a serious stumbling block in the reliable identification of interacting proteins. Recently, Kubben et al. (2010) have introduced a work-around for this problem. They have used chemical cross-linking to stabilize lamin complexes before solubilization and pull- down.
- BioIDTM provides an improvement to these prior approaches in the characterization of potential protein-protein interactions and near-neighbor analyses. BioIDTM uniquely combines two important attributes. The first of these is that it detects potential interactions in their normal cellular context. The second is that it sidesteps issues associated with bait or prey protein solubility. Because the key step of biotinylation occurs before solubilization it will detect both weak and transient interactions. Both of these features are highlighted in the present example using BioIDTM-LaA data where both soluble and membrane proteins were efficiently detected.
- BioIDTM relies on the expression of an exogenous protein that is fused to BirA*, a protein slightly larger than GFP. With respect to myc-BirA*LaA, the fusion protein appears to be targeted appropriately to the nuclear lamina where it shares the same solubility properties as both wild-type and GFP-taggcd LaA. A more subtle issue may arise through biotinylation. Although we observed no evidence of a detrimental effect, our studies have used the addition of excess biotin to cell culture media to enhance the biotinylation of vicinal proteins. The covalent attachment of biorin to primary amines, predominantly lysines, leads to the loss of charge on these sites and at the same time could inhibit other secondary modifications.
- BioIDTM is contingent upon the ability to biotinylate neighboring proteins, which is in turn dependent on the number and availability of primary amines in these proteins. Consequently, the abundance of the biotinylated proteins should not be used to indicate the strength or abundance of an association. Similarly, the absence of biotinylation does not rule out interaction or proximity. BioIDTM-mediated biotinylation is not generally used to validate an actual protein interaction, but instead is used as a screen to identify candidates that can be subsequently investigated systematically or in a hypothesis-based manner.
- BioIDTM identifies well-characterized protein interactors of LaA, including a number of proteins detected by Kubben et al. (2010) using complex purification in combination with chemical cross-linking (LAP1, LAP2 isoforms, Emerin, and MAN1). It is clear that most (if not all) of the more abundant proteins identified with BioIDTM-LaA, amounting to more than 50% of those detected, largely reside in close proximity to the INM. These could fell into the transient, indirect, or vicinal categories.
- BioIDTM identified SLAP75, a previously uncharacterized protein that is clearly enriched at the nuclear lamina.
- Other NE proteins in the dataset are lamins B 1 and B2 and lamin B receptor (LBR).
- LBR lamin B receptor
- the relatively low level of detection of B-type lamins could be a reflection of findings that A- and B-type lamins may be segregated into separate filament systems (Shiim et al., 2008).
- LBR is not known to interact with A-type lamins and its appearance could be simply a consequence of indirect interactions and/or proximity. However, it was detected by Kubben et al. (2010) using their approach of LaA affinity-capture combined with chemical cross-linking.
- nesprin- 1 and nesprin-2 are LINC complex and NE components.
- FHL1 although apparently nucleoplasmic and cytoplasmic (there are three splice isoforms), was detected by BioIDTM-LaA. This raises the possibility that these proteins may constitute an interaction network that if disrupted gives rise to the common phenotype of EDMD (Simon and Wilson, 2011).
- BioIDTM-LaA Some of the proteins identified by BioIDTM-LaA are classified as either cytoplasmic or ER residents. The latter are all membrane proteins. It is possible that at least some of these could have access to the ⁇ ⁇ , although not concentrate there. Certainly there is precedent for this (Torrisi and Bonatti, 1985; Torrisi et al., 1 87). Alternatively, these cytoplasmic and ER proteins might become biotinylated during mitosis when the NE breaks down and lamins aie dispersed throughout the cytoplasm. We are currently investigating the application of BioID in synchronized cell populations that may shed light on these possibilities.
- ACE angiotensin-converting enzyme
- cytoplasmically oriented NPC proteins including Nup214 and Nup358, were found to be biotinylated. As with the cytoplasmic and ER proteins, it is possible that this biotinylation occurs during mitosis. It is also possible that this might occur during nuclear import of myc-BirA*LaA. However, the fact that import receptors were not detected in the screen places a question mark over this. On the other hand, the large size of these nucleoporins may have biased their identification by mass spectrometry.
- nucleoplasmic NPC proteins such as Nupl 53, Nup50, ELYS, and TPR likely reflect the close association between the NPCs and the lamina (Daigle et at., 2001).
- Nupl53 has already been shown to interact with LaA (Al-Haboubi et al., 201 1). These interactions could explain the altered distribution of NPCs observed in LMNA- deficient cells (Sullivan et al., 1 99).
- BioIDTM A significant finding that lends additional credence to the utility of BioIDTM is the observation that histones, which are lysine rich and highly abundant in the nucleus, constitute a disproportionately small f action of the identified proteins. This indicates that BioIDTM is not generating widespread biotinylation, but is more selectively labeling only those proteins in immediate proximity to the fusion protein. This could also be inferred from our fluorescence microscopy data where the streptavidin labeling is colocalized with the myc-BirA*LaA and restricted largely to the NE. It should also be noted that low levels of histones are reported to be biotinylated in vivo ( uroishi et al., 2011). We detected biotinylated histones (HI .3/H1.4 and HI .0) in our control preparations at levels substantially lower than the four endogenously biotinylated mammalian carboxylases.
- BioIDTM provides a powerful new approach to probe protein interactions and proximity in a variety of cell types, it is a technique that is accessible to a broad range of researchers comfortable working with conventional molecular and cell biology techniques, and does not require specialized equipment other than the proteomic analysis that has become a commonly available service. . While the present example focuses on mammalian cells, BiolDTM can be applied in cells from a wide variety of species as well as in model organisms.
- Humanized BirA (Mechold et al., 2005) was mutated to Rl 18G by overlap extension PCR. Products for both the WT and Rl 18G contain a 5' Sail site and at the 3' end, an Xhol, stop codon and Aflll. These were digested with Sail and Aflll and inserted into pcDNA3.1 C-terminal to a myc-epitope digested with Xhol and Aflll. Human LaA was excised from pcDNA3.1 by Xhol and Aflll and inserted in frame with the mycBirA* in pcDNA3.1 using the same restriction sites.
- pRetroXTM.Tight.puro is a puromycin selectable mammalian expression vector that contains a Tet-on-based tetracycline- inducible promoter to inducibly regulate expression.
- pRetroXTM Tet-ON Advanced HEK293 cells that stably express the doxycycline-regulated transactivator protein were transiently transfected with pRetroXTM-Tight.puro myc-BirA*LaA with Lipofectamine 2000 (Invitrogen; Roux et al., 2009). Cells began selection with 0.5 ⁇ g ml puromycin 48 h after transfection. Upon colony formation, subclones were isolated and screened by immunofluorescence after induction by the addition of 1 ml doxycycline for 24 h. Immunofluorescence
- Other antibodies include rabbit ann-FAM16 A SLAP75 (1 :200; Sigma-Aldrich), mouse anti-HA (1:200 12CA5; Covance), mouse anti-Nupl53 (1:2, SA1; Bodoor et al., 1999), and mouse anti-LaA (1:100, XB10; Horton et al., 1992). Proteins were visualized with goat anti-mouse, goat anti-rabbit, or streptavadin coupled to Alexa Fluor 488 or -568 (1:1,000; Invitrogen). DNA was detected with Hoechst dye 33258. Coverslips were mounted in 10% Mowiol 4-88.
- Cells were incubated for 24 h in complete media supplemented with 1 ⁇ ' ⁇ doxycycline and 50 ⁇ biorin. After three PBS washes, cells (for small-scale analysis, ⁇ 10 7 ; for large scale analysis, 4 1 7 ) were lysed at 25°C in 1 ml lysis buffer (50 mM Tris, pH 7.4, 500 mM NaCl, 0.4% SDS, 5 mM EDTA, 1 mM DTT, and lx Complete protease inhibitor [Roche]) and sonicated. Triton X-100 was added to 2% final concentration.
- 1 ml lysis buffer 50 mM Tris, pH 7.4, 500 mM NaCl, 0.4% SDS, 5 mM EDTA, 1 mM DTT, and lx Complete protease inhibitor [Roche]
- wash buffer 2 (0.1% deoxycholate, 1% Triton X-100, 500 mM NaCl, 1 mM EDTA, and 50 mM Hepes, pH 7.5
- wash buffer 3 250 mM LiCl, 0.5% NP-40, 0.5% deoxycholate, 1 mM EDTA, and 10 mM Tris, pH 8.1
- wash buffer 4 50 mM Tris, pH 7.4, and 50 mM NaCl.
- 10% of the sample was reserved for Western blot analysis. Bound proteins were removed from the magnetic beads with 50 ⁇ of Laemmli SDS-sample buffer saturated with biotin at 98°C. For the larger scale preparation, 90% of the sample to be analyzed by mass spectrometry was washed twice in 50 mM NH 4 HCO 3 . Protein identification by mass spectrometry
- proteins eluted from the streptavidin beads by SDS-sample buffer were reduced and alkylated and separated by ID SDS-PAGE. Separated proteins were visualized by colloidal Coomassie blue staining.
- the whole gel lane was cut in 24 equal-sized gel bands, destained, and submitted to tryptic in-gel digestion, all using perforated microtiter plates (Proxeon) with exchange of solvents by low-speed centritugation.
- Peptides were eluted into V-bottom polypropylene microtiter plates, freeze-dried, dissolved in 0.1% formic acid in water, and submitted to nano-flow HPLC coupled to a QTOF mass spectrometer (1260 nanoHPLC [Agilent Technologies] and QTOF 6554 with ChipCube [Agilent Technologies]). Separation of peptides was performed on a 150-mm ⁇ 75-um C 18 Reprosil column in a chip (Chip II; Agilent Technologies). The applied gradient was from 8% acetonitrile in water with 0.2% formic acid to 35% acetonitrile in water with 0.2% formic acid over 35 min.
- the mass spectrometer calibration was maintained by continuous submission of a calibrant solution and recalibration of the acquired spectra after the analytical run.
- the LC-MS/MS system was controlled by MassHunterTM Acquisition software (Agilent Technologies), 4 MS spectra per second and 3 MS/MS spectra per second were collected.
- the MS to MS/MS switching was done data dependent with a threshold of 1,000 counts and a charge of 2-4 for the peptides.
- Raw data were converted into mzdata.xml using MassHunterTM Qualitative Analysis software
- Formic acid was added to the peptide solution (to 2%) before on-line analysis of peptides by high-resolution, high-accuracy LC- MS MS, consisting of a MichromTM HPLC, a 15-cm Michrom MagicTM CI 8 column, a low-flow ADVANCED MichromTM MS source, and a LTQ-OrbitrapTM XL (Thermo Fisher Scientific).
- a 120-min gradient of 10-30%B (0.1% formic acid, 100% acetonitrile) was used to separate the peptides. The total LC time was 141 min.
- the LTQ-OrbitrapTM XL was set to scan precursors in the Orbitrap at a resolution of 60,000, followed by data-dependent MS/MS of the top four precursors.
- Raw LC- MS MS data were submitted to Sorcerer EnterpriseTM (Sagc-N Research Inc.) for protein identification against the IPI human protein database, which contains semi- tryptic peptide sequences with the allowance of up to two missed cleavages and precursor mass tolerance of 50.0 ppm.
- a molecular mass of 57 D was added to all cysteines to account for carboxyamidometliylation.
- Differential search included 16 D for methionine oxidation, and 226 D on N terminus and lysine for biotinylation. Search results were sorted, filtered, statically analyzed, and displayed using
- TPP trans-proteomic pipeline
- Lamina-associated polypeptide 2alpha binds intranuclear A-type lamins.
- Integral membrane proteins of the nuclear envelope interact with lamins and chromosomes, and binding is modulated by mitotic phosphorylation. Cell. 73: 1267-79.
- Nuclear lamins building blocks of nuclear architecture.
- Sampl is functionally associated with the LINC complex and A-type lamina networks. Journal of cell science. 124:2077-85.
- Emery-Dreifuss muscular dystrophy The anatomical record. Part A,
- Tpr as the architectural element of the nuclear basket. Molecular biology of the cell. 15 :4261 -77.
- Biotinylation is a natural, albeit rare, modification of human histones.
- Nesprin 4 is an outer nuclear membrane protein that can induce kinesin- mediated cell polarization. Proceedings of the National Academy of Sciences of the United States of America. 106:2194-9.
- nucleoporin Nupl53 is required for nuclear pore basket formation, nuclear pore complex anchoring and import of a subset of nuclear proteins.
- G2-BioIDTM 2 nd generation BioIDTM whose primary improvement is a considerable reduction in the size of the promiscuous biotin ligase.
- Gl- BioIDTM is based on a mutation (Rl 18G) of the E. coli BirA gene
- G2-BioIDTM is based on a mutation (R40G) of biotin protein ligase (BPL) from A. aeolicus.
- BPL biotin protein ligase
- the original version has an N-terminal myc-epitope tag for protein detection.
- G2-BioIDTM is 234 amino acids (26.56 kDa), whereas Gl -BioIDTM is 322 amino acids (35.22 kDa), both calculated without the 9 amino acid myc-epitope tag.
- the wild type ligases that are the basis for Gl -BioIDTM and G2-BioIDTM have been structurally characterized.
- the dimensions (length and width) of Gl-BioIDTM are 7.4 x 4.0 nm and for G2-BioIDTM are 5.2 x 4.2 nm. This considerable size difference ( ⁇ 30% difference) is due to the presence of a naturally occurring DNA-binding domain that is found in Gl -BioIDTM but not in the G2-BioIDTM.
- HEK293 cells were either mock transfected (control) or transfected with myc- Gl- or myc- G2-BioIDTM either in the presence or absence of 50 ⁇ biotin.
- FIG. 5 As compared to the control lanes, the expression of the Gl - (*) and G2- BioIDTM (*) proteins is detected by anti-myc immunoblotting. Promiscuous biotinylation of endogenous proteins is observed with streptavidin-HRP for both Gland G2-BioIDTM. This effect is enhanced by the presence of exogenous biotin.
- HEK293 cells were either mock transfected (control) or transfected with myc-Gl - or myc-G2-BioID -LaA either in the presence of 50 ⁇ biotin. Similar levels of promiscuous biotinylation of endogenous proteins is observed with streptavidin-HRP for both G l- and G2-BiolDTM-LaA. Expression of the Gl- and G2- BioID-LaA proteins is detected by anti-myc immunoblotting.
- BioIDTM The overall process of BioIDTM can be broken down into two stages; 1) generation and characterization of a BiolDTM-fusion protein in a mammalian (or other) expression vector for stable expression in a cell line and 2) use of that cell line for a large-scale BioIDTM pull-down to identify protein candidates by mass spectrometry.
- this exemplary protocol is specific to the application of BioIDTM in common mammalian cell lines. Based on the teachings herein, it will be well within the level of skill in the art to apply BioIDTM to a variety of cell types and species.
- BioIDTM The fundamental component of this system is the BioIDTM fusion protein. If the purpose of BioIDTM is to screen for physiologically relevant protein interactions for a protein of interest then the ideal fusion protein will represent a functional replacement of the original protein. Thus care should be taken in deciding where in the protein the biotin ligase will be incorporated. Prior evidence of successful fusions with similarly sized proteins, such as GFP, can be a good starting point to guide this decision. Consideration of posttranslational modifications, either to the N- or C- terminus must be considered as well. For example, C-terminal prenylation or an N- terminal signal peptide should not be disrupted.
- Another consideration is how to ensure that the fusion protein is functional. In many cases this is not possible unless the function of the protein is well known, some form of knockout/knockdown system exists and there is a phenotypic outcome to rescue. However, it is typically possible to at least compare the localization of the fusion protein to a more minimally tagged, or preferably endogenous, protein by immunofluorescence microscopy. Attention should be paid to expression levels as overexpression of proteins can often lead to mis-localization. In most cases low levels of BioIDTM fusion proteins, at or below the level of the endogenous protein, are sufficient for the identification of candidates. Thus when validating by transient transfection, attention should be paid to cells expressing lower levels of the fusion protein as this is the desired, and often likely, outcome for the cells that stably express the fusion protein. Choice of cells and expression method
- BioIDTM fusion protein Once the BioIDTM fusion protein has been designed and an expression plasmid generated it is necessary to consider the cell type in which to express the BioIDTM fusion protein. This is a situation-specific decision, however there are a few concepts to consider. Preferably the expression will be similar in all cells within the population to be used for large-scale BioIDTM pull-down. Thus traditional transient transfection is best limited to preliminary functional validation. A population of cells stably expressing similar amounts of the fusion protein is preferred. This can be accomplished with standard random integration or various viral expression systems. Since it is usually sufficient to screen for BioIDTM candidates more likely to be physiologically relevant, an extremely low level of protein expression is ideal.
- Inducible expression of the fusion protein can be utilized, especially if expression of the bait is toxic. However, inducible expression is not necessary in most instances since biotinylation itself is induced by the addition of excess biotin.
- This protocol describes exemplary techniques for creation and characterization of a BioIDTM-fusion protein expression plasmid followed by its stable expression in mammalian cells. Techniques such as PCR cloning and generation of stably expressing cells will not be described as they vary depending on the situation. The methods needed to test the targeting of the fusion protein and its expression and activity in cells prior to a large-scale BiolD pull-down will be described. Materials
- Streptavidin-HRP High Sensitivity Streptavidin-HRP, Thermo Scientific
- ABS Blocking buffer (see recipe)
- BSA is preferred for blocking to eliminate any free biotin that is potentially present in milk or serum. Free biotin will compete with biotinylate proteins for binding to streptavidin-HRP.
- This step can be used to reduce background signal on membrane.
- ECL enhanced chemiluminescence
- This step removes residual sodium azide and hydrogen peroxide that may affect the following Western blot analysis. 15. Proceed to immunoblot membrane with antibodies specific to the BioID fusion protein (eg. anti-myc/HA) to confirm its expression and migration by SDS-PAGE.
- BioID fusion protein eg. anti-myc/HA
- immunofluorescence Perform immunoblot analysis on subclones that pass the immunofluorescence screening. If viral infection is utilized, screen population of infected cells by immunofluorescence and immunoblot.
- biotin can be added to the cells as in Step 3 to monitor biotinylation function of BioIDTM fusion protein.
- This protocol describes an exemplary embodiment of the invention using cells stably expressing a BioIDTM fusion protein (along with non-expressing control cells) to perform large-scale BioIDTM pull-down experiments.
- the purpose of these experiments is to isolate sufficient amounts of proteins biotinylated by the BioIDTM fusion protein to be identified by mass spectrometry.
- the starting material for these experiments may vary depending on a number of factors. These include the efficiency of biotinylation by the BioIDTM fusion protein and the number of desired candidate proteins.
- This protocol describes the analysis of four confluent 10 cm plates of cells/condition ( ⁇ xl0 7 cells). The protocol ends immediately prior to analysis by mass spectrometry, a service typically performed by a core facility.
- lysis and wash steps can be carried out at RT to avoid precipitation of SDS and deoxycholic acid.
- lysis and wash steps can be carried out at RT to avoid precipitation of SDS and deoxycholic acid.
- a tube cap opener used to reduce keratin contamination.
- the number of cells utilized will correlate with numbers of candidates identified,
- Incubation time may vary depending on the goals of the experiment.
- This step helps to remove residual free-biotin from the medium.
- the purpose of this harsh lysis is to try to disrupt all protein interactions and completely denature/solubilize the proteins. Transfer lysed cells to a 15 ml conical tube.
- Triton X-100 dilutes out the SDSand prevents its precipitation at 4°C. Position the probe tip in the sample just above the tube bottom.
- This dilution provides more favorable conditions for affinity capture.
- This step helps solubilize any precipitated proteins that may be present due to reduced salt and detergent concentrations from the previous step and assists in mixing the sample. Aliquot the sample evenly to three pre-chilled 2 ml tubes.
- This step junctions to equilibrate the beads in the binding buffer. Once the beads accumulate at one side of tube wall, remove the supernatant gently by pipetting.
- This step removes the buffer the beads equilibrated in. After sample centrifugation, carefully transfer supernatant to the tubes prepared in step 16. Do not disturb the small insoluble pellet on the tube wall when removing the supernatant.
- Step 17 can be performed quickly to prevent the beads from drying out. Resuspend the samples and beads with gentle pipetting.
- wash buffer 1 0.5 ml of wash buffer 1 to each tube and resuspend beads gently by pipetting.
- wash buffer 1 1.5 ml of wash buffer 1 to one of the three tubes and resuspend gently.
- wash buffer 2 as described in steps 20-24.
- wash buffer 3 as described in steps 20-24.
- Final wash buffer may vary depending on analysis method. If necessary, freeze the sample quickly in liquid nitrogen. Frozen samples can be stored at -
- Biotin (Sigma, B4501) in 50 ml of serum-free DMEM (or standard tissue culture media). Pipetting may be required to dissolve biotin completely.
- Deoxycholic acid stock solution must be protected from light.
- Deoxycholic acid does not solubilize well under pH 7.1. Add Hepes last. Store at RT
- BioIDTM makes this solution fresh with each use
- the subcellular locations in which BioIDTM has successfully been applied to date include the nucleus, cytoplasm, endoplasmic reticulum (ER) and mitochondrial matrix. This protocol does not provide specific details for identification of proteins by mass-spectrometry. BioIDTM experiments performed to date have utilized on-bead tryptic digestion to release peptides for analysis by ID LC-MS MS (Roux et al., 2012). Advantages of this approach include circumvention of the difficulty in removing biotinylation proteins/peptides from streptavidin matrix without also removing the streptavidin itself. An additional benefit from this approach is removal of the SDS-PAGE separation of proteins, a common source of keratin contamination and mass-spectrometry interference due to residual SDS.
- the expected outcome is the identification of proteins that interact with and or are proximate to the BioIDTM fusion protein.
- the results will likely arrive as a list of proteins that were identified by mass- spectrometry. This list should not only contain the names of the proteins but the peptide sequences and number of times each peptide was identified. This provides some information as to the relative abundance of that protein in the sample. In parallel there will be results from the control cells that do not express a BioIDTM fusion protein.
- This list usually contains the five naturally biotinylated carboxylases (pyruvate carboxylase, PC; methylcrotonoyl-CoA carboxylase subunit alpha, MCCl; acetyl-CoA carboxylase 1, ACACA; acetyl-CoA carboxylase 1, ACACB; propionyl- CoA carboxylase alpha chain, PCCA), limited contaminating keratins and ribosomal proteins, and a small number of histones. These can be subtracted from the mass spectrometry results from the BioIDTM-fusion protein samples. After this subtraction, the rcrnairiing proteins are candidate interactors or proximate proteins that can be validated by other methods.
- BioIDTM typically results in a relatively large list of candidates, from -50 to 250, depending on the experiment. These can be ranked by abundance and/or based on what is already known about those proteins. It should be noted that the abundance, or lack thereof, does not necessarily imply anything about biological relevance and the user will have to make decisions as how to best utilize the results.
- PCR cloning and validation of the fusion protein by transient transfection can be accomplished in as little as a week.
- Generation and selection of stably expressing cells could take as little as a week if viral expression is utilized or 3-4 weeks if relying on random integration following transient transfection and selection.
- a large-scale BioIDTM pull-down can be accomplished in under a week. This includes the time required for the generation of sufficient cell numbers (typically 4xl0 7 cells).
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Hematology (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Urology & Nephrology (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Food Science & Technology (AREA)
- Biotechnology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
The present invention provides methods for identifying in vivo proximate proteins and reagents to use in such methods. Eg, a promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells.
Description
Methods and reagents for identifying proximate proteins
Cross Reference
This application claims priority to U.S. Provisional Patent Application Serial No. 61/722145 filed November 3, 2012, incorporated by reference herein in its entirety.
Background
The elucidation of protein-protein interactions represents a significant barrier to the understanding of complex biological processes. In recent years it has become increasingly clear that the functions of many proteins can only be fully understood in the context of networks of interactions. Furthermore, the description of such networks provides keys to our understanding of disease processes . Biochemical and genetic techniques, including affinity-capture complex purification and yeast two-hybrid strategies have provided powerful tools in the search for new molecular associations. However, these methods also display fundamental limitations. For high-throughput genetic approaches, protein interactions are commonly assessed in a cellular environment different to that in which they would normally occur, often lacking the proper machinery for post-translational modifications and the normal complement of associated binding partners, including molecular chaperones. This can lead to incomplete or erroneous datasets. Biochemical approaches suffer loss of candidates through protein insolubility and transient or weak interactions. As a consequence of these limitations many proteins remain refractory to conventional methods used to screen for protein interactions. These issues are more relevant than ever, as we collectively look to the daunting task of unraveling the protein "interactome."
Summary of the Invention
In a first aspect, the present invention provides methods for identifying in vivo proximate proteins, comprising (a) culturing recombinant cells comprising a recombinant nucleic acid capable of directing expression of a fusion protein comprising (i) a heterologous promiscuous biotin protein ligase (BPL), or functional equivalent thereof, and (ii) a bait polypeptide, wherein the culturing is carried out
under conditions suitable for expression of the fusion protein in the eukaryotic cells; and (b) identifying biotinylated proteins, wherein the biotinylated proteins are proteins present in the cells proximate to the bait polypeptide. In one embodiment, the BPL, or functional equivalent thereof, comprises a protein with an amino acid sequence of general formula 1;
X1-X2-X3-X4-X5-X6-X7-X8-X9-X10-X11-X12-X13-X14-X15-X16-X17-X18-X19- X20-X21-X22-X23-X24-X25-X26-X27-X28-X29-X30-X31-X32 (SEQ ID NO: 13), wherein
I is selected from the group consisting of A, L, I, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of I, V, and L;
X4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R_ K, E, and I;
X7 is Q;
X8 selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, K, N, S, Q, and E;
X10 is G;
XI I is selected from the group consisting of R and K;
X12 is G;
X13 is any residue other than R;
X14 is selected from the group consisting of R, L, W, G, and S;
X15 is selected from the group consisting of G, Q, P, and ;
X16 is selected from the group consisting of R and N;
X17 is selected from the group consisting of , Q, V, E, T, Mf and A,
X18 is W;
X19 is selected from the group consisting of F, L, Y, E, I, and V;
X20 is selected from the group consisting of S, M, and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consisting of F, E, A, , Y, and V;
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C, or is absent;
X25 is selected from the group consisting of C and A, or is absent;
X26 is selected from the group consisting of A and L, or is absent;
X27 is selected from the group consisting of A, G, and S, or is absent;
X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting of L, I, A, and F;
X30 is selected from the group consisting of Y, M, A, V, and L;
X31 is selected from the group consisting of L, Gf V, I, and F; and
X32 is selected from the group consisting of S, T, and F.
In another aspect, the present invention provides recombinant nucleic acids, comprising (a) a first nucleic acid domain encoding a promiscuous biotin protein ligase (BPL); and (b) a second nucleic acid domain encoding a bait polypeptide. In one embodiment, the first nucleic acid domain encodes a protein comprising an amino acid sequence of general formula 1 :
X1-X2-X3-X4-X5-X6-X7-X8-X9-X10-X11-X12-X13-X14-X15-X16-X17-X18-X19- X20-X21-X22-X23-X24-X25-X26-X27-X28-X29-X30-X31-X32 (SEQ ID NO: 13), wherein
XI is selected from the group consisting of A, L, I, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of I, V, and L;
X4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R, K, E, and I;
X7 is Q;
X8 selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, , N, S, Q, and E;
XlO is G;
XI I is selected from the group consisting of R and K,
X12 is G;
13 is any residue other than R;
14 is selected from the group consisting of R, L, W, G, and S;
X15 is selected from the group consisting of G, Q, P, and K;
X16 is selected from the group consisting of R and N;
X17 is selected from the group consisting of , Q, V, E, T, M, and A;
X18 s W;
X19 is selected from the group consisting of F, L, Y, E, I, and V;
X20 is selected from the group consisting of S, M, and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consisting of F, E, A, K, Y, and V;
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C, or is absent;
X25 is selected from the group consisting of C and A, or is absent;
X26 is selected from the group consisting of A and L, or is absent;
X27 is selected from the group consisting of A, G, and S, or is absent;
X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting of L, I, A, and F;
X30 is selected from the group consisting of Y, M, A, V, and L;
X31 is selected from the group consisting of L, G, V, I, and F; and
X32 is selected from the group consisting of S, T, and F.
In another aspect, the present invention provides recombinant expression vectors comprising a recombinant nucleic acid of the invention. In another aspect, the invention provides recombinant host cells comprising a recombinant expression vector of the invention. In another aspect, the invention provides transgenic, non- human organism comprising a recombinant host cell of the invention.
In a further aspect, the present invention provides recombinant fusion proteins, comprising (a) a first domain encoding a biotin protein ligase (BPL); and (b) a second domain encoding a bait polypeptide. In one embodiment, the first domain comprises an amino acid sequence of general formula 1 :
X1-X2-X3-X4-X5-X6-X7-X8-X9-X10-X11-X12-X13-X14-X15-X16-X17-X18-X19- X20-X21 -X22-X23-X24-X25-X26-X27-X28-X29-X30-X31 -X32 (SEQ ID NO: 13), wherein
XI is selected from the group consisting of A, L, I, T, V, or is absent,
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of I, V, and L;
X4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R, K, E, and I;
X7 is Q;
X8 selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, K, N, S, Q, and E;
X10 is G;
XI 1 is selected from the group consisting of R and K.;
X12 is G;
X 13 is any residue other than R;
X14 is selected from the group consisting of R, L, W, G, and S;
X 15 is selected from the group consisting of G, Q, P, and ;
X16 is selected from the group consisting of R and N;
X17 is selected from the group consisting of , Q, V, E, T, M, and A;
X18 is W;
X19 is selected from the group consisting of F, L, Y, E, I, and V;
X20 is selected from the group consisting of S, M, and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consisting of F, E, A, K, Y, and V;
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C, or is absent;
X25 is selected from the group consisting of C and A, or is absent;
X26 is selected from the group consisting of A and L, or is absent;
X27 is selected from the group consisting of A, G, and S, or is absent;
X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting of L, I, A, and F;
X30 is selected from the group consisting of Y, M, A, V, and L;
X31 is selected from the group consisting of L, G, V, I, and F; and
X32 is selected from the group consisting of S, T, and F.
Description of the Figures
Figure 1, Model for application of BioID method, (a) Expression of a promiscuous biotin-ligase fusion protein in live cells leads to the selective biotinylation of proteins proximate to that fusion protein. After stringent cell lysis and protein denaturation, biotinylated proteins are affinity purified. These candidate proteins can be identified by mass spectrometry or immunoblot analysis, (b) In our application of BioID™ to LaA to identify candidate proteins we used HEK293 cells stably expressing inducible mycBirA*LaA. 24 h before lysis, cells were induced to express mycBirA*LaA with doxycycline and to biotinylate endogenous proteins with 50 μΜ biotin. Cells were lysed under stringent conditions and biotinylated.
Figure 2. BirA* promiscuously biotinylates endogenous proteins in mammalian cells. HeLa cells were analyzed 24 h after transient transfection with myc-BirA- WT or myc-BirA* (Rl 18G). After transfection, cells were cultured either with or without supplemental biotin (50 μΜ). (a) By Western blot analysis similar levels of the exogenous BirA (asterisk) are detected in all samples with anti-myc. Biotinylated proteins, including both exogenous BirA (asterisk) and endogenous proteins, were detected with HRP-streptavidin. Enhanced protein biotinylation is observed in the myc-BirA* samples as compared with the WT isoform. This difference is dramatically enhanced by the presence of excess biotin. (b) By fluorescence microscopy the BirA is predominantly nuclear as observed with anti- myc. Biotinylated proteins were detected with fluorescently labeled streptavidin. Considerable biotinylation is only observed in cells expressing myc-BirA* and supplemented with excess biotin. The biotin signal predominantly co-localizes with myc-BirA*. DNA is labeled with Hoechst. Bar, 4 μιη.
Figure 3. Proximity-dependent promiscuous biotinylation by BioID-
LaA. HEK293 cells inducibly expressing myc-BirA*LaA, or parental controls, were analyzed 24 h after induction with or without excess biotin. (a) By immunoblot analysis the LaA fusion protein (asterisk) is detected with anti-myc. In control cells levels of endogenously biotinylated proteins are unaffected by the supplemental biotin; however, the biotinylation of endogenous proteins by myc-BirA*LaA is dramatically enhanced in the presence of excess biotin. (b) The myc-BirA*LaA is detected at the nuclear rim and to a lesser extent in the nucleoplasm. Biotinylated proteins colocalize with the LaA fusion protein. DNA is labeled with Hoechst. Bar, 5 pm. (c) To monitor the relative rate of biotinylation by BioID, myc-BirA*LaA HEK293 cells were provided with 50 μΜ biotin at different time points. The levels of myc-BirA*LaA (asterisk in panel a) are similar for all conditions; however, the extent of biotinylation increases with duration of biotin supplementation until it is saturated by 24 h. (d) Similar results were observed by fluorescence microscopy. Myc- BirA*LaA was detected with anti-myc and biotinylated proteins with fluorescently labeled streptavidia DNA was labeled with Hoechst. Bar, 25 pm. (e) The identity of the proteins biotinylated by BioI -LaA was determined by mass spectrometry of proteins isolated with streptavidin-coupled magnetic beads. The relative abundance (percentage of total identified spectra adjusted for protein size, identified spectra per 103 amino acids) and classification of proteins uniquely biotinylated by myc-
BirA*LaA is depicted in the chart. Conservatively estimated, 50% of the detected proteins predominantly reside at the nuclear lamina, INM, or nucleoplasmic face of the NPCs.
Figure 4. SLAP75 is a novel peripheral membrane constituent of the NE identified with BioID™-LaA. To determine if F AM 169A/SL AP75 is a novel peripheral membrane constituent of the NE its subcellular localization was analyzed by immunofluorescence microcopy, (a) As detected with anti-SLAP75, the endogenous protein is co-localized with LaA at the NE of HEK293 cells, (b) Although endogenous SLAP75 is not detected in HeLa cells (not depicted), transiently expressed HA-SLAP75 detected with anti-FAM169A/SLAP75 is localized to the NE, labeled with anti-Nu l53. DNA was labeled with Hoechst . Bar, 10 μιτι.
Figure 5. Comparison of Gl- and G2-BioID™ by immune- and streptavidin-blot analysis. To compare the relative efficacy of the 2nd generation (G2) of BioID™ to the 1 st (Gl ), we transiently transfected human HEK293 cells with myc-tagged Gl (*) or G2 (**) BioID™ in the presence or absence of exogenous biotin. Protein lysates from these cells prepared 24 hours after transfection were separated by SDS-PAGE and analyzed by immuno- and streptavidin-blotting. As compared to control samples without BioID™ expression, similar levels of biotinylation of endogenous proteins were observed with Gl and G2.
Figure 6. Comparison of Gl- and G2-BioID™ by fluorescence microscopy. To compare the relative efficacy of the 2nd generation (G2) of BioID™ to the 1st (Gl), we transiently transfected human U20S cells with myc-tagged Gl (*) or G2 (**) BioID™ in the presence or absence of exogenous biotin. 24 hours after transfection cells we fixed and analyzed these cells by immuno- and streptavidin- fluorescence. In cells expressing either BioID™-fusion protein, biotinylation of endogenous proteins was observed.
Figure 7. Comparison of Gl- and G2-BioID™-LaA by immuno- and streptavidin-blot analysis. To compare the relative efficacy of the 2nd generation (G2) of BioID™ to the 1* (Gl), we generated human HEK293 cells stat stable express myc-tagged Gl (*) or G2 (**) BioK)™-LaA. Protein lysates from these cells prepared 24 hours after the addition of excess biotin were separated by SDS-PAGE and analyzed by immuno- and streptavidin-blotting. As compared to control samples
without BioID -LaA expression, a similar level and pattern of biotinylation of endogenous proteins was observed with Gl- and G2-BioID™-LaA .
Figure 8a through 8c. Exemplary Biotin Protein Ligase Amino Acid Sequences. A list of BPL amino acid sequences from exemplary species.
Figure 9. Alignment of exemplary Biotin Protein Ligase Amino Acid
Sequences. A ClustalW alignment of BPL sequences from exemplary species. The conserved BPL active site is bracketed.
Detailed Description of the Invention
All references cited are herein incorporated by reference in their entirety.
Within this application, unless otherwise stated, the techniques utilized may be found in any of several well-known references such as: Molecular Cloning: A Laboratory Manual {Sambrook, et al., 1989, Cold Spring Harbor Laboratory Press), Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by D. Goeddel, 1991. Academic Press, San Diego, CA), "Guide to Protein Purification" in Methods in Enzymology (M.P. Deutshcer, ed., (1990) Academic Press, Inc.); PCR Protocols: A Guide to Methods and Applications (Innis, et al. 1 90. Academic Press, San Diego, CA), Culture of Animal Cells: A Manual of Basic Technique, ά Ed. (R.I. Freshney. 1987. Liss, Inc. New York, NY), Gene Transfer and Expression Protocols, pp. 109- 128, ed. E.J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1 98 Catalog (Ambion, Austin, TX).
As used herein, the singular forms "a", "an" and "the" include plural referents unless the context clearly dictates otherwise. "And" as used herein is interchangeably used with "or" unless expressly stated otherwise.
As used herein, the term "about" means +/- 5% of the relevant unit of measurement.
As used herein, the amino acid residues are abbreviated as follows: alanine (Ala; A), asparagine (Asn; N), aspartic acid (Asp; D), arginine (Arg; R), cysteine (Cys; C), glutamic acid (Glu; E), glutamine (Gin; Q), glycine (Gly; G), histidine (His; H), isoleucine (He; I), leucine (Leu; L), lysine (Lys; ), methionine (Met; M), phenylalanine (Phc; F), proline (Pro; P), serine (Scr; S), threonine (Thr; T), tryptophan (Trp; W), tyrosine (Tyr; Y), and valine (Val; V).
All embodiments of any aspect of the invention can be used in combination, unless the context clearly dictates otherwise.
In a first aspect, the present invention provides methods for identifying in vivo proximate proteins, comprising
(a) culturing recombinant cells comprising recombinant nucleic acids capable of directing expression of a fusion protein comprising (i) a heterologous promiscuous biotin protein ligase (BPL), or functional equivalent thereof, and (ii) a bait polypeptide, wherein the culturing is carried out under conditions suitable for expression of the fusion protein in the eukaryotic cells; and
(b) identifying biotinylated proteins, wherein the biotinylated proteins are proteins present in the cells proximate to the bait polypeptide.
The methods and reagents of the present invention can be used, for example, for proximity-dependent labeling of proteins in live cells. Named "BioID™" for proximity-dependent biotin identification, this approach is based on fusion of a promiscuous biotin protein ligase to a targeting protein (the "bait polypeptide"). BioID™ features proximity-dependent biotinylation of proteins that are near- neighbors of the fusion protein. The examples presented herein demonstrate that the methods and reagents of the invention are generally applicable method to screen for both interacting and neighboring proteins in their native cellular environment.
BioID™ is effective when applied to insoluble and membrane associated proteins, two classes of proteins that may be refractory to screening with conventional approaches. Additional advantages include the ability to identify weak and/or transient interactions, to screen for interactions in a relatively natural cellular setting and temporal inducibility of biotin labeling
In one embodiment the proteins present in the cells proximate to the bait polypeptide are candidate proteins for interacting with the bait polypeptide.
As used herein, the term "proximate" means within at least about 25 nm or closer. In a further embodiment, the biotinylated proteins are proteins present in the cells within at least about 20 nm of the bait polypeptide. In various further embodiments, the biotinylated proteins are proteins present in the cells within at least about 19, 18, 17, 16, 15, 14, 13, 12, 11, 10 nm of the bait polypeptide, or closer.
The methods can be used with any recombinant cell type that capable of expressing the fusion protein. The cells may be prokaryotic or eukaryotic. Non- limiting embodiments include bacterial cells, trypanosomal cells, protozoan cells, fungal cells, and mammalian cells.
As used herein, "heterologous" means that the BPL is not naturally expressed in the cell (i.e., the gene encoding the promiscuous biotin protein ligase is recombinantly introduced in the cell).
The BPL is "promiscuous," in that it is not selective for its endogenous substrate, and thus shows improved proximity-dependent biotinylation of proteins. In one embodiment, the promiscuous BPL may be a naturally occurring BPL f agment; in another embodiment, the promiscuous biotin protein ligase is a mutated biotin protein ligase, or a fragment thereof.
Any suitable promiscuous BPL can be used in the methods of the invention. Virtually every organism has a BPL that has been identified and its amino acid sequence demonstrated. Any such promiscuous BPL fragment, or BPL mutation can be used in the methods of the invention, so long as it is not endogenous to the recombinant cell.
Thus, in one embodiment the BPL is a fragment of a wild-type BPL, including but not limited to a fragment of any of the BPLs in Figure 9, or functional equivalent thereof. The functional equivalent can be any modification to the promiscuous BPL that retains BPL activity (i.e.: insertions, deletions, point mutations, etc.) In one embodiment, the BPL is deleted for all or part of its DNA binding domain. The DNA binding domain is not conserved beyond some prokaryotes and is fragmented in some BPLs. In a non-limiting example, the E. eeli BPL (SEQ ID NO: 1) is found between residues 22-46. As used herein, "deleted for" includes both BPLs that naturally lack all or a part of their DNA binding domain, and BPLs that are engineered to delete all or a part of their DNA binding domain. In other embodiments, the BPL may be full length but with one or more mutations, such as mutations to increase "promiscuity" of the BPL. Exemplary such embodiments are shown in Figure 8, which possess an R to G mutation to increase promiscuity (SEQ ID NOS: 1-4, 6-12). One such embodiment is the E. colt BirA protein (SEQ ID NO: 1).
In one embodiment, the BPL comprises a protein with an amino acid sequence of general formula 1 :
X1-X2-X3-X4-X5-X6-X7-X8-X9-X10-X11-X12-X13-X14-X15-X16-X17-X18-X19- X20-X21-X22-X23-X24-X25-X26-X27-X28-X29-X30-X31-X32 (SEQ ID NO: 13), wherein
I is selected from the group consisting of A, L, Γ, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of I, V, and L;
X4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R, K, E, and I;
X7 is Q;
X8 selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, K, N, S, Q, and E;
X10 is G;
XI 1 is selected from the group consisting of R and K.;
X12 is G;
X13 is any amino acid other than R;
X14 is selected from the group consisting of R, L, W, G, and S;
X15 is selected from the group consisting of G, Q, P, and K;
X16 is selected from the group consisting of R and N;
X17 is selected from the group consisting of , Q, V, E, T, M, and A;
X18 is W;
X19 is selected from the group consisting of F, L, Y, E, I, and V;
X20 is selected from the group consisting of S, M, and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consisting of F, E, A, K, Y, and V;
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C, or is absent;
X25 is selected from the group consisting of C and A, or is absent;
X26 is selected from the group consisting of A and L, or is absent;
X27 is selected from the group consisting of A, G, and S, or is absent,
X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting of L, I, A, and F;
X30 is selected from the group consisting of Y, M, A, V, and L;
X31 is selected from the group consisting of L, G, V, I, and F; and
X32 is selected from the group consisting of S, T, and F.
Proteins according to general formula 1 include the BPL active site based on a number of known BPLs (See Figure 9— sequence alignment; residues 511 to 542). In many BPLs, the XI 3 position is R; mutating this to any other amino acid helps to make the resulting BPL more promiscuous. In an exemplary embodiment, X13 is G.
As will be understood by those of skill in the art, there are many BPLs that in which the active site may include residues different from those recited in general formula I. Thus, this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
In various embodiments, at least one of the following is true of the BPL of general formula 1 (SEQ ID NO: 17):
X4 is A;
X8 is T or S;
X13 is G;
XlS is G;
X1 is L, I, or V;
X20 is S;
X21 is P
X23 is G;
X2 is L or I;
X30 is A. V, or L;
X31 is L, G, V, or I; and/or
X32 is S.
In another embodiment, the BPL comprises or consists of an amino acid sequence according to SEQ ID NO: 14, as represented in Table 1 below. Numbering in the Table is based on the numbering in the sequence alignment shown in Figure 9.
Proteins with an amino acid sequence according to SEQ ID NO: 14 include the BPL active site and additional residues required for BPL activity based on a number of known BPLs (See Figure 9; residues 506 to 796). As will be understood by those of skill in the art, there arc many BPLs that in which the amino acid sequence required for activity may include residues different from those recited in SEQ ID NO: 14.
Thus, this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
Position Residues Position Residues
506 P, T, Q, absent 507 E, N, absent
508 S, T, M, absent 509 T, G, absent
510 T, F, L, absent 511 A, L, I, V, T, absenl
512 C, L, V, H, A, absent 513 I, V, L
514 Α,σ 515 E, D, R, T, N,V,A
516 Y,R,K,E,I 517 Q
518 T,S,Q,N, V,F 519 Α,Κ,Ν, S, Q,E
520 G 521 R,K
522 G 523 Any AA other than R,
524 R, L, W, G, S 525 G, K, Q, P
526 R,N 527 K, Q, V, E, T, M, A
528 W 529 L, V, I, F, Y, E
530 S,M,N 531 P,Q,D
532 F, E, A, K, Y, V 533 G, A
534 V, C, absent 535 C, A, absent
536 A, L, absent 537 A, G, S, absent
538 N,G, Q, T,C 539 L, I, A, F
540 A,V, L,Y,M 541 L, G,V,I,F
542 R, T, F 543 , F, Γ, 1
544 F, L, Y, V, P, I, S 545 L, I, V, W
546 R,N,A,D,Q 547 L, P, F, S, V, M
548 F., K, Π, Q, P 549 1., G, V, A, Q, F, P,D
550 G, F, K, Y, T 551 N, R, Q, absent
552 R, F, V, absent 553 N, S, absent
554 T, absent 555 T, I, absent
556 S, P, absent 557 V, I, absent
558 P, A, V, absent 559 A, Ε,Κ,Ρ, L, F
560 A,V,L,N, S,K 561 A,L,I,V,Q
562 I, L, E, A, Y, H, absent 563 G, L, V, Q, absent
564 L, I, V, S, M 565 S,P,N, M, A
566 L, V, F 567 V,L,A
568 A,I,L, V,Y 569 G, P,A,C,U,T
570 I, L, V, K, R, E 571 V, A, G, S, C,E
572 V, I, L, A, M 573 A, I, L,V,S,Y,R,K
574 D, E, S, R 575 V,A,I, Y5S
576 L,A,I,C 577 R, Ε,Ν,Κ, G.P.H
578 K, E, N, L, G, E, absent 579 L, I, Y, A, L, F, ., absent
580 G, T, V, S, C, E, Q, absent 581 A, E, D, G, N, L, absent
582 P, absent 583 F, A, absent
584 I, A, L, D, K, absent 585 K, P, E, R, D, N, absent
586 V,L, A, F 587 R, K, S, Q, G7 A, F
588 V,L,I 589 K
9 W 591 P
592 N 593 D
594 , V 595 Y,L,M
596 L,V,A,F,Y 597 Q, D, , N, L, R, S
598 D, E, V, G, S 599 N, P, D, L, absent
600 T, K, absent 601 Y, G, absent
602 Y, G, absent 603 K, absent
604 R, absent 605 , absent
606 N, absent 607 L, absent
608 K, absent 609 L, absent
610 V, absent 611 N, absent
612 T, absent 613 G, absent
614 F, absent 615 E, absent
616 II, absent 617 T, absent
618 K, absent 619 L, absent
620 P, absent 621 L, I, absent
622 G, N, absent 623 D, F, absent
624 I, Q, absent 625 E, G, absent
626 P, K, D, absent 627 N, A, Q, absent
628 Y, absent 629 R, , G, L, M
630 K 631 L, V, I
632 A,G,S 633 G
634 I, v 635 L, I
636 V.C.T 637 Ε,Ν,Τ
638 S, L,I 639 I, S,A,H,N
640 G, N, absent 641 K,S,H,F, Y, L, absent
642 T, K, G, I, R, V, M, absent 643 G, D, N, P, K, S, absent
644 D, , G, N, E 645 A, M, V, K, T, absent
646 A, L, D, Q, A, Y, F, absent 647 Q, Y, A, C, H, N absent
648 I, L, V, M 649 A,V,I,L
650 I, L, A, V 651 G
652 A,1,C 653 G
654 I, V, L, F 655 N
656 V, L, M 657 A, N, S, T, D
658 N,Q,M,L, G,S 659 I^S.Q^A.D.L.
660 R, E, K, I, P, G, N 661 R, D; P, absent
662 A, T, a sent 663 V, G, I, T, absent
664 E, T, A, S, C, absent 665 V, 1., I, E, P, Q, absent
666 Q,N,S,E, T,D 667 V,E,P, T, A, D
668 V, I, G, L, W 669 N, K, S, R, A, I, V, Q
670 Q,D,E,T,R 671 G, R, I, P, E
672 W, Α,Υ,Ρ,ί,Η 673 J,T, A,V,N,S
674 T, E, S, K, absent 675 E, N, absent
676 R, absent 677 Q, absent
678 Q, absent 679 L, V, S, Q, absent
680 Q, Y, C, K, H, D, N, absent 681 E,G,L,N, P,
682 A, I, E, D, P 683 G, T, D, L, R, A, G, E
684 I, G, L, P, S 685 N,K,D,H,T,P,E,Q,K
686 D, E, A, K, P, absent 687 L, V, I, W, F
688 D, E, K, S, R, Q 689 R, K, P, A, F
690 E,D,N,K,Q,V 691 T,E,M,K,C,Y
692 L, V, I, F 693 A, 1, L, Q
694 A,L, P, S, D,T 695 M, K, N, L, S, A, R
696 L, V, A, F, Y 697 V, A, I, L, M, Γ
698 R, K, Q, P, Υ,Ν,Τ 699 E, R, Η,Τ,Α, K,N,Q,V
700 L,I,F 701 D, E, R, S, Y, A
702 A, E, T, S, R, V, K 703 A, Γ, I,, N, R, Y, W
704 L,I,Q,H,Y 705 Ε,Κ,Ν, A,R, D
706 L, K, I, Q, R, E 707 F, L
708 E,K,D,I,L,Q 709 Q, E, N, T, D
710 E, K, N, Y, Q 711 G, S,E
712 L, F, 1, A, P 713 A, K, D, S, Q, N
714 P, E, D, L, S, O 715 E, Y, absent
716 E, absent 717 I,L. V,Y,F
718 L,K,Q,R,E 719 S, Q, G.K.N, P,E
720 R, , A, S, E, L 721 Y, W,I
722 E, Q, K, L, Y 723 R, ,S,A,E,Q
724 L, K, Y, R, T 725 D, M, N, S, A, W
726 L,A,V,I,N 727 H, R, F, Y, T
728 I,L, S,G 729 G, N, D
730 R, E, S, K, Q 731 P, E, Q, I, T, R
732 V,I 733 K, R, H, N, I,T
734 L,V, AJ 735 G, A, I, L, R, P, Q
736 I,G,T, L,D,S,E 737 G, E, S, H, N, T, A
738 D, G, Q, N, T, S, E 739 K,G, N, A
740 E, D= absent 741 Q, absent
742 M, absent 743 V, absent
744 E, V, T, P, absent 745 S, I, Q,K, A, absent
746 F,T,S, H, A, V 747 G, I, V: , S
748 Ι,Τ,Κ 749 S,L, E, V, F, T,Q,I
750 R; V, Q, Y, E, G 751 G,D,I
752 L,I,T 753 D, E, S, T
754 D, E, K, F, T, S 755 Q,K,R,D, S,Y,F
756 G 757 A, G, Y, L, F, absent
758 L, A, I 759 L, I, V, K, Q
760 V,A,I,L 761 E, L,G,R, K,Q,I,H
762 Q, T, C, O, E, L 763 E, G, P, L, absent
764 D,E,N,K,A,V,G 765 G,K,S,E,N
766 G, N, V, E, absent 767 S, N absent
768 S, absent 769 T, absent
770 Q, E, absent 77 L F, P, absent
772 T, absent 773 G, T, absenl
774 N, K, absent 775 V, M, absent
776 I, V, R, Y, C, absent 777 I,Q,E,N,II, S,T
778 V, L, K, M, R 779 P, E,F,I,Q,H
780 P, W, I, N 781 M, L, S, A, D, absent
782 G, S 783 G, T, E, N
784 E,C,V, T, S,R 785 F,I, V,N
786 S, H, F, D, N 787 L, V, G, I, M, absent
788 L, G, R, F, M, D 789 R, It, H, absent
790 S, N, Q, absent 791 L, V, absent
792 S, I, R, absent 793 A, V, L, K, T, absent
794 E, K, R, P, absenl 795 K, A, absenl
796 V, T, L, R, absent
Residues X1-X32 of general formula 1 are shown in Table 1 at residues 511- 542. In one embodiment, the BPL for use in the invention comprises any residue other than R, including but not limited to a G residue at position 523 (XI 3 in general formula 1 ). In this embodiment, the invariant R residue in wild type BPLs at position X13 is modified, which results in enhanced promiscuity of the resulting BPL
In one embodiment, the BPL compiises or consists of a BPL comprising or consisting of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-4 and 6-12. These amino acid sequences are also shown in Figure 8a through 8c (SEQ ID NOs: 1 -4 and 6-12). In specific embodiments, the E. coli BirA* protein of SEQ ID NO: 1 or the A. eolicus BPL protein mutant of SEQ ID NO: 2, or functional equivalents thereof, are used
Similarly, the bait polypeptide maybe any bait polypeptide deemed suitable for a given use. The bait polypeptide can be anything that will directly or indirectly target the protein to specific cellular regions. It can be a full length protein or any
fragment that imparts some kind of targeting or binding property. In various non- limiting embodiments, the bait polypeptide may comprise lamin-A (P02545; SEQ ID NO: 18), frataxin (Q16595; SEQ ID NO: 19), sun2 (Q9UH99; SEQ ID NO: 20), Nupl07 (P57740; SEQ ID NO: 21), torsin-A (014656; SEQ ID NO: 22), or fragments or homologues thereof.
The BPL and the bait polypeptide may be immediately adjacent in the fusion protein, or may be separated by any number of additional amino acids as desirable for a given purpose. Including such a "linker" region between the BPL and the bait polypeptide may, for example, extend the range of biotinylation and/or provide the enzyme with increased accessibility to label adjacent proteins. The BPL protein may be anywhere within the fusion protein, even internal between other domains N- terminal and C-terminal to the promiscuous BPL.
The recombinant cells are capable of expressing the fusion protein; thus, the recombinant cells comprise nucleic acids encoding the fusion protein, including but not limited to recombinant nucleic acids and vectors as described below. In one embodiment, the fusion protein-encoding nucleic acids in the recombinant cells may be present in an expression vector, such as a plasmid or viral-based vector. Any expression vector suitable for an intended use can be used in the present invention. Such expression vectors can be of any type known in the art, including but not limited to plasmid and viral-based expression vectors. The construction of expression vectors for use in transfccting prokaryotic and eukaryotic cells is also well known in the art, and thus can be accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and Maniatis, in: Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1 89; Gene Transfer and Expression Protocols, pp. 109- 128, ed, E.J, Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, TX). The expression vector must be rcplicablc in the host organisms either as an episome or by integration into host chromosomal DNA. In a preferred embodiment, the expression vector comprises a plasmid. However, the invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors. Specifics of the expression vector will depend on the ultimate desired use. Designing appropriate expression vectors for an intended use is well within the level of those of skill in the art based on the teachings herein.
The fusion protein-encoding nucleic acids in the expression vector are under control of a promoter capable of directing expression of the encoded fusion protein in
the recombinant cell.. Any suitable promoter may be used that can direct expression (i.e.: is "operatively linked") of the encoded proteins. The term "promoter" includes any nucleic acid sequence sufficient to direct expression of the encoded protein(s), including inducible promoters, repressible promoters and constitutive promoters. If inducible, there are sequences present which mediate regulation of protein expression so that the polynucleotide is transcribed only when an inducer molecule is present. Such cis-active sequences for regulated expression of an associated polynucleotide in response to environmental signals are well known to the art. The expression vector may comprise any other control sequences as may be suitable for an intended use. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered "operably linked" to the coding sequence. Other such control sequences include, but are not limited to, polyadenylation signals, enhancers, termination signals, and ribosome binding sites.
The promoter sequence used to drive expression of the fusion protein may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive, etc.).
In another embodiment, the fusion protein-encoding nucleic acids may be integrated into cellular chromosomal DNA In the latter embodiment, the fusion protein-encoding nucleic acids may comprise a "knock-in" of the fusion protein- encoding nucleic acids into an endogenous gene. In this embodiment, the fusion protein-encoding nucleic acid may be knocked-in to any suitable cellular chromosomal gene. In one embodiment, the knock-in comprises a mouse knock-in to any suitable mouse cellular chromosomal gene.
In another embodiment, the method comprises supplementation of the culture medium with biotin. This embodiment is particularly useful when the recombinant cells do not endogenously produce biotin. Based on the teaching provided herein, it will be well within the level of skill in the art to determine appropriate timing and concentration of biotin supplementation of the culture medium, based on specifics of the assay to be carried out. Thus, in embodiments where biotin is added to the culture
medium, it can be added at any suitable concentration, including but not limited to those described in the examples that follow. In one non-limiting embodiment, biotin can be added at con concentrations ranging from about 1 um to about 100 urn, or at about 50 um.
Suitable culture conditions will depend on the cell type used and all other specifics of a given assay, as will be understood by those of skill in the art. In one non-limiting embodiment, the recombinant cells are cultured in the presence of biotin (endogenous or exogenously added) for between about 0.5 and about 96 hours. In various other non-limiting embodiments, the recombinant cells are cultured in the presence of biotin (endogenous or exogenously added) for between about 1 and about 96 hours, about 2 and 96 hours, about 6 and 96 hours, about 12 and 96 hours, about 24 and 96 hours, about 0.5 and 72 hours, about 0.5 and 48 hours, about 0.5 and 24 hours, about 1 and 96 hours, about 1 and 72 hours, about 1 and 48 hours, about 1 and 24 hours, about 2 and 96 hours, about 2 and 72 hours, about 2 and 48 hours, about 2 and 24 hours, about 6 and 96 hours, about 6 and 72 hours, about 6 and 48 hours, about 6 and 24 hours, about 12 and 96 hours, about 12 and 72 hours, about 12 and 48 hours, about 12 and 24 hours, about 24 and 96 hours, about 24 and 72 hours, or about 24 and 48 hours.
In another embodiment, identifying the biotinylated proteins comprises isolating the biotinylated polypeptides. This embodiment is optional, as one can use any technique for identifying the biotinylated proteins, including but not limited to by mass spectrometry analysis of crude cell lysates. In embodiments where the biotinylated proteins are isolated, any suitable method can be used, including but not limited to affinity purification, mass spectrometry, and Western blotting.
In one exemplary embodiment, the method comprises the following;
1. Prepare an expression plasmid encoding a fusion protein of interest.
2. For each experimental condition (eg. control, fusion protein) plate at least two wells of cells, one with glass coverslips and the other without.
3. Express the fusion protein in the cells of choice by transient transfection.
Process mock or non-transfected cells in parallel. Apply a suitable concentration of biotin to cells at time of transfection, or shortly after transfection if the transfection protocol used requires replacement of media shortly after transfection.
Culture cells in presence of biotin for a suitable period of time, such as between 18-24 hours.
Process the cells (using any suitable technique) on coverslips for analysis by immunofluorescence microscopy following transfcction and culture.
Process the cells for immunoblot analysis following transfection and culture. Following whole cell lysis, electrophoretic separation of proteins and Western blotting, agitate the resulting membrane in a suitable blocking conditions. Agitate membrane in streptavidin-horseradish peroxidase (HRP) at a suitable dilution lin blocking buffer under suitable conditions.
Wash membrane to remove unbound streptavidin HRP.
Add enhanced chemiluminescence (ECL) reagent to observe biotinylated proteins.
Following successful analysis of biotinylated proteins, quench HRP signal on membrane.
Wash membrane to remove quenching solution.
Proceed to immunoblot membrane with antibodies specific to the fusion protein to confirm its expression and migration by SDS-PAGE.
In another exemplary embodiment, the method comprises the following: Begin with four 10 cm dishes for each experimental condition (cells expressing fusion protein or control cells).
When cells reach approximately 80% confluency, change medium to fresh complete medium containing suitable concentration of biotin.
Incubate cells for suitable time period, such as 18-24 hours..
Remove medium and rinse the cells.
Add lysis buffer and scrape cells gently to harvest the cells.
Transfer lysed cells to a conical tube.
Add a suitable amount of Triton X- 100 (such as a final concentration -2%) and mix.
Sonicate under suitable conditions.
Aliquot the sample to separate tubes.
Spin down at suitable speed (such as 16,500 x g relative centrifugal force (RCF)).
11. Place new tubes in a Magnetic Separation Stand and add a suitable amount of lysis buffer to each tube.
12. Equilibrate the beads in the binding buffer.
13. Remove the buffer the beads equilibrated in.
14. Resuspend the samples and beads.
15. Incubate the tube on a rotator under suitable conditions.
16. Place the tubes on the Magnetic Separation Stand and collect beads.
17. Remove the supernatant.
18. Add wash buffer to each rube and resuspend beads gently by pipetting.
19. Place tubes on a rotator under suitable conditions.
20. Remove the supernatant.
21. Pool beads in wash buffer and resuspend gently.
22. Remove detergents in the sample that may interfere with mass-spectrometry.
23. Prepare samples for mass-spec analysis.
In another aspect, the present invention provides recombinant nucleic acids, comprising:
(a) a first nucleic acid domain encoding a biotin protein ligase (BPL); and
(b) a second nucleic acid domain encoding a bait polypeptide.
The BPL may be any BPL as deemed appropriate for a given use, including but not limited to any of those disclosed herein. In one embodiment, the BPL comprises a protein with an amino acid sequence of general formula 1 :
X 1 -X2-X3-X4-X5-X6-X7-X8-X9-X 10-X 11 -X 12-X 13-X 14-X 15-X 16-X 17-X 18-X 19- X20-X21 -X22-X23-X24-X25-X26-X27-X28-X29-X30-X31 -X32 (SEQ ID NO: 13), wherein
XI is selected from the group consisting of A, L, I, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of I, V, and L;
X4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R, K, E, and I;
X7 is Q;
X8 selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, K, N, S, Q, and E;
X10 is G;
XI 1 is selected from the group consisting of R and K;
X12 is G;
X13 is any residue other than R;
X14 is selected from the group consisting of R, L, W, G, and S;
X15 is selected from the group consisting of G, Q, P, and K;
X16 is selected from the group consisting of R and N;
X17 is selected from the group consisting of K, Q, V, E, T, M, and A;
X18 is W;
X19 is selected from the group consisting of F, L, Y, E, I, and V;
X20 is selected from the group consisting of S, M, and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consisting of F, E, A, , Y, and V;
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C, or is absent;
X25 is selected from the group consisting of C and A, or is absent;
X26 is selected from the group consisting of A and L, or is absent;
X27 is selected from the group consisting of A, G, and S, or is absent;
X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting of L, I, A, and F;
X30 is selected from the group consisting of Y, M, A, V, and L;
X31 is selected from the group consisting of L, G, V, I, and F; and
X32 is selected from the group consisting of S, T, and F.
Proteins according to general formula 1 include the BPL active site based on a number of known BPLs (Sec Figure — sequence alignment, residues 511 to 542). As will be understood by those of skill in the art, there are many BPLs that in which the active site may include residues different from those recited in general formula I. Thus, this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
In various embodiments, at least one of the following is true of the BPL of general formula 1 (SEQ ID NO: 17):
X4 is A;
X8 is T or S;
X13 is G;
X15 is G;
X1 is L, I, or V;
X20 is S;
X21 is P
X23 is G;
X29 is L or I;
X30 is A. V, or L;
X31 is L, G, V, or I; and/or
X32 is S.
In a preferred embodiment, X13 is G. In this embodiment, the invariant R residue in wild type BPLs at position XI 3 is modified to any other amino acid residue, including but not limited to G, which results in enhanced promiscuity of the resulting BPL.
In another embodiment, the BPL comprises or consists of an amino acid sequence according to SEQ ID NO: 14, as represented in Table 1 below. Numbering in the Table is based on the numbering in the sequence alignment shown in Figure 9.
Proteins with an amino acid sequence according to SEQ ID NO: 14 include the BPL active site and additional residues required for BPL activity based on a number of known BPLs (See Figure 9; residues 506 to 796). As will be understood by those of skill in the art, there are many BPLs that in which the amino acid sequence required for activity may include residues different from those recited in SEQ ID NO: 14.
Thus, this embodiment is an exemplary embodiment and is not intended to be limiting on the scope of the invention.
In one embodiment, the BPL for use in the invention comprises any residue other than R, including but not limited to a G residue at position 523 (XI 3 in general formula 1). In this embodiment, the invariant R residue in wild type BPLs at position
X13 is modified, which results in enhanced promiscuity of the resulting BPL
In one embodiment, the BPL comprises or consists of a BPL comprising or consisting of an amino acid sequence selected from the group consisting of SEQ ID NO:l-4 and 6-12. These amino acid sequences are also shown in Figure 8a through
Figure 8c (SEQ ID NOs: 1-4 and 6-12). In specific embodiments, the E. coli BirA* protein of SEQ ID NO: 1 or the A. aeolicus BPL mutant protein of SEQ ID NO: 2, or functional equivalents thereof, are used.
Similarly, the nucleic acid encoding the bait polypeptide may be any bait polypeptide deemed suitable for a given use. In various non-limiting embodiments, the bait polypeptide may comprise lamin-A (SEQ ID NO: 18), f ataxin (SEQ ID NO: 19), sun2 (SEQ ID NO: 20), Nup 107 (SEQ ID NO: 21), torsin-A (SEQ ID NO: 22), or fragments or homologues thereof.
The coding regions for the BPL and the bait polypeptide may be immediately adjacent in the fusion protein, or may be separated by any number of additional nucleic acids as desirable for a given purpose. Including such a "linker" region between the BPL and the bait polypeptide coding regions may, for example, extend the range of biotinylation and/or provide the encoded fusion protein with increased accessibility to label adjacent proteins.
The recombinant nucleic acid may be of any type, including DNA and RNA, and may be single stranded or double stranded. It will be understood by those of skill in the art that the first domain and the second domain can be present in any order in the recombinant nucleic acid (i.e. : the first domain may be 5 ' or 3 ' to the second domain); similarly the BPL protein may be anywhere within the fusion protein, even internal between other domains N-terminal and C-terminal to the BPL. The recombinant nucleic acid may further comprise "spacer" nucleotides between the first domain and the second domain; such spacer sequences may be of any length suitable for a given purpose, as will be understood by those of skill in the art.
In one embodiment, the fusion protein-encoding nucleic acids in the recombinant cells may be present in an expression vector, such as a plasmid or viral- based vector. Any expression vector suitable for an intended use can be used in the immunogenic compositions of the present invention. Such expression vectors can be of any type known in the art, including but not limited to plasmid and viral-based expression vectors. The construction of expression vectors for use in transfecting prokar otic or eukaryotic cells is also well known in the art, and thus can be accomplished via standard techniques. (See, for example, Sambrook, Fritsch, and Maniatis, in; Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1 89; Gene Transfer and Expression Protocols, pp. 109-128, ed. E.J. Murray, The Humana Press Inc., Clifton, N.J.), and the Ambion 1998 Catalog (Ambion, Austin, TX). The expression vector must be replicable in the host organisms either as an episome τ by integration into host chromosomal DNA. In a preferred embodiment, the expression vector comprises a plasmid. However, the
invention is intended to include other expression vectors that serve equivalent functions, such as viral vectors. Specifics of the expression vector will depend on the ultimate desired use. Designing appropriate expression vectors for an intended use is well within the level of those of skill in the art based on the teachings herein.
In expression vectors of the invention, the first domain and the second domain are operatively linked to a promoter. Any suitable promoter may be used that can direct expression (i.e.: is "operatively linked") of the encoded fusion protein. The term "promoter" includes any nucleic acid sequence sufficient to direct expression of the encoded protein(s), including inducible promoters, repressible promoters and constitutive promoters. If inducible, there are sequences present which mediate regulation of protein expression so that the polynucleotide is transcribed only when an inducer molecule is present. Such cis-active sequences for regulated expression of an associated polynucleotide in response to environmental signals are well known to the art. The expression vector may comprise any other control sequences as may be suitable for an intended use. The control sequences need not be contiguous with the nucleic acid sequences, so long as they function to direct the expression thereof. Thus, for example, intervening untranslated yet transcribed sequences can be present between a promoter sequence and the nucleic acid sequences and the promoter sequence can still be considered "operably linked" to the coding sequence. Other such control sequences include, but are not limited to, polyadenylation signals, enhancers, termination signals, and ribosome binding sites.
The promoter sequence used to drive expression of the fusion protein may be constitutive (driven by any of a variety of promoters, including but not limited to, CMV, SV40, RSV, actin, EF) or inducible (driven by any of a number of inducible promoters including, but not limited to, tetracycline, ecdysone, steroid-responsive, etc.).
In a further aspect, the present invention provides recombinant fusion proteins expressed from the recombinant nucleic acids of the invention. Thus, in one embodiment, the fusion proteins comprises a BPL according to any embodiment or combination of the embodiments disclosed herein and a bait polypeptide, including but not limited to lamin-A, frataxin, sun2, Nupl07, torsin-A, or portions thereof.
The isolated polypeptides can be used, for example, in contacting candidate proteins in an extracellular environment to test verify candidate protein interactions with the bait polypeptide.
In a further aspect, the present invention provides recombinant host cells comprising an expression vector of the present invention. The cells may be prokaryotic or eukaryotic. Non-limiting embodiments include bacterial cells, trypanosomal cells, protozoan cells, fungal cells, vertebrate cells, and mammalian cells (such as mouse cells). The expression vectors may be transiently or stably present in the host cell. As will be apparent to those of skill in the art, the recombinant host cells of the invention are capable of expressing the fusion protein, and thus can be used in the methods of the invention.
In one embodiment, the recombinant host cell comprises a recombinant nucleic acid of any embodiment or combination of embodiments of the present invention knocked-in to an endogenous host cell gene. As is understood by those of skill in the art, a "knock-in" refers to insertion of the recombinant nucleic acid encoding the fusion protein at a specific chromosomal locus (i.e., a targeted insertion). The fusion protein-encoding nucleic acid may be knocked-in to any suitable cellular chromosomal gene. In a further embodiment, the knock-in is generated in a cell (such as an embryonic stem cell) of a non-human organism, such as a mammal including but not limited to a mouse. Knock-in procedures are well defined in a variety of organisms where embryonic stem cells are easily obtained and manipulated.
Thus, in another aspect, the present invention provides a transgenic, non- human organism comprising a recombinant host cell of the present invention. In one embodiment, the organism is a mammal. In a specific embodiment, the organism is a mouse. In one embodiments, recombinant host cells of the present invention comprise, embryonic stem cells which are then used to produce the transgenic non- human organisms of the present invention using standard techniques in the art.
Example 1
Here we describe an approach to screen for proximate proteins in a relatively natural cellular environment. Our method to identify neighboring and potentially interacting proteins is based on the use of a promiscuous prokaryotic biotin protein ligase. The biotin ligase is fused to a protein of interest, and then introduced into mammalian (or other) cells where it will biotinylatc vicinal proteins upon supplementation of the culture medium with biotin. Biotinylated proteins can then be selectively isolated and identified by conventional methods, most notably mass spectrometry. We have applied this strategy, which we call BioID™, to identify
candidate proteins that are proximate to and/or interact with human lamin A (LaA), a well-characterized component of the nuclear envelope (NE), a specialized extension of the endoplasmic reticulum that surrounds the nuclear contents during interphase.
LaA is an intermediate filament protein and member of the A-type lamin family that is encoded by the LMNA gene (Gerace and Huber, 2012). Together with B-type lamins, the A-type lamins are constituents of the nuclear lamina, a filamentous protein meshwork that is intimately associated with the inner nuclear membrane (INM), the membranous portion of the NE that faces the interior of the nucleus. This association is mediated, at least in part, by multiple interactions with integral INM proteins. In addition, nuclear pore complexes (NPCs), large multi-protein channels that span the nuclear membranes and which mediate nucleocytoplasmic trafficking of macromolecules, are anchored to the nuclear lamina (Aaronson and Blobel, 1975; Dwyer and Blobel, 1976). Although the bulk of the A- and B-type lamins are localized to the nuclear lamina, a nucleoplasmic population is thought to function in various aspects of nuclear metabolism, including transcription and replication (Mon¬ et al., 2000; Goldman et al., 2002).
In mammalian somatic cells, the nuclear lamina is roughly 15-20 tun thick and is considered to represent an important structural element of the NE (Gerace and Huber, 2012). Indeed, the role of the nuclear lamina as a determinant of both NE and global nuclear architecture has been highlighted by findings that mutations in the LMNA gene are linked to multiple human diseases including muscular dystrophy, lipodystrophy, and premature aging syndromes {Worman et al., 2009; Worman, 2012). Many of these disorders, known as laminopathies, are associated with oftentimes gross perturbations in nuclear and NE organization. To better understand the etiology of the laminopathies, much effort has been focused on identifying lamin- interacting proteins. However, both A- and B-type lamins are highly insoluble and consequently it has proven extremely difficult to define their molecular associations using conventional approaches. For these reasons we felt that LaA represented an ideal candidate with which to evaluate the utility of BioID™ as a general proximity- based approach to screen for potential protein-protein interactions. At the same time BioID™ introduces a new strategy with which to further explore LaA function.
Our development and use of BioID™ to identify LaA-proximal proteins has revealed a number of abundant candidates among which are known inteiactors of LaA. These include integral proteins of the INM as well as NPC components. Less
abundant candidates fall into functional categories that include transcription, chromatin regulation, RNA processing, and DNA repair. An uncharacterized protein was also among the more prominent candidates revealed by LaA BiolD™. We demonstrate that this protein, which we have named SLAP75, is a novel constituent of the NE that appears to be expressed in a cell type-specific fashion. Taken together, these findings demonstrate that BiolD™ is an effective method to screen for proximate and interacting proteins. This relatively simple and rapid technique has broad applicability to monitor protein behavior in live cells, providing a number of advantages over existing methods.
We sought to generate a method for labeling proteins in a proximity- dependent manner in mammalian cells. We envisioned a system based on the fusion of a protein of interest to an enzyme that could selectively modify vicinal proteins in vivo (Fig. 1 a). There are two requirements for such a system. The first is that the fusion protein must be targeted appropriately when expressed in cells. The second is that the modification itself must facilitate isolation of the specifically labeled proteins. Because it is relatively uncommon in vivo and amenable to selective isolation,we chose to focus on biotinylation.
BirA is a 35-kD DNA-binding biotin protein ligase in Escherichia coli that regulates the biotinylation of a subunit of acetyl-CoA carboxylase and acts as a transcriptional repressor for the biotin biosynthetic operon (Chapman-Smith and
Cronan, 1 99). BirA has been harnessed for experimental applications, including use in eukaryotic cells. The BirA acceptor-peptide system takes advantage of the extreme specificity of BirA in biotinylating its substrate peptide (Beckett et al., 1999). With this system, a minimal recognition sequence, a biotin acceptor tag (BAT), is fused to a protein of interest and coexpressed with BirA. This leads to the biotinylation of the BAT sequence permitting one-step high affinity (Kj = 10~H M; Green, 1 63) avidin streptavidin-mediated purification of the tagged protein. Because biotinylation is a rare modification, in mammalian cells it is restricted primarily to only a few carboxylases (Chapman-Smith and Cronan, 1 99); BAT-independent binding is minimal.
Biotinylation by BirA is a two-step process. The first of these combines biotin and ATP to form biotinoyl-5'-AMP (bioAMP; Lane et al., 1964). This activated biotin is held within the BirA active site until it reacts with a specific lysine residue of the BAT sequence in the second step. For our purposes, the problem with BirA lies with
its stringent selectivity for its endogenous substrate. What we desired was a far more promiscuous biotin ligase. This requirement led us to certain BirA mutants that prematurely release the highly reactive yet labile bioAMP (Kwon and Beckett, 2000; Streaker and Beckett, 2006). One such BirA mutant (Rl 18G, hereafter called BirA*), which is defective in both self-association and DNA binding (Kwon et al., 2000), and displays an affinity for bioAMP two orders of magnitude less than that of the wild- type enzyme (BirA-WT; Kwon and Beckett, 2000). In E, coli, BirA* expression results in promiscuous protein biotinylation because free bioAMP will readily react with primary amines. More significantly however, it has been demonstrated in vitro that BirA* will promiscuously biotinylate proteins in a proximity-dependent fashion (Choi-Rhee et al., 2004; Cronan, 2005).
We explored the possibility that BirA* would promiscuously biotinylate proteins in live mammalian cells. To this end we generated myc epitope-tagged, humanized BirA-WT and BirA* for transient expression in HeLa cells. Western blot analysis using streptavidin-HRP revealed modest levels of biotinylated proteins with BirA* as compared with BirA-WT (Fig. 2 a). Addition of 50 μΜ biotin to tissue culture medium, however, results in a massive stimulation of promiscuous biotinylation by BirA* but not BirA-WT (Fig. 2 a). By fluorescence microscopy the distribution of biotinylated proteins appears similar to that of myc-BirA* itself, which is predominantly nuclear with a subpopulation found in the cytoplasm (Fig. 2 b). These results indicate that BirA* promiscuously biotinylates proteins in mammalian cells. Furthermore, the level of that biotinylation is primarily regulated by the concentration of available free biotin. In conventional tissue culture media formulations fetal calf serum is the source of biotin. Our results indicate that the concentrations of biotin in standard complete media are insufficient to fuel significant biotinylation by BirA*. This has also been demonstrated for BAT biotinylation by BirA-WT (Nesbeth et al., 2006; Kulman etal., 2007), suggesting that it is not a BirA*-specifk phenomenon. Application of BioID™ to the nuclear lamina
Wc next wished to determine whether BirA* could be used as a tool to identify vicinal proteins in vivo. To this end we fused myc-BirA* to the N terminus of human LaA. a well-characterized constituent of the nuclear lamina. During interphase LaA has a relatively restricted distribution within the cell. It is detected for the most
2Θ
part at the NE with a subpopulation found throughout the nucleoplasm (Goldman et at., 2002). To provide consistent and controllable expression levels, we generated HEK293 cells that stably and inducibly express myc-BirA*LaA (Fig. 1 b). In these cells, myc-BirA*LaA localizes predominantly to the nuclear envelope, similar to both endogenous LaA and LaA harboring an N-terminal GFP tag, a modification that does not appear to alter the function of LaA (Broers et al., 1999; Shumaker et al.5 2006). Biotinylation of endogenous proteins in cells expressing myc-BirA*LaA, either in the presence or absence of exogenous biotin, was monitored on Western blots probed with streptavidin-HRP. As is the case with myc-BirA* alone, the presence of 50 μΜ biotin in the culture medium strongly stimulates biotinylation of a wide range of endogenous proteins, in addition to myc-BirA *LaA itself (Fig. 3 a). Microscopy using fluorescent streptavidin reveals that the bulk of these biotinylated proteins must reside at the NE and colocalize with myc-BirA*LaA (Fig. 3 b). The implication is that proteins in the vicinity of myc-BirA*LaA are preferentially biotinylated. It should be noted that not only does the intracellular localization of these biotinylated proteins differ between myc-BirA* (predominantly nucleoplasmic) and myc-BirA*LaA (predominantly at the NE), but their electrophoretic mobilities and hence identities also differ as revealed by Western blot analysis (Figs. 2 a and 3 a). These results suggest that BirA* can be targeted to a specific cellular location and will biotinylate endogenous proteins in a proximity-dependent manner.
Temporal regulation of BioIDI M
The requirement for exogenous biolin suggested to us a means to modulate BirA* activity. To explore this further, HE 293 cells expressing myc-BirA*LaA were analyzed by Western blot at various times after addition of 50 μΜ biotin to their culture medium (Fig. 3 c). Levels of biotinylated proteins increase in parallel with the duration of biotin exposure. This effect reaches saturation within 6 to 24 h with no obvious increase observed at later time points. A similar increase in biotinylation can also be observed by fluorescence microscopy (Fig. 3 d). Both methods reveal a time- dependent accumulation of biotinylated proteins, the majority of which appear to be endogenous and which evidently co-localize with myc-BirA*LaA. These studies indicate that by controlling access to biotin we can temporally regulate biotinylation by BirA*. This opens up the possibility of performing pulse-chase type experiments using this technique.
Identification of vicinal proteins with BioID -LaA
We next set out to test our hypothesis that proteins biotinylated by myc- BirA*LaA should be enriched with known interactors of LaA as well as with near neighbors within the nuclear lamina and I M, and to a lesser extent within the nucleoplasm. To accomplish this we induced myc-BirA*LaA expression in HEK293 cells in the presence of doxycycline and 50 μΜ biotin for 24 h and then lysed the cells under stringent denaturing conditions using an SDS-containing buffer (Fig. 1 b). Parental HE 293 cells, processed in parallel, were used as controls. For these experiments 4.0 * 107 cells (four confluent 10-cm dishes) were analyzed. Biotinylated proteins were captured with streptavidin immobilized on paramagnetic beads, rigorously washed, and bound proteins analyzed by mass spectrometry. Proteins unique to the BiolD™-LaA (myc-BirA*LaA) pull-down, and not detected with identical pull-downs from control cells, were categorized based on localization and function (Fig. 3 e). The relative abundance of the identified proteins within each category is given as a percentage of the total. The bulk of the proteins identified by BioID™-LaA are known NE components, including a number of INM proteins. The most abundant of these are the β and γ iso forms of lamina-associated polypeptide 2 (LAP2, TMPO) and lamina-associated polypeptide 1 (LAPl, TORIAIP). LAPl has a documented association with LaA (Foisner and Gerace, 1993), as does LAP2a, a soluble LAP2 isoform that also appeared prominently in the dataset (Dechat et al., 2000). Two other INM proteins, emerin (EMD; Lee et al., 2001) and MAN1 (LEMD3; Mansharamani and Wilson, 2005) identified by BioID™-LaA are also known to interact with LaA. An additional ΓΝΜ protein detected in our screen was SAMP1 (TMEM201). Also known as NET5, SAMP1 was originally identified in a proteomic analysis of rat liver nuclear membrane proteins (Schirmer et al., 2003). A recent study (Gudise et al., 2011) suggests that SAMP1 is part of a protein network that includes A-type lamins and L1NC complexes. The latter are evolutionarily conserved protein assemblies that span the NE and couple nucleoskeletal and cytoskeletal structures (Burke and Roux, 2009).
12 proteins associated with nuclcocytoplasmic transport were detected by BioID-LaA. The three most prominent of these, Nupl53, Nup50, and ELYS have been localized to the nucleoplasmic face of NPCs where they would be situated in the vicinity of the nuclear lamina (Sukegawa and Blobel, 1993; Guan et al., 2000;
Walther et al., 2001; Rasala et al., 2008). At least one of these, Nupl53, has previously been shown to interact directly with LaA (Al-Haboubi et al., 2011). A fourth NPC protein, Tpr, which is itself associated with Nu l53 (Hase and Cordes, 2003; Krull et al., 2004), also appeared in the BioID™ screen. The detection of these NPC proteins by BioID™-LaA is consistent with an NPC anchorage function for the nuclear lamina.
Several additional classes of proteins were represented among the BioID™- LaA candidates, albeit at lower levels. These included proteins associated with DNA repair, transcription, chromatin regulation, and RNA-processing. Proteins considered to be components of a nucleoskeleton were also detected, the most abundant of which was filamin A (FLNA; Castano et al., 2010).
Identification of a novel ΝΈ constituent detected by BioID™-LaA
An uncharacterized protein of 75 kD, FAM169A (KIAA0888), featured prominently in the BioID™-LaA dataset. FAM169A has no predicted transmembrane domain and lacks any sequence motifs that might provide clues to its function. To test the possibility that FAM169A is a novel NE constituent we examined the localization of the endogenous protein in HEK293 cells by immunofluorescence microscopy. Fig. 4 a clearly shows that FAM169A is concentrated at the NE. Differential permeabilization of HE 293 cells with digitonin versus Triton X-100 indicates that FAM169A resides on the nuclear face of the NE. We also introduced human HA epitope-tagged FAM1 9A into HeLa cells, which do not appear to express this protein. Consistent with the findings in HEK293 cells, recombinant FAM169A, detected using the anti-FAM16 A localizes predominantly to the NE (Fig. 4 b), although in both cell types we could always observe what appeared to be a nucleoplasmic population. In neither cell line was there any obvious association with NPCs. Taken together, these findings indicate that FAM169A is a novel NE component that must be enriched at the nuclear lamina or at the interface of the lamina and ΓΝΜ. We therefore propose to name this protein, SLAP75 (for soluble lamina-associated protein of 75 kD). Besides SLAP75, only two other soluble proteins (other than the lamins themselves), barrier to autointegration factor (BAF; Segura-Totten et al., 2002) and germ cell-less (GCL; Holaska and Wilson, 2006), have been shown to accumulate at the nuclear lamina. Proteomic screens have identified scores of membrane proteins thai are enriched at the nuclear periphery
(Schirmer et al., 2003). Our identification of an entirely new peripheral membrane constituent of the NE highlights the use of BioID™ as a valuable complement to these earlier studies. Furthermore, it confirms the use of BioID™ as an effective proximity- based tool to screen for neighboring and potentially interacting proteins. With this in mind, we have identified 10 other uncharacterized proteins, including UPF0428 protein CXorf56, UPF0414 transmembrane protein C20orf30, UPF0552 protein C15orf38, and uncharacterized protein C9orf78.
Discussion
We have devised a simple and rapid technique, BioID™, which provides a means of identifying neighboring and potentially interacting proteins in vivo. The method takes advantage of BirA*, a highly promiscuous form of the E. coti BirA biotin protein ligase. BirA* may be targeted to specific subcellular locations by fusion to a "bait" protein. Nearby proteins, biotinylated by BirA*, can then be recovered in a single step on streptavidin-coated beads and identified by mass spectrometry. The only requirement for BioID™ is the expression of a single fusion protein.
Consequently, BioID™ is applicable to map protein associations in essentially any accessible cell type, mammalian or otherwise.
There are currently two strategies that are widely used to detect protein interactions. The first of these involves the yeast two-hybrid (Ύ2Η) system and takes advantage of the ability of hybrid transcription factor domains to functionally associate, thereby driving expression of reporter genes. The second strategy is based upon co-immunoprecipitation or pull-down, frequently involving expression of single- or double-tagged bait proteins. Irnmunoprecipitated proteins are then identified by mass spectrometry. A significant attribute of the Y2H approach is that because it is based on a cDNA library screen, it is more likely to detect weak interactions or interactions between low abundance proteins. Furthermore, it is the method of choice where the focus is on proteins that may only be expressed in rare cell types. On the other hand, it is contingent upon proteins, or protein fragments maintaining their ability to fold correctly and to associate when removed from their normal cellular environment because by definition these interactions must take place within yeast, often in subcellular regions unlike that which they normally inhabit and without their normal complement of associated proteins and post-translational modifications. In many situations this may present a significant problem, especially when membrane
proteins enter the equation. The other side of the coin is that incorrectly folded "bait" or "prey" proteins, while failing to interact with their cognate partners, may display other spurious interactions and hence give rise to false positives.
The pull-down approach has provided valuable data in a variety of systems. However, it has two limitations. The first of these is the problem of scale when dealing with low abundance proteins. Simply put, such proteins may not be detected where it proves impractical to prepare or manipulate sufficient start material. The second limitation concerns solubility. Conditions required to sohibilize many bait proteins may not be compatible with preserving interactions with partner proteins and vice versa. This becomes especially significant when considering weak interactions. In the case of lamin A, a highly insoluble protein, this has proved to be a serious stumbling block in the reliable identification of interacting proteins. Recently, Kubben et al. (2010) have introduced a work-around for this problem. They have used chemical cross-linking to stabilize lamin complexes before solubilization and pull- down. Significantly, this approach detected many of the same putative LaA interactors that we have now identified using BioID™-LaA. Cross-linking certainly represents a valuable enhancement to the pull-down strategy. However, as an added variable it may in turn introduce additional artifacts such as aggregation.
BioID™ provides an improvement to these prior approaches in the characterization of potential protein-protein interactions and near-neighbor analyses. BioID™ uniquely combines two important attributes. The first of these is that it detects potential interactions in their normal cellular context. The second is that it sidesteps issues associated with bait or prey protein solubility. Because the key step of biotinylation occurs before solubilization it will detect both weak and transient interactions. Both of these features are highlighted in the present example using BioID™-LaA data where both soluble and membrane proteins were efficiently detected.
BioID™ relies on the expression of an exogenous protein that is fused to BirA*, a protein slightly larger than GFP. With respect to myc-BirA*LaA, the fusion protein appears to be targeted appropriately to the nuclear lamina where it shares the same solubility properties as both wild-type and GFP-taggcd LaA. A more subtle issue may arise through biotinylation. Although we observed no evidence of a detrimental effect, our studies have used the addition of excess biotin to cell culture media to enhance the biotinylation of vicinal proteins. The covalent attachment of
biorin to primary amines, predominantly lysines, leads to the loss of charge on these sites and at the same time could inhibit other secondary modifications. These effects might in turn alter the behaviors of both the fusion protein and neighboring proteins. The efficacy of BioID™ is contingent upon the ability to biotinylate neighboring proteins, which is in turn dependent on the number and availability of primary amines in these proteins. Consequently, the abundance of the biotinylated proteins should not be used to indicate the strength or abundance of an association. Similarly, the absence of biotinylation does not rule out interaction or proximity. BioID™-mediated biotinylation is not generally used to validate an actual protein interaction, but instead is used as a screen to identify candidates that can be subsequently investigated systematically or in a hypothesis-based manner.
Based on our mass spectrometry results we see clear evidence that BioID™ identifies well-characterized protein interactors of LaA, including a number of proteins detected by Kubben et al. (2010) using complex purification in combination with chemical cross-linking (LAP1, LAP2 isoforms, Emerin, and MAN1). It is clear that most (if not all) of the more abundant proteins identified with BioID™-LaA, amounting to more than 50% of those detected, largely reside in close proximity to the INM. These could fell into the transient, indirect, or vicinal categories.
Furthermore, BioID™ identified SLAP75, a previously uncharacterized protein that is clearly enriched at the nuclear lamina. Other NE proteins in the dataset are lamins B 1 and B2 and lamin B receptor (LBR). The relatively low level of detection of B-type lamins could be a reflection of findings that A- and B-type lamins may be segregated into separate filament systems (Shiim et al., 2008). LBR is not known to interact with A-type lamins and its appearance could be simply a consequence of indirect interactions and/or proximity. However, it was detected by Kubben et al. (2010) using their approach of LaA affinity-capture combined with chemical cross-linking.
Also included in the list of identified proteins, albeit at reduced levels, are many nuclear proteins associated with DNA repair, transcription, chromatin regulation, and RNA processing. These proteins are not predominantly enriched at the NE, raising the question of how they were biotinylated by BioID™-LaA. We propose that these represent either a subpopulation of nuclear proteins that transiently associate with LaA at the NE and/or were biotinylated by nucleoplasm^ BioH)™- LaA (Goldman et al., 2002). Several of these proteins have what could be described as a circumstantial connection to LaA and might therefore be pan of a LaA interaction
network. PARP1, MDCl, NUMA, and NONO were all detected as part of the BAF proteome (Montes de Oca et al., 2009), i.e., they associate either directly or indirectly with BAF, while BAF itself is known to associate directly with LaA (Holaska et al., 2003). Consequently, the detection of these proteins by BioID™-LaA could be a reflection of these reciprocal associations. However, BAF itself was not picked up in the BioID™-LaA screen, potentially due tc its small size (89 residues) and/or due to limited association in these cells.
Mutations in LMNA and EMD, the gene encoding emerin, both give rise to Emery-Dreifuss muscular dystrophy (EDMD2, 3, and EDMD1, respectively; Bione et al., 1994; Bonne et al., 1999, 2000; Raffaele Di Barletta et al., 2000). LaA and emerin are known to interact (Lee et al., 2001); indeed, emerin was one of the more abundant proteins detected in the BioID™-LaA screen. Defects in the genes encoding at least three other proteins, nesprin-1, nesprin-2, and FHL1 (four-and-a-half LIM protein 1) are also known to cause EDMD (EDMD4-6, respectively; Zhang et al., 2007;
Gueneau et al., 2009). Both nesprin- 1 and nesprin-2 are LINC complex and NE components. FHL1, although apparently nucleoplasmic and cytoplasmic (there are three splice isoforms), was detected by BioID™-LaA. This raises the possibility that these proteins may constitute an interaction network that if disrupted gives rise to the common phenotype of EDMD (Simon and Wilson, 2011).
Some of the proteins identified by BioID™-LaA are classified as either cytoplasmic or ER residents. The latter are all membrane proteins. It is possible that at least some of these could have access to the Π Μ, although not concentrate there. Certainly there is precedent for this (Torrisi and Bonatti, 1985; Torrisi et al., 1 87). Alternatively, these cytoplasmic and ER proteins might become biotinylated during mitosis when the NE breaks down and lamins aie dispersed throughout the cytoplasm. We are currently investigating the application of BioID in synchronized cell populations that may shed light on these possibilities. It should be noted that ACE (angiotensin-converting enzyme), a type-I membrane protein synthesized in the ER, was likely identified due to nonspecific binding, as there are no available primary amines for biotinylation in its cytoplasmic domain.
Several cytoplasmically oriented NPC proteins, including Nup214 and Nup358, were found to be biotinylated. As with the cytoplasmic and ER proteins, it is possible that this biotinylation occurs during mitosis. It is also possible that this might occur during nuclear import of myc-BirA*LaA. However, the fact that import
receptors were not detected in the screen places a question mark over this. On the other hand, the large size of these nucleoporins may have biased their identification by mass spectrometry. The significantly more abundantly represented nucleoplasmic NPC proteins such as Nupl 53, Nup50, ELYS, and TPR likely reflect the close association between the NPCs and the lamina (Daigle et at., 2001). Certainly, Nupl53 has already been shown to interact with LaA (Al-Haboubi et al., 201 1). These interactions could explain the altered distribution of NPCs observed in LMNA- deficient cells (Sullivan et al., 1 99).
Based on our results here, we conservatively estimate that 50% of the proteins detected by BioID™-LaA predominantly reside in the INM, the nuclear lamina, or the nucleoplasmic face of NPCs. The nuclear lamina in mammalian somatic cells is generally agreed to be ~ 15 20 nm thick (Aaronson and Blobel, 1 75; Dwyer and Blobcl, 1976) and is closely apposed to the INM (Geracc and Hubcr, 2012). Nupl53 and Nup50 appear to be associated with the nuclear ring of NPCs, as does the N- terminal region of TPR (Guan et al., 2000; rull et al., 2004). This would place these nucleoporins at about the level of the nuclear lamina. Taken together, these findings suggest that roughly 50% of detected proteins likely reside within -20-30 nm of the nearest LaA molecule, and could well be much closer. More accurate measurements are hmited by the population of nucleoplasmic mycBirA*LaA and the considerable mobility of most proteins over the 24-h labeling period.
A significant finding that lends additional credence to the utility of BioID™ is the observation that histones, which are lysine rich and highly abundant in the nucleus, constitute a disproportionately small f action of the identified proteins. This indicates that BioID™ is not generating widespread biotinylation, but is more selectively labeling only those proteins in immediate proximity to the fusion protein. This could also be inferred from our fluorescence microscopy data where the streptavidin labeling is colocalized with the myc-BirA*LaA and restricted largely to the NE. It should also be noted that low levels of histones are reported to be biotinylated in vivo ( uroishi et al., 2011). We detected biotinylated histones (HI .3/H1.4 and HI .0) in our control preparations at levels substantially lower than the four endogenously biotinylated mammalian carboxylases.
In summary, BioID™ provides a powerful new approach to probe protein interactions and proximity in a variety of cell types, it is a technique that is accessible to a broad range of researchers comfortable working with conventional molecular and
cell biology techniques, and does not require specialized equipment other than the proteomic analysis that has become a commonly available service. . While the present example focuses on mammalian cells, BiolD™ can be applied in cells from a wide variety of species as well as in model organisms.
Materials and methods
Plasmids
Humanized BirA (Mechold et al., 2005) was mutated to Rl 18G by overlap extension PCR. Products for both the WT and Rl 18G contain a 5' Sail site and at the 3' end, an Xhol, stop codon and Aflll. These were digested with Sail and Aflll and inserted into pcDNA3.1 C-terminal to a myc-epitope digested with Xhol and Aflll. Human LaA was excised from pcDNA3.1 by Xhol and Aflll and inserted in frame with the mycBirA* in pcDNA3.1 using the same restriction sites. The entire myc- BirA*LaA sequence was removed from pcDNA3.1 by Nhel and Aflll, bunted and inserted into pRetroX™.Tight.puro that was digested with EcoRI and blunted. Clones were screened for proper directionality. pRetroX™.Tight.puro is a puromycin selectable mammalian expression vector that contains a Tet-on-based tetracycline- inducible promoter to inducibly regulate expression.
Cell culture and generation of stable cell lines
pRetroX™ Tet-ON Advanced HEK293 cells (Takara Bio Inc.) that stably express the doxycycline-regulated transactivator protein were transiently transfected with pRetroX™-Tight.puro myc-BirA*LaA with Lipofectamine 2000 (Invitrogen; Roux et al., 2009). Cells began selection with 0.5 μg ml puromycin 48 h after transfection. Upon colony formation, subclones were isolated and screened by immunofluorescence after induction by the addition of 1 ml doxycycline for 24 h. Immunofluorescence
Cells were fixed with 3% paraformaldehyde PBS and permeabilized in 0.4% Triton X-100/PBS (Roux et al., 2009). Differential permeabilization was performed after paraformaldehyde fixation with 0.001% digitonin at 4°C for 10 min (Crisp et al., 2006; Liu et al, 2007; Roux et al., 2009). Mouse anti-myc (1:10 9E10; American Type Culture Collection) and strcptavidin-568 (1:1,000; Invitrogen) were used to identify myc fusion proteins and biotinylated proteins, respectively. Other antibodies include rabbit ann-FAM16 A SLAP75 (1 :200; Sigma-Aldrich), mouse anti-HA (1:200 12CA5; Covance), mouse anti-Nupl53 (1:2, SA1; Bodoor et al., 1999), and
mouse anti-LaA (1:100, XB10; Horton et al., 1992). Proteins were visualized with goat anti-mouse, goat anti-rabbit, or streptavadin coupled to Alexa Fluor 488 or -568 (1:1,000; Invitrogen). DNA was detected with Hoechst dye 33258. Coverslips were mounted in 10% Mowiol 4-88. The majority of images were obtained at 25°C using either a Leica DM B microscope (40x/l .00 PL FLUOTAR™ oil PH3 and 63x/l .32 HCL PL APO oil PH3 Leica objectives) running IPLab IVisian software, or an Applied Precision Delta Vision Core system based on an Olympus LX71 microscope equipped with a 60x NA 1.42 lens. Image acquisition and processing was accomplished using DeltaVision Resolve3D and Softworx 4.1.0 software. Both microscope systems were equipped with Photometries CoolSnap HQ cameras. Some conventional epifluorescence images of HA-SLAP75-transfected HeLa cells were acquired using a Zeiss Axioimager.Zl equipped with a 63x NA 1.4 lens and CoolSnap HQ camera.
Western blotting
Cells were lysed in Laemmli SDS-sample buffer, separated by SDS-PAGE and transferred to nitrocellulose (Liu et al., 2007). Immunoblotting was performed (Liu et al., 2007) with rabbit anti-myc (1:50,000; Abeam). Biotinylated proteins were detected similarly with the following modifications. Membranes were blocked in 2.5% bovine serum albumin in PBS with 0.4% Triton X-100 and incubated in the same buffer with HRP-conjugated streptavidin ( 1 :40,000; Invitrogen).
Affinity capture of biotinylated proteins
Cells were incubated for 24 h in complete media supplemented with 1 μ^'ηιΐ doxycycline and 50 μΜ biorin. After three PBS washes, cells (for small-scale analysis, <107; for large scale analysis, 4 1 7) were lysed at 25°C in 1 ml lysis buffer (50 mM Tris, pH 7.4, 500 mM NaCl, 0.4% SDS, 5 mM EDTA, 1 mM DTT, and lx Complete protease inhibitor [Roche]) and sonicated. Triton X-100 was added to 2% final concentration. After further sonication, an equal volume of 4°C 50 mM Tris (pH 7.4) was added before additional sonication (subsequent steps at 4°C) and centrifugation at 16,000 relative centrifugal force. Supernatants were incubated with 600 μΐ Dynabeads™ (MyOne Steptavadin CI; Invitrogen) overnight. Beads were collected and washed twice for 8 min at 25°C (all subsequent steps at 25°C) in 1 ml wash buffer 1 (2% SDS in dr^O). This was repeated once with wash buffer 2 (0.1% deoxycholate, 1% Triton X-100, 500 mM NaCl, 1 mM EDTA, and 50 mM Hepes, pH 7.5), once with wash buffer 3 (250 mM LiCl, 0.5% NP-40, 0.5% deoxycholate, 1 mM
EDTA, and 10 mM Tris, pH 8.1) and twice with wash buffer 4 (50 mM Tris, pH 7.4, and 50 mM NaCl). 10% of the sample was reserved for Western blot analysis. Bound proteins were removed from the magnetic beads with 50 μΐ of Laemmli SDS-sample buffer saturated with biotin at 98°C. For the larger scale preparation, 90% of the sample to be analyzed by mass spectrometry was washed twice in 50 mM NH4HCO3. Protein identification by mass spectrometry
For reduced scale experiments, proteins eluted from the streptavidin beads by SDS-sample buffer were reduced and alkylated and separated by ID SDS-PAGE. Separated proteins were visualized by colloidal Coomassie blue staining. The whole gel lane was cut in 24 equal-sized gel bands, destained, and submitted to tryptic in-gel digestion, all using perforated microtiter plates (Proxeon) with exchange of solvents by low-speed centritugation. Peptides were eluted into V-bottom polypropylene microtiter plates, freeze-dried, dissolved in 0.1% formic acid in water, and submitted to nano-flow HPLC coupled to a QTOF mass spectrometer (1260 nanoHPLC [Agilent Technologies] and QTOF 6554 with ChipCube [Agilent Technologies]). Separation of peptides was performed on a 150-mm χ 75-um C 18 Reprosil column in a chip (Chip II; Agilent Technologies). The applied gradient was from 8% acetonitrile in water with 0.2% formic acid to 35% acetonitrile in water with 0.2% formic acid over 35 min. The mass spectrometer calibration was maintained by continuous submission of a calibrant solution and recalibration of the acquired spectra after the analytical run. The LC-MS/MS system was controlled by MassHunter™ Acquisition software (Agilent Technologies), 4 MS spectra per second and 3 MS/MS spectra per second were collected. The MS to MS/MS switching was done data dependent with a threshold of 1,000 counts and a charge of 2-4 for the peptides. Raw data were converted into mzdata.xml using MassHunter™ Qualitative Analysis software
(Agilent Technologies) and database search was performed using MASCOT™ 3.2 (MatrixScience) and human IPI database (version 3.65). Carboxymethylated Cys was set as fixed modification, oxidized Met, deamidation of Asn and Gin, pyroGlu formation of the N terminus and acetylation of the N terminus as variable modification. The resulting .dat files were loaded into SCAFFOLD Q+ (Proteome Software). The acceptance level for proteins was two identified peptides with minimum 95% probability each. Spectra of candidates were verified visually.
For large-scale analysis, on bead tryptic digests were analyzed by 1 D LC/MS MS by the Sanford-Burnham Proteomic Facility (La Jo 11a, CA). Tris(2-
carboxyethyljphosphine (TCEP) was added to 100 μΐ of beads suspension mix and proteins were reduced at 37°C for 30 min. Iodoacetamide was added (to 20 mM) and proteins were alkylated at 37°C for 40 min in the dark. Mass spectrometry grade trypsin (Promega) was added (—1 :50 ratio) for overnight digestion at 37°C. Magnetic beads were removed by centrifugation. Formic acid was added to the peptide solution (to 2%) before on-line analysis of peptides by high-resolution, high-accuracy LC- MS MS, consisting of a Michrom™ HPLC, a 15-cm Michrom Magic™ CI 8 column, a low-flow ADVANCED Michrom™ MS source, and a LTQ-Orbitrap™ XL (Thermo Fisher Scientific). A 120-min gradient of 10-30%B (0.1% formic acid, 100% acetonitrile) was used to separate the peptides. The total LC time was 141 min. The LTQ-Orbitrap™ XL was set to scan precursors in the Orbitrap at a resolution of 60,000, followed by data-dependent MS/MS of the top four precursors. Raw LC- MS MS data were submitted to Sorcerer Enterprise™ (Sagc-N Research Inc.) for protein identification against the IPI human protein database, which contains semi- tryptic peptide sequences with the allowance of up to two missed cleavages and precursor mass tolerance of 50.0 ppm. A molecular mass of 57 D was added to all cysteines to account for carboxyamidometliylation. Differential search included 16 D for methionine oxidation, and 226 D on N terminus and lysine for biotinylation. Search results were sorted, filtered, statically analyzed, and displayed using
PeptideProphet™ and ProteinProphet™ (Institute for Systems Biology). The minimum trans-proteomic pipeline (TPP) probability score for proteins was set to
0.95, to assure TPP error rate of lower than 0.01.
References
Aaronson, R.P., and G. Blobel. 1975, Isolation of nuclear pore complexes in
association with a lamina. Proceedings of the National Academy of Sciences of the United States of America. 72: 1007- 11.
Al-Haboubi, T., D.K. Shumaker, J. oser, M. Wehnert, and B. Fahrenkrog. 2011.
Distinct association of the nuclear pore protein Nupl53 with A- and B-type lamins. Nucleus. 2.
Beckett, D., E. Kovaleva, and P.J. Schatz. 1999. A minimal peptide substrate in biotin holoenzyme synthctasc-catalyzed biotinylation. Protein Sci. 8:921-9.
Bione, S., E. Maestrini, S. Rivella, M. Mancini, S. Regis, G. Romeo, and D. Toniolo.
1994. Identification of a novel X-linked gene responsible for Emery-Dreiruss muscular dystrophy. Nature genetics. 8:323-7.
Bodoor, ., S. Shaikh, D. Salina, W.H. Raharjo, R. Bastos, M. Lohka, and B. Burke. 1999. Sequential recruitment of PC proteins to the nuclear periphery at the end of mitosis. Journal of cell science. 112 ( Pt 13):2253-64.
Bonne, G., M.R. Di Barletta, S. Varnous, H.M. Becane, E.H. Hanunouda, L. Merlini, F. Muntoni, C.R. Greenberg, F. Gary, J.A. Urtizberea, D. Duboc, M. Fardeau,
D. Toniolo, and K. Schwartz. 1 99. Mutations in the gene encoding lamin A/C cause autosomal dominant Emery-Dreifuss muscular dystrophy. Nature genetics. 21 :285-8.
Bonne, G., E. Mercuri, A. Muchir, A. Urtizberea, H.M. Becane, D. Recan, L. Merlini, M. Wehnert, R. Boor, U. Reuner, M. Vorgerd, E.M. Wicklein, B. Eymard, D.
Duboc, I. Penisson-Besnier, J.M. Cuisset, X. Ferrer, I. Desguerre, D.
Lacombe, K. Bushby, C. Pollitt, D. Toniolo, M. Fardeau, . Schwartz, and F. Muntoni. 2000. Clinical and molecular genetic spectrum of autosomal dominant Emery-Dreifuss muscular dystrophy due to mutations of the lamin A C gene. Annals of neurology. 48:170-80.
Broers, J.L., B.M. Machiels, GJ. van Eys, HJ. uijpers, E.M. Manders, R. van Driel, and F.C. Ramaekers. 1999. Dynamics of the nuclear lamina as monitored by GFP-tagged A-type lamins. Journal of cell science. 1 12 ( Pt 20):3463-75.
Burke, B., and K J. Roux. 2009. Nuclei take a position: managing nuclear location.
Developmental cell. 17:587-97.
Castano, E., V.V. Philimonenko, M. ahle, J. Fukalova, A. Kalendova, S. Yildirim, R. Dzijak, H. Dingova-Krasna, and P. Hozak. 2010. Actin complexes in the cell nucleus: new stones in an old field. Histochemistry and cell biology. 133:607-26.
Chapman-Smith, A., and J.E, Cronan, Jr. 1999, Molecular biology of biotin
attachment to proteins. JNutr. 129:477S-484S.
Choi-Rhee, E., H. Schulman, and J.E. Cronan. 2004. Promiscuous protein
biotinylation by Escherichia coli biotin protein Hgase. Protein Sci. 13:3043-50. Crisp, M., Q. Liu, K. Roux, J.B. Rattner, C. Shanahan, B. Burke, P.D. Stahl, and D.
Hodzic. 2006. Coupling of the nucleus and cytoplasm: role of the LINC complex. The Journal of cell biology. 172:41-53.
Cronan, J.E. 2005. Targeted and proximity-dependent promiscuous protein
biotinylation by a mutant Escherichia coli biotin protein ligase. JNutr
Biochem. 16:416-8.
Daigle, N., J. Beaudouin, L. Hartnell, G. Imreh, E. Hallberg, J. Lippincott-Schwartz, and J. Ellenberg. 2001. Nuclear pore complexes form immobile networks and have a very low turnover in live mammalian cells. The Journal of cell biology. 154:71-84.
Dechat, T., B. Korbei, OA. Vaughan, S. Vlcek, C.J. Hutchison, and R. Foisner. 2000.
Lamina-associated polypeptide 2alpha binds intranuclear A-type lamins.
Journal of cell science. 113 Pt 19:3473-84.
Dwyer, N., and G. Blobel. 1 76. A modified procedure for the isolation of a pore complex-lamina fraction from rat liver nuclei. The Journal of cell biology. 70:581-91.
Foisner, R., and L. Gerace. 1993. Integral membrane proteins of the nuclear envelope interact with lamins and chromosomes, and binding is modulated by mitotic phosphorylation. Cell. 73: 1267-79.
Gerace, L., and M.D. Huber. 2012. Nuclear lamina at the crossroads of the cytoplasm and nucleus. Journal of structural biology, 177:24-31.
Goldman, R.D., Y. Gruenbaum, R.D. Moir, D.K. Shumaker, and T.P. Spann. 2002.
Nuclear lamins: building blocks of nuclear architecture. Genes &
development. 16:533-47.
Green, N.M. 1 63. Avidin. 1. The Use of (l4-C)Biotin for Kinetic Studies and for Assay. Biochem J. 89:585-91.
Guan, T., R.H. Kehlenbach, E.C. Schirmer, A. Kehlenbach, F. Fan, B.E. Clurman, N.
Arnheim, and L. Gerace. 2000. Nup50, a nucleoplasmically oriented nuclcoporin with a role in nuclear protein export. Molecular and cellular biology. 20:5619-30.
Gudise, S., R.A. Figueroa, R. Lindberg, V. Larsson, and E. Hallberg. 201 1 , Sampl is functionally associated with the LINC complex and A-type lamina networks. Journal of cell science. 124:2077-85.
Gueneau, L., A.T. Bertrand, J.P. Jais, M.A. Salih, T. Stojkovic, M. Wehnert, M.
Hoeltzenbein, S. Spuler, S. Saitoh, A. Verschueren, C. Tranchant, M. Beuvin, E. Lacene, N.B. Romero, S. Heath, D. Zelenika, T. Voit, B. Eymard, R. Ben
Yaou, and G. Bonne. 2009. Mutations of the FHL1 gene cause Emcry- Dreifuss muscular dystrophy. American journal of human genetics. 85:338-53.
Hase, M.E., and V.C. Cordes. 2003. Direct interaction with nupl53 mediates binding of Tpr to the periphery of the nuclear pore complex. Molecular biology of the cell. 14: 1923-40.
Holaska, J.M., .K. Lee, A.K. Kowalski, and K.L. Wilson. 2003. Transcriptional repressor germ cell-less (GCL) and barrier to autointegration factor (RAF) compete for binding to emerin in vitro. The Journal of biological chemistry. 278:6969-75.
Holaska, J.M., and K.L. Wilson. 2006. Multiple roles for emerin: implications for
Emery-Dreifuss muscular dystrophy. The anatomical record. Part A,
Discoveries in molecular, cellular, and evolutionary biology. 288:676-80. Horton, H., I. McMorrow, and B. Burke. 1 92. Independent expression and assembly properties of heterologous lamins A and C in murine embryonal carcinomas.
European journal of cell biology. 57: 172-83.
Krull, S., J. Thyberg, B. Bjorkroth, H.R. Rackwitz, and V.C. Cordes. 2004.
Nucleoporins as components of the nuclear pore complex core structure and
Tpr as the architectural element of the nuclear basket. Molecular biology of the cell. 15 :4261 -77.
Kubben, N., J.W. Voncken, J. Demmers, C. Calis, G. van Almen, Y. Pinto, and T.
Misteli. 2010. Identification of differential protein interactors of lamin A and progeria Nucleus. 1 :513-25.
Kulman, J.D., M. Satake, and J.E. Harris. 2007. A versatile system for site-specific enzymatic biotinylation and regulated expression of proteins in cultured mammalian cells. Protein expression and purification. 52:320-8.
Kuroishi, T.5 L. Rios-Avila, V. Pestinger, S.S. Wijeratne, and J. Zempleni. 2011.
Biotinylation is a natural, albeit rare, modification of human histones.
Molecular genetics and metabolism. 104:537-45.
Kwon, ., and D. Beckett. 2000. Function of a conserved sequence motif in biotin holoenzyme synthetases. Protein science : a publication of the Protein
Society, 9: 1530-9.
Kwon, K., E,D. Streaker, S. Ruparelia, and D. Beckett. 2000. Multiple disordered loops function in corcprcssor-induccd dimerization of the biotin repressor. J Mol Biol. 304:821-33.
Lane, M.D., .L. Romioger, D.L. Young, and F. Lynen. 1964. The Enzymatic
Synthesis of Holotranscaiboxylase from Apotranscarboxylase and (+)-Biotin.
Ii. Investigation of the Reaction Mechanism. J Biol Chem. 239:2865-71. Lee, . ., T. Haraguchi, R.S. Lee, T. Koujin, Y. Hiraoka, and K.L. Wilson. 2001.
Distinct functional domains in emerin bind lamin A and DNA-bridging protein
BAF. Journal of cell science. 114:4567-73.
Liu, Q., N. Pante, T. Misteli, M. Elsagga, M. Crisp, D. Hodzic, B. Burke, and KJ,
Roux. 2007. Functional association of Sunl with nuclear pore complexes. The
Journal of cell biology. 178:785-98,
Mansharamani, M., and K.L. Wilson. 2005. Direct binding of nuclear membrane protein MANl to emerin in vitro and two modes of binding to barrier-to- autointegration factor. The Journal of biological chemistry. 280: 13863-70. Mechold, U., C. Gilbert, and V. Ogryzko. 2005. Codon optimization of the BirA enzyme gene leads to higher expression and an improved efficiency of biotinylation of target proteins in mammalian cells. Journal of biotechnology.
116:245-9.
Moir, R.D., M. Yoon, S. Khuon, and R.D. Goldman. 2000. Nuclear lamins A and B 1 : different pathways of assembly during nuclear envelope formation in living cells. The Journal of cell biology. 151 : 1155-68.
Montes de Oca, R., C.J. Shoemaker, M. Gucek, R.N. Cole, and K.L. Wilson. 2009.
Barrier-to-autointegration factor proteome reveals chromatin-regulatory partners. PloSone. 4:e7050.
Nesbeth, D., S.L. Williams, L. Chan, T. Brain, N.K. Slater, F. Farzaneh, and D.
Darling. 2006. Metabolic biotinylation of lentiviral pseudotypes for scalable paramagnetic microparticle-dependent manipulation. Molecular therapy : the journal of the American Society of Gene Therapy. 13:814-22.
Raffaele Di Barletta, M., E. Ricci, G. Galluzzi, P. Tonali, M. Mora, L. Morandi, A.
Romorini, T. Voit, K.H. Orstavik, L. Merlini, C. Trevisan, V. Biancalana, I. Housmanowa-Petrusewicz, S. Bione, R. Ricotti, K. Schwartz, G. Bonne, and D. Toniolo. 2000. Different mutations in the LMNA gene cause autosomal dominant and autosomal recessive Emcry-Drcifuss muscular dystrophy. American journal of human genetics. 66: 1407-12.
Rasala, B.A., C. Ramos, A. Harel, and D.J. Forbes. 2008. Capture of AT-rich chromatin by ELYS recruits POM121 and NDC1 to initiate nuclear pore assembly. Molecular biology of the cell. 1 :3982-96.
Roux, K.J., M.L. Crisp, Q. Liu, D. Kim, S. Kozlov, C.L. Stewart, and B. Burke. 2009.
Nesprin 4 is an outer nuclear membrane protein that can induce kinesin- mediated cell polarization. Proceedings of the National Academy of Sciences of the United States of America. 106:2194-9.
Sang, L., JJ. Miller, K.C. Corbit, RH. Giles, M.J. Brauer, E.A. Otto, L.M. Baye, X.
Wen, S J. Scales, M. Kwong, E.G. Huntzicker, M.K. Sfakianos, W. Sandoval, J.F. Bazan, P. Kulkarni, F.R. Garcia-Gonzalo, A.D. Seol, J.F. O'Toole, S.
Held, H.M Reutter, W.S. Lane, M.A. Rafiq, A. Noor, M. Ansar, A.R. Devi, V.C. Sheffield, D.C. Slusarski, J.B. Vincent, D.A. Doherty, F. HUdebrandt, J.F. Reiter, and P.K. Jackson. 2011. Mapping the NPHP-JBTS-MKS protein network reveals ciliopathy disease genes and pathways. Cell. 145:513-28. Schirmer, E.C., L. Florens, T. Guan, J.R. Yates, 3rd, and L. Gerace. 2003. Nuclear membrane proteins with potential disease links found by subtractive proteomics. Science. 301 : 1380-2.
Segura-Totten, M., A.K. Kowalski, R. Craigie, and K.L. Wilson. 2002. Bamer-to- autointegration factor: major roles in chromatin decondensation and nuclear assembly. The Journal of cell biology. 158:475-85.
Shimi, T„ K. Pfleghaar, S. Kojima, C.G. Pack, I. Solovei, A.E. Goldman, S.A. Adam, D.K. Shumaker, M. Kinjo, T. Cremer, and R.D. Goldman. 2008. The A- and B-type nuclear lamin networks: microdomains involved in chromatin organization and transcription. Genes & development. 22:3409-21.
Shumaker, D,K,, T. Dechat, A. Kohlmaier, S.A. Adam, M.R, Bozovsky, M.R. Erdos, M. Eriksson, A.E. Goldman, S. Khuon, F.S. Collins, T. Jenuwein, and R.D. Goldman. 2006. Mutant nuclear lamin A leads to progressive alterations of epigenetic control in premature aging. Proceedings of the National Academy of Sciences of the United States of America. 103 8703-8.
Simon, D.N., and K.L. Wilson. 2011. The micleoskeleton as a genome-associated dynamic 'network of networks'. Nature reviews. Molecular cell biology. 12:695-708.
Streaker, E.D., and D. Beckett. 2006. Nonenzymatic biotinylation of a biotin carboxyl carrier protein: unusual reactivity of the physiological target lysine. Protein Sci. 15:1928-35.
Sukegawa, J., and G. Blobcl. 1993. A nuclear pore complex protein that contains zinc finger motifs, binds DNA, and faces the nucleoplasm. Cell. 72:29-38.
Sullivan, T., D. Escalante-Alcalde, H. Bhatt, M. Anver, N. Bhat, K. Nagashima, C.L.
Stewart, and B. Burke. 1999. Loss of A-type lamin expression compromises nuclear envelope integrity leading to muscular dystrophy. The Journal of cell biology. 147:913-20.
Torrisi, .R., and S. Bonatti. 1985. Immunocytochemical study of the partition and distribution of Sindbis virus glycoproteins in freeze-fractured membranes of infected baby hamster kidney cells. The Journal of cell biology. 101 : 1300-6.
Torrisi, .R., L.V. Lotti, A. Pavan, G. Migliaccio, and S. Bonatti. 1987. Free
diffusion to and from the inner nuclear membrane of newly synthesized plasma membrane glycoproteins. The Journal of cell biology. 104:733-7. van Steensel, B., and S. Henikoff. 2000. Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nature
biotechnology. 18:424-8.
Walther, T.C., M. Fornerod, H. PickersgiU, M. Goldberg, T.D. Allen, and I.W. Mattaj.
2001. The nucleoporin Nupl53 is required for nuclear pore basket formation, nuclear pore complex anchoring and import of a subset of nuclear proteins. The EMBO journal. 20:5703-14.
Worman, H.J. 2012. Nuclear lamins and laminopathies. The Journal of pathology.
226:316-25.
Worman, H.J., L.G. Fong, A, Muchir, and S.G. Young, 2009. Laminopathies and the long strange trip from basic cell biology to therapy. The Journal of clinical investigation. 1 19: 1825-36.
Zhang, Q., C. Bethmann, N.F. Worth, J.D. Davies, C. Wasner, A. Feuer, CD.
Ragnauth, Q. Yi, J.A. Mellad, D.T. Warren, M.A. Wheeler, J.A. Ellis, J.N. Skepper, M. Vorgerd, B. Schlotter-Weigel, P.L. Weissberg, R.G. Roberts, M. Wchnert, and CM. Shanahan. 2007. Ncsprin-1 and -2 arc involved in the pathogenesis of Emery Dreifuss muscular dystrophy and are critical for nuclear envelope integrity. Human molecular genetics. 16:2816-33.
Example 2. 2nd Generation BioID™
To optimize the original 1 generation BioID™ method (Gl-BioID™) we have developed a 2nd generation BioID™ (G2-BioID™) whose primary improvement is a considerable reduction in the size of the promiscuous biotin ligase. Whereas Gl- BioID™ is based on a mutation (Rl 18G) of the E. coli BirA gene, G2-BioID™ is based on a mutation (R40G) of biotin protein ligase (BPL) from A. aeolicus. A humanized nucleic acid sequence for G2-BioID was generated by gene synthesis such that it could be inserted into a mammalian expression vector (pcDNA3.1). The original version has an N-terminal myc-epitope tag for protein detection. G2-BioID™ is 234 amino acids (26.56 kDa), whereas Gl -BioID™ is 322 amino acids (35.22 kDa), both calculated without the 9 amino acid myc-epitope tag. The wild type ligases that are the basis for Gl -BioID™ and G2-BioID™ have been structurally characterized. The dimensions (length and width) of Gl-BioID™ are 7.4 x 4.0 nm and for G2-BioID™ are 5.2 x 4.2 nm. This considerable size difference (~30% difference) is due to the presence of a naturally occurring DNA-binding domain that is found in Gl -BioID™ but not in the G2-BioID™.
Nucleic acid sequence for G2-BioID (untagged)
ATGTTCAAGAACCTGATCTGGCTGAAGGAGGTGGACAGCACCCAGGAGA G
ACTGAAGGAGTGGAACGTGAGCTACGGCACCGCCCTGGTGGCCGACAGA C
AGACCAAGGGCAGAGGCGGCCTGGGCAGAAAGTGGCTGAGCCAGGAGGG C
GGCCTGTACTTCAGCTTCCTGCTGAACCCCAAGGAGTTCGAGAACCTGCT GCAGCTGCCCCTGGTGCTGGGCCTGAGCGTGAGCGAGGCCCTGGAGGAGA TCACCGAGATCCCCTTCAGCCTGAAGTGGCCCAACGACGTGTACTTCCAG GAGAAGAAGGTGAGCGGCGTGCTGTGCGAGCTGAGCAAGGACAAGCTGA T
CGTGGGCATCGGCATCAACGTGAACCAGAGAGAGATCCCCGAGGAGATC A
AGGACAGAGCCACCACCCTGTACGAGATCACCGGCAAGGACTGGGACAG A
AAGGAGGTGCTGCTGAAGGTGCTGAAGAGAATCAGCGAGAACCTGAAGA A
GTTCAAGGAGAAGAGCTTCAAGGAGTTCAAGGGCAAGATCGAGAGCAAG A TGCTGTACCTGGGCGAGGAGGTGAAGCTGCTGGGCGAGGGCAAGATCACC GGCAAGCTGGTGGGCCTGAGCGAGAAGGGCGGCGCCCTGATCCTGACCG A
GGAGGGCATCAAGGAGATCCTGAGCGGCGAGTTCAGCCTGAGAAGAAGC T GA (SEQ ID NO: 15)
Amino acid sequence for G2-BioID (untagged)
MF NLIWLKEVDSTQERL EWNVSYGTALVADRQTKGRGGLGRKWLSQEG GLYFSFLI-NPKEFENLLQLPLVLGLSVSEALEEITEIPFSLKWPNDVYFQ E KVSGVLCELS DKLIVGIGINVNQREIPEE1 DRATTLYEITGKDWDR KEVI KVLKRISENLKKFKEKSFKEFKGKIESKMLYLGEEVKLLGEGKIT GKLVGLSE GGALU^TEEGIKEILSGEFSLRRS (SEQ ID NO; 16)
HEK293 cells were either mock transfected (control) or transfected with myc- Gl- or myc- G2-BioID™ either in the presence or absence of 50 μΜ biotin.
Experiments were carried out as described in Example 1. Results are shown in
Figure 5. As compared to the control lanes, the expression of the Gl - (*) and G2- BioID™ (*) proteins is detected by anti-myc immunoblotting. Promiscuous biotinylation of endogenous proteins is observed with streptavidin-HRP for both Gland G2-BioID™. This effect is enhanced by the presence of exogenous biotin.
We then compared Gl- and G2-BioID™ by fluorescence microscopy. U20S cells were transfected with myc-Gl- or myc-G2-BioID™ either in the presence or absence of 50 uM biotin. Gl - (*) and G2-BioID™ (*) proteins were detected with anti-myc. Promiscuous biotinylation (biotin) of endogenous proteins was observed with streptavidin-488 for both Gl - and G2-BioID™ (See Figure 6). This promiscuous biotinylation is enhanced by the presence of exogenous biotin. Nuclei are labeled with Hoechst dye. Scale bar is 20 um.
We then compared Gl- and G2-BioID™-LaA by immuno- and streptavidin- blot analysis. HEK293 cells were either mock transfected (control) or transfected with
myc-Gl - or myc-G2-BioID -LaA either in the presence of 50 μΜ biotin. Similar levels of promiscuous biotinylation of endogenous proteins is observed with streptavidin-HRP for both G l- and G2-BiolD™-LaA. Expression of the Gl- and G2- BioID-LaA proteins is detected by anti-myc immunoblotting.
Example 3. Exemplary BioID™ protocols
Introduction
The overall process of BioID™ can be broken down into two stages; 1) generation and characterization of a BiolD™-fusion protein in a mammalian (or other) expression vector for stable expression in a cell line and 2) use of that cell line for a large-scale BioID™ pull-down to identify protein candidates by mass spectrometry. Note that this exemplary protocol is specific to the application of BioID™ in common mammalian cell lines. Based on the teachings herein, it will be well within the level of skill in the art to apply BioID™ to a variety of cell types and species.
Design of the BioID fusion protein
The fundamental component of this system is the BioID™ fusion protein. If the purpose of BioID™ is to screen for physiologically relevant protein interactions for a protein of interest then the ideal fusion protein will represent a functional replacement of the original protein. Thus care should be taken in deciding where in the protein the biotin ligase will be incorporated. Prior evidence of successful fusions with similarly sized proteins, such as GFP, can be a good starting point to guide this decision. Consideration of posttranslational modifications, either to the N- or C- terminus must be considered as well. For example, C-terminal prenylation or an N- terminal signal peptide should not be disrupted. This may complicate the molecular cloning process of generating the expression plasmid for the fusion protein, but is worth the effort. If there is little information as to how N- or C-terminal fusions might affect the protein of interest it is recommended to try both in parallel, at least to the point of validating each fusion protein for proper targeting.
Functional validation of the fusion protein
Another consideration is how to ensure that the fusion protein is functional. In many cases this is not possible unless the function of the protein is well known, some form of knockout/knockdown system exists and there is a phenotypic outcome to rescue. However, it is typically possible to at least compare the localization of the fusion protein to a more minimally tagged, or preferably endogenous, protein by immunofluorescence microscopy. Attention should be paid to expression levels as overexpression of proteins can often lead to mis-localization. In most cases low levels of BioID™ fusion proteins, at or below the level of the endogenous protein, are sufficient for the identification of candidates. Thus when validating by transient transfection, attention should be paid to cells expressing lower levels of the fusion protein as this is the desired, and often likely, outcome for the cells that stably express the fusion protein. Choice of cells and expression method
Once the BioID™ fusion protein has been designed and an expression plasmid generated it is necessary to consider the cell type in which to express the BioID™ fusion protein. This is a situation-specific decision, however there are a few concepts to consider. Preferably the expression will be similar in all cells within the population to be used for large-scale BioID™ pull-down. Thus traditional transient transfection is best limited to preliminary functional validation. A population of cells stably expressing similar amounts of the fusion protein is preferred. This can be accomplished with standard random integration or various viral expression systems. Since it is usually sufficient to screen for BioID™ candidates more likely to be physiologically relevant, an extremely low level of protein expression is ideal.
Inducible expression of the fusion protein can be utilized, especially if expression of the bait is toxic. However, inducible expression is not necessary in most instances since biotinylation itself is induced by the addition of excess biotin. BASIC PROTOCOL 1
GENERATION OF CELLS EXPRESSING A BioID FUSION PROTEIN
This protocol describes exemplary techniques for creation and characterization of a BioID™-fusion protein expression plasmid followed by its stable expression in mammalian cells. Techniques such as PCR cloning and generation of stably
expressing cells will not be described as they vary depending on the situation. The methods needed to test the targeting of the fusion protein and its expression and activity in cells prior to a large-scale BiolD pull-down will be described. Materials
1 mM Biotin (see recipe)
PBS
BSA blocking buffer (see recipe)
Streptavidin-Alexa Fluor® (Invitrogen)
Streptavidin-HRP (High Sensitivity Streptavidin-HRP, Thermo Scientific)
ABS Blocking buffer (see recipe)
Quenching solution (see recipe)
SDS-PAGE Electrophoresis Unit (Mini-PROTEAN® II Electrophoresis Cell, Bio- Rad)
Semi-Dry Transfer Cell (Trans-Blot®, Bio-Rad)
Validation of BiolD fusion protein
1. Prepare the expression plasmid for ihe BiolD™ fusion protein.
2. For each experimental condition (eg. control, BiolD™ fusion protein) plate two wells of cells in a standard 6-well plate, one with glass coverslips and the other without
3. Express the fusion protein in the cells of choice by transient transfection.
Process mock or non-transfected cells in parallel. Apply biotin (50 μΜ final concentration) to cells at time of transfection. If transfection protocol requires replacement of media shortly after transfection add biotin at that time.
a. The presence of excess biotin promotes biotinylation by ihe BiolD™ fusion protein (usually over a period of 18-24 hrs). This permits simultaneous analysis of expression, targeting and function of the BiolD™ fusion protein by immunofluorescence microscopy and immunoblot.
4. Process the cells on coverslips for analysis by immunofluorescence
microscopy 1 day following transient transfection.
a. It is recommended to use PFA fixation for 10 min followed by TX-100 permeabilization for 15 min. Use of MeOH fixation will result in strong mitochondrial signal observed with fluorescentfy labeled streptavidin. Use
the labeled streptavidin at 1:1,000 with the secondary antibody and DNA labeling reagent (eg. hoechst, DAPI),
5. Process the cells for immunoblot analysis 1 day following transient
transfection and addition of biotin.
6. Following whole cell lysis, SDS-PAGE separation and protein transfer, agitate membrane in BSA blocking buffer for 20-30 min at RT.
a. BSA is preferred for blocking to eliminate any free biotin that is potentially present in milk or serum. Free biotin will compete with biotinylate proteins for binding to streptavidin-HRP.
7. Agitate membrane in streptavidin-HRP at 1:40,000 in blocking buffer for 40 min at room temperature (RT).
a. May need to optimize concentration depending on source of streptavidin- HRP. This incubation can be performed overnight at 4 °C for convenience.
8. Quickly wash membrane in PBS 2-3 times to wash away unbound streptavidin HRP.
9. Agitate membrane in ABS blocking buffer (see recipe) for 5 min.
a. This step can be used to reduce background signal on membrane.
10. Quickly wash membrane in PBS 2-3 times.
1 1. Agitate membrane in PBS for 5 min.
12. Add enhanced chemiluminescence (ECL) reagent to observe biotinylated proteins.
13. Following successful analysis of biotinylated proteins, quench HRP signal on membrane with 20 min agitation in quenching solution.
a. Removal of streptavidin-HRP with standard membrane stripping methods is futile given the strength of the biotin-streptavidin interaction. The quenching permanently inactivates the HRP allowing further probing with additional antibodies. Consider reapplication of ECL to the membrane and visualization to confirm success of quenching.
14. Wash membrane with PBS multiple times to remove quenching solution.
a. This step removes residual sodium azide and hydrogen peroxide that may affect the following Western blot analysis.
15. Proceed to immunoblot membrane with antibodies specific to the BioID fusion protein (eg. anti-myc/HA) to confirm its expression and migration by SDS-PAGE.
a. Use standard protocol for this beginning with blocking in ABS or equivalent (BSA blocking not required). As an alternative to HA/myc tag detection of the BioID™ fusion protein an anti-BirA antibody is effective by immunofluorescence and immvnoblot (chicken anti-BirA ab 14002, Abeam).
1 . Generation of cells stably expressing BioID™ fusion protein
17. Initiate generation of stable cell lines with validated cell line.
a. This is a highly variable process depending on the strategy and cell type.
18. If subcloning of cells is performed, screen subclones first by
immunofluorescence. Perform immunoblot analysis on subclones that pass the immunofluorescence screening. If viral infection is utilized, screen population of infected cells by immunofluorescence and immunoblot.
a. 50 μΜ biotin can be added to the cells as in Step 3 to monitor biotinylation function of BioID™ fusion protein.
1 . Freeze down multiple vials of stably expressing cells for future BioID1M experiments. BASIC PROTOCOL 2
BioID™ PULL-DOWN TO IDENTIFY CANDIDATE PROTEINS
This protocol describes an exemplary embodiment of the invention using cells stably expressing a BioID™ fusion protein (along with non-expressing control cells) to perform large-scale BioID™ pull-down experiments. The purpose of these experiments is to isolate sufficient amounts of proteins biotinylated by the BioID™ fusion protein to be identified by mass spectrometry. The starting material for these experiments may vary depending on a number of factors. These include the efficiency of biotinylation by the BioID™ fusion protein and the number of desired candidate proteins. This protocol describes the analysis of four confluent 10 cm plates of cells/condition (~ xl07 cells). The protocol ends immediately prior to analysis by mass spectrometry, a service typically performed by a core facility.
Materials
1 mM Biotin (see recipe)
PBS
Protease inhibitor (Halt Protease Inhibitor Cocktail, EDTA-Free, Thermo Scientific)
20% Triton X-100
Lysis buffer (see recipe)
50 mM Tris (pH 7.4)
Dynabeads® (MyOne Steptavadin CI, Life Technologies)
Wash buffer 1 (see recipe)
Wash buffer 2 (see recipe)
Wash buffer 3 (see recipe)
50 mM Ammonium bicarbonate (NHtHCC^)
Sonicator (Sonifier-250, Branson)
Centrifuge (Legend Micro 21R, Thermo Scientific)
MagneSphere® Technology Magnetic Separation Stand (Promega)
Note: Unless otherwise specified, lysis and wash steps can be carried out at RT to avoid precipitation of SDS and deoxycholic acid. To reduce keratin contamination, use unopened DNAse RNAse-free tubes, wear gloves and use a tube cap opener.
Perform cell lysis
1. Begin with four 10 cm dishes for each experimental condition (cells expressing BioID™ constructs or control cells).
The number of cells utilized will correlate with numbers of candidates identified,
2. When cells reach approximately 80% confluency, change medium to fresh complete medium containing 50 μΜ biotin (IX).
3. Incubate cells for 24 hr.
Incubation time may vary depending on the goals of the experiment.
4. Remove medium completely by aspiration and rinse the cells twice at room temperature with 5 ml/dish of PBS.
This step helps to remove residual free-biotin from the medium.
5. Add 600 μΐ of lysis buffer/dish and scrape cells gently to harvest the cells.
This can be be done at RT.
The purpose of this harsh lysis is to try to disrupt all protein interactions and completely denature/solubilize the proteins. Transfer lysed cells to a 15 ml conical tube.
Add 240 μΐ of 20% Triton X-100 (final concentration -2%) and mix by trituration. This tube can be kept on ice during subsequent sonication.
Adding a five fold excess of Triton X-100 dilutes out the SDSand prevents its precipitation at 4°C. Position the probe tip in the sample just above the tube bottom.
Apply sonication for two sessions with 30 pulses using a Branson Sonifier 250 (or equivalent) at 30% duty cycle and an output level of 3. Let the tube sit on ice for 2 min between each session to prevent overheating.
If the sample is still viscous and cloudy after sonication, apply an additional period of sonication. Sonication also functions to shear DNA. Add 2.16 ml of prc-chillcd 50 mM Tris (pH 7.4) and mix well.
This dilution provides more favorable conditions for affinity capture.
Apply one session of sonication (30 pulses using a Branson Sonifier 250 at 30% duty cycle and an output level of 3).
This step helps solubilize any precipitated proteins that may be present due to reduced salt and detergent concentrations from the previous step and assists in mixing the sample. Aliquot the sample evenly to three pre-chilled 2 ml tubes.
Spin down at 16,500 x g relative centrifugal force (RCF) for 10 min at 4°C. Perform affinity purification ofbiotinylated proteins During the centrifugatkra in step 13, place three new 2 ml tubes in the Magnetic Separation Stand and add 0.75 ml of RT lysis buffer and 0.75 ml of RT 50 mM Tris (pH 7.5) to each tube.
Mix the stock of magnetic streptavidin beads well with gentle tapping to resuspend and add 200 μΐ of beads to each tube prepared in step 14. Wait for 3 min RT.
This step junctions to equilibrate the beads in the binding buffer.
Once the beads accumulate at one side of tube wall, remove the supernatant gently by pipetting.
This step removes the buffer the beads equilibrated in. After sample centrifugation, carefully transfer supernatant to the tubes prepared in step 16. Do not disturb the small insoluble pellet on the tube wall when removing the supernatant.
Step 17 can be performed quickly to prevent the beads from drying out. Resuspend the samples and beads with gentle pipetting.
Incubate the tube on a rotator at 4°C overnight.
Place the tubes on the Magnetic Separation Stand and wait 3 min to collect beads at RT.
Remove the supernatant gently by pipetting. Try not to disturb beads on the wall of the tube.
Add 1.5 ml of wash buffer 1 to each tube and resuspend beads gently by pipetting.
Place tubes on a rotator for 8 min at RT.
Remove the supernatant as described in step 20-21.
To pool beads into one tube, add 1.5 ml of wash buffer 1 to one of the three tubes and resuspend gently.
Transfer resuspended sample in step 25 to another tube. Repeat this step to combine three samples into one tube in a final volume of 1.5 ml buffer 1. Place tubes on a rotator for 8 mins at RT.
Wash once with 1.5 ml of wash buffer 2 as described in steps 20-24.
Wash once with 1.5 ml of wash buffer 3 as described in steps 20-24.
Add 1.5 ml of 50 mM Tris (pH 7.4) and resuspend gently with pipetting. This step can be used to remove detergents in the sample that may interfere with mass-spectrometry. Save 150 μΐ (10% of total) of resupended beads for further analysis by Western blot.
Spin beads (both 1.35 ml and 150 ul samples) at 6,000 x g RCF for 5 min to collect beads.
Remove the supernatant completely. Do not disrupt beads at the tube bottom.
34. Add SO μΙ of SO m ammonium bicarbonate to the tube that contained 1.3S ml sample (destined for mass-spec analysis) and resuspend gently with pipetting.
Final wash buffer may vary depending on analysis method. If necessary, freeze the sample quickly in liquid nitrogen. Frozen samples can be stored at -
35. For the 150 μΐ sample, after removing supernatant, add 100 μΐ of IX SDS- PAGE sample buffer and resuspend gently by pipetting. Heat samples at 98°C for 5 min. Samples can be stored at -20°C for further analysis by immunoblot as described in Basic Protocol 1.
Prior to sending samples for analysis by mass spectrometry it is recommended to perform immunoblot analysis of ~5 μΐ from the protein in the SDS-PAGE sample buffer. This should follow the same protocol in Basic Protocol I to check for evidence of biotinylated proteins from BioID fusion proteins. A considerable difference in biotinylated proteins between the control and BioID fusion protein samples justifies proceeding to mass spectrometry.
REAGENTS AND SOLUTIONS
Use Milli-Q purified water or equivalent in all recipes.
1 mM Biotin (20X)
Dissolve 12.2 mg Biotin (Sigma, B4501) in 50 ml of serum-free DMEM (or standard tissue culture media). Pipetting may be required to dissolve biotin completely.
Sterilize by passing through a 0.22 μιη syringe-driven filter unit (Millex). Dispense into sterile 50 ml tube, cap tighdy. Store at4°C.
Lysis Buffer
50 mM Tris (pH 7.4)
500 mM NaCl
0.2% SDS (w/v)
Store at RT
I proteinase inhibitor (add just before use)
1 mM DTT (add just before use)
Note: This lysis buffer composition differs from that described in the initial publication of BioID (Roux et al., 2012).
Wash buffer 1
2% SDS (w/v)
Store at T
Wash buffer 2
0.1% (w/v) deoxycholic acid
1% (w/v) Triton X-100
1 mM EDTA
500 mM NaCl
50 mM Hepes (pH 7.5)
Deoxycholic acid stock solution must be protected from light.
Deoxycholic acid does not solubilize well under pH 7.1. Add Hepes last. Store at RT
Wash buffer 3
0.5% (w/v) deoxycholic acid
0.5% (w/v) NP-40
1 mM EDTA
250 mM LiCl
Store at RT
BSA blocking buffer
1% bovine serum albumin, fraction V
0.2% (w/v) Triton X-100
Bring up to final volume with IX PBS.
Store at 4°C
ABS blocking buffer
10% (v/v) adult bovine serum
1% (w/v) Triton X-100
Bring up to final volume with IX PBS
Store at 4°C
Quenching solution
3% (v/v) sodium azide
4.5% (v/v) hydrogen peroxide
Bring up to volume with IX PBS
Make this solution fresh with each use The subcellular locations in which BioID™ has successfully been applied to date include the nucleus, cytoplasm, endoplasmic reticulum (ER) and mitochondrial matrix. This protocol does not provide specific details for identification of proteins by mass-spectrometry. BioID™ experiments performed to date have utilized on-bead tryptic digestion to release peptides for analysis by ID LC-MS MS (Roux et al., 2012). Advantages of this approach include circumvention of the difficulty in removing biotinylation proteins/peptides from streptavidin matrix without also removing the streptavidin itself. An additional benefit from this approach is removal of the SDS-PAGE separation of proteins, a common source of keratin contamination and mass-spectrometry interference due to residual SDS.
The expected outcome is the identification of proteins that interact with and or are proximate to the BioID™ fusion protein. For a iull-scale BioID™ pull-down the results will likely arrive as a list of proteins that were identified by mass- spectrometry. This list should not only contain the names of the proteins but the peptide sequences and number of times each peptide was identified. This provides some information as to the relative abundance of that protein in the sample. In parallel there will be results from the control cells that do not express a BioID™ fusion protein. This list usually contains the five naturally biotinylated carboxylases (pyruvate carboxylase, PC; methylcrotonoyl-CoA carboxylase subunit alpha, MCCl; acetyl-CoA carboxylase 1, ACACA; acetyl-CoA carboxylase 1, ACACB; propionyl- CoA carboxylase alpha chain, PCCA), limited contaminating keratins and ribosomal proteins, and a small number of histones. These can be subtracted from the mass spectrometry results from the BioID™-fusion protein samples. After this subtraction, the rcrnairiing proteins are candidate interactors or proximate proteins that can be
validated by other methods. It is cost-effective to coordinate experiments so that a single control is used for several BioID™ pull-downs. This is acceptable if the control pull-down is performed in parallel under the same conditions and in the same cellular background as the cells expressing BioID™ fusion proteins. BioID™ typically results in a relatively large list of candidates, from -50 to 250, depending on the experiment. These can be ranked by abundance and/or based on what is already known about those proteins. It should be noted that the abundance, or lack thereof, does not necessarily imply anything about biological relevance and the user will have to make decisions as how to best utilize the results.
The process of generating the BioID™-fusion protein expression plasmid by
PCR cloning and validation of the fusion protein by transient transfection can be accomplished in as little as a week. Generation and selection of stably expressing cells could take as little as a week if viral expression is utilized or 3-4 weeks if relying on random integration following transient transfection and selection. A large-scale BioID™ pull-down can be accomplished in under a week. This includes the time required for the generation of sufficient cell numbers (typically 4xl07 cells).
Identification of candidates by mass-spectrometry typically takes 1-2 weeks. Thus a standard BioID™ experiment can be accomplished in 1-2 months from start to finish. LITERATURE CITED
Choi-Rhee, E., H. Schulman, and J.E. Cronan. 2004. Promiscuous protein
biotinylation by Escherichia coli biotin protein ligase. Protein science : a publication of the Protein Society. 13:3043-50.
Cronan, J.E. 2005. Targeted and proximity-dependent promiscuous protein
biotinylation by a mutant Escherichia coli biotin protein ligase. The Journal of nutritional biochemistry. 16:416-8.
Kwon, ., and D. Beckett. 2000. Function of a conserved sequence motif in biotin holoenzyme synthetases. Protein science : a publication of the Protein Society.
9:1530-9.
Roux, K.J., D.I. Kim, M. Raida, and B. Burke. 2012. A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells. The Journal of cell biology.
van Steensel, B., and S. Henikoff. 2000. Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nature biotechnology. 18:424-8.
Claims
1. A method for identifying in vtvo proximate proteins, comprising
(a) culturing recombinant cells comprising a recombinant nucleic acid capable of directing expression of a fusion protein comprising if) a hetcrologoas promiscuous biotin protein ligase (BPL), or functional equivalent thereof, and (if) a bait: polypeptide, wherein the cuiftn ing is carried out under conditions suitable o expression of the fusion protein in the eukaryotie cells; and
fb) identifying biotinyfated proteins, wherein the biottnylated proteins are proteins present in the cells proximate to the bait polypeptide.
2. The method of claim I, wherein the BPL, or functional equivalent thereof, comprises a protein with an amino acid sequence of general formula 1 :
Xi-X2~X3-X4-X5-X6-X7-XS-X9-Xi 0-Xi 1 -X.12-X I3-X14-X! 5-X 16-Xi 7- 1 S-X 1 - X2O-X21-X22-X23-X24-X25-X26-X27-X28-X29-X30-X31 -X32 ('SEQ ID NO: 13), wherein
XI is selected from the group consisting of A , L, l, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of I, V, and L;
X4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R, K, E, and 1;
X7 is Q;
XH selected from the group consisting of Q, T, N« F, V, and S;
X.9 is selected from the group consisting of A , , M, S, Q, and E;
Xl is G;
I 1 is selected from the group consisting of R and :
X12 is G .
X.1.3 is any residue other than R;
ΧΪ4 is selected front the group consisting of R, L, W, G, and S;
X 15 is selected from the group consisting of G. Q, P, a d K;
XJ.6 is selected from the group consisting of R and :
X17 is selected front the group consisting of K, Q, V, E, T, M , and A;
XI 8 is ;
X ί 9 is selected from the group consisting of i'\ L, Y, E, I, and V ;
X20 is selected from the group consisting of S, M, and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consisting of F, E, A, R'„ Y, and V;
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C. or is abscot;
X25 is selected from the group consisting of C and A, or is absent:
X26 is selected from the group consisting of A and L, or is absent;
X27 is selected from the group consisting of A. G, and S, or is absent; .X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting o L, ?„ A, and F:
X30 is selected, from the group consisting of Y. M, A, V, and L;
X31 is selected from the group consisting of L, G. V, 1, and F: and X.32 is selected from the group consisting of S, T, and .
3. The method of claim 2, wherein at least one of the following is true of the BPL of general formula 1 f SEQ ID NO: 17) :
X4 is A :
X8 is T or S;
XI 3 is G;
XI 5 is G;
X i 'Hs ! . or V;
X2 is S;
X2 i is P
X23 is G;
X29 is i, or Ϊ:
X30 is A. V, or L:
X31 is L5 G, V, or Ϊ; and/or
X32 is S,
4. The method of claim 3, wherein X.13 is G.
5. The meihod of any one of claims l~4„ wherein tbe BPL, or functional equivalent thereof, comprises of an ammo acid sequence according to SEQ ID NO: 14.
6. The method of claim 5. wherein the BPL comprises a G residue at position 523.
7. Toe method of any one of claims 1-6, wherein, the BPL comprises an amino acid sequence selected from the group consisting of SEQ ID NO .1 -4 and 6- 12.
8. Hie method of any one of c laims i -7, wherein the recombinant nucleic ac id is present i an expression vector,
9. The method of any one of claims 1-7, wherein the recombinant nucleic acid is integrated into cellular chromosomal DNA.
Hi. The method of any one of claims 1 -9, wherein the colturing comprises adding bkrtin to culture medium that the recombinant cel ls arc cultured in.
i 1. The method of any one of claims 1-10, wherein die cuituring is carried for between about 0.5 hours and about 96 hours.
12. The method of any one of claims 1-1 1 , where in the identifying the biotin lated proteins comprises isolating the biotmy!ated polypeptides,
. 3. A recombinant nucleic acid, comprising
(a) a first nucleic acid domain encoding a promiscuous biotin protein itgase (BPL); and
(b) a second nucleic acid domain encoding a bait polypeptide.
14. The recombinant nucleic acid of claim 13, wherein the firs nucleic acid domain encodes a protein comprising an amino acid sequence of general formula 1 : XI -X.-.X3-X4-.X5-X6-X7-X8-X9-X10-X1 l-Xl 2-Xt 3-ΧΊ 4- 1 5-X16-X 1 7-X18-. 19- X20-X2 J.-X22-X23-X24-X25-X26-X27-X28-X29-X30-X3 1 -X32 (S'EQ ID NO: 13), wherein
XI is selected from the group consisting of A, L, 1, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of 1, V, and L;
X4 is selected from the group consisting of A and G;
XS is selected from the group consisting of E, D, R, T, N, V, and A;
X6 is selected from the group consisting of Y, R, K. E. and 1;
X7 is 0;
X8 selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, , , S, Q, and E;
iO is G ;
XI I is selected from the group consisting of R and ;
12 is G;
X ! 3 i any residue other than R ,
X'f4 is selected from the group consisting of R, L, W, G , and S;
XI 5 is selected from the group consisting of G. Q, P, and ;
X ! 6 is selected
Χί7 is selected
XI 8 is W;
XI9 is selected
Λ Κ> IS selected
X2I is selected
X22 is selected
X23 is selected
X25 is setected
X26 is selected
X27 is selected
X2X is selected
X29 is selected
X30 is selected
X31 is selected
X32 is selected
.
15. The recombinant nucleic acid of claim Ϊ 4, wherein. XI 3 ts G.
16. Hie recombinant nucteie acid of any one of claims 13-15, wherein the first nucleic acid domain encodes a BPL protein comprising an amino acid sequence according to SEQ ID NO: 14,
17. The recombinant nucleic acid of claim 16, wherein the first n uc leic acid domain encodes a BPL protein comprising a G residue at position 523.
1 . The recombinant nucleic acid of any one of claims 13-17, wherein the first nucleic acid domain encodes a BPL protein comprising an amino acid sequence selected from the group consisting of SEQ ID NO: 1-4 and 6-12.
19. .An expressio vector comprising the recombinant nucleic acid of any one of claims 1 -18 operatively linked to a promoter.
20. A recombinant host cell comprising the recombinant nucleic acid, of an one of claims 13-18 or the expression vector of claim 19.
21. The recombinant host cell of claim 20, wherein the recombinant host ceil is an embryonic stein cell.
22 A transgenic, non-human organism comprising a recombinant host ceil of claim 20 or 21 .
23. The transgenic, non-human organism of claim 22, wherein die organism is a mammal
24. The transgenic, non-human organism of claim 23, wherein the mammal is a mouse
25. A recombinant fusion protein, comprising
fa) a first domain encoding a bio iii rotein ligase (BP.L); and
(b) a second domain encoding a bait -polypeptide.
26. lite recombinant fusion protein of claim 25, wherein the first domain comprises an amino acid sequence of general formula i:
X 1 -X2-X3 -X4-X5 -X6-X7-X8 -X9- 10-X i i -X 12-X 13-X i 4-X i >-X 16-X 17-X18-X i 9- X2O.X2 -X22-X23-X24-X25-X26-X27-X2S-X29-X30-X31 -X32 (SEQ ID NO: 13), wherein
.XI is selected from the group consisting of A, L, 1, T, V, or is absent;
X2 is selected from the group consisting of C, L, V, H, A, or is absent;
X3 is selected from the group consisting of 1, V, and L:
X.4 is selected from the group consisting of A and G;
X5 is selected from the group consisting of E, D, ' , T, N, V, and A;
X6 is selected from the group consisting of Y, R, , E, and I;
X7 is Q;
X selected from the group consisting of Q, T, N, F, V, and S;
X9 is selected from the group consisting of A, , N. S, Q, and E;
XIO is G;
X I 1 is selected from the group consistin of and K;
Xf.2 is G;
XI 3 is any residue other than R;
X14 is selected from the group consisting of R, L, W, G, and S;
XI is selected from the group consisting of G, Q, P, an K;
X.1.6 is selected from the group consisting of R and N;
XI 7 is selected from the group consisting o , Q, V, E, T, M, and A;
X18 is W;
Xf. is selected from the group consisting of F, L Y, E, Ϊ, and V;
X2 is selected from the group consisting of S, , and N;
X21 is selected from the group consisting of P, Q, and D;
X22 is selected from the group consistin of 1\ £, .A, , Y, and V:
X23 is selected from the group consisting of G and A
X24 is selected from the group consisting of V and C. or is abseot;
X25 is selected from the group consisting of C and A, or is absent;
X26 is selected from the group consisting of A and L. or is absent;
X27 is selected from the group consisting of A, G, and 8, or is absent;
X28 is selected from the group consisting of N, G, Q, T, and C;
X29 is selected from the group consisting of L, Ϊ, A, and P:
X30 is selected from the group consisting of Y, M, A. V, and L;
X. 1 is selected from the group consisting of L, G, V, I, and F; and
X32 is selected from the group consisting of S, T, and F.
27. The recombinant fusion protein of claim 26, wherein X i 3 is G.
28. The recombinant fusion protein o aty one of claims 25-27, wherein the first domain comprises an amino acid sequence according to SEQ ID NO: 14.
29. The recombinant fusion protein of claim 28, wherein the first domain comprises a G residue at position 523.
30. The recombinant fusion protein of an one of claims 25-29, wherein the first domain comprises an amino acid sequence selected from the group consisting of SEQ ID O: 1 -4 and 6-12.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261722145P | 2012-11-03 | 2012-11-03 | |
US61/722,145 | 2012-11-03 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014070227A1 true WO2014070227A1 (en) | 2014-05-08 |
Family
ID=47998521
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2013/030261 WO2014070227A1 (en) | 2012-11-03 | 2013-03-11 | Methods and reagents for identifying proximate proteins |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2014070227A1 (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018053053A1 (en) * | 2016-09-13 | 2018-03-22 | The Broad Institute, Inc. | Proximity-dependent biotinylation and uses thereof |
WO2018208755A1 (en) * | 2017-05-09 | 2018-11-15 | The Regents Of The University Of California | Compositions and methods for tagging target proteins in proximity to a nucleotide sequence of interest |
CN112813153A (en) * | 2021-01-18 | 2021-05-18 | 复旦大学 | Method for determining downstream transcription factor interacting with target protein and DNA binding sequence thereof and application thereof |
CN114149980A (en) * | 2021-11-04 | 2022-03-08 | 中山大学 | A Novel Protein Biotin Ligase and Its Based Proximity Labeling System PhastID |
CN114457048A (en) * | 2022-02-11 | 2022-05-10 | 中山大学 | A protein biotin ligase and its adjacent protein labeling system and application |
CN114606208A (en) * | 2022-03-31 | 2022-06-10 | 中山大学 | A high-confidence and rapid proximity protein labeling system ScPhastID and labeling method |
WO2022216157A1 (en) * | 2021-04-09 | 2022-10-13 | Stichting Radboud Universiteit | Off the shelf proximity biotinylation enzyme |
US12110522B2 (en) | 2018-01-17 | 2024-10-08 | The Board Of Trustees Of The Leland Stanford Junior University | Engineered promiscuous biotin ligases for efficient proximity labeling |
-
2013
- 2013-03-11 WO PCT/US2013/030261 patent/WO2014070227A1/en active Application Filing
Non-Patent Citations (76)
Title |
---|
"Culture afAnimal Cells: A Manual of Basic Technique", 1987, LISS, INC. |
"Gene Expression Technology", vol. 185, 1991, ACADEMIC PRESS, article "Methods in Enzymology" |
"Gene Transfer and Expression Protocols", 1998, THE HUMANA PRESS INC., pages: 109 - 128 |
"Methods in Enzymology", 1990, ACADEMIC PRESS, INC., article "Guide to Protein Purification" |
"PCR Protocols: A Guide to Methods and Applications", 1990, ACADEMIC PRESS |
AARONSON, R.P.; G. BLOBEL: "Isolation of nuclear pore complexes in association with a lamina", PROCEEDINGS OFTHE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 72, 1975, pages 1007 - 11 |
AL-HABOUBI, T.; D.K. SHUMAKER; J. KOSER; M. WEHNERT; B. FAHRENKROG: "Distinct association of the nuclear pore protein Nup153 with A- and B-type Jamins", NUCLEUS, 2011, pages 2 |
BECKETT, D.; E. KOVALCVA; P.J. SCHATZ: "A minimal peptide substrate in biotin holoenzyme synthetase-catalyzed biotinylation", PROTEIN SCI., vol. 8, 1999, pages 921 - 9, XP002971557 |
BEN SHORT BEN SHORT: "Meet the neighbors", THE JOURNAL OF CELL BIOLOGY, vol. 196, no. 6, 12 March 2012 (2012-03-12), pages 666, XP055069809 * |
BIONE, S.; E. MAESTRINI; S. RIVELLA; M. MANCINI; S. REGIS; G. ROMEO; D. TONIOLO: "Identification of a novel X-linked gene responsible for Emery-Dreifuss muscular dystrophy", NATURE GENETICS, vol. 8, 1994, pages 323 - 7 |
BODOOR, K.; S. SHAIKH; D. SALINA; W.H. RAHARJO; R BASTOS; M. LOHKA; B. BURKE: "Sequential recruitment ofNPC proteins to the nuclear periphery at the end of mitosis", JOURNAL OF CELL SCIENCE, vol. 112, 1999, pages 2253 - 64 |
BONNE, G.; E. MERCURI; A. MUCHIR; A. URTIZBEREA; H.M. BECANE; D. RECAN; L. MERTINI; M. WEHNERT; R. BOOR; U. REUNER: "Annals of neurology", vol. 48, 2000, article "Clinical and molecular genetic spectrum of autosomal dominant Emery-Dreifuss muscular dystrophy due to mutations of the lamin A/C gene", pages: 170 - 80 |
BONNE, G.; M.R. DI BARLETTA; S. VAMOUS; H.M. BCCANE; EH. HAMMOUDA; L. MCRLINI; F. MUNTONI; C.R. GREENBERG; F. GAY; J.A. URTIZBEREA: "Mutations in the gene encoding lamin A/C cause autosomal dominant Emery-Dreifuss muscular dystrophy", NATURE GENETICS, vol. 21, 1999, pages 285 - 8, XP008054131, DOI: doi:10.1038/6799 |
BROERS, J.L.; BM. MACHIELS; GJ. VAN EYS; H.J. KUIJPERS; E.M. MANDERS; R. VAN DRIEL; F.C. RAMAEKERS: "Dynamics of the nuclear lamina as monitored by GFP-tagged A-type lamins", JOURNAL OFCELL SCIENCE, vol. 112, 1999, pages 3463 - 75 |
BURKE, B.; K.J. ROUX: "Nuclei take a position: managing nuclear location", DEVELOPMENTAL CELL, vol. 17, 2009, pages 587 - 97 |
CASTANO, E.; V.V. PHILIMONENKO; M. KAHLE; J. FUKALOVA; A. KALENDOVA; S. YILDIRIM; R DZIJAK; H. DINGOVA-KRASNA; P. HOZAK: "Actin complexes in the cell nucleus: new stones in an old field", HISTOCHEMISTRY AND CELL BIOLOGY, vol. 133, 2010, pages 607 - 26, XP019848071 |
CHAPMAN-SMITH, A.; J.E. CRONAN, JR.: "Molecular biology of biotin attachment to proteins", JNUTR., vol. 129, 1999, pages 477S - 484S |
CHOI-RHEE, E.; H. SCHULMAN; J.E. CRONAN: "Promiscuous protein biotinylation by Escherichia coli biotin protein ligase", PROTEIN SCI., vol. 13, 2004, pages 3043 - 50, XP055069906, DOI: doi:10.1110/ps.04911804 |
CHOI-RHEE, E.; H. SCHULMAN; J.E. CRONAN: "Promiscuous protein biotinylation by Escherichia coli biotin protein ligase", PROTEIN SCIENCE : A PUBLICATION OF THE PROTEIN SOCIETY, vol. 13, 2004, pages 3043 - 50, XP055069906 * |
CHOI-RHEE, E.; H. SCHULMAN; J.E. CRONAN: "Promiscuous protein biotinylation by Escherichia coli biotin protein ligase", PROTEIN SCIENCE : A PUBLICATION OF THE PROTEIN SOCIETY, vol. 13, 2004, pages 3043 - 50, XP055069906, DOI: doi:10.1110/ps.04911804 |
CRISP, M.; Q. LIU; K. ROUX; J.B. RATTNER; C. SHANAHAN; B. BURKE; P.D. STAHL; D. HODZIC: "Coupling of the nucleus and cytoplasm: role of the LINC complex", THE JOURNAL OF CELL BIOLOGY, vol. 172, 2006, pages 41 - 53 |
CRONAN, J.E.: "Targeted and proximity-dependent promiscuous protein biotinylation by a mutant Escherichia coli biotin protein ligase", J NURR BTOCHEM., vol. 16, 2005, pages 416 - 8, XP004962819, DOI: doi:10.1016/j.jnutbio.2005.03.017 |
CRONAN, J.E.: "Targeted and proximity-dependent promiscuous protein biotinylation by a mutant Escherichia coli biotin protein ligase", THE JOURNAL OF NUTRITIONAL BIOCHEMISTRY, vol. 16, 2005, pages 416 - 8, XP004962819, DOI: doi:10.1016/j.jnutbio.2005.03.017 |
CRONAN, J.E.: "Targeted and proximity-dependent promiscuous protein biotinylation by a mutant Escherichia coli biotin protein ligase", THE JOURNAL OF NUTRITIONAL BIOCHEMISTRY, vol. 16, 2005, pages 416 - 8, XP027618841 * |
DAIGLE, N.; J. BEAUDOUIN; L. HARTNELL; G. IMREH; E. HALLBERG; J. LIPPINCOTT-SCHWARTZ; J. ELLENBERG: "Nuclear pore complexes form immobile networks and have a very low turnover in live mammalian cells", THE JOURNAL OF CELL BIOLOGY, vol. 154, 2001, pages 71 - 84 |
DECHAT, T.; B. KORBEI; OA. VAUGHAN; S. VLCEK; C.J. HUTCHISON; R. FOISNER: "Lamina-associated polypeptide 2alpha binds intranuclear A-type lamins", JOURNAL OF CELL SCIENCE, vol. 113, 2000, pages 3473 - 84 |
DWYER, N.; G. BLOBEL: "A modified procedure for the isolation of a pore complex-lamina fraction from rat liver nuclei", THE JOURNAL OF CELL BIOLOGY, vol. 70, 1976, pages 581 - 91 |
FOISNER, R.; L. GERACE: "Integral membrane proteins of the nuclear envelope interact with lamins and chromosomes, and binding is modulated by mitotic phosphorylation.", CELL, vol. 73, 1993, pages 1267 - 79, XP024245859, DOI: doi:10.1016/0092-8674(93)90355-T |
GERACE, L.; M.D. HUBER: "Nuclear lamina at the crossroads of the cytoplasm and nucleus", JOURNAL OFSTRUCTURAL BIOLOGY, vol. 177, 2012, pages 24 - 31, XP028444074, DOI: doi:10.1016/j.jsb.2011.11.007 |
GOLDMAN, R.D.; Y. GRUENBAUM; R.D. MOIR; D.K. SHUMAKER; T.P. SPANN: "Nuclear lamins: building blocks of nuclear architecture", GENES & DEVELOPMENT, vol. 16, 2002, pages 533 - 47 |
GREEN, N.M.: "Avidin. 1. The Use of (14-C)Biotin for Kinetic Studies and for Assay", BIOCHEM J., vol. 89, 1963, pages 585 - 91 |
GUAN, T.; R.H. KEHLENBACH; E.C. SCHIRMER; A. KEHLENBACH; F. FAN; B.E. CLUNNAN; N. ARNHEIM; L. GERACE: "Nup50, a nucleoplasmically oriented nucleoporin with a role in nuclear protein export", MOLECULAR AND CELLULAR BIOLOGY, vol. 20, 2000, pages 5619 - 30 |
GUDISC, S.; R.A. FIGUEROA; R. LINDBERG; V. LARSSON; E. HALLBERG: "Samp is functionally associated with the LINC complex and A-type lamina networks", JOURNAL OF CELL SCIENCE, vol. 124, 2011, pages 2077 - 85 |
GUENEAU, L.; A.T. BERTRAND; J.P. JAIS; M.A. SALIB; T. STOJKOVIC; M. WEHNERT; M. HOELTZENBEIN; S. SPULER; S. SAITOH; A. VERSCHUEREN: "Mutations of the FHLI gene cause Emery-Dreifuss muscular dystrophy", AMERICAN JOURNAL OFHUMAN GENETICS, vol. 85, 2009, pages 338 - 53 |
HASE, M.E.; V.C. CORDES: "Direct interaction with nupl53 mediates binding of Tpr to the periphery of the nuclear pore complex", MOLECULAR BIOLOGY OF THE CELL, vol. 14, 2003, pages 1923 - 40 |
HOLASKA, J.M.; K.K. LEE; A.K. KOWALSKI; K.L. WILSON: "Transcriptional repressor germ cell-less (GCL) and barrier to autointegration factor (BAF) compete for binding to emerin in vitro", THE JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 278, 2003, pages 6969 - 75 |
HOLASKA, J.M.; KL. WILSON: "Multiple roles for emerin: implications for Emery-Dreifuss muscular dystrophy", THE ANATOMICAL RECORD. PART A, DISCOVERIES IN MOLECULAR, CELLULAR, AND EVOLUTIONARY BIOLOGY, vol. 288, 2006, pages 676 - 80 |
HORTON, H.; I. MCMORROW; B. BURKE: "Independent expression and assembly properties of heterologous lamins A and C in murine embryonal carcinomas", EUROPEAN JOURNAL OF CELL BIOLOGY, vol. 57, 1992, pages 172 - 83 |
K. J. ROUX ET AL: "A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells", HUMAN MOLECULAR GENETICS, vol. 16, no. 23, 12 March 2012 (2012-03-12), pages 2816 - 810, XP055069796, ISSN: 0964-6906, DOI: 10.1093/hmg/ddm238 * |
KRULL, S.; J. THYBERG; B. BJORKROTH; H.R. RACKWITZ; V.C. CORDES: "Nucleoporins as components of the nuclear pore complex core structure and Tpr as the architectural element of the nuclear basket", MOLECULAR BIOLOGY OF THE CELL, vol. 15, 2004, pages 4261 - 77 |
KUBBEN, N.; J.W. VONCKEN; J. DEMMERS; C. CALIS; G. VAN ALMEN; Y. PINTO; T. MISTELI: "Identification of differential protein interactors of lamin A and progerin", NUCLEUS, vol. 1, 2010, pages 513 - 25 |
KULMAN, J.D.; M. SATAKE; J.E. HARRIS: "A versatile system for site-specific enzymatic biotinylation and regulated expression of proteins in cultured mammalian cells", PROTEIN EXPRESSION AND PURIFICATION, vol. 52, 2007, pages 320 - 8, XP005737472, DOI: doi:10.1016/j.pep.2006.09.011 |
KUROISHI, T.; L. RIOS-AVILA; V. PESTINGER; S.S. WIJERATNE; J. ZEMPLENI: "Biotinylation is a natural, albeit rare, modification of human histones", MOLECULAR GENETICS AND METABOLISM, vol. 104, 2011, pages 537 - 45, XP028117219, DOI: doi:10.1016/j.ymgme.2011.08.030 |
KWON, K.; D. BECKETT: "Function of a conserved sequence motif in biotin holoenzyme synthetases", PROTEIN SCIENCE: A PUBLICATION OF THE PROTEIN SOCIETY, vol. 9, 2000, pages 1530 - 9 |
KWON, K.; E.D. STREAKER; S. RUPARELIA; D. BECKETT: "Multiple disordered loops function in corepressor-induced dimerization of the biotin repressor", J MOL BIOL., vol. 304, 2000, pages 821 - 33 |
LANE, M.D.; K.L. ROMINGER; D.L. YOUNG; F. LYNEN: "The Enzymatic Synthesis of Holotranscarboxylase from Apotranscarboxylase and (+)-Biotin. Ii, Investigation of the Reaction Mechanism", J BIOL CHEM., vol. 239, 1964, pages 2865 - 71 |
LEE, K.K.; T. HARAGUCHI; R.S. LEE; T. KOUJIN; Y. HIRAOKA; K.L. WILSON: "Distinct functional domains in emeiin bind lamin A and DNA-bridging protein BAF", JOURNAL OF CELL SCIENCE, vol. 114, 2001, pages 4567 - 73 |
LIU, Q.; N. PANTE; T. MISTELI; M. ELSAGGA; M. CRISP; D. HODZIC; B. BURKE; K.J. ROUX: "Functional association of Sun 1 with nuclear pore complexes", THE JOURNAL OF CELL BIOLOGY, vol. 178, 2007, pages 785 - 98 |
MANSHARAMANI, M.; K.L. WILSON: "Direct binding of nuclear membrane protein MAN1 to emerin in vitro and two modes of binding to barrier-to-autointegration factor", THE JOURNAL OFBIOLOGICAL CHEMISTRY, vol. 280, 2005, pages 13863 - 70 |
MECHOLD, U.; C. GILBERT; V. OGRYZKO: "Codon optimization of the BirA enzyme gene leads to higher expression and an improved efficiency of biotinylation of target proteins in mammalian cells", JOURNAL OF BIOTECHNOLOGY, vol. 116, 2005, pages 245 - 9, XP004737497, DOI: doi:10.1016/j.jbiotec.2004.12.003 |
MOIR, RD.; M. YOON; S. KHUON; R.D. GOLDMAN: "Nuclear lamins A and B 1: different pathways of assembly during nuclear envelope formation in living cells", THE JOURNAL OF CELL BIOLOGY, vol. 151, 2000, pages 1155 - 68 |
MONTES DE OCA, R.; C.J. SHOEMAKER; M. GUCEK; R.N. COLE; K.L. WILSON: "Barrier-to-autointegration factor proteome reveals chromatin-regulatory partners", PLOS ONE, vol. 4, 2009, pages 7050 |
NESBETH, D.; S.L. WILLIAMS; L. CHAN; T. BRAIN; N.K. SLATER; F. FARZANEH; D. DARLING: "Metabolic biotinylation of lentiviral pseudotypes for scalable paramagnetic microparticle-dependent manipulation", MOLECULAR THERAPY : THE JOURNAL OF THE AMERICAN SOCIETY OF GENE THERAPY, vol. 13, 2006, pages 814 - 22, XP005358616, DOI: doi:10.1016/j.ymthe.2005.09.016 |
RAFFAELE DI BARLETTA, M.; E. RICCI; G. GALLUZZI; P. TONALI; M. MORA; L. MORANDI; A. ROMORINI; T. VOIT; K.H. ORSTAVIK; L. MERLINI: "Different mutations in the LMNA gene cause autosomal dominant and autosomal recessive Emery-Dreifuss muscular dystrophy", AMERICAN JOURNAL OFHUMAN GENETICS, vol. 66, 2000, pages 1407 - 12 |
RASALA, B.A.; C. RAMOS; A. HAREL; D.J. FORBES.: "Capture of AT-rich chromatin by ELYS recruits POM 121 and NDC1 to initiate nuclear pore assembly", MOLECULAR BIOLOGY OF THE CELL., vol. 19, 2008, pages 3982 - 96 |
ROUX, K.J.; D.I. KIM; M. RAIDA; B. BURKE: "A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells", THE JOURNAL OF CELL BIOLOGY, 2012 |
ROUX, K.J.; M.L. CRISP; Q. LIU; D. KIM; S. KOZLOV; C.L. STEWART; B. BURKE: "Nesprin 4 is an outer nuclear membrane protein that can induce kinesin- mediated cell polarization", PROCEEDINGS OF THE NATIONAL ACADEMY AF SCIENCES OF THE UNITED STATES OFAMERICA, vol. 106, 2009, pages 2194 - 9 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
SAMBROOK; FRITSCH; MANIATIS: "Molecular Cloning, A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS |
SANG, L.; JJ. MILLER; K.C. CORBIT; R.H. GILES; M.J. BRAUER; E.A. OTTO; L.M. BAYE; X. WEN; S.J. SCALES; M. KWONG: "Mapping the NPHP-JBTS-MKS protein network reveals ciliopathy disease genes and pathways", CELL, vol. 145, 2011, pages 513 - 28, XP028374836, DOI: doi:10.1016/j.cell.2011.04.019 |
SCHIRMER, E.C.; L. FLORENS; T. GUAN; J.R. YATES, 3RD; L. GERACE: "Nuclear membrane proteins with potential disease links found by subtractive proteomics", SCIENCE, vol. 301, 2003, pages 1380 - 2 |
SEGURA-TOTTEN, M.; A.K. KOWALSKI; R. CRAIGIE; K.L. WILSON: "Barrier-to-autointegration factor: major roles in chromatin decondensation and nuclear assembly", THE JOURNAL OF CELL BIOLOGY, vol. 158, 2002, pages 475 - 85 |
SHIMI, T.; K. PFLEGHAAR; S. KOJIMA; C.G..PACK; I. SOLOVEI; A.E. GOLDMAN; S.A. ADAM; D.K. SHUMAKER; M. KINJO; T. CREMER: "The A- and B-type nuclear lamin networks: microdomains involved in chromatin organization and transcription", GENES & DEVELOPMENT, vol. 22, 2008, pages 3409 - 21 |
SHUMAKER, D.K.; T. DECHAT; A. KOHLMAIER; S.A. ADAM; M.R. BOZOVSKY; M.R. ERDOS; M. ERIKSSON; A.E. GOLDMAN; S. KHUON; F.S. COLLINS: "Mutant nuclear lamin A leads to progressive alterations of epigenetic control in premature aging", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED SLATES OF AMERICA, vol. 103, 2006, pages 8703 - 8 |
SIMON, D.N.; K.L. WILSON: "The nucleoskeleton as a genome-associated dynamic 'network of networks", NATURE REVIEWS. MOLECULAR CELL BIOLOGY, vol. 12, 2011, pages 695 - 708 |
STREAKER, E.D.; D. BECKETT: "Nonenzymatic biotinylation of a biotin carboxyl carrier protein: unusual reactivity of the physiological target lysine", PROTEIN SCI., vol. 15, 2006, pages 1928 - 35 |
SUKEGAWA, J.; G. BLOBEL: "A nuclear pore complex protein that contains zinc finger motifs, binds DNA, and faces the nucleoplasm", CELL, vol. 72, 1993, pages 29 - 38, XP024245436, DOI: doi:10.1016/0092-8674(93)90047-T |
SULLIVAN, T.; D. ESCALANTE-ALCALDE; H. BHATT; M. ANVER; N. BHAT; K. NAGASHIMA; C.L. STEWART; B. BURKE: "Loss of A-type lamin expression compromises nuclear envelope integrity leading to muscular dystrophy", THE JOURNAL OFCELL BIOLOGY, vol. 147, 1999, pages 913 - 20 |
TORRISI, M.R.; L.V. LOTTI; A. PAVAN; G. MIGLIACCIO; S. BONATTI: "Free diffusion to and from the inner nuclear membrane of newly synthesized plasma membrane glycoproteins", THE JOURNAL OFCELL BIOLOGY, vol. 104, 1987, pages 733 - 7 |
TORRISI, M.R.; S. BONATTI: "Immunocytochemical study of the partition and distribution ofSindbis virus glycoproteins in freeze-fractured membranes of infected baby hamster kidney cells", THE JOURNAL OF CELL BIOLOGY, vol. 101, 1985, pages 1300 - 6 |
VAN STEENSEL, B.; S. HENIKOFF: "Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase", NATURE BIOTECHNOLOGY, vol. 18, 2000, pages 424 - 8, XP002187608, DOI: doi:10.1038/74487 |
VAN STEENSET, B.; S. HENIKOFF: "Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase", NATURE BIOTECHNOLOGY, vol. 18, 2000, pages 424 - 8, XP002187608, DOI: doi:10.1038/74487 |
WALTHER, T.C.; M. FOMEROD; H. PICKERSGILL; M. GOLDBERG; T.D. ALLEN; I.W. MATTAJ.: "The nucleoporin Nup 153 is required for nuclear pore basket formation, nuclear pore complex anchoring and import of a subset of nuclear proteins", THE EMBOJOURNAL., vol. 20, 2001, pages 5703 - 14 |
WORMAN, H.J.: "Nuclear lamins and laminopathies", THE JOURNAL OF PATHOLOGY, vol. 226, 2012, pages 316 - 25, XP055053091, DOI: doi:10.1002/path.2999 |
WORMAN, H.J.; L.G. FONG; A. MUCHIR; S.G. YOUNG: "Laminopathies and the long strange trip from basic cell biology to therapy", THE JOURNAL OFCLINICAL INVESTIGATION, vol. 119, 2009, pages 1825 - 36, XP055060705, DOI: doi:10.1172/JCI37679 |
ZHANG, Q.; C. BETHMANN; N.F. WORTH; J.D. DAVIES; C. WASNER; A. FEUER; C.D. RAGRAUTH; Q. YI; J.A. MELLAD; D.T. WARREN: "Nesprin-1 and -2 are involved in the pathogenesis of Emery Dreifuss muscular dystrophy and are critical for nuclear envelope integrity", HUMAN MOLECULAR GENERICS, vol. 16, 2007, pages 2816 - 33 |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018053053A1 (en) * | 2016-09-13 | 2018-03-22 | The Broad Institute, Inc. | Proximity-dependent biotinylation and uses thereof |
US12188059B2 (en) | 2016-09-13 | 2025-01-07 | The Broad Institute, Inc. | Proximity-dependent biotinylation and uses thereof |
WO2018208755A1 (en) * | 2017-05-09 | 2018-11-15 | The Regents Of The University Of California | Compositions and methods for tagging target proteins in proximity to a nucleotide sequence of interest |
US12110522B2 (en) | 2018-01-17 | 2024-10-08 | The Board Of Trustees Of The Leland Stanford Junior University | Engineered promiscuous biotin ligases for efficient proximity labeling |
CN112813153B (en) * | 2021-01-18 | 2022-10-18 | 复旦大学 | A method for determining the downstream transcription factor interacting with target protein and its DNA binding sequence and its application |
CN112813153A (en) * | 2021-01-18 | 2021-05-18 | 复旦大学 | Method for determining downstream transcription factor interacting with target protein and DNA binding sequence thereof and application thereof |
WO2022216157A1 (en) * | 2021-04-09 | 2022-10-13 | Stichting Radboud Universiteit | Off the shelf proximity biotinylation enzyme |
CN114149980A (en) * | 2021-11-04 | 2022-03-08 | 中山大学 | A Novel Protein Biotin Ligase and Its Based Proximity Labeling System PhastID |
CN114149980B (en) * | 2021-11-04 | 2023-10-13 | 中山大学 | A novel protein biotin ligase and its proximity labeling system PhastID |
CN114457048B (en) * | 2022-02-11 | 2024-04-16 | 中山大学 | Protein biotin ligase and adjacent protein marking system and application thereof |
CN114457048A (en) * | 2022-02-11 | 2022-05-10 | 中山大学 | A protein biotin ligase and its adjacent protein labeling system and application |
CN114606208B (en) * | 2022-03-31 | 2024-06-11 | 中山大学 | Quick adjacent protein marking system SCPHASTID with high credibility and marking method |
CN114606208A (en) * | 2022-03-31 | 2022-06-10 | 中山大学 | A high-confidence and rapid proximity protein labeling system ScPhastID and labeling method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Roux et al. | A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells | |
WO2014070227A1 (en) | Methods and reagents for identifying proximate proteins | |
Hase et al. | Direct interaction with nup153 mediates binding of Tpr to the periphery of the nuclear pore complex | |
Matic et al. | In vivo identification of human small ubiquitin-like modifier polymerization sites by high accuracy mass spectrometry and an in vitro to in vivo strategy | |
Rousová et al. | Novel mechanistic insights into the role of Mer2 as the keystone of meiotic DNA break formation | |
Serfling et al. | Incorporation of unnatural amino acids into proteins expressed in mammalian cells | |
Shinde et al. | The ancestral ESCRT protein TOM1L2 selects ubiquitinated cargoes for retrieval from cilia | |
Henneberg et al. | Activity-based profiling of cullin–RING E3 networks by conformation-specific probes | |
Laidlaw et al. | Recycling of cell surface membrane proteins from yeast endosomes is regulated by ubiquitinated Ist1 | |
Li et al. | Nuclear localization of Desmoplakin and its involvement in telomere maintenance | |
EP3601325B1 (en) | New tau species | |
JP4683572B2 (en) | Quantification of recombinant protein | |
Sangith et al. | Discovery of novel interacting partners of PSMD9, a proteasomal chaperone: Role of an Atypical and versatile PDZ-domain motif interaction and identification of putative functional modules | |
Nahlé et al. | Defining the interactomes of proteins involved in cytoskeletal dynamics using high-throughput proximity-dependent biotinylation in cellulo | |
Hernandez et al. | Bimolecular fluorescence complementation analysis to reveal protein interactions in herpes virus infected cells | |
Draczkowski et al. | Structural basis of K11/K48-branched ubiquitin chain recognition by the human 26S proteasome | |
Jäverfelt et al. | The MYO1B and MYO5B motor proteins and the SNX27 sorting nexin regulate membrane mucin MUC17 trafficking in enterocytes | |
Kuppa et al. | Rtt105 configurationally staples RPA and blocks facilitated exchange and interactions with RPA-interacting proteins | |
Lloyd et al. | A method for the detection and enrichment of endogenous cereblon substrates | |
JP6229908B2 (en) | Protein tag, tagged protein and protein purification method | |
Lapointe et al. | eIF5B and eIF1A remodel human translation initiation complexes to mediate ribosomal subunit joining | |
Owen et al. | Cargo selective vesicle tethering: the structural basis for binding of specific cargo proteins by the Golgi tether component TBC1D23 | |
Thiel et al. | Development of a p62 biodegrader for autophagy targeted degradation | |
James et al. | Phosphorylation-dependent interactions of VAPB and ELYS contribute to the temporal progression of mitosis | |
Ikeuchi et al. | Recognition and deubiquitination of free 40S for translational reset by Otu2 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13712626 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13712626 Country of ref document: EP Kind code of ref document: A1 |