US20030049649A1 - Targeted modification of chromatin structure - Google Patents
Targeted modification of chromatin structure Download PDFInfo
- Publication number
- US20030049649A1 US20030049649A1 US10/084,826 US8482602A US2003049649A1 US 20030049649 A1 US20030049649 A1 US 20030049649A1 US 8482602 A US8482602 A US 8482602A US 2003049649 A1 US2003049649 A1 US 2003049649A1
- Authority
- US
- United States
- Prior art keywords
- molecule
- chromatin
- dna
- fusion
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108010077544 Chromatin Proteins 0.000 title claims abstract description 378
- 210000003483 chromatin Anatomy 0.000 title claims abstract description 378
- 230000004048 modification Effects 0.000 title claims abstract description 75
- 238000012986 modification Methods 0.000 title claims abstract description 75
- 238000000034 method Methods 0.000 claims abstract description 220
- 230000001413 cellular effect Effects 0.000 claims abstract description 87
- 230000006798 recombination Effects 0.000 claims abstract description 9
- 238000005215 recombination Methods 0.000 claims abstract description 9
- 108090000623 proteins and genes Proteins 0.000 claims description 344
- 210000004027 cell Anatomy 0.000 claims description 255
- 230000004927 fusion Effects 0.000 claims description 217
- 238000007634 remodeling Methods 0.000 claims description 168
- 230000004568 DNA-binding Effects 0.000 claims description 164
- 238000009739 binding Methods 0.000 claims description 164
- 230000027455 binding Effects 0.000 claims description 162
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 136
- 150000007523 nucleic acids Chemical class 0.000 claims description 130
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 125
- 229920001184 polypeptide Polymers 0.000 claims description 121
- 230000014509 gene expression Effects 0.000 claims description 118
- 102000039446 nucleic acids Human genes 0.000 claims description 118
- 108020004707 nucleic acids Proteins 0.000 claims description 118
- 239000012634 fragment Substances 0.000 claims description 64
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 claims description 49
- 229910052725 zinc Inorganic materials 0.000 claims description 49
- 239000002773 nucleotide Substances 0.000 claims description 48
- 239000011701 zinc Substances 0.000 claims description 48
- 125000003729 nucleotide group Chemical group 0.000 claims description 46
- 230000004913 activation Effects 0.000 claims description 43
- 108010033040 Histones Proteins 0.000 claims description 42
- 102000003964 Histone deacetylase Human genes 0.000 claims description 37
- 108090000353 Histone deacetylase Proteins 0.000 claims description 37
- 230000002255 enzymatic effect Effects 0.000 claims description 32
- 101000702560 Homo sapiens Probable global transcription activator SNF2L1 Proteins 0.000 claims description 28
- 102000044753 ISWI Human genes 0.000 claims description 26
- 108010073929 Vascular Endothelial Growth Factor A Proteins 0.000 claims description 26
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 claims description 26
- 102000003893 Histone acetyltransferases Human genes 0.000 claims description 23
- 108090000246 Histone acetyltransferases Proteins 0.000 claims description 23
- 101000702559 Homo sapiens Probable global transcription activator SNF2L2 Proteins 0.000 claims description 21
- 102000040945 Transcription factor Human genes 0.000 claims description 21
- 108091023040 Transcription factor Proteins 0.000 claims description 21
- 102000040430 polynucleotide Human genes 0.000 claims description 20
- 108091033319 polynucleotide Proteins 0.000 claims description 20
- 239000002157 polynucleotide Substances 0.000 claims description 20
- 102000003951 Erythropoietin Human genes 0.000 claims description 18
- 108090000394 Erythropoietin Proteins 0.000 claims description 18
- 229940105423 erythropoietin Drugs 0.000 claims description 18
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 claims description 18
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 claims description 18
- 239000011230 binding agent Substances 0.000 claims description 17
- 238000004519 manufacturing process Methods 0.000 claims description 16
- 101000702544 Homo sapiens SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 5 Proteins 0.000 claims description 15
- 230000037426 transcriptional repression Effects 0.000 claims description 11
- 210000005260 human cell Anatomy 0.000 claims description 10
- 238000001514 detection method Methods 0.000 claims description 9
- 210000004102 animal cell Anatomy 0.000 claims description 7
- 102000004190 Enzymes Human genes 0.000 claims description 6
- 108090000790 Enzymes Proteins 0.000 claims description 6
- 101000709520 Chlamydia trachomatis serovar L2 (strain 434/Bu / ATCC VR-902B) Atypical response regulator protein ChxR Proteins 0.000 claims description 5
- 108010036115 Histone Methyltransferases Proteins 0.000 claims description 5
- 102000011787 Histone Methyltransferases Human genes 0.000 claims description 5
- 239000003814 drug Substances 0.000 claims description 5
- 150000003384 small molecules Chemical class 0.000 claims description 5
- 108010069091 Dystrophin Proteins 0.000 claims description 4
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 claims description 4
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 claims description 4
- 108010043400 Protamine Kinase Proteins 0.000 claims description 4
- 102000001253 Protein Kinase Human genes 0.000 claims description 4
- 108010080146 androgen receptors Proteins 0.000 claims description 4
- 102000008157 Histone Demethylases Human genes 0.000 claims description 3
- 108010074870 Histone Demethylases Proteins 0.000 claims description 3
- 102000001307 androgen receptors Human genes 0.000 claims description 3
- 101100220553 Caenorhabditis elegans chd-3 gene Proteins 0.000 claims description 2
- 101100220549 Mus musculus Chd1 gene Proteins 0.000 claims description 2
- 108091005804 Peptidases Proteins 0.000 claims description 2
- 102000001039 Dystrophin Human genes 0.000 claims 2
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 claims 2
- 239000004365 Protease Substances 0.000 claims 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims 1
- 239000000203 mixture Substances 0.000 abstract description 48
- 230000035897 transcription Effects 0.000 abstract description 34
- 238000013518 transcription Methods 0.000 abstract description 34
- 230000008569 process Effects 0.000 abstract description 30
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 18
- 239000013611 chromosomal DNA Substances 0.000 abstract description 5
- 102000004169 proteins and genes Human genes 0.000 description 144
- 235000018102 proteins Nutrition 0.000 description 137
- 108020004414 DNA Proteins 0.000 description 76
- 230000000694 effects Effects 0.000 description 68
- 239000013598 vector Substances 0.000 description 63
- 108010047956 Nucleosomes Proteins 0.000 description 52
- 210000001623 nucleosome Anatomy 0.000 description 52
- 235000001014 amino acid Nutrition 0.000 description 46
- 150000001413 amino acids Chemical class 0.000 description 45
- 239000000047 product Substances 0.000 description 43
- 238000001994 activation Methods 0.000 description 42
- -1 for example Proteins 0.000 description 41
- 238000003556 assay Methods 0.000 description 36
- 239000002502 liposome Substances 0.000 description 33
- 239000013612 plasmid Substances 0.000 description 31
- 230000001105 regulatory effect Effects 0.000 description 30
- 108091006112 ATPases Proteins 0.000 description 28
- 102000057290 Adenosine Triphosphatases Human genes 0.000 description 28
- 102000006947 Histones Human genes 0.000 description 28
- 230000002103 transcriptional effect Effects 0.000 description 28
- 239000013615 primer Substances 0.000 description 27
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 24
- 102000009524 Vascular Endothelial Growth Factor A Human genes 0.000 description 23
- 239000013604 expression vector Substances 0.000 description 23
- 230000003321 amplification Effects 0.000 description 22
- 238000003199 nucleic acid amplification method Methods 0.000 description 22
- 230000003612 virological effect Effects 0.000 description 22
- 108091034117 Oligonucleotide Proteins 0.000 description 21
- 101000702545 Homo sapiens Transcription activator BRG1 Proteins 0.000 description 20
- 102100031021 Probable global transcription activator SNF2L2 Human genes 0.000 description 20
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 20
- 230000003993 interaction Effects 0.000 description 20
- 239000000523 sample Substances 0.000 description 20
- 230000006870 function Effects 0.000 description 19
- 125000003275 alpha amino acid group Chemical group 0.000 description 18
- 102000004217 thyroid hormone receptors Human genes 0.000 description 18
- 108090000721 thyroid hormone receptors Proteins 0.000 description 18
- 238000004458 analytical method Methods 0.000 description 17
- 239000002585 base Substances 0.000 description 16
- 150000001875 compounds Chemical class 0.000 description 16
- 230000007423 decrease Effects 0.000 description 16
- 108020001507 fusion proteins Proteins 0.000 description 16
- 102000037865 fusion proteins Human genes 0.000 description 16
- 239000003446 ligand Substances 0.000 description 16
- 239000012528 membrane Substances 0.000 description 16
- 210000004940 nucleus Anatomy 0.000 description 16
- 210000001519 tissue Anatomy 0.000 description 16
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 15
- 241000700605 Viruses Species 0.000 description 15
- 108020004999 messenger RNA Proteins 0.000 description 15
- 238000001890 transfection Methods 0.000 description 15
- 238000001415 gene therapy Methods 0.000 description 14
- 238000001727 in vivo Methods 0.000 description 14
- 108020001756 ligand binding domains Proteins 0.000 description 14
- 102000005962 receptors Human genes 0.000 description 14
- 108020003175 receptors Proteins 0.000 description 14
- 239000000126 substance Substances 0.000 description 14
- 238000011144 upstream manufacturing Methods 0.000 description 14
- 102000004966 Nuclear Receptor Coactivator 1 Human genes 0.000 description 13
- 108090001146 Nuclear Receptor Coactivator 1 Proteins 0.000 description 13
- 230000033228 biological regulation Effects 0.000 description 13
- 150000002632 lipids Chemical class 0.000 description 13
- 230000001404 mediated effect Effects 0.000 description 13
- 230000010076 replication Effects 0.000 description 13
- 108020004705 Codon Proteins 0.000 description 12
- 241000196324 Embryophyta Species 0.000 description 12
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 101001035011 Homo sapiens Histone deacetylase 2 Proteins 0.000 description 12
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 12
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 12
- 239000000427 antigen Substances 0.000 description 12
- 108091007433 antigens Proteins 0.000 description 12
- 102000036639 antigens Human genes 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 12
- 230000002759 chromosomal effect Effects 0.000 description 12
- 210000000349 chromosome Anatomy 0.000 description 12
- 238000010276 construction Methods 0.000 description 12
- 230000001419 dependent effect Effects 0.000 description 12
- 239000000499 gel Substances 0.000 description 12
- 238000003752 polymerase chain reaction Methods 0.000 description 12
- 108091026890 Coding region Proteins 0.000 description 11
- 101000808011 Homo sapiens Vascular endothelial growth factor A Proteins 0.000 description 11
- 108700007305 ISWI Proteins 0.000 description 11
- 101710169053 SWI/SNF complex subunit SMARCC1 Proteins 0.000 description 11
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 11
- 238000006243 chemical reaction Methods 0.000 description 11
- 238000000338 in vitro Methods 0.000 description 11
- 230000010354 integration Effects 0.000 description 11
- 210000004962 mammalian cell Anatomy 0.000 description 11
- 230000011987 methylation Effects 0.000 description 11
- 238000007069 methylation reaction Methods 0.000 description 11
- 239000013603 viral vector Substances 0.000 description 11
- 241000588724 Escherichia coli Species 0.000 description 10
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 10
- 101710185494 Zinc finger protein Proteins 0.000 description 10
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 10
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 10
- 230000001225 therapeutic effect Effects 0.000 description 10
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 9
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 9
- 102000011931 Nucleoproteins Human genes 0.000 description 9
- 108010061100 Nucleoproteins Proteins 0.000 description 9
- 102100024793 SWI/SNF complex subunit SMARCC1 Human genes 0.000 description 9
- 230000004075 alteration Effects 0.000 description 9
- 239000002299 complementary DNA Substances 0.000 description 9
- 238000013461 design Methods 0.000 description 9
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 9
- 230000005764 inhibitory process Effects 0.000 description 9
- 108091008146 restriction endonucleases Proteins 0.000 description 9
- 230000008685 targeting Effects 0.000 description 9
- 238000012546 transfer Methods 0.000 description 9
- 230000014616 translation Effects 0.000 description 9
- 230000005945 translocation Effects 0.000 description 9
- 241000701161 unidentified adenovirus Species 0.000 description 9
- 108010040163 CREB-Binding Protein Proteins 0.000 description 8
- 102100021975 CREB-binding protein Human genes 0.000 description 8
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 8
- 102100030012 Deoxyribonuclease-1 Human genes 0.000 description 8
- 101000581507 Homo sapiens Methyl-CpG-binding domain protein 1 Proteins 0.000 description 8
- 101000615488 Homo sapiens Methyl-CpG-binding domain protein 2 Proteins 0.000 description 8
- 102100027383 Methyl-CpG-binding domain protein 1 Human genes 0.000 description 8
- 102100021299 Methyl-CpG-binding domain protein 2 Human genes 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 230000021736 acetylation Effects 0.000 description 8
- 238000006640 acetylation reaction Methods 0.000 description 8
- 210000000170 cell membrane Anatomy 0.000 description 8
- 230000006196 deacetylation Effects 0.000 description 8
- 238000003381 deacetylation reaction Methods 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 101000700937 Amsacta albistriga Sex-specific storage protein 1 Proteins 0.000 description 7
- 101100042630 Caenorhabditis elegans sin-3 gene Proteins 0.000 description 7
- 102100039996 Histone deacetylase 1 Human genes 0.000 description 7
- 102100039999 Histone deacetylase 2 Human genes 0.000 description 7
- 102100021455 Histone deacetylase 3 Human genes 0.000 description 7
- 101001035024 Homo sapiens Histone deacetylase 1 Proteins 0.000 description 7
- 101000899282 Homo sapiens Histone deacetylase 3 Proteins 0.000 description 7
- 206010020751 Hypersensitivity Diseases 0.000 description 7
- 101150045244 ISW2 gene Proteins 0.000 description 7
- 206010028980 Neoplasm Diseases 0.000 description 7
- 108700008625 Reporter Genes Proteins 0.000 description 7
- 102000007508 Retinoblastoma-Binding Protein 4 Human genes 0.000 description 7
- 108010071034 Retinoblastoma-Binding Protein 4 Proteins 0.000 description 7
- 101100509370 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) ISW1 gene Proteins 0.000 description 7
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 7
- 108700019146 Transgenes Proteins 0.000 description 7
- 230000009471 action Effects 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 238000010790 dilution Methods 0.000 description 7
- 239000012895 dilution Substances 0.000 description 7
- 238000009472 formulation Methods 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 239000008188 pellet Substances 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- 230000007115 recruitment Effects 0.000 description 7
- 108091008025 regulatory factors Proteins 0.000 description 7
- 238000006467 substitution reaction Methods 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 6
- 102100036279 DNA (cytosine-5)-methyltransferase 1 Human genes 0.000 description 6
- 102000016911 Deoxyribonucleases Human genes 0.000 description 6
- 108010053770 Deoxyribonucleases Proteins 0.000 description 6
- 108090000079 Glucocorticoid Receptors Proteins 0.000 description 6
- 102100033417 Glucocorticoid receptor Human genes 0.000 description 6
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 6
- 241000238631 Hexapoda Species 0.000 description 6
- 102100038885 Histone acetyltransferase p300 Human genes 0.000 description 6
- 101000882390 Homo sapiens Histone acetyltransferase p300 Proteins 0.000 description 6
- 101001032118 Homo sapiens Histone deacetylase 8 Proteins 0.000 description 6
- 101000835860 Homo sapiens SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily B member 1 Proteins 0.000 description 6
- 101000978776 Mus musculus Neurogenic locus notch homolog protein 1 Proteins 0.000 description 6
- 101710163270 Nuclease Proteins 0.000 description 6
- 102100025746 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily B member 1 Human genes 0.000 description 6
- 239000007983 Tris buffer Substances 0.000 description 6
- 230000003213 activating effect Effects 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 230000029087 digestion Effects 0.000 description 6
- 239000003623 enhancer Substances 0.000 description 6
- 210000003527 eukaryotic cell Anatomy 0.000 description 6
- 230000002538 fungal effect Effects 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 238000005259 measurement Methods 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 238000004806 packaging method and process Methods 0.000 description 6
- 230000008488 polyadenylation Effects 0.000 description 6
- 102000037983 regulatory factors Human genes 0.000 description 6
- 230000001177 retroviral effect Effects 0.000 description 6
- 238000003786 synthesis reaction Methods 0.000 description 6
- 239000003053 toxin Substances 0.000 description 6
- 231100000765 toxin Toxicity 0.000 description 6
- 108700012359 toxins Proteins 0.000 description 6
- 230000005026 transcription initiation Effects 0.000 description 6
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 6
- 239000003981 vehicle Substances 0.000 description 6
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 5
- 102000014914 Carrier Proteins Human genes 0.000 description 5
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 5
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 5
- 238000002965 ELISA Methods 0.000 description 5
- 102000018233 Fibroblast Growth Factor Human genes 0.000 description 5
- 108050007372 Fibroblast Growth Factor Proteins 0.000 description 5
- 108010001515 Galectin 4 Proteins 0.000 description 5
- 102100039556 Galectin-4 Human genes 0.000 description 5
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 5
- 241000725303 Human immunodeficiency virus Species 0.000 description 5
- 108091036060 Linker DNA Proteins 0.000 description 5
- 108060004795 Methyltransferase Proteins 0.000 description 5
- 102000003945 NF-kappa B Human genes 0.000 description 5
- 108010057466 NF-kappa B Proteins 0.000 description 5
- 108091008324 binding proteins Proteins 0.000 description 5
- 108010006025 bovine growth hormone Proteins 0.000 description 5
- 108091092356 cellular DNA Proteins 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 5
- 235000011180 diphosphates Nutrition 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 5
- 229940126864 fibroblast growth factor Drugs 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 229940088597 hormone Drugs 0.000 description 5
- 239000005556 hormone Substances 0.000 description 5
- 229910001629 magnesium chloride Inorganic materials 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- 108020004017 nuclear receptors Proteins 0.000 description 5
- 230000030648 nucleus localization Effects 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- 230000026731 phosphorylation Effects 0.000 description 5
- 238000006366 phosphorylation reaction Methods 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 229920000136 polysorbate Polymers 0.000 description 5
- 230000002265 prevention Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 230000014493 regulation of gene expression Effects 0.000 description 5
- 230000008439 repair process Effects 0.000 description 5
- 230000001718 repressive effect Effects 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 238000002741 site-directed mutagenesis Methods 0.000 description 5
- 230000009870 specific binding Effects 0.000 description 5
- 238000010361 transduction Methods 0.000 description 5
- 230000026683 transduction Effects 0.000 description 5
- 230000014621 translational initiation Effects 0.000 description 5
- 238000010798 ubiquitination Methods 0.000 description 5
- 241001430294 unidentified retrovirus Species 0.000 description 5
- 239000013607 AAV vector Substances 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 102100022846 Histone acetyltransferase KAT2B Human genes 0.000 description 4
- 101150013550 MBD2 gene Proteins 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- 108091061960 Naked DNA Proteins 0.000 description 4
- 108091005461 Nucleic proteins Proteins 0.000 description 4
- 238000012408 PCR amplification Methods 0.000 description 4
- 108700009124 Transcription Initiation Site Proteins 0.000 description 4
- 102100033254 Tumor suppressor ARF Human genes 0.000 description 4
- 239000003242 anti bacterial agent Substances 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 239000003937 drug carrier Substances 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000007613 environmental effect Effects 0.000 description 4
- MQLVWQSVRZVNIP-UHFFFAOYSA-L ferrous ammonium sulfate hexahydrate Chemical compound [NH4+].[NH4+].O.O.O.O.O.O.[Fe+2].[O-]S([O-])(=O)=O.[O-]S([O-])(=O)=O MQLVWQSVRZVNIP-UHFFFAOYSA-L 0.000 description 4
- 239000000835 fiber Substances 0.000 description 4
- 102000034356 gene-regulatory proteins Human genes 0.000 description 4
- 108091006104 gene-regulatory proteins Proteins 0.000 description 4
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 4
- 102000058223 human VEGFA Human genes 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 238000001638 lipofection Methods 0.000 description 4
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 210000000130 stem cell Anatomy 0.000 description 4
- 230000032258 transport Effects 0.000 description 4
- 239000011592 zinc chloride Substances 0.000 description 4
- JIAARYAFYJHUJI-UHFFFAOYSA-L zinc dichloride Chemical compound [Cl-].[Cl-].[Zn+2] JIAARYAFYJHUJI-UHFFFAOYSA-L 0.000 description 4
- 230000005730 ADP ribosylation Effects 0.000 description 3
- 102000007469 Actins Human genes 0.000 description 3
- 108010085238 Actins Proteins 0.000 description 3
- 108700028369 Alleles Proteins 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 102000001805 Bromodomains Human genes 0.000 description 3
- 108050009021 Bromodomains Proteins 0.000 description 3
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 3
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 3
- 108010060434 Co-Repressor Proteins Proteins 0.000 description 3
- 102000008169 Co-Repressor Proteins Human genes 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 108010009540 DNA (Cytosine-5-)-Methyltransferase 1 Proteins 0.000 description 3
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 3
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 3
- 101710083341 Histone acetyltransferase KAT2B Proteins 0.000 description 3
- 102000009331 Homeodomain Proteins Human genes 0.000 description 3
- 108010048671 Homeodomain Proteins Proteins 0.000 description 3
- 101000721661 Homo sapiens Cellular tumor antigen p53 Proteins 0.000 description 3
- 101000931098 Homo sapiens DNA (cytosine-5)-methyltransferase 1 Proteins 0.000 description 3
- 101000987586 Homo sapiens Eosinophil peroxidase Proteins 0.000 description 3
- 101000920686 Homo sapiens Erythropoietin Proteins 0.000 description 3
- 101001040800 Homo sapiens Integral membrane protein GPR180 Proteins 0.000 description 3
- 102100021244 Integral membrane protein GPR180 Human genes 0.000 description 3
- 102000012330 Integrases Human genes 0.000 description 3
- 108010061833 Integrases Proteins 0.000 description 3
- 102000000589 Interleukin-1 Human genes 0.000 description 3
- 108010002352 Interleukin-1 Proteins 0.000 description 3
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 101150071869 Mbd1 gene Proteins 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 102000006890 Methyl-CpG-Binding Protein 2 Human genes 0.000 description 3
- 108010072388 Methyl-CpG-Binding Protein 2 Proteins 0.000 description 3
- 108010059724 Micrococcal Nuclease Proteins 0.000 description 3
- 241001529936 Murinae Species 0.000 description 3
- 102100030569 Nuclear receptor corepressor 2 Human genes 0.000 description 3
- 101710153660 Nuclear receptor corepressor 2 Proteins 0.000 description 3
- 108700020796 Oncogene Proteins 0.000 description 3
- 241001631646 Papillomaviridae Species 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- 102000055027 Protein Methyltransferases Human genes 0.000 description 3
- 108700040121 Protein Methyltransferases Proteins 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 3
- 108700033844 Pseudomonas aeruginosa toxA Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 108010091086 Recombinases Proteins 0.000 description 3
- 102000018120 Recombinases Human genes 0.000 description 3
- 108091027981 Response element Proteins 0.000 description 3
- 101100536259 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) TAF14 gene Proteins 0.000 description 3
- 241000235343 Saccharomycetales Species 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 3
- 108700005077 Viral Genes Proteins 0.000 description 3
- MMWCIQZXVOZEGG-HOZKJCLWSA-N [(1S,2R,3S,4S,5R,6S)-2,3,5-trihydroxy-4,6-diphosphonooxycyclohexyl] dihydrogen phosphate Chemical compound O[C@H]1[C@@H](O)[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](O)[C@H]1OP(O)(O)=O MMWCIQZXVOZEGG-HOZKJCLWSA-N 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- 239000008346 aqueous phase Substances 0.000 description 3
- 238000003491 array Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000007385 chemical modification Methods 0.000 description 3
- 210000003763 chloroplast Anatomy 0.000 description 3
- 239000013599 cloning vector Substances 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 3
- 230000017858 demethylation Effects 0.000 description 3
- 238000010520 demethylation reaction Methods 0.000 description 3
- 230000030609 dephosphorylation Effects 0.000 description 3
- 238000006209 dephosphorylation reaction Methods 0.000 description 3
- 230000008021 deposition Effects 0.000 description 3
- 239000013024 dilution buffer Substances 0.000 description 3
- 238000010494 dissociation reaction Methods 0.000 description 3
- 230000005593 dissociations Effects 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 235000013861 fat-free Nutrition 0.000 description 3
- 108060003196 globin Proteins 0.000 description 3
- 239000005090 green fluorescent protein Substances 0.000 description 3
- 230000006195 histone acetylation Effects 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 238000001802 infusion Methods 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 239000000138 intercalating agent Substances 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 238000001990 intravenous administration Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 239000012139 lysis buffer Substances 0.000 description 3
- 235000013336 milk Nutrition 0.000 description 3
- 239000008267 milk Substances 0.000 description 3
- 210000004080 milk Anatomy 0.000 description 3
- 210000003205 muscle Anatomy 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 239000008194 pharmaceutical composition Substances 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 230000007420 reactivation Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- QZAYGJVTTNCVMB-UHFFFAOYSA-N serotonin Chemical compound C1=C(O)C=C2C(CCN)=CNC2=C1 QZAYGJVTTNCVMB-UHFFFAOYSA-N 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 108091006106 transcriptional activators Proteins 0.000 description 3
- 230000034512 ubiquitination Effects 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- AXAVXPMQTGXXJZ-UHFFFAOYSA-N 2-aminoacetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound NCC(O)=O.OCC(N)(CO)CO AXAVXPMQTGXXJZ-UHFFFAOYSA-N 0.000 description 2
- ZOOGRGPOEVQQDX-UUOKFMHZSA-N 3',5'-cyclic GMP Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-UUOKFMHZSA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- 102100023635 Alpha-fetoprotein Human genes 0.000 description 2
- 101710137189 Amyloid-beta A4 protein Proteins 0.000 description 2
- 102100022704 Amyloid-beta precursor protein Human genes 0.000 description 2
- 101710151993 Amyloid-beta precursor protein Proteins 0.000 description 2
- 108700031308 Antennapedia Homeodomain Proteins 0.000 description 2
- 101710095342 Apolipoprotein B Proteins 0.000 description 2
- 102100040202 Apolipoprotein B-100 Human genes 0.000 description 2
- 102000007592 Apolipoproteins Human genes 0.000 description 2
- 108010071619 Apolipoproteins Proteins 0.000 description 2
- 101100493735 Arabidopsis thaliana BBX25 gene Proteins 0.000 description 2
- 101001030716 Arabidopsis thaliana Histone deacetylase HDT1 Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 241000193738 Bacillus anthracis Species 0.000 description 2
- 102100026189 Beta-galactosidase Human genes 0.000 description 2
- 102100023995 Beta-nerve growth factor Human genes 0.000 description 2
- 206010006187 Breast cancer Diseases 0.000 description 2
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 description 2
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 description 2
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 2
- 108010022366 Carcinoembryonic Antigen Proteins 0.000 description 2
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 2
- 102000000844 Cell Surface Receptors Human genes 0.000 description 2
- 108010001857 Cell Surface Receptors Proteins 0.000 description 2
- 102000004410 Cholesterol 7-alpha-monooxygenases Human genes 0.000 description 2
- 108090000943 Cholesterol 7-alpha-monooxygenases Proteins 0.000 description 2
- 108091029430 CpG site Proteins 0.000 description 2
- 108010079245 Cystic Fibrosis Transmembrane Conductance Regulator Proteins 0.000 description 2
- 102100023419 Cystic fibrosis transmembrane conductance regulator Human genes 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 101710096438 DNA-binding protein Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 108010053187 Diphtheria Toxin Proteins 0.000 description 2
- 102000016607 Diphtheria Toxin Human genes 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- 102100024108 Dystrophin Human genes 0.000 description 2
- 238000008157 ELISA kit Methods 0.000 description 2
- 101150002621 EPO gene Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- 102000008946 Fibrinogen Human genes 0.000 description 2
- 108010049003 Fibrinogen Proteins 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 241000713813 Gibbon ape leukemia virus Species 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 101800003471 Helicase Proteins 0.000 description 2
- 108010034791 Heterochromatin Proteins 0.000 description 2
- 102100021454 Histone deacetylase 4 Human genes 0.000 description 2
- 102100021453 Histone deacetylase 5 Human genes 0.000 description 2
- 102100022537 Histone deacetylase 6 Human genes 0.000 description 2
- 102100025539 Histone deacetylase complex subunit SAP18 Human genes 0.000 description 2
- 102100023357 Histone deacetylase complex subunit SAP30 Human genes 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000899259 Homo sapiens Histone deacetylase 4 Proteins 0.000 description 2
- 101000899255 Homo sapiens Histone deacetylase 5 Proteins 0.000 description 2
- 101000899330 Homo sapiens Histone deacetylase 6 Proteins 0.000 description 2
- 101000693664 Homo sapiens Histone deacetylase complex subunit SAP18 Proteins 0.000 description 2
- 101000686001 Homo sapiens Histone deacetylase complex subunit SAP30 Proteins 0.000 description 2
- 101000615492 Homo sapiens Methyl-CpG-binding domain protein 4 Proteins 0.000 description 2
- 101000611023 Homo sapiens Tumor necrosis factor receptor superfamily member 6 Proteins 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- 206010061218 Inflammation Diseases 0.000 description 2
- 102100037850 Interferon gamma Human genes 0.000 description 2
- 108010074328 Interferon-gamma Proteins 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical group CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical group CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 2
- 108010001831 LDL receptors Proteins 0.000 description 2
- 108010054278 Lac Repressors Proteins 0.000 description 2
- 101710128836 Large T antigen Proteins 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 239000005089 Luciferase Substances 0.000 description 2
- 102100021290 Methyl-CpG-binding domain protein 4 Human genes 0.000 description 2
- 102000007474 Multiprotein Complexes Human genes 0.000 description 2
- 108010085220 Multiprotein Complexes Proteins 0.000 description 2
- 241000714177 Murine leukemia virus Species 0.000 description 2
- WWGBHDIHIVGYLZ-UHFFFAOYSA-N N-[4-[3-[[[7-(hydroxyamino)-7-oxoheptyl]amino]-oxomethyl]-5-isoxazolyl]phenyl]carbamic acid tert-butyl ester Chemical compound C1=CC(NC(=O)OC(C)(C)C)=CC=C1C1=CC(C(=O)NCCCCCCC(=O)NO)=NO1 WWGBHDIHIVGYLZ-UHFFFAOYSA-N 0.000 description 2
- 101710167853 N-methyltransferase Proteins 0.000 description 2
- 108010025020 Nerve Growth Factor Proteins 0.000 description 2
- 101000587759 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) Transcription elongation factor spt5 Proteins 0.000 description 2
- 102000006570 Non-Histone Chromosomal Proteins Human genes 0.000 description 2
- 108010008964 Non-Histone Chromosomal Proteins Proteins 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 102000043276 Oncogene Human genes 0.000 description 2
- 239000012124 Opti-MEM Substances 0.000 description 2
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 2
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 108010081690 Pertussis Toxin Proteins 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 108700001094 Plant Genes Proteins 0.000 description 2
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 2
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 2
- 229920000776 Poly(Adenosine diphosphate-ribose) polymerase Polymers 0.000 description 2
- 101710182846 Polyhedrin Proteins 0.000 description 2
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 2
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- ATUOYWHBWRKTHZ-UHFFFAOYSA-N Propane Chemical compound CCC ATUOYWHBWRKTHZ-UHFFFAOYSA-N 0.000 description 2
- 108010001267 Protein Subunits Proteins 0.000 description 2
- 102000002067 Protein Subunits Human genes 0.000 description 2
- 101710149951 Protein Tat Proteins 0.000 description 2
- 102000002727 Protein Tyrosine Phosphatase Human genes 0.000 description 2
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 2
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 2
- 241000125945 Protoparvovirus Species 0.000 description 2
- 102000009572 RNA Polymerase II Human genes 0.000 description 2
- 108010009460 RNA Polymerase II Proteins 0.000 description 2
- 230000004570 RNA-binding Effects 0.000 description 2
- 238000011529 RT qPCR Methods 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 102100031029 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1 Human genes 0.000 description 2
- 101710089108 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily E member 1 Proteins 0.000 description 2
- 101100311254 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) STH1 gene Proteins 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- 108090000184 Selectins Proteins 0.000 description 2
- 102000003800 Selectins Human genes 0.000 description 2
- 241000713311 Simian immunodeficiency virus Species 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- 108010085012 Steroid Receptors Proteins 0.000 description 2
- 101100277996 Symbiobacterium thermophilum (strain T / IAM 14863) dnaA gene Proteins 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- 101710183280 Topoisomerase Proteins 0.000 description 2
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 2
- 102100031988 Tumor necrosis factor ligand superfamily member 6 Human genes 0.000 description 2
- 102100040403 Tumor necrosis factor receptor superfamily member 6 Human genes 0.000 description 2
- 108700042462 X-linked Nuclear Proteins 0.000 description 2
- 102000056014 X-linked Nuclear Human genes 0.000 description 2
- 108091007916 Zinc finger transcription factors Proteins 0.000 description 2
- 102000038627 Zinc finger transcription factors Human genes 0.000 description 2
- 102000030621 adenylate cyclase Human genes 0.000 description 2
- 108060000200 adenylate cyclase Proteins 0.000 description 2
- 239000000443 aerosol Substances 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 208000026935 allergic disease Diseases 0.000 description 2
- 108010026331 alpha-Fetoproteins Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- DZHSAHHDTRWUTF-SIQRNXPUSA-N amyloid-beta polypeptide 42 Chemical compound C([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O)[C@@H](C)CC)C(C)C)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O)C(C)C)C(C)C)C1=CC=CC=C1 DZHSAHHDTRWUTF-SIQRNXPUSA-N 0.000 description 2
- 238000000376 autoradiography Methods 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 210000001185 bone marrow Anatomy 0.000 description 2
- 210000002798 bone marrow cell Anatomy 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 229910052791 calcium Inorganic materials 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 108091092328 cellular RNA Proteins 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical group NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 210000000172 cytosol Anatomy 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- MWRBNPKJOOWZPW-CLFAGFIQSA-N dioleoyl phosphatidylethanolamine Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC(COP(O)(=O)OCCN)OC(=O)CCCCCCC\C=C/CCCCCCCC MWRBNPKJOOWZPW-CLFAGFIQSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 102000015694 estrogen receptors Human genes 0.000 description 2
- 108010038795 estrogen receptors Proteins 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 239000012894 fetal calf serum Substances 0.000 description 2
- 229940012952 fibrinogen Drugs 0.000 description 2
- 102000018146 globin Human genes 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- 208000006454 hepatitis Diseases 0.000 description 2
- 231100000283 hepatitis Toxicity 0.000 description 2
- 210000004458 heterochromatin Anatomy 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 108091008039 hormone receptors Proteins 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000009610 hypersensitivity Effects 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000004054 inflammatory process Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- 102000006495 integrins Human genes 0.000 description 2
- 108010044426 integrins Proteins 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 150000002634 lipophilic molecules Chemical class 0.000 description 2
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 229930182817 methionine Chemical group 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 230000002438 mitochondrial effect Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 229940053128 nerve growth factor Drugs 0.000 description 2
- IDBIFFKSXLYUOT-UHFFFAOYSA-N netropsin Chemical compound C1=C(C(=O)NCCC(N)=N)N(C)C=C1NC(=O)C1=CC(NC(=O)CN=C(N)N)=CN1C IDBIFFKSXLYUOT-UHFFFAOYSA-N 0.000 description 2
- 230000001575 pathological effect Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 229930029653 phosphoenolpyruvate Natural products 0.000 description 2
- DTBNBXWJWCWCIK-UHFFFAOYSA-N phosphoenolpyruvic acid Chemical compound OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 description 2
- 230000023603 positive regulation of transcription initiation, DNA-dependent Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 239000002987 primer (paints) Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 235000004252 protein component Nutrition 0.000 description 2
- 108020000494 protein-tyrosine phosphatase Proteins 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 230000009711 regulatory function Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 239000012146 running buffer Substances 0.000 description 2
- 102000023888 sequence-specific DNA binding proteins Human genes 0.000 description 2
- 108091008420 sequence-specific DNA binding proteins Proteins 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 102000005969 steroid hormone receptors Human genes 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 102100032270 tRNA (cytosine(38)-C(5))-methyltransferase Human genes 0.000 description 2
- 101710184308 tRNA (cytosine(38)-C(5))-methyltransferase Proteins 0.000 description 2
- 108091035539 telomere Proteins 0.000 description 2
- 102000055501 telomere Human genes 0.000 description 2
- 210000003411 telomere Anatomy 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 238000004809 thin layer chromatography Methods 0.000 description 2
- 230000005029 transcription elongation Effects 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 108091008023 transcriptional regulators Proteins 0.000 description 2
- 108091006107 transcriptional repressors Proteins 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 210000003956 transport vesicle Anatomy 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 108700026220 vif Genes Proteins 0.000 description 2
- 238000003260 vortexing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- VGONTNSXDCQUGY-RRKCRQDMSA-N 2'-deoxyinosine Chemical group C1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=CNC2=O)=C2N=C1 VGONTNSXDCQUGY-RRKCRQDMSA-N 0.000 description 1
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 1
- HGFIOWHPOGLXPU-UHFFFAOYSA-L 4,7-diphenyl-1,10-phenanthroline 4',4''-disulfonate Chemical compound C1=CC(S(=O)(=O)[O-])=CC=C1C1=CC=NC2=C1C=CC1=C(C=3C=CC(=CC=3)S([O-])(=O)=O)C=CN=C21 HGFIOWHPOGLXPU-UHFFFAOYSA-L 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- JYCQQPHGFMYQCF-UHFFFAOYSA-N 4-tert-Octylphenol monoethoxylate Chemical compound CC(C)(C)CC(C)(C)C1=CC=C(OCCO)C=C1 JYCQQPHGFMYQCF-UHFFFAOYSA-N 0.000 description 1
- 102000040125 5-hydroxytryptamine receptor family Human genes 0.000 description 1
- 108091032151 5-hydroxytryptamine receptor family Proteins 0.000 description 1
- 101150037123 APOE gene Proteins 0.000 description 1
- 101150066797 ARP7 gene Proteins 0.000 description 1
- 101150061796 ARP9 gene Proteins 0.000 description 1
- 101150020330 ATRX gene Proteins 0.000 description 1
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 1
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 1
- 108010013043 Acetylesterase Proteins 0.000 description 1
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 102100032187 Androgen receptor Human genes 0.000 description 1
- 102400000068 Angiostatin Human genes 0.000 description 1
- 108010079709 Angiostatins Proteins 0.000 description 1
- 108050000824 Angiotensin II receptor Proteins 0.000 description 1
- 102000008873 Angiotensin II receptor Human genes 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 102100029470 Apolipoprotein E Human genes 0.000 description 1
- 101710095339 Apolipoprotein E Proteins 0.000 description 1
- 102100021569 Apoptosis regulator Bcl-2 Human genes 0.000 description 1
- 101100444285 Arabidopsis thaliana DYAD gene Proteins 0.000 description 1
- 101100421779 Arabidopsis thaliana SNL3 gene Proteins 0.000 description 1
- 208000006400 Arbovirus Encephalitis Diseases 0.000 description 1
- 108700042296 Archaeal Genes Proteins 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 101800001288 Atrial natriuretic factor Proteins 0.000 description 1
- 102400001282 Atrial natriuretic peptide Human genes 0.000 description 1
- 101800001890 Atrial natriuretic peptide Proteins 0.000 description 1
- 241000304886 Bacilli Species 0.000 description 1
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- 102000000806 Basic-Leucine Zipper Transcription Factors Human genes 0.000 description 1
- 108010001572 Basic-Leucine Zipper Transcription Factors Proteins 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 208000003508 Botulism Diseases 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 102100031151 C-C chemokine receptor type 2 Human genes 0.000 description 1
- 101710149815 C-C chemokine receptor type 2 Proteins 0.000 description 1
- 102100024167 C-C chemokine receptor type 3 Human genes 0.000 description 1
- 101710149862 C-C chemokine receptor type 3 Proteins 0.000 description 1
- 102100031102 C-C motif chemokine 4 Human genes 0.000 description 1
- 102100031650 C-X-C chemokine receptor type 4 Human genes 0.000 description 1
- 101710082513 C-X-C chemokine receptor type 4 Proteins 0.000 description 1
- QCMYYKRYFNMIEC-UHFFFAOYSA-N COP(O)=O Chemical class COP(O)=O QCMYYKRYFNMIEC-UHFFFAOYSA-N 0.000 description 1
- 101100220616 Caenorhabditis elegans chk-2 gene Proteins 0.000 description 1
- 101100123577 Caenorhabditis elegans hda-1 gene Proteins 0.000 description 1
- 101100534223 Caenorhabditis elegans src-1 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 102100025580 Calmodulin-1 Human genes 0.000 description 1
- 102100025465 Calpain-10 Human genes 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 102000011933 Cathepsin W Human genes 0.000 description 1
- 108010061112 Cathepsin W Proteins 0.000 description 1
- 102000005600 Cathepsins Human genes 0.000 description 1
- 108010084457 Cathepsins Proteins 0.000 description 1
- 102000009410 Chemokine receptor Human genes 0.000 description 1
- 108050000299 Chemokine receptor Proteins 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 206010008631 Cholera Diseases 0.000 description 1
- 101710181272 Choline dehydrogenase, mitochondrial Proteins 0.000 description 1
- 102000009660 Cholinergic Receptors Human genes 0.000 description 1
- 108010009685 Cholinergic Receptors Proteins 0.000 description 1
- 102000017589 Chromo domains Human genes 0.000 description 1
- 108050005811 Chromo domains Proteins 0.000 description 1
- 208000037051 Chromosomal Instability Diseases 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 108010021408 Clostridium perfringens iota toxin Proteins 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000709687 Coxsackievirus Species 0.000 description 1
- 108091029523 CpG island Proteins 0.000 description 1
- 101710095468 Cyclase Proteins 0.000 description 1
- 102000016736 Cyclin Human genes 0.000 description 1
- 108050006400 Cyclin Proteins 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 101710155335 DELLA protein SLR1 Proteins 0.000 description 1
- 102100024812 DNA (cytosine-5)-methyltransferase 3A Human genes 0.000 description 1
- 102100024810 DNA (cytosine-5)-methyltransferase 3B Human genes 0.000 description 1
- 101710123222 DNA (cytosine-5)-methyltransferase 3B Proteins 0.000 description 1
- 108010024491 DNA Methyltransferase 3A Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 108010092160 Dactinomycin Proteins 0.000 description 1
- 101100216294 Danio rerio apoeb gene Proteins 0.000 description 1
- 101100540419 Danio rerio kdrl gene Proteins 0.000 description 1
- 101100239628 Danio rerio myca gene Proteins 0.000 description 1
- 241000710829 Dengue virus group Species 0.000 description 1
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 1
- 102000001477 Deubiquitinating Enzymes Human genes 0.000 description 1
- 108010093668 Deubiquitinating Enzymes Proteins 0.000 description 1
- 206010012689 Diabetic retinopathy Diseases 0.000 description 1
- 239000004338 Dichlorodifluoromethane Substances 0.000 description 1
- 102000015554 Dopamine receptor Human genes 0.000 description 1
- 108050004812 Dopamine receptor Proteins 0.000 description 1
- 108700016023 Drosophila Smr Proteins 0.000 description 1
- 108700007302 Drosophila brm Proteins 0.000 description 1
- 108700025095 Drosophila gro Proteins 0.000 description 1
- 101100058112 Drosophila melanogaster Bap60 gene Proteins 0.000 description 1
- 101000932863 Drosophila melanogaster Chromatin assembly factor 1 p55 subunit Proteins 0.000 description 1
- 101001056834 Drosophila melanogaster Chromatin-remodeling complex ATPase chain Iswi Proteins 0.000 description 1
- 101100506416 Drosophila melanogaster HDAC1 gene Proteins 0.000 description 1
- 101100439546 Drosophila melanogaster Mi-2 gene Proteins 0.000 description 1
- 101000970235 Drosophila melanogaster Nucleosome-remodeling factor subunit NURF301 Proteins 0.000 description 1
- 101100126252 Drosophila melanogaster Nurf-38 gene Proteins 0.000 description 1
- 108010024212 E-Selectin Proteins 0.000 description 1
- 102100023471 E-selectin Human genes 0.000 description 1
- 101710140859 E3 ubiquitin ligase TRAF3IP2 Proteins 0.000 description 1
- 102100026620 E3 ubiquitin ligase TRAF3IP2 Human genes 0.000 description 1
- 102000001301 EGF receptor Human genes 0.000 description 1
- 108060006698 EGF receptor Proteins 0.000 description 1
- 201000011001 Ebola Hemorrhagic Fever Diseases 0.000 description 1
- 241001466953 Echovirus Species 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 241000224431 Entamoeba Species 0.000 description 1
- 241000709661 Enterovirus Species 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- 101800001467 Envelope glycoprotein E2 Proteins 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 102100031690 Erythroid transcription factor Human genes 0.000 description 1
- 101710100588 Erythroid transcription factor Proteins 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- 101001091269 Escherichia coli Hygromycin-B 4-O-kinase Proteins 0.000 description 1
- 108010022894 Euchromatin Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108091029865 Exogenous DNA Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108091008794 FGF receptors Proteins 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- 101150118938 FLK gene Proteins 0.000 description 1
- 108010039471 Fas Ligand Protein Proteins 0.000 description 1
- 102000015303 Fatty Acid Synthases Human genes 0.000 description 1
- 102100034543 Fatty acid desaturase 3 Human genes 0.000 description 1
- 108010087894 Fatty acid desaturases Proteins 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 102000044168 Fibroblast Growth Factor Receptor Human genes 0.000 description 1
- 241000724791 Filamentous phage Species 0.000 description 1
- 241000710831 Flavivirus Species 0.000 description 1
- 108091006027 G proteins Proteins 0.000 description 1
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 1
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 1
- 102000005915 GABA Receptors Human genes 0.000 description 1
- 108010005551 GABA Receptors Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 241001123946 Gaga Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 241000224466 Giardia Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108700023224 Glucose-1-phosphate adenylyltransferases Proteins 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 102000018899 Glutamate Receptors Human genes 0.000 description 1
- 108010027915 Glutamate Receptors Proteins 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 101100446349 Glycine max FAD2-1 gene Proteins 0.000 description 1
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 1
- 108010078321 Guanylate Cyclase Proteins 0.000 description 1
- 102000014469 Guanylate cyclase Human genes 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 208000037357 HIV infectious disease Diseases 0.000 description 1
- 101150004167 HMG gene Proteins 0.000 description 1
- 101150031823 HSP70 gene Proteins 0.000 description 1
- 206010061192 Haemorrhagic fever Diseases 0.000 description 1
- 101100028493 Haloferax volcanii (strain ATCC 29605 / DSM 3757 / JCM 8879 / NBRC 14742 / NCIMB 2012 / VKM B-1768 / DS2) pan2 gene Proteins 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 1
- 108700024845 Hepatitis B virus P Proteins 0.000 description 1
- 208000005176 Hepatitis C Diseases 0.000 description 1
- 108010049606 Hepatocyte Nuclear Factors Proteins 0.000 description 1
- 102000008088 Hepatocyte Nuclear Factors Human genes 0.000 description 1
- 241000175212 Herpesvirales Species 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 102100037907 High mobility group protein B1 Human genes 0.000 description 1
- 101710168537 High mobility group protein B1 Proteins 0.000 description 1
- 102100039869 Histone H2B type F-S Human genes 0.000 description 1
- 102100021467 Histone acetyltransferase type B catalytic subunit Human genes 0.000 description 1
- 102100038719 Histone deacetylase 7 Human genes 0.000 description 1
- 102100038715 Histone deacetylase 8 Human genes 0.000 description 1
- 101710159917 Histone deacetylase HDA1 Proteins 0.000 description 1
- 101710082969 Histone deacetylase RPD3 Proteins 0.000 description 1
- 101000971171 Homo sapiens Apoptosis regulator Bcl-2 Proteins 0.000 description 1
- 101000984149 Homo sapiens Calpain-10 Proteins 0.000 description 1
- 101000938351 Homo sapiens Ephrin type-A receptor 3 Proteins 0.000 description 1
- 101000851181 Homo sapiens Epidermal growth factor receptor Proteins 0.000 description 1
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 1
- 101001035372 Homo sapiens Histone H2B type F-S Proteins 0.000 description 1
- 101001047006 Homo sapiens Histone acetyltransferase KAT2B Proteins 0.000 description 1
- 101000898976 Homo sapiens Histone acetyltransferase type B catalytic subunit Proteins 0.000 description 1
- 101001032113 Homo sapiens Histone deacetylase 7 Proteins 0.000 description 1
- 101001091610 Homo sapiens Krev interaction trapped protein 1 Proteins 0.000 description 1
- 101100183128 Homo sapiens MBD1 gene Proteins 0.000 description 1
- 101001028019 Homo sapiens Metastasis-associated protein MTA2 Proteins 0.000 description 1
- 101000687346 Homo sapiens PR domain zinc finger protein 2 Proteins 0.000 description 1
- 101001109800 Homo sapiens Pro-neuregulin-1, membrane-bound isoform Proteins 0.000 description 1
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 1
- 101000891649 Homo sapiens Transcription elongation factor A protein-like 1 Proteins 0.000 description 1
- 101000802101 Homo sapiens mRNA decay activator protein ZFP36L2 Proteins 0.000 description 1
- 241000598436 Human T-cell lymphotropic virus Species 0.000 description 1
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 1
- 241000701085 Human alphaherpesvirus 3 Species 0.000 description 1
- 241000701027 Human herpesvirus 6 Species 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 206010021143 Hypoxia Diseases 0.000 description 1
- 206010061598 Immunodeficiency Diseases 0.000 description 1
- 208000029462 Immunodeficiency disease Diseases 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102100023915 Insulin Human genes 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 102100034349 Integrase Human genes 0.000 description 1
- 108010064593 Intercellular Adhesion Molecule-1 Proteins 0.000 description 1
- 102100037877 Intercellular adhesion molecule 1 Human genes 0.000 description 1
- 102100026720 Interferon beta Human genes 0.000 description 1
- 108010047761 Interferon-alpha Proteins 0.000 description 1
- 102000006992 Interferon-alpha Human genes 0.000 description 1
- 108090000467 Interferon-beta Proteins 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108050006617 Interleukin-1 receptor Proteins 0.000 description 1
- 102000019223 Interleukin-1 receptor Human genes 0.000 description 1
- 102000013462 Interleukin-12 Human genes 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 102000003812 Interleukin-15 Human genes 0.000 description 1
- 108090000172 Interleukin-15 Proteins 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 102000000588 Interleukin-2 Human genes 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 102000000646 Interleukin-3 Human genes 0.000 description 1
- 102000004388 Interleukin-4 Human genes 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 102000000743 Interleukin-5 Human genes 0.000 description 1
- 102000004889 Interleukin-6 Human genes 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 1
- 108090000862 Ion Channels Proteins 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical group OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical group C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Chemical group CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- 241000222722 Leishmania <genus> Species 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- 108010092277 Leptin Proteins 0.000 description 1
- 206010024238 Leptospirosis Diseases 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- 208000016604 Lyme disease Diseases 0.000 description 1
- 108010074338 Lymphokines Proteins 0.000 description 1
- 102000008072 Lymphokines Human genes 0.000 description 1
- 102000004083 Lymphotoxin-alpha Human genes 0.000 description 1
- 108090000542 Lymphotoxin-alpha Proteins 0.000 description 1
- 108700043128 MBD2 Proteins 0.000 description 1
- 101150078498 MYB gene Proteins 0.000 description 1
- 101150039798 MYC gene Proteins 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 102100025169 Max-binding protein MNT Human genes 0.000 description 1
- 241000712079 Measles morbillivirus Species 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 102100037511 Metastasis-associated protein MTA2 Human genes 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 102100029820 Mitochondrial brown fat uncoupling protein 1 Human genes 0.000 description 1
- 108050002686 Mitochondrial brown fat uncoupling protein 1 Proteins 0.000 description 1
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 1
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 1
- 241001225774 Moira Species 0.000 description 1
- 241000713869 Moloney murine leukemia virus Species 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 101000777470 Mus musculus C-C motif chemokine 4 Proteins 0.000 description 1
- 101100372761 Mus musculus Flt1 gene Proteins 0.000 description 1
- 101000596402 Mus musculus Neuronal vesicle trafficking-associated protein 1 Proteins 0.000 description 1
- 101000835859 Mus musculus SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily B member 1 Proteins 0.000 description 1
- 101000800539 Mus musculus Translationally-controlled tumor protein Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 102000014415 Muscarinic acetylcholine receptor Human genes 0.000 description 1
- 108050003473 Muscarinic acetylcholine receptor Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- PCNLLVFKBKMRDB-UHFFFAOYSA-N N-ethyl-N-[[2-(1-pentylindol-3-yl)-1,3-thiazol-4-yl]methyl]ethanamine Chemical compound C(C)N(CC=1N=C(SC=1)C1=CN(C2=CC=CC=C12)CCCCC)CC PCNLLVFKBKMRDB-UHFFFAOYSA-N 0.000 description 1
- 108091008604 NGF receptors Proteins 0.000 description 1
- 102000048850 Neoplasm Genes Human genes 0.000 description 1
- 108700019961 Neoplasm Genes Proteins 0.000 description 1
- 206010029113 Neovascularisation Diseases 0.000 description 1
- 108010042309 Netropsin Proteins 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 108010040722 Neurokinin-2 Receptors Proteins 0.000 description 1
- 102100021877 Neuronal pentraxin receptor Human genes 0.000 description 1
- 102000019315 Nicotinic acetylcholine receptors Human genes 0.000 description 1
- 108050006807 Nicotinic acetylcholine receptors Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 102100023050 Nuclear factor NF-kappa-B p105 subunit Human genes 0.000 description 1
- 102100022935 Nuclear receptor corepressor 1 Human genes 0.000 description 1
- 101710153661 Nuclear receptor corepressor 1 Proteins 0.000 description 1
- XDMCWZFLLGVIID-SXPRBRBTSA-N O-(3-O-D-galactosyl-N-acetyl-beta-D-galactosaminyl)-L-serine Chemical compound CC(=O)N[C@H]1[C@H](OC[C@H]([NH3+])C([O-])=O)O[C@H](CO)[C@H](O)[C@@H]1OC1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 XDMCWZFLLGVIID-SXPRBRBTSA-N 0.000 description 1
- 101100046877 Oryza sativa subsp. japonica TRAB1 gene Proteins 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 101710105116 Oxygen-dependent choline dehydrogenase Proteins 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 102100024885 PR domain zinc finger protein 2 Human genes 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 102000003728 Peroxisome Proliferator-Activated Receptors Human genes 0.000 description 1
- 108090000029 Peroxisome Proliferator-Activated Receptors Proteins 0.000 description 1
- 201000005702 Pertussis Diseases 0.000 description 1
- 101100440941 Petroselinum crispum CPRF1 gene Proteins 0.000 description 1
- 108010064785 Phospholipases Proteins 0.000 description 1
- 102000015439 Phospholipases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 206010035148 Plague Diseases 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 108700023400 Platelet-activating factor receptors Proteins 0.000 description 1
- 108010061844 Poly(ADP-ribose) Polymerases Proteins 0.000 description 1
- 102000012338 Poly(ADP-ribose) Polymerases Human genes 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 229920002873 Polyethylenimine Polymers 0.000 description 1
- 108010059820 Polygalacturonase Proteins 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 241000588769 Proteus <enterobacteria> Species 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 101150065817 ROM2 gene Proteins 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- 101100173553 Rattus norvegicus Fer gene Proteins 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 description 1
- 108090000783 Renin Proteins 0.000 description 1
- 102100028255 Renin Human genes 0.000 description 1
- 241000725643 Respiratory syncytial virus Species 0.000 description 1
- 102000007503 Retinoblastoma-Binding Protein 7 Human genes 0.000 description 1
- 108010071000 Retinoblastoma-Binding Protein 7 Proteins 0.000 description 1
- 241000606651 Rickettsiales Species 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 101150100588 SAM gene Proteins 0.000 description 1
- 102000014011 SANT domains Human genes 0.000 description 1
- 108050003888 SANT domains Proteins 0.000 description 1
- 101150010457 SAS5 gene Proteins 0.000 description 1
- 108700039010 SET domains Proteins 0.000 description 1
- 102000051614 SET domains Human genes 0.000 description 1
- 102100022340 SHC-transforming protein 1 Human genes 0.000 description 1
- 101150055709 SNF1 gene Proteins 0.000 description 1
- 102100024790 SWI/SNF complex subunit SMARCC2 Human genes 0.000 description 1
- 101710169052 SWI/SNF complex subunit SMARCC2 Proteins 0.000 description 1
- 101150016929 SWI1 gene Proteins 0.000 description 1
- 101150011461 SWI3 gene Proteins 0.000 description 1
- 101100042631 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SIN3 gene Proteins 0.000 description 1
- 101100477851 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SNF12 gene Proteins 0.000 description 1
- 101100533773 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SNF6 gene Proteins 0.000 description 1
- 101100367258 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SWP82 gene Proteins 0.000 description 1
- 101000781972 Schizosaccharomyces pombe (strain 972 / ATCC 24843) Protein wos2 Proteins 0.000 description 1
- 101100368917 Schizosaccharomyces pombe (strain 972 / ATCC 24843) taz1 gene Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000295644 Staphylococcaceae Species 0.000 description 1
- 108010039811 Starch synthase Proteins 0.000 description 1
- 101001091268 Streptomyces hygroscopicus Hygromycin-B 7''-O-kinase Proteins 0.000 description 1
- 208000006011 Stroke Diseases 0.000 description 1
- 102100037342 Substance-K receptor Human genes 0.000 description 1
- 108010043934 Sucrose synthase Proteins 0.000 description 1
- 101800001271 Surface protein Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 239000008049 TAE buffer Substances 0.000 description 1
- 102100040296 TATA-box-binding protein Human genes 0.000 description 1
- 102000004399 TNF receptor-associated factor 3 Human genes 0.000 description 1
- 108090000922 TNF receptor-associated factor 3 Proteins 0.000 description 1
- 101710192266 Tegument protein VP22 Proteins 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- AUYYCJSJGJYCDS-LBPRGKRZSA-N Thyrolar Chemical class IC1=CC(C[C@H](N)C(O)=O)=CC(I)=C1OC1=CC=C(O)C(I)=C1 AUYYCJSJGJYCDS-LBPRGKRZSA-N 0.000 description 1
- 101150115343 Tnfsf15 gene Proteins 0.000 description 1
- 101001009610 Toxoplasma gondii Dense granule protein 5 Proteins 0.000 description 1
- 108010083268 Transcription Factor TFIID Proteins 0.000 description 1
- 102100031027 Transcription activator BRG1 Human genes 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 1
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 1
- 108010009583 Transforming Growth Factors Proteins 0.000 description 1
- 102000009618 Transforming Growth Factors Human genes 0.000 description 1
- 102400001320 Transforming growth factor alpha Human genes 0.000 description 1
- 101800004564 Transforming growth factor alpha Proteins 0.000 description 1
- 241000224526 Trichomonas Species 0.000 description 1
- 241000223104 Trypanosoma Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 102000000160 Tumor Necrosis Factor Receptor-Associated Peptides and Proteins Human genes 0.000 description 1
- 108010080432 Tumor Necrosis Factor Receptor-Associated Peptides and Proteins Proteins 0.000 description 1
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108050002568 Tumor necrosis factor ligand superfamily member 6 Proteins 0.000 description 1
- 102100033725 Tumor necrosis factor receptor superfamily member 16 Human genes 0.000 description 1
- 102000014384 Type C Phospholipases Human genes 0.000 description 1
- 108010079194 Type C Phospholipases Proteins 0.000 description 1
- 102000007537 Type II DNA Topoisomerases Human genes 0.000 description 1
- 108010046308 Type II DNA Topoisomerases Proteins 0.000 description 1
- 102000003425 Tyrosinase Human genes 0.000 description 1
- 108060008724 Tyrosinase Proteins 0.000 description 1
- 108091000117 Tyrosine 3-Monooxygenase Proteins 0.000 description 1
- 102000048218 Tyrosine 3-monooxygenases Human genes 0.000 description 1
- 102000006275 Ubiquitin-Protein Ligases Human genes 0.000 description 1
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 108010000134 Vascular Cell Adhesion Molecule-1 Proteins 0.000 description 1
- 102100023543 Vascular cell adhesion protein 1 Human genes 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 108070000030 Viral receptors Proteins 0.000 description 1
- 101100459258 Xenopus laevis myc-a gene Proteins 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-KNYAHOBESA-N [[(2r,3s,4r,5r)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] dihydroxyphosphoryl hydrogen phosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)O[32P](O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KNYAHOBESA-N 0.000 description 1
- 239000000370 acceptor Substances 0.000 description 1
- HGEVZDLYZYVYHD-UHFFFAOYSA-N acetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound CC(O)=O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O HGEVZDLYZYVYHD-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 229930183665 actinomycin Natural products 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 108700021044 acyl-ACP thioesterase Proteins 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 229940009456 adriamycin Drugs 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 102000004305 alpha Adrenergic Receptors Human genes 0.000 description 1
- 108090000861 alpha Adrenergic Receptors Proteins 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000001640 apoptogenic effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- FZCSTZYAHCUGEM-UHFFFAOYSA-N aspergillomarasmine B Natural products OC(=O)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O FZCSTZYAHCUGEM-UHFFFAOYSA-N 0.000 description 1
- 238000000211 autoradiogram Methods 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 102000012740 beta Adrenergic Receptors Human genes 0.000 description 1
- 108010079452 beta Adrenergic Receptors Proteins 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 125000003164 beta-aspartyl group Chemical group 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 125000001246 bromo group Chemical group Br* 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- NSQLIUXCMFBZME-MPVJKSABSA-N carperitide Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CSSC[C@@H](C(=O)N1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)=O)[C@@H](C)CC)C1=CC=CC=C1 NSQLIUXCMFBZME-MPVJKSABSA-N 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000012820 cell cycle checkpoint Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 210000003850 cellular structure Anatomy 0.000 description 1
- 230000004700 cellular uptake Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 108010040093 cellulose synthase Proteins 0.000 description 1
- 101150113535 chek1 gene Proteins 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 239000005482 chemotactic factor Substances 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 108010011713 delta-15 desaturase Proteins 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001687 destabilization Effects 0.000 description 1
- 230000000368 destabilizing effect Effects 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- 229940039227 diagnostic agent Drugs 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- PXBRQCKWGAHEHS-UHFFFAOYSA-N dichlorodifluoromethane Chemical compound FC(F)(Cl)Cl PXBRQCKWGAHEHS-UHFFFAOYSA-N 0.000 description 1
- 235000019404 dichlorodifluoromethane Nutrition 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 210000002249 digestive system Anatomy 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000012377 drug delivery Methods 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000009881 electrostatic interaction Effects 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 210000000632 euchromatin Anatomy 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000001125 extrusion Methods 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 230000000799 fusogenic effect Effects 0.000 description 1
- UHBYWPGGCSDKFX-VKHMYHEASA-N gamma-carboxy-L-glutamic acid Chemical compound OC(=O)[C@@H](N)CC(C(O)=O)C(O)=O UHBYWPGGCSDKFX-VKHMYHEASA-N 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000004034 genetic regulation Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 108700025184 hepatitis B virus X Proteins 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 230000006197 histone deacetylation Effects 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 102000057382 human EPHA3 Human genes 0.000 description 1
- 102000044890 human EPO Human genes 0.000 description 1
- 102000055650 human NRG1 Human genes 0.000 description 1
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 1
- 108010064894 hydroperoxide lyase Proteins 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 230000007954 hypoxia Effects 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 230000007813 immunodeficiency Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 102000002467 interleukin receptors Human genes 0.000 description 1
- 108010093036 interleukin receptors Proteins 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 238000007917 intracranial administration Methods 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 208000028867 ischemia Diseases 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 229940039781 leptin Drugs 0.000 description 1
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 1
- 108010019813 leptin receptors Proteins 0.000 description 1
- 102000005861 leptin receptors Human genes 0.000 description 1
- 238000000670 ligand binding assay Methods 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 102100034703 mRNA decay activator protein ZFP36L2 Human genes 0.000 description 1
- 208000002780 macular degeneration Diseases 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- LSDPWZHWYPCBBB-UHFFFAOYSA-O methylsulfide anion Chemical group [SH2+]C LSDPWZHWYPCBBB-UHFFFAOYSA-O 0.000 description 1
- 238000012737 microarray-based gene expression Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 239000003226 mitogen Substances 0.000 description 1
- 230000034839 mitotic sister chromatid segregation Effects 0.000 description 1
- 108091005601 modified peptides Proteins 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 238000012243 multiplex automated genomic engineering Methods 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- UPBAOYRENQEPJO-UHFFFAOYSA-N n-[5-[[5-[(3-amino-3-iminopropyl)carbamoyl]-1-methylpyrrol-3-yl]carbamoyl]-1-methylpyrrol-3-yl]-4-formamido-1-methylpyrrole-2-carboxamide Chemical compound CN1C=C(NC=O)C=C1C(=O)NC1=CN(C)C(C(=O)NC2=CN(C)C(C(=O)NCCC(N)=N)=C2)=C1 UPBAOYRENQEPJO-UHFFFAOYSA-N 0.000 description 1
- OHDXDNUPVVYWOV-UHFFFAOYSA-N n-methyl-1-(2-naphthalen-1-ylsulfanylphenyl)methanamine Chemical compound CNCC1=CC=CC=C1SC1=CC=CC2=CC=CC=C12 OHDXDNUPVVYWOV-UHFFFAOYSA-N 0.000 description 1
- 230000017239 negative regulation of gene expression Effects 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 108010001839 neuronal pentraxin receptor Proteins 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000007500 overflow downdraw method Methods 0.000 description 1
- 102000002574 p38 Mitogen-Activated Protein Kinases Human genes 0.000 description 1
- 108010068338 p38 Mitogen-Activated Protein Kinases Proteins 0.000 description 1
- 108700025694 p53 Genes Proteins 0.000 description 1
- 101150081585 panB gene Proteins 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 210000000680 phagosome Anatomy 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 150000008298 phosphoramidates Chemical class 0.000 description 1
- 238000003322 phosphorimaging Methods 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 102000030769 platelet activating factor receptor Human genes 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 230000019532 positive regulation of gene expression Effects 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 239000001294 propane Substances 0.000 description 1
- 239000003380 propellant Substances 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 102000021127 protein binding proteins Human genes 0.000 description 1
- 108091011138 protein binding proteins Proteins 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- UOWVMDUEMSNCAV-WYENRQIDSA-N rachelmycin Chemical compound C1([C@]23C[C@@H]2CN1C(=O)C=1NC=2C(OC)=C(O)C4=C(C=2C=1)CCN4C(=O)C1=CC=2C=4CCN(C=4C(O)=C(C=2N1)OC)C(N)=O)=CC(=O)C1=C3C(C)=CN1 UOWVMDUEMSNCAV-WYENRQIDSA-N 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 210000004994 reproductive system Anatomy 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 102000027483 retinoid hormone receptors Human genes 0.000 description 1
- 108091008679 retinoid hormone receptors Proteins 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 229940016590 sarkosyl Drugs 0.000 description 1
- 108700004121 sarkosyl Proteins 0.000 description 1
- 238000003345 scintillation counting Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000009758 senescence Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 210000002265 sensory receptor cell Anatomy 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 229940076279 serotonin Drugs 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 230000018381 sister chromatid cohesion Effects 0.000 description 1
- 101150117195 snf gene Proteins 0.000 description 1
- KSAVQLQVUXSOCR-UHFFFAOYSA-M sodium lauroyl sarcosinate Chemical compound [Na+].CCCCCCCCCCCC(=O)N(C)CC([O-])=O KSAVQLQVUXSOCR-UHFFFAOYSA-M 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 108010042747 stallimycin Proteins 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 208000003265 stomatitis Diseases 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 239000005495 thyroid hormone Substances 0.000 description 1
- 229940036555 thyroid hormone Drugs 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 108010014677 transcription factor TFIIE Proteins 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 230000010415 tropism Effects 0.000 description 1
- 102000003390 tumor necrosis factor Human genes 0.000 description 1
- 238000003160 two-hybrid assay Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 210000002229 urogenital system Anatomy 0.000 description 1
- 208000019553 vascular disease Diseases 0.000 description 1
- 230000002227 vasoactive effect Effects 0.000 description 1
- 208000005925 vesicular stomatitis Diseases 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 239000000277 virosome Substances 0.000 description 1
- 239000012130 whole-cell lysate Substances 0.000 description 1
- 235000005074 zinc chloride Nutrition 0.000 description 1
- 230000004572 zinc-binding Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1058—Directional evolution of libraries, e.g. evolution of libraries is achieved by mutagenesis and screening or selection of mixed population of organisms
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/66—General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/67—General methods for enhancing the expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6811—Selection methods for production or design of target specific oligonucleotides or binding molecules
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/10—Sequence alignment; Homology search
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/02—Fusion polypeptide containing a localisation/targetting motif containing a signal sequence
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/01—Fusion polypeptide containing a localisation/targetting motif
- C07K2319/09—Fusion polypeptide containing a localisation/targetting motif containing a nuclear localisation signal
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/20—Fusion polypeptide containing a tag with affinity for a non-protein ligand
- C07K2319/24—Fusion polypeptide containing a tag with affinity for a non-protein ligand containing a MBP (maltose binding protein)-tag
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/40—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation
- C07K2319/43—Fusion polypeptide containing a tag for immunodetection, or an epitope for immunisation containing a FLAG-tag
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/71—Fusion polypeptide containing domain for protein-protein interaction containing domain for transcriptional activaation, e.g. VP16
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/74—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
- C07K2319/75—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor containing a fusion for activation of a cell surface receptor, e.g. thrombopoeitin, NPY and other peptide hormones
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/80—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor
- C07K2319/81—Fusion polypeptide containing a DNA binding domain, e.g. Lacl or Tet-repressor containing a Zn-finger domain for DNA binding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
Definitions
- the present disclosure is in the fields of chromatin structure and genetic regulation, in particular, the modification of chromatin structure to facilitate interaction of molecules with a region of interest in cellular chromatin.
- Regulation of gene expression in a cell is generally mediated by sequence-specific binding of gene regulatory molecules, often proteins, to chromosomal DNA. Regulatory proteins can effect either positive or negative regulation of gene expression. Generally, a regulatory protein will exhibit preference for binding to a particular binding sequence, or target site. Target sites for many regulatory proteins (and other molecules) are known or can be determined by one of skill in the art.
- chromosomal DNA is packaged into nucleosomes.
- a nucleosome comprises a core and a linker.
- the nucleosome core comprises an octamer of core histones (two each of H2A, H2B, H3 and H4) around which is wrapped approximately 150 base pairs of chromosomal DNA.
- linker DNA segment of approximately 50 base pairs is associated with linker histone Hi (or a related linker histone in certain specialized cells).
- Nucleosomes are organized into a higher-order chromatin fiber (sometimes denoted a “solenoid” or a 30 nm fiber) and chromatin fibers are organized into chromosomes. See, for example, Wolffe “Chromatin: Structure and Function” 3 rd Ed., Academic Press, San Diego, 1998 and Kornberg et al. (1999) Cell 98:285-294.
- Chromatin structure is not static, but is subject to modification by processes collectively known as chromatin remodeling.
- Chromatin remodeling can serve, for example, to remove nucleosomes from a region of DNA, move nucleosomes from one region of DNA to another, change the spacing between nucleosomes or add nucleosomes to a region of DNA in the chromosome.
- Chromatin remodeling can also result in changes in higher order structure, thereby influencing the balance between transcriptionally active chromatin (open chromatin or euchromatin) and transcriptionally inactive chromatin (closed chromatin or heterochromatin).
- Chromosomal proteins are subject to numerous types of chemical modification, some or all of which influence chromatin structure.
- histones are subject to acetylation by histone acetyltransferases, deacetylation by histone deacetylases, methylation by histone methyltransferases (and therefore presumably to demethylation by histone demethylases), ubiquitination by ubiquitin ligases, de-ubiquitination by ubiquitin hydrolases, phosphorylation by histone kinases, dephosphorylation by histone phosphatases, and reversible ADP-ribosylation by poly-ADP ribose polymerase (PARP, also known as TFIIC).
- PARP poly-ADP ribose polymerase
- chromatin-resident transcriptional regulators such as, for example, TFIIE (Imhof et al. (1997) Curr. Biol. 7:689-692), p53 (Gu et al. (1997) Cell 90:595-606) and GATA-1 (Boyes et al. (1998) Nature 396:594-598).
- TFIIE Imhof et al. (1997) Curr. Biol. 7:689-692
- p53 Gu et al. (1997) Cell 90:595-606
- GATA-1 Boyes et al. (1998) Nature 396:594-598.
- Chemical modification of histone and/or non-histone proteins is often a step in the chromatin remodeling process, and can have either positive or negative effects on gene expression.
- histone acetylation is correlated with gene activation; while deacetylation of histones is correlated with gene repression.
- histone acetyl transferases include Gcn5p, p300/CBP-associated factor (P/CAF), p300, CREB-binding protein (CBP), HAT1, TFIID-associated factor 250 (TAF II 250), and steroid receptor coactivator-1 (SRC-1).
- P/CAF p300/CBP-associated factor
- CBP CREB-binding protein
- SRC-1 steroid receptor coactivator-1
- the HDAC family of proteins have been identified as histone deacetylases and include homologues to the budding yeast histone deacetylase RPD3 (e.g., HDAC1, HDAC2, HDAC3 and HDAC8) and homologues to the budding yeast histone deacetylase HDA1 (e.g., HDAC4, HDAC5, HDAC6 and HDAC7). Ng et al. (2000) Trends Biochem. Sci. 25:121-126.
- the Rsk-2 (RKS90) kinase has been identified as a histone kinase. Sassone-Corsi et al. (1999) Science 285:886-891.
- a histone methyltransferase (CARM-1) has also been identified. Chen et al. (1999) Science 284:2174-2177.
- chromatin structure Because of the dynamic structure of cellular chromatin, the ability of a regulatory molecule to bind its target site in a chromosome may be limited, in certain circumstances, by chromatin structure. For example, if a target site is present in “open” chromatin (generally thought of as nucleosome-free or having an altered nucleosomal conformation compared to bulk chromatin) structural barriers to the binding of a regulatory molecule to its target site are unlikely. By contrast, if a target site is present in “closed” chromatin (i.e. having extensive higher-order structure and/or close nucleosome spacing), steric barriers to binding are likely to exist.
- the ability of a regulatory molecule to bind to a target site in cellular chromatin will depend on the structure of the chromatin surrounding that particular target site.
- the chromatin structure of a particular gene can vary depending on, for example, cell type and/or developmental stage. For this reason, the regulation of a given gene in a particular cell can be influenced not only by the presence or absence of gene regulatory factors, but also by the chromatin structure of the gene.
- Remodeling of chromatin can lead to activation of gene expression in vitro.
- the NURF chromatin remodeling complex stimulates the transcriptional activation activity of the GAGA transcription factor.
- Transcriptional activation by a GAL4-VP 16 fusion requires the RSF chromatin remodeling complex.
- LeRoy et al. (1998) Science 282:1900-1904.
- the SWI/SNF chromatin remodeling complex potentiates transcriptional activation by the VP16 activation domain and by ligand-bound glucocorticoid receptor. Neely et al. (1999) Mol. Cell 4:649-655; Wallberg et al. (2000) Mol. Cell. Biol. 20:2004-2013.
- chromatin remodeling complexes for gene activation in vivo.
- the human SWI/SNF chromatin remodeling complex is required for the activity of the glucocorticoid receptor. Fryer et al. (1998) Nature 393:88-91.
- the mammalian SWI/SNF chromatin remodeling complex is required for activation of the hsp70 gene. de La Serna et al. (2000) Mol. Cell. Biol. 20:2839-2851. Mutations in the Drosophila ISWI protein adversely affect expression of the engrailed and Ultrabithorax genes. Deuring et al. (2000) Mol. Cell 5:355-365.
- compositions and methods useful for targeted modification of chromatin are useful for facilitating processes that depend upon access of cellular DNA sequences to DNA-binding molecules, for example, transcription, replication, recombination, repair and integration.
- targeted modification of chromatin facilitates regulation of gene expression by endogenous or exogenous molecules, by providing access to cellular DNA sequences. Modification is any change in chromatin structure, compared to the normal state of the chromatin in the cell in which it resides.
- a method for modifying a region of interest in cellular chromatin comprises contacting cellular chromatin with a fusion molecule that binds to a binding site in the region of interest.
- the fusion molecule comprises a DNA-binding domain and a component of a chromatin remodeling complex or a functional fragment thereof.
- the fusion molecule is a polypeptide.
- Cellular chromatin can be present in any type of cell, including prokaryotic, eucaryotic or archaeal. Eucaryotic cells include microorganisms, fungal cells, plants and animals, including vertebrate, mammalian and human cells.
- the DNA-binding domain of a fusion molecule comprises a triplex-forming nucleic acid, an intercalator, an antibiotic, or a minor groove binder.
- the DNA-binding domain comprises a zinc finger DNA-binding domain.
- a fusion molecule is a fusion polypeptide comprising a zinc finger DNA-binding domain. Other polypeptide DNA-binding domains are also useful.
- Chromatin remodeling complexes generally contain an enzymatic component, which is often an ATPase, a histone acetyl transferase or a histone deacetylase.
- ATPase components include, but are not limited to, the following polypeptides: SWI2/SNF2, Mi-2, ISWI, BRM, BRG/BAF, Chd-1, Chd-2, Chd-3, Chd-4 and Mot-1.
- chromatin remodeling complexes can be used as a portion of a fusion molecule.
- Many components of chromatin remodeling complexes have been identified by sequence homology. Accordingly, additional chromatin remodeling complexes and their components are likely to be discovered and their use is contemplated by the present disclosure.
- chromatin modification facilitates modulation of expression of a gene of interest. Modulation of expression comprises activation or repression of a gene of interest.
- chromatin modification facilitates recombination between an exogenous nucleic acid and cellular chromatin. In this way, targeted integration of transgenes is accomplished more efficiently.
- a fusion molecule can be a polypeptide.
- chromatin modification is accomplished by contacting a cell with a polynucleotide encoding a fusion polypeptide, such that the polynucleotide is introduced into the cell and the fusion polypeptide is expressed in the cell.
- fusion polypeptides comprising a fusion between a DNA-binding domain and a component of a chromatin remodeling complex (or functional fragment thereof), as well as polynucleotides encoding them, are provided.
- cells comprising these fusion polypeptides and cells comprising polynucleotides encoding these fusion polypeptides.
- fusion polypeptides comprising a zinc finger DNA binding domain and polynucleotides encoding them.
- a region of interest in cellular chromatin which is to be modified, comprises a gene.
- genes whose chromatin structure can be modified through the use of the compositions and methods disclosed herein include, but are not limited to, vascular endothelial growth factor (VEGF), erythropoietin (EPO), androgen receptor, PPAR- ⁇ 2, p16, p53, Rb, dystrophin and e-cadherin.
- VEGF vascular endothelial growth factor
- EPO erythropoietin
- PPAR- ⁇ 2 phosphatidherin
- the DNA binding domain of the fusion molecule is selected to bind to a sequence (i.e., a target site) in one of the aforementioned genes.
- modification of chromatin structure is accompanied by an additional step of contacting cellular chromatin with a second molecule.
- the modification of chromatin structure effected by the binding of the fusion molecule facilitates the binding of the second molecule.
- the second molecule can be a transcription regulatory molecule, either an endogenous factor or one that is exogenously supplied to a cell.
- the second molecule is also a fusion molecule, preferably a fusion polypeptide.
- the second molecule comprises a zinc finger DNA-binding domain.
- the second molecule can also comprise, for example, a transcriptional activation domain or a transcriptional repression domain.
- modification of chromatin structure, in a region of interest, by a fusion molecule as disclosed herein provides access for the binding of a second molecule which can regulate the transcription of a gene in or near the region of interest.
- a second molecule is a fusion comprising a DNA binding domain and an enzyme (or functional fragment thereof) that covalently modifies histones, for example, a histone acetyl transferase or a histone deacetylase.
- a first fusion molecule facilitates remodeling of chromatin, making it a substrate for the activity of a second fusion molecule that facilitates covalent modification of the remodeled chromatin.
- a second molecule can comprise a fusion between a DNA binding domain and a component of a chromatin remodeling complex that is different from the one present in the first molecule. In this way, it is possible to recruit multiple chromatin remodeling complexes to a region of interest in cellular chromatin.
- cellular chromatin is contacted with three molecules.
- the first comprises a fusion between a DNA binding domain and a component of a chromatin remodeling complex or a functional fragment thereof.
- the second molecule can comprise, for example, a transcriptional regulatory molecule (endogenous or exogenous), a fusion between a DNA binding domain and a component of a chromatin remodeling complex or a fusion between a DNA binding domain and an enzyme that covalently modifies histones.
- the third molecule can be an endogenous or exogenous transcriptional regulatory molecule, or a fusion molecule.
- a fusion molecule can be a fusion polypeptide and can comprise a DNA binding domain (e.g., a zinc finger DNA binding domain) and a transcriptional regulatory domain, such as, for example, an activation domain or a repression domain.
- a DNA binding domain e.g., a zinc finger DNA binding domain
- a transcriptional regulatory domain such as, for example, an activation domain or a repression domain.
- the first and second molecules can be involved in modifying chromatin structure in a region of interest to allow access to that region by a third molecule which can be, for example, a molecule with transcriptional regulatory function.
- the first molecule can be involved in the modification of chromatin structure to allow access by the second and third molecules (both of which can be, for example, transcriptional regulatory molecules) in a region of interest in cellular chromatin.
- the first molecule can facilitate chromatin remodeling in the region of interest
- the second molecule can be involved in covalent modification of histones in the region of interest
- the third molecule can bind in the region of interest and possess transcriptional regulatory function.
- fourth, fifth, etc. molecules can also be contacted with cellular chromatin to modify its structure in a region of interest and effect regulation of a gene in that region.
- methods for modulating expression of a gene comprise the steps of contacting cellular chromatin with a first fusion molecule that binds to a binding site in cellular chromatin, wherein the binding site is in the gene, and wherein the first fusion molecule comprises a DNA-binding domain and a component of a chromatin remodeling complex or a functional fragment thereof, and further contacting the cellular chromatin with a second molecule that binds to a target site in the gene and modulates expression of the gene.
- the DNA-binding domain of the first fusion molecule is a zinc finger DNA-binding domain.
- the second molecule can be, for example, a small molecule therapeutic, a minor groove binder, a peptide, a polyamide, a DNA molecule, a triplex-forming oligonucleotide, an RNA molecule, or a polypeptide.
- Exemplary polypeptides include, but are not limited to, transcription factors, recombinases, integrases, helicases, and DNA or RNA polymerases. Any of the aforementioned molecules can be either exogenous or endogenous.
- the second molecule can be a second fusion molecule, For example, a fusion polypeptide.
- the second molecule is a fusion polypeptide comprising a zinc finger DNA binding domain.
- the second fusion molecule can also comprise a transcriptional activation domain or a transcriptional repression domain.
- a plurality of first fusion molecules are contacted with cellular chromatin.
- a plurality of second molecules each having a distinct target site in the gene, can be contacted with cellular chromatin in the practice of methods to modulate expression of a gene.
- the disclosed methods for modulating expression of a gene can include the use of a single first fusion molecule and a single second molecule, a single first fusion molecule and a plurality of second molecules, a plurality of first fusion molecules and a single second molecule, and a plurality of first fusion molecules and a plurality of second molecules.
- expression of a plurality of genes is modulated according to the disclosed methods. This can be accomplished in several ways.
- a plurality of first fusion molecules, each binding to a distinct binding site, wherein each distinct binding site is in a distinct gene are contacted with cellular chromatin.
- One or more of the first fusion molecules can be a zinc finger fusion polypeptide comprising a zinc-finger DNA-binding domain.
- a first fusion molecule can bind to a shared binding site in two or more of the plurality of genes.
- a single first fusion molecule binds to a shared binding site in all of the plurality of genes whose expression is modulated.
- Additional methods for modulating the expression of a plurality of genes involve contacting a plurality of second molecules with cellular chromatin, in combination with the contact of one or more first fusion molecules with cellular chromatin.
- Each of the plurality of second molecules can bind to a distinct target site, wherein each distinct target site is in a distinct gene.
- a single second molecule can bind to a shared target site in two or more different genes.
- a single second molecule binds to a shared target site in all of the plurality of genes whose expression is modulated.
- one or more accessible regions within the region of interest are identified and one or more target sites for the DNA-binding portion of the fusion molecule are identified within the accessible region.
- the DNA-binding domain is capable of binding to nucleosomal DNA sequences and identification of an accessible region is not necessary.
- chromatin modification as disclosed herein, often results in the generation of an accessible region in cellular chromatin in the region of interest, which can facilitate the binding of other molecules, either exogenous or endogenous.
- Exogenous molecules whose binding can be facilitated by the generation of an accessible region through chromatin modification include, but are not limited to, minor groove binders, major groove binders, intercalators, small molecule therapeutics, nucleic acids, and polypeptides, including fusion polypeptides, preferably comprising a zinc finger DNA-binding domain.
- Fusion polypeptides comprising a DNA-binding domain and a component of a chromatin remodeling complex, and methods for producing such fusion polypeptides, are also provided.
- such fusion polypeptides are produced by expressing a polynucleotide as described in the preceding paragraph in a suitable host cell.
- FIG. 1 shows the PCR amplification scheme for production of constructs encoding the Veg1 and Veg3a DNA-binding domains.
- FIG. 2 shows the results of DNA-binding affinity determination for the Veg1 DNA-binding subdomain.
- FIG. 2A shows EMSA analysis. Unbound probe is at the bottom of the gel and shifted probe (bound to Veg1) is indicated by the arrow to the right of the gel photo. Concentration of Veg1 is given at the top.
- MBP-VEGF1 indicates a binding reaction in which 15 nM of the Veg1-maltose binding protein fusion was used.
- FIG. 3 shows an autoradiogram of a DNA gel, indicating the existence and location of DNaseI-hypersensitive sites in the human VEGF-A gene.
- FIG. 4 is a schematic diagram of the plasmid pSRC1b-EPO2C.
- the rightward-pointing arrow represents the start site of a transcription unit encoding a fusion protein that includes a nuclear localization signal (NLS), a ZFP binding domain targeted to nucleotide ⁇ 862 of the human erythropoietin gene (EPO2c ZFP), a portion of the SRC1 protein from amino acids 781-1385 (SRC1b), and a FLAG epitope (Flag).
- pCMV represents a CMV promoter. Selected restriction enzyme recognition sites are also indicated.
- FIG. 5 shows erythropoietin (EPO) levels in transfected and control cells, as determined by ELISA.
- the bar labeled pSRC1b-EPO2C represents levels of EPO secreted into the medium by cells transfected with a plasmid encoding a fusion between an EPO-targeted ZFP and a portion of the SRC1 protein.
- pCDNA3.1 represents secreted EPO levels in cells transfected with a control plasmid that does not encode a ZFP-SRC1 fusion.
- FIG. 6 is a schematic diagram depicting the structure of a set of fusion molecules described in Example 13.
- NLS refers to a nuclear localization sequence
- ZFP-DBD refers to the VEGF3a/l zinc finger DNA-binding domain
- MBD refers to a portion of a methyl binding domain protein
- DNMT refers to a portion of a DNA N-methyl transferase protein
- Flag refers to a FLAG epitope.
- FIG. 7 shows VEGF levels in transfected and control cells, as determined by ELISA.
- MOCK refers to cells transfected with a vector that does not contain a ZFP-MBD or ZFP-DNMT fusion.
- pEGFP-KRAB refers to cells transfected with a green fluorescent protein-encoding plasmid.
- PVF3a/1 refers to the VEGF3a/1 DNA binding domain described in Examples 3 and 13.
- MBD refers to various methyl binding domain proteins.
- DNMT refers to various DNA N-methyl transferases.
- compositions and methods useful for modifying chromatin structure in a predetermined region of interest in cellular chromatin facilitates many processes involving nucleotide sequence-specific interaction of molecules with cellular chromatin.
- modification of chromatin structure is a prerequisite for binding of a regulatory molecule to its target site in cellular chromatin. Such binding can be useful in the regulation of an endogenous cellular gene by one or more endogenous and/or exogenous molecules.
- Regulation of gene expression often involves recruitment of a chromatin remodeling complex to a region of cellular chromatin (e.g., the promoter of a gene). Recruitment can occur, for example, by protein-protein interactions between a sequence-specific DNA-binding transcriptional regulatory protein bound at a promoter and a component of the remodeling complex. See, for example, Peterson et al. (2000) Curr. Opin. Genet. Devel. 10:187-192. Alterations in chromatin structure in the vicinity of the promoter, mediated by the recruited remodeling complex, facilitate subsequent interactions that result in transcriptional activation or repression.
- a remodeling complex can be localized is limited by the sequence specificity of the DNA-binding transcriptional regulatory protein, since most, if not all, protein components of chromatin remodeling complexes do not possess sequence-specific DNA-binding activity.
- it is not easy to target chromatin remodeling to a particular region of interest in cellular chromatin unless one possesses a protein that is: (1) capable of binding to chromatin in or near the region of interest, and (2) capable of interacting with at least one component of a multi-subunit chromatin remodeling complex.
- the methods and compositions disclosed herein allow targeted modification of any region of interest in cellular chromatin, by employing a fusion molecule comprising a DNA-binding domain and a component of a chromatin remodeling complex or functional fragment thereof.
- the DNA-binding domain is selected or designed to bind to a target site within or near the region of interest. Any DNA-binding entity having the requisite specificity is suitable.
- the DNA-binding domain is a zinc finger DNA-binding domain.
- Binding of the DNA-binding portion of the fusion molecule localizes the portion of the fusion molecule comprising a component of a chromatin remodeling complex to the region of binding, where it interacts with other components to reconstitute a functional chromatin remodeling complex in the vicinity of the target site.
- Chromatin remodeling ensues in the vicinity of the target site, which renders the region of binding (e.g., a gene promoter) susceptible to the action of endogenous regulatory factors, and/or to the regulatory activities of exogenous molecules.
- targeted remodeling of chromatin will facilitate the regulation of many processes involving access of molecules to DNA in cellular chromatin including, but not limited to, replication, recombination, repair, transcription, telomere function and maintenance, sister chromatid cohesion, and mitotic chromosome segregation.
- targeted integration of exogenous DNA into cellular chromatin will be enhanced by chromatin remodeling in the region of the desired integration site.
- MOLECULAR CLONING A LABORATORY MANUAL, Second edition, Cold Spring Harbor Laboratory Press, 1989; Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, 1987 and periodic updates; the series METHODS IN ENZYMOLOGY, Academic Press, San Diego; Wolffe, CHROMATIN STRUCTURE AND FUNCTION, Third edition, Academic Press, San Diego, 1998; METHODS IN ENZYMOLOGY, Vol. 304, “Chromatin” (P. M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego, 1999; and METHODS IN MOLECULAR BIOLOGY, Vol. 119, “Chromatin Protocols” (P. B. Becker, ed.) Humana Press, Totowa, 1999.
- nucleic acid polynucleotide, and oligonucleotide are used interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form.
- these terms are not to be construed as limiting with respect to the length of a polymer.
- the terms can encompass known analogues of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties.
- an analogue of a particular nucleotide has the same base-pairing specificity; i.e., an analogue of A will base-pair with T.
- nucleic acids containing modified backbone residues or linkages which are synthetic, naturally occurring, and non-naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides.
- analogs include, without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs).
- nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated.
- Nucleic acids include, for example, genes, cDNAs, and mRNAs. Polynucleotide sequences are displayed herein in the conventional 5′-3′ orientation.
- Chromatin is the nucleoprotein structure comprising the cellular genome.
- Cellular chromatin comprises nucleic acid, primarily DNA, and protein, including histones and non-histone chromosomal proteins.
- the majority of eukaryotic cellular chromatin exists in the form of nucleosomes, wherein a nucleosome core comprises approximately 150 base pairs of DNA associated with an octamer comprising two each of histones H2A, H2B, H3 and H4; and linker DNA (of variable length depending on the organism) extends between nucleosome cores.
- a molecule of histone H1 is generally associated with the linker DNA.
- the term “chromatin” is meant to encompass all types of cellular nucleoprotein, both prokaryotic and eukaryotic.
- Cellular chromatin includes both chromosomal and episomal chromatin.
- Chromatin modification refers to any process by which the structure of chromatin or its constituents is altered. Remodeling can include, for example, removal or repositioning of nucleosomes, addition of nucleosomes, changes in nucleosome density, changes in the path of DNA along the histone octamer, and/or changes in higher-order chromatin structure such as, for example, unwinding of the chromatin solenoid. Chromatin modification can also include modifications to histones or nucleic acid which might not necessarily change the structure of chromatin as assayable by current methods. For example, acetylation or deacetylation of histones, as well as methylation or demethylation of nucleic acid, are instances of chromatin modification.
- a chromosome as is known to one of skill in the art, is a chromatin complex comprising all or a portion of the genome of a cell.
- the genome of a cell is often characterized by its karyotype, which is the collection of all the chromosomes that comprise the genome of the cell.
- the genome of a cell can comprise one or more chromosomes.
- An episome is a replicating nucleic acid, nucleoprotein complex or other structure comprising a nucleic acid that is not part of the chromosomal karyotype of a cell.
- Examples of episomes include plasmids and certain viral genomes.
- a target site is a nucleic acid sequence that defines a portion of a nucleic acid to which a binding molecule will bind, provided sufficient conditions for binding exist.
- the sequence 5′-GAATTC-3′ is a target site for the Eco RI restriction endonuclease.
- a target site may be required for binding of a molecule to a nucleic acid at the target site.
- binding of a molecule to a polynucleotide comprising a target site may require both a particular nucleotide sequence and a particular protein composition adjacent to, or in the vicinity of, the target site.
- Conditions such as, for example, temperature, pH, and ionic strength can also affect binding of a molecule to its target site.
- Target sites for various transcription factors are known. See, for example, Wingender et al. (1997) Nucleic Acids Res. 25:265-268 and the TRANSFAC Transcription Factor database at http://transfac.gbf.de/TRANSFAC/, accessed on Apr. 13, 2000.
- target sites for newly-discovered transcription factors, as well as other types of exogenous molecule can be determined by methods that are well-known to those of skill in the art such as, for example, electrophoretic mobility shift assay, exonuclease protection, DNase footprinting, chemical footprinting and/or direct nucleotide sequence determination of a binding site. See, for example, Ausubel et al., supra, Chapter 12.
- a binding site in cellular chromatin is a region at which a particular molecule, for example a protein, will bind to a target site in the chromatin.
- a binding site will generally comprise a target site, but not every target site will constitute a binding site in cellular chromatin.
- a target site may be occluded by one or more chromosomal components, such as histones or nonhistone proteins, or might be rendered inaccessible to its binding molecule because of nucleosomal or higher-order chromatin structure.
- the presence of one or more chromosomal proteins may be required, in addition to a target site, to define a binding site.
- An accessible region is a site in a chromosome, episome or other cellular structure comprising a nucleic acid, in which a target site present in the nucleic acid can be bound by an exogenous molecule which recognizes the target site.
- an accessible region is one that is not packaged into a nucleosomal structure.
- the distinct structure of an accessible region can often be detected by its sensitivity to chemical and enzymatic probes, for example, nucleases.
- An exogenous molecule is a molecule that is not normally present in a cell, but can be introduced into a cell by one or more genetic, biochemical or other methods. Normal presence in the cell is determined with respect to the particular developmental stage and environmental conditions of the cell. Thus, for example, a molecule that is present only during embryonic development of muscle is an exogenous molecule with respect to an adult muscle cell. Similarly, a molecule induced by heat shock is an exogenous molecule with respect to a non-heat-shocked cell.
- An exogenous molecule can comprise, for example, a functioning version of a malfunctioning endogenous molecule or a malfunctioning version of a normally-functioning endogenous molecule.
- An exogenous molecule can be, among other things, a small molecule, such as is generated by a combinatorial chemistry process, or a macromolecule such as a protein, nucleic acid, carbohydrate, lipid, glycoprotein, lipoprotien, polysaccharide, any modified derivative of the above molecules, or any complex comprising one or more of the above molecules.
- Nucleic acids include DNA and RNA, can be single- or double-stranded; can be linear, branched or circular; and can be of any length. Nucleic acids include those capable of forming duplexes, as well as triplex-forming nucleic acids. See, for example, U.S. Pat. Nos. 5,176,996 and 5,422,251.
- Proteins include, but are not limited to, DNA-binding proteins, transcription factors, chromatin remodeling factors, methylated DNA binding proteins, polymerases, methylases, demethylases, acetylases, deacetylases, kinases, phosphatases, integrases, recombinases, ligases, topoisomerases, gyrases and helicases.
- An exogenous molecule can be the same type of molecule as an endogenous molecule, e.g., protein or nucleic acid, providing it has a sequence that is different from an endogenous molecule.
- an exogenous nucleic acid can comprise an infecting viral genome, a plasmid or episome introduced into a cell, or a chromosome that is not normally present in the cell.
- lipid-mediated transfer i.e., liposomes, including neutral and cationic lipids
- electroporation direct injection
- cell fusion cell fusion
- particle bombardment particle bombardment
- calcium phosphate co-precipitation DEAE-dextran-mediated transfer
- viral vector-mediated transfer viral vector-mediated transfer.
- an endogenous molecule is one that is normally present in a particular cell at a particular developmental stage under particular environmental conditions.
- an endogenous nucleic acid can comprise a chromosome, the genome of a mitochondrion, chloroplast or other organelle, or a naturally-occurring episomal nucleic acid.
- Additional endogenous molecules can include proteins, for example, transcription factors and components of chromatin remodeling complexes.
- a fusion molecule is a molecule in which two or more subunit molecules are linked, preferably covalently.
- the subunit molecules can be the same chemical type of molecule, or can be different chemical types of molecules.
- Examples of the first type of fusion molecule include, but are not limited to, fusion polypeptides (for example, a fusion between a ZFP DNA-binding domain and a transcriptional activation domain) and fusion nucleic acids (for example, a nucleic acid encoding the fusion polypeptide described supra).
- fusion molecules examples include, but are not limited to, a fusion between a triplex-forming nucleic acid and a polypeptide, and a fusion between a minor groove binder and a nucleic acid.
- a fusion molecule is a nucleic acid which encodes a ZFP DNA-binding domain in operative linkage with a component of a chromatin remodeling complex or functional fragment thereof.
- a gene for the purposes of the present disclosure, includes a DNA region encoding a gene product (see infra), as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.
- Gene expression refers to the conversion of the information, contained in a gene, into a gene product.
- a gene product can be the direct transcriptional product of a gene (e.g., mRNA, tRNA, rRNA, antisense RNA, ribozyme, structural RNA or any other type of RNA) or a protein produced by translation of a mRNA.
- Gene products also include RNAs which are modified, by processes such as capping, polyadenylation, methylation, and editing, and proteins modified by, for example, methylation, acetylation, phosphorylation, ubiquitination, ADP-ribosylation, myristilation, and glycosylation.
- Modulation of gene expression refers to a change in the activity of a gene.
- Modulation of expression can include, but is not limited to, gene activation and gene repression. Modulation can be assayed by determining any parameter that is indirectly or directly affected by the expression of the target gene. Such parameters include, e.g., changes in RNA or protein levels; changes in protein activity; changes in product levels;
- reporter genes such as, for example, luciferase, CAT, beta-galactosidase, or GFP (see, e.g., Mistili & Spector, (1997) Nature Biotechnology 15:961-964)
- changes in signal transduction changes in
- Such functional effects can be measured by conventional methods, e.g., measurement of RNA or protein levels, measurement of RNA stability, and/or identification of downstream or reporter gene expression.
- Readout can be by way of, for example, chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible markers, ligand binding assays; changes in intracellular second messengers such as cGMP and inositol triphosphate (IP 3 ); changes in intracellular calcium levels; cytokine release, and the like.
- Gene activation is any process which results in an increase in production of a gene product.
- a gene product can be either RNA (including, but not limited to, mRNA, rRNA, tRNA, and structural RNA) or protein.
- gene activation includes those processes which increase transcription of a gene and/or translation of a mRNA. Examples of gene activation processes which increase transcription include, but are not limited to, those which facilitate formation of a transcription initiation complex, those which increase transcription initiation rate, those which increase transcription elongation rate, those which increase processivity of transcription and those which relieve transcriptional repression (by, for example, blocking the binding of a transcriptional repressor).
- Gene activation can constitute, for example, inhibition of repression as well as stimulation of expression above an existing level.
- Examples of gene activation processes which increase translation include those which increase translational initiation, those which increase translational elongation and those which increase mRNA stability.
- gene activation comprises any detectable increase in the production of a gene product, preferably an increase in production of a gene product by about 2-fold, more preferably from about 2- to about 5-fold or any integer therebetween, more preferably between about 5- and about 10-fold or any integer therebetween, more preferably between about 10- and about 20-fold or any integer therebetween, still more preferably between about 20- and about 50-fold or any integer therebetween, more preferably between about 50- and about 100-fold or any integer therebetween, more preferably 100-fold or more.
- Gene repression is any process which results in a decrease in production of a gene product.
- a gene product can be either RNA (including, but not limited to, mRNA, rRNA, tRNA, and structural RNA) or protein.
- gene repression includes those processes which decrease transcription of a gene and/or translation of a mRNA.
- Examples of gene repression processes which decrease transcription include, but are not limited to, those which inhibit formation of a transcription initiation complex, those which decrease transcription initiation rate, those which decrease transcription elongation rate, those which decrease processivity of transcription and those which antagonize transcriptional activation (by, for example, blocking the binding of a transcriptional activator).
- Gene repression can constitute, for example, prevention of activation as well as inhibition of expression below an existing level.
- Examples of gene repression processes which decrease translation include those which decrease translational initiation, those which decrease translational elongation and those which decrease mRNA stability.
- Transcriptional repression includes both reversible and irreversible inactivation of gene transcription.
- gene repression comprises any detectable decrease in the production of a gene product, preferably a decrease in production of a gene product by about 2-fold, more preferably from about 2- to about 5-fold or any integer therebetween, more preferably between about 5- and about 10-fold or any integer therebetween, more preferably between about 10- and about 20-fold or any integer therebetween, still more preferably between about 20- and about 50-fold or any integer therebetween, more preferably between about 50- and about 100-fold or any integer therebetween, more preferably 100-fold or more.
- gene repression results in complete inhibition of gene expression, such that no gene product is detectable.
- modulating expression, inhibiting expression and activating expression of a gene can refer to the ability of a molecule to activate or inhibit transcription of a gene.
- Activation includes prevention of transcriptional inhibition (i.e., prevention of repression of gene expression) and inhibition includes prevention of transcriptional activation (i.e., prevention of gene activation).
- cells contacted with, for example, ZFPs can be compared to control cells, e.g., without the zinc finger protein or with a non-specific ZFP, to examine the extent of inhibition or activation.
- Control samples can be assigned a relative gene expression activity value of 100%. Modulation/inhibition of gene expression is achieved when the gene expression activity value relative to the control is about 80% or below, preferably 50% or below (i.e., 0.5x or less the activity of the control), more preferably 25% or below, more preferably 0-5%.
- Modulation/activation of gene expression is achieved when the gene expression activity value relative to the control is greater than 100%, preferably 110% or more, more preferably 150% or more (i.e., 1.5 ⁇ the activity of the control or greater), more preferably 200-500% or more, still more preferably 1000-2000% or more.
- Eucaryotic cells include, but are not limited to, fungal cells (such as yeast), protozoal cells, plant cells, insect cells, animal cells, including avian cells, teleost cells, amphibian cells, reptilian cells, mammalian cells, canine cells, porcine cells, feline cells, murine cells, ovine cells, bovine cell, equine cells, primate cells and human cells.
- fungal cells such as yeast
- protozoal cells such as plant cells
- insect cells such as yeast
- animal cells including avian cells, teleost cells, amphibian cells, reptilian cells, mammalian cells, canine cells, porcine cells, feline cells, murine cells, ovine cells, bovine cell, equine cells, primate cells and human cells.
- a region of interest is any region of cellular chromatin, such as, for example, a gene or a non-coding sequence within or adjacent to a gene, in which it is desirable to, for example, modify chromatin structure and/or bind an exogenous molecule.
- a region of interest can be present in a chromosome, an episome, an organellar genome (e.g., mitochondrial, chloroplast), or an infecting viral genome, for example.
- a region of interest can be within the coding region of a gene, within transcribed non-coding regions such as, for example, leader sequences, trailer sequences or introns, or within non-transcribed regions, either upstream or downstream of the coding region.
- operable linkage operative linkage, operably linked and operatively linked are used with reference to a juxtaposition of two or more components (such as sequence elements), in which the components are arranged such that both components function normally and allow the possibility that at least one of the components can mediate a function that is exerted upon at least one of the other components.
- a transcriptional regulatory sequence such as a promoter
- An operatively linked transcriptional regulatory sequence is generally joined in cis with a coding sequence, but need not be directly adjacent to it.
- an enhancer can constitute a transcriptional regulatory sequence that is operatively-linked to a coding sequence, even though they are not contiguous.
- the term operatively linked can refer to the fact that each of the components performs the same function in linkage to the other component as it would if it were not so linked.
- the ZFP DNA-binding domain and the component of the chromatin remodeling complex (or functional fragment thereof) are in operative linkage if, in the fusion polypeptide, the ZFP DNA-binding domain portion is able to bind its target site and/or its binding site, while the component of the chromatin remodeling complex (or functional fragment thereof) is able to interact with other members of its cognate chromatin remodeling complex.
- a functional fragment of a protein, polypeptide or nucleic acid is a protein, polypeptide or nucleic acid whose sequence is not identical to the full-length protein, polypeptide or nucleic acid, yet retains the same function as the full-length protein, polypeptide or nucleic acid.
- a functional fragment can possess more, fewer, or the same number of residues as the corresponding native molecule, and/or can contain one or more amino acid or nucleotide substitutions.
- the DNA-binding function of a polypeptide can be determined, for example, by filter-binding, electrophoretic mobility-shift, or immunoprecipitation assays. See Ausubel et al., supra.
- the ability of a protein to interact with another protein can be determined, for example, by co-immunoprecipitation, two-hybrid assays or complementation, both genetic and biochemical. See, for example, Fields et al. (1989) Nature 340:245-246; U.S. Pat. No. 5,585,245 and PCT WO 98/44350.
- recombinant when used with reference to a cell, indicates that the cell replicates an exogenous nucleic acid, or expresses a peptide or protein encoded by an exogenous nucleic acid.
- Recombinant cells can contain genes that are not found within the native (non-recombinant) form of the cell.
- Recombinant cells can also contain genes found in the native form of the cell wherein the genes are modified and re-introduced into the cell by artificial means.
- the term also encompasses cells that contain a nucleic acid endogenous to the cell that has been modified without removing the nucleic acid from the cell; such modifications include those obtained by gene replacement, site-specific mutation, and related techniques.
- recombinant cells express genes that are not found within the native (naturally occurring) form of the cell or express a second copy of a native gene that is otherwise normally or abnormally expressed, underexpressed or not expressed at all.
- Recombinant cells also include cells or cell lines derived from cells that have been modified as described.
- nucleic acid, protein, or vector When used with reference, e.g., to a nucleic acid, protein, or vector, the term recombinant refers to nucleic acids, proteins or vectors that have been modified by the introduction of heterologous nucleic acid or amino acid sequence, and includes any other alterations of a native nucleic acid or protein.
- An expression vector is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a host cell, and optionally integration and/or replication of the expression vector in a host cell.
- the expression vector can be part of a plasmid, viral genome, or nucleic acid fragment, of viral or non-viral origin.
- Expression vectors can be, for example, naked DNA molecules, or can comprise nucleic acid of viral or nonviral origin packaged into viral particles.
- the expression vector includes an expression cassette, which comprises a nucleic acid to be transcribed operably linked to control elements that are capable of effecting expression of a nucleic acid that is operatively linked to the control elements in hosts compatible with such sequences.
- Expression cassettes include at least promoters and optionally, transcription termination signals.
- a recombinant expression cassette includes at least a nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide) and a promoter. Additional factors necessary or helpful in effecting expression can also be used, for example, an expression cassette can also include nucleotide sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell. Transcription termination signals, enhancers, and other nucleic acid sequences that influence gene expression can also be included in an expression cassette.
- polypeptide, peptide and protein are used interchangeably herein to refer to a polymer of amino acid residues.
- the terms apply to amino acid polymers in which one or more amino acid residue is an analog or mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
- Polypeptides can be modified, e.g., by phosphorylation, methylation, myristilation, acetylation and/or the addition of carbohydrate residues to form glycoproteins.
- polypeptide, peptide and protein include all of these modified polypeptides, as well as polypeptides comprising any additional covalent or non-covalent modification.
- Polypeptide sequences are displayed herein in the conventional N-terminal to C-terminal orientation.
- a subsequence or segment when used in reference to a nucleic acid or polypeptide, refers to a sequence of nucleotides or amino acids that comprise a part of a longer sequence of nucleotides or amino acids (e.g., a polynucleotide or polypeptide), respectively.
- Specific binding between an antibody or other binding agent and an antigen, or between two binding partners means that the dissociation constant for the interaction is less than 10 ⁇ 6 M.
- Preferred antibody/antigen or binding partner complexes have a dissociation constant of less than about 10 ⁇ 7 M, and preferably 10 ⁇ 8 M to 10 ⁇ 9 M or 10 ⁇ 10 M or lower.
- a binding domain or binding molecule is a compound that is able to bind, either covalently or non-covalently, to another molecule.
- the other molecule can be, for example, a polynucleotide (e.g., DNA or RNA) or a polypeptide.
- Binding domains can comprise any compound able to bind another molecule; exemplary binding domains are polypeptides and are denoted binding proteins.
- a binding protein can bind to, for example, a DNA molecule (a DNA-binding domain), an RNA molecule (an RNA-binding domain) and/or a protein molecule (a protein-binding domain).
- a protein-binding protein In the case of a protein-binding protein, it can bind to itself (to form homodimers, homotrimers, etc.) and/or it can bind to one or more molecules of a different protein or proteins.
- a binding domain can have more than one type of binding activity.
- zinc finger proteins have DNA-binding, RNA-binding and protein-binding activity.
- a zinc finger binding protein is a protein or polypeptide that binds DNA, RNA and/or protein, preferably in a sequence-specific manner, as a result of stabilization of protein structure through coordination of a zinc ion.
- the term zinc finger binding protein is often abbreviated as zinc finger protein or ZFP.
- the individual DNA binding domains are typically referred to as fingers.
- a ZFP has least one finger, typically two fingers, three fingers, four fingers, five fingers, or six or more fingers. Each finger binds from two to four base pairs of DNA, typically three or four base pairs of DNA.
- a ZFP binds to a nucleic acid sequence called a target site or target segment. Each finger typically comprises an approximately 30 amino acid, zinc-chelating, DNA-binding subdomain.
- C 2 H 2 class An exemplary motif characterizing one class of these proteins (C 2 H 2 class) is -Cys-(X) 2-4 -Cys-(X) 12 -His-(X) 3-5 -His (where X is any amino acid).
- a single zinc finger of this class consists of an alpha helix containing the two invariant histidine residues and two beta sheets, which form a beta turn containing the two invariant cysteine residues. The two cysteine and two histidine residues coordinate a single zinc atom (see, e.g., Berg & Shi, Science 271:1081-1085 (1996)).
- Zinc finger proteins can be engineered to bind to predetermined sequences.
- Examples of zinc finger engineering include designed zinc finger proteins and selected zinc finger proteins.
- a designed zinc finger protein is a protein not occurring in nature whose structure and composition result principally from rational criteria. Rational criteria for design include application of substitution rules and computerized algorithms for processing information in a database storing information of existing ZFP designs and binding data, for example as described in PCT WO 98/53058, WO 98/53059, WO 99/53060 and WO 00/42219.
- a selected zinc finger protein is a protein not found in nature whose production results primarily from an empirical process such as phage display. See e.g., U.S. Pat. No. 5,789,538; U.S. Pat. No. 6,007,988; U.S. Pat. No. 6,013,453; WO 95/19431; WO 96/06166 WO 98/53057 and WO 98/54311.
- a target site or target sequence for a ZFP can be a nucleotide sequence (either DNA or RNA) or an amino acid sequence.
- a ZFP target site typically has about four to about ten base pairs, but can be as long as 18-20 base pairs, e.g., for a six-finger ZFP.
- a two-fingered ZFP recognizes a four to seven base pair target site
- a three-fingered ZFP recognizes a six to ten base pair target site.
- a DNA target sequence for a three-finger ZFP is generally either 9 or 10 nucleotides in length, depending upon the presence and/or nature of cross-strand interactions between the ZFP and the target sequence.
- Target sequences can be found in any DNA or RNA sequence, including regulatory sequences, exons, introns, or any non-coding sequence.
- a target subsite or subsite is the portion of a DNA target site that is bound by a single zinc finger.
- a subsite in the absence of cross-strand interactions, a subsite is generally three nucleotides in length.
- a cross-strand interaction occurs (e.g., a “D-able subsite,” as described for example in co-owned PCT WO 00/42219, incorporated by reference in its entirety herein)
- a subsite is four nucleotides in length and overlaps with another 3- or 4-nucleotide subsite.
- K d refers to the dissociation constant for the compound, i.e., the concentration of a compound (e.g., a zinc finger protein) that gives half maximal binding of the compound to its target (i.e., half of the compound molecules are bound to the target) under given conditions (i.e., when [target] ⁇ K d ), as measured using a given assay system (see, e.g., U.S. Pat. No. 5,789,538). Any assay system can be used, as long is it gives an accurate measurement of the actual k d .
- the k d for a ZFP is measured using an electrophoretic mobility shift assay (“EMSA”), as described, for example, in WO 00/441566 and WO 00/42219.
- ESA electrophoretic mobility shift assay
- Administering an expression vector, nucleic acid, ZFP, or a delivery vehicle to a cell comprises transducing, transfecting, electroporating, translocating, fusing, phagocytosing, shooting or ballistic methods, etc., i.e., any means by which a protein or nucleic acid can be transported across a cell membrane and preferably into the nucleus of a cell.
- the term effective amount includes that amount which results in the desired result, for example, remodeling of cellular chromatin structure in a region of interest, repression of an active gene, activation of a repressed gene, or inhibition of transcription of a structural gene or translation of RNA.
- a delivery vehicle refers to a compound, e.g., a liposome, toxin, or a membrane translocation polypeptide, which is used to administer an exogenous molecule.
- Delivery vehicles can be used, for example, to administer nucleic acids encoding fusion molecules.
- Exemplary delivery vehicles include lipid:nucleic acid complexes, expression vectors, viruses, and the like.
- a promoter is defined as an array of nucleic acid control sequences that direct transcription.
- a promoter typically includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of certain RNA polymerase II type promoters, a TATA element, enhancer, CCAAT box, SP-1 site, etc.
- a promoter also optionally includes distal enhancer or repressor elements, which can be located as much as several thousand base pairs from the start site of transcription.
- the promoters often have an element that is responsive to transactivation by a DNA-binding moiety such as a polypeptide, e.g., a nuclear receptor, Gal4, the lac repressor and the like.
- a constitutive promoter is a promoter that is active under most environmental and developmental conditions.
- An inducible promoter is a promoter that is active under certain environmental or developmental conditions.
- a regulatory domain or functional domain refers to a protein or a polypeptide sequence (or portion thereof) that has transcriptional modulation activity, or that is capable of interacting with proteins and/or protein domains that have transcriptional modulation activity.
- proteins include, e.g., transcription factors and co-factors (e.g., KRAB, MAD, ERD, SID, nuclear factor kappa B subunit p65, early growth response factor 1, and nuclear hormone receptors, VP16, VP64), endonucleases, integrases, recombinases, methyltransferases, histone acetyltransferases, histone deacetylases and polypeptides which are components of a chromatin remodeling complex, and their functional fragments.
- transcription factors and co-factors e.g., KRAB, MAD, ERD, SID, nuclear factor kappa B subunit p65, early growth response factor 1, and nuclear hormone receptors, VP16, VP64
- a functional domain can be covalently or non-covalently linked to a DNA-binding domain (e.g., a ZFP) to modulate transcription of a gene of interest.
- a DNA-binding domain e.g., a ZFP
- some binding domains, such as for example ZFPs can act in the absence of a functional domain to modulate transcription.
- transcription of a gene of interest can be modulated by a binding domain, such as a ZFP, linked to multiple functional domains.
- heterologous is a relative term, which when used with reference to portions of a nucleic acid indicates that the nucleic acid comprises two or more subsequences that are not found in the same relationship to each other in nature.
- a nucleic acid that is recombinantly produced typically has two or more sequences from unrelated genes synthetically arranged to make a new functional nucleic acid, e.g., a promoter from one source and a coding region from another source or a fusion of coding sequences from two different genes.
- the two nucleic acids are thus heterologous to each other in this context.
- the recombinant nucleic acids When added to a cell, the recombinant nucleic acids would also be heterologous to the endogenous genes of the cell.
- a heterologous nucleic acid would include a recombinant nucleic acid that has integrated into the chromosome, or a recombinant extrachromosomal nucleic acid.
- a heterologous protein indicates that the protein comprises two or more subsequences that are not found in the same relationship to each other in nature (e.g., a fusion protein, wherein sequences from two or more different proteins are encoded by a single nucleic acid sequence). See, e.g., Ausubel, supra, for an introduction to recombinant techniques.
- a host cell is a cell that contains one or more exogenous molecules such as, for example, expression vectors and/or heterologous nucleic acids.
- the host cell typically supports the replication or expression of an expression vector.
- Host cells may be prokaryotic cells such as, for example, E. coli and B. subtilis, or eukaryotic cells such as fungal cells (e.g., yeast), protozoal cells, plant cells, insect cells, animal cells, avian cells, teleost cells, amphibian cells, mammalian cells, primate cells or human cells.
- Exemplary mammalian cell lines include CHO, HeLa, 293, COS-1, and the like, e.g., cultured cells (in vitro), explants and primary cultures (in vitro and ex vivo), and cells in vivo.
- amino acid refers to naturally occurring and synthetic amino acids, as well as amino acid analogues and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.
- Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, carboxyglutamate, and O-phosphoserine.
- Amino acid analogue refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an ⁇ carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine, and methyl sulfonium.
- Such analogues have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid.
- Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally occurring amino acid.
- Conservatively modified variants applies to both amino acid and nucleic acid sequences.
- conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences.
- degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem.
- nucleic acid variations are silent variations, which are one species of conservatively modified variations.
- Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid.
- each codon in a nucleic acid except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan
- TGG which is ordinarily the only codon for tryptophan
- amino acid and nucleic acid sequences individual substitutions, deletions or additions that alter, add or delete a single amino acid or nucleotide or a small percentage of amino acids or nucleotides in the sequence create a conservatively modified variant, wherein the alteration results in the substitution of an amino acid with a chemically similar amino acid.
- Conservative substitution tables providing functionally similar amino acids are well known in the art.
- conservatively modified variants are in addition to and do not exclude polymorphic variants and alleles. See, e.g., Creighton, Proteins (1984) for a discussion of amino acid properties.
- compositions and methods disclosed herein involve fusions between a DNA-binding domain and a component of a chromatin remodeling complex.
- compositions and methods disclosed herein involve fusions between a DNA-binding domain and a domain which participates in modulation of gene expression such as, for example a transcriptional activation domain or a transcriptional repression domain.
- a DNA-binding domain can comprise any molecular entity capable of sequence-specific binding to chromosomal DNA. Binding can be mediated by electrostatic interactions, hydrophobic interactions, or any other type of chemical interaction.
- moieties which can comprise part of a DNA-binding domain include, but are not limited to, minor groove binders, major groove binders, antibiotics, intercalating agents, peptides, polypeptides, oligonucleotides, and nucleic acids.
- An example of a DNA-binding nucleic acid is a triplex-forming oligonucleotide.
- Minor groove binders include substances which, by virtue of their steric and/or electrostatic properties, interact preferentially with the minor groove of double-stranded nucleic acids. Certain minor groove binders exhibit a preference for particular sequence compositions. For instance, netropsin, distamycin and CC-1065 are examples of minor groove binders which bind specifically to AT-rich sequences, particularly runs of A or T. WO 96/32496.
- antibiotics are known to exert their effects by binding to DNA. Binding of antibiotics to DNA is often sequence-specific or exhibits sequence preferences. Actinomycin, for instance, is a relatively GC-specific DNA binding agent.
- a DNA-binding domain is a polypeptide.
- Certain peptide and polypeptide sequences bind to double-stranded DNA in a sequence-specific manner.
- transcription factors participate in transcription initiation by RNA Polymerase II through sequence-specific interactions with DNA in the promoter and/or enhancer regions of genes. Defmed regions within the polypeptide sequence of various transcription factors have been shown to be responsible for sequence-specific binding to DNA. See, for example, Pabo et al. (1992) Ann. Rev. Biochem. 61:1053-1095 and references cited therein.
- regions include, but are not limited to, motifs known as leucine zippers, helix-loop-helix (HLH) domains, helix-turn-helix domains, zinc fingers, ⁇ -sheet motifs, steroid receptor motifs, bZIP domains homeodomains, AT-hooks and others.
- the amino acid sequences of these motifs are known and, in some cases, amino acids that are critical for sequence specificity have been identified.
- Polypeptides involved in other process involving DNA, such as replication, recombination and repair will also have regions involved in specific interactions with DNA.
- Peptide sequences involved in specific DNA recognition such as those found in transcription factors, can be obtained through recombinant DNA cloning and expression techniques or by chemical synthesis, and can be attached to other components of a fusion molecule by methods known in the art.
- Proteins containing methyl binding domains, or functional fragments thereof, can also be used as DNA-binding domains.
- Methyl binding domain proteins recognize and bind to CpG dinucleotide sequences in which the C residue is methylated.
- Proteins containing a methyl-binding domain include, but are not limited to, MBD1, MBD2, MBD3, MBD4, MeCP1 and MeCP2. See, for example, Bird et al. (1999) Cell 99:451-454.
- DNA methyl transferases which methylate the 5-position of C residues in CpG dinucleotides such as, for example, DNMT1, DNMT2, DNMT3a and DNMT3b, or functional fragments thereof, can be used as a DNA-binding domain.
- enzymes which demethylate methylated CpG, or functional fragments thereof can be used as a DNA-binding domain. Fremant et al. (1997) Nucleic Acids Res. 25:2375-2380; Okano et al. (1998) Nature Genet. 19:219-220; Bhattacharya et al. (1999) Nature 397:579-583; and Robertson et al. (2000) Carcinogenesis 21:461-467.
- a DNA-binding domain comprises a zinc finger DNA-binding domain. See, for example, Miller et al. (1985) EMBO J. 4:1609-1614; Rhodes et al. (1993) Scientific American Feb.: 56-65; and Klug (1999) J. Mol. Biol. 293:215-218.
- a target site for a zinc finger DNA-binding domain is identified according to site selection rules disclosed in co-owned WO 00/42219.
- ZFP DNA-binding domains are designed and/or selected to recognize a particular target site as described in co-owned WO 00/42219; WO 00/41566; and U.S. Ser. Nos.
- Certain DNA-binding domains are capable of binding to DNA that is packaged in nucleosomes. See, for example, Cordingley et al. (1987) Cell 48:261-270; Pina et al. (1990) Cell 60:719-731; and Cirillo et al. (1998) EMBO J. 17:244-254.
- Certain ZFP-containing proteins such as, for example, members of the nuclear hormone receptor superfamily, are capable of binding DNA sequences packaged into chromatin. These include, but are not limited to, the glucocorticoid receptor and the thyroid hormone receptor. Archer et al. (1992) Science 255:1573-1576; Wong et al. (1997) EMBO J. 16:7130-7145.
- binding domains are able to bind to internucleosomal (linker) DNA sequences. See, e.g., Zhang et al. (2000) J. Biol. Chem. 275:33,850-33,860.
- the binding specificity of the DNA-binding domain can be determined by identifying accessible regions in the cellular chromatin. Accessible regions can be determined as described in co-owned PCT/US01/40617, the disclosure of which is hereby incorporated by reference herein.
- a DNA-binding domain is then designed and/or selected to bind to a target site within the accessible region.
- chromatin modification Two major types have been described. The first is dependent on covalent modification. Covalent modification of histones occurs by processes such as, for example, acetylation and deacetylation. Covalent modification of DNA is exemplified by methylation of cytosine residues in CpG dinucleotides. The second type of modification results in changes in nucleosome location and/or conformation, and relies on the activity of ATP-driven chromatin remodeling machines. Both types of chromatin modification are carried out in vivo by multiprotein complexes. For the purposes of the present disclosure, proteins involved in either of these types of chromatin modification can comprise a component of a chromatin remodeling complex.
- Modifications of the first type often comprise histone acetylation, catalyzed by a complex containing a histone acetyl transferase (HAT), or histone deacetylation, catalyzed by a complex containing a histone deacetylase (HDAC).
- HAT histone acetyl transferase
- HDAC histone deacetylase
- An example of a complex involved in this type of chromatin modification is a histone deacetylase complex, examples of which include the SIN3 and Mi-2 complexes.
- Knoepfler et al. (1999) Cell 99:447-450 These complexes generally comprise one or more enzymatic components (i.e., a HDAC) as well as one or more non-enzymatic components.
- a component of a chromatin remodeling complex can be either an enzymatic or a non-enzymatic component (or a functional fragment of an enzymatic or non-enzymatic component) of a complex involved in the covalent modification of histones.
- Additional types of covalent modification of chromosomal proteins include, but are not limited to, methylation, demethylation, phosphorylation, dephosphorylation, ubiquitination, de-ubiquitination, ADP-ribosylation and de-ribosylation.
- Proteolysis of chromosomal proteins can also influence chromatin structure.
- Covalent modification of nucleosomal histones is the basis of a histone code that is involved in regulation of gene expression, at least in part through effects on chromatin structure. See, for example, Jenuwein et al. (2001) Science 293:1074-1080.
- proteins that participate in covalent modification of histones comprise enzymatic components of chromatin remodeling complexes.
- histones such as, for example, histone kinases, histone phosphatases, histone methyl transferases, histone demethylases, SAM synthetases, HP1, Su(Var) proteins and E(var) proteins
- proteins that interact with the aforementioned proteins and their functional fragments are useful in the disclosed methods and compositions.
- proteins involved in histone modification and regulation of chromatin structure contain conserved domains: these include the bromodomain, the chromodomain, the SET domain, the SANT domain, and the PHD domain. Accordingly, any protein comprising one of these domains is useful as a component of a fusion with a DNA-binding domain for use in the disclosed methods and compositions.
- a remodeling complex comprises an enzymatic component (an ATPase protein subunit) and one or more non-enzymatic protein subunits.
- ATPase subunits are grouped into three major families: the SWI/SNF family, the ISWI family, and the Mi-2/CHD family. See Tyler et al. (1999) Cell 99:443-446.
- a component of a chromatin remodeling complex can comprise one of its constituent proteins or a functional fragment thereof.
- a component of a chromatin remodeling complex can be an enzymatic component or a non-enzymatic component.
- Enzymatic components of chromatin remodeling complexes include, but are not limited to, the following ATPases: SWI2/SNF2, STH1, BRM, HBRM, BRG1, Mi-2/CHD, ISW1, ISW2, ISWI, and hSNF2h.
- SWI2/SNF2, STH1, BRM, HBRM, BRG1, Mi-2/CHD, ISW1, ISW2, ISWI, and hSNF2h include, but are not limited to, the following ATPases: SWI2/SNF2, STH1, BRM, HBRM, BRG1, Mi-2/CHD, ISW1, ISW2, ISWI, and hSNF2h.
- Tyler et al., supra Armstrong et al. (1998) Curr. Opin. Genet. Dev. 8:165-172; Guschin et al. (1999) Curr. Biol. 9:R742-746; and Wolffe et al. (2000) J. Struct. Biol. 129:102
- Modifications in chromatin structure include those which render chromosomal sequences more accessible to regulatory factors (i.e., formation of “open” chromatin) as well as those which make chromosomal sequences less accessible (i.e., formation of “closed” chromatin).
- Such modifications can include, for example, removal of nucleosomes from DNA, deposition of nucleosomes onto DNA, repositioning of nucleosomes, changes in nucleosome spacing, changes in nucleosome density, changes in the degree and/or nature of the interaction between DNA and histones in the nucleosome, changes in the path of DNA along the surface of the nucleosome, and/or changes in higher-order chromatin structure such as, for example, unwinding of the chromatin solenoid.
- compositions and methods disclosed herein involve fusions between a DNA-binding domain and a component of a chromatin remodeling complex, as described supra, or a polynucleotide encoding such a fusion.
- chromatin remodeling complexes have been identified and characterized in several organisms and cell types.
- Complexes known as SWI/SNF, RSC, ISW1 and ISW2 have been isolated and characterized in yeast.
- the NURF, CHRAC, ACF and brahma (dSWI/SNF or BRM) complexes have been isolated and characterized.
- Chromatin remodeling complexes from human cells named brm/BRG (hSWI/SNF), NURD and RSF have been isolated and characterized. See, for example, Cairns (1998) Trends Biochem. Sci. 23:20-25; Murchardt et al. (1999) J. Mol. Biol.
- the SWI/SNF chromatin remodeling complex of yeast comprises the SWI2/SNF2 helicase/ATPase and products of the SNF5, SWI3, SWP73, ARP7, ARP9, SWI1, SNF6, SWP82, SWP29 and SNF1 genes.
- Arp7 and Arp9 are actin-related proteins.
- the SWP29 gene product is also known as either TFG-3 or TAF30.
- the Drosophila brahma (brm) complex also known as dSWI/SNF contains an ATPase subunit, homologous to SWI2/SNF2, called brahma (brm), as well as SNR1, BAP155 (moira), BAP60, BAP111, BAP55, BAP74 and BAP47/ACT1/ACT2 subunits.
- chromatin remodeling complexes contain either of the two SWI2/SNF2 homologues mBRM or mBRG-1, along with subunits named mSNF5 and mBAF60a.
- SWI2/SNF2 homologues hBRM also known as hSNF2 ⁇
- BRG-1 also known as hSNF2 ⁇
- hSNF5 also known as INI-1
- hBAF170 also known as INI-1
- hBAF155 also known as INI-1
- hBAF60a or hBAF60b or hBAF60c
- hBAF57 ⁇ -actin
- hBAF53 also known as p270
- hBAF110 subunits also known as p270
- chromatin remodeling complexes have been discovered by virtue of their participation in the regulation of globin gene expression in human cells. These include E-RC1, comprising BRG-1 and the BAF57 protein, and the PYR complex, comprising hSNF5/INI1, BAF57, BAF60a, and BAF170.
- the RSC complex (“ r emodels the s nestture of c hromatin”), first identified in yeast, is a 15-subunit complex comprising the SWI2/SNF2 homologous ATPase STH1, along with SFH-1, RSC-8, actin-related proteins, RSC-6 and SAS-5.
- RSC The RSC complex
- Rsc1 and Rsc2 Two recently characterized subunits of RSC, denoted Rsc1 and Rsc2, each contains two bromodomains, a BAH (“bromo adjacent homology”) domain and an A/T hook motif, and thus likely participates in the interaction between the RSC complex and chromatin. Cairns et al. (1999) Mol. Cell 4:715-723.
- a family of helicase/ATPase proteins with homology to SNF2 have been described. These proteins contain seven conserved domains and are involved in a range of cellular functions, including transcription, recombination and repair.
- the mammalian ATRX protein is an example of this group of proteins. See Picketts et al. (1996) Hum. Mol. Genet. 5:1899-1907.
- NURF Nu cleosome R emodeling F actor
- the components of NURF include ISWI (a SWI2-related DNA-dependent ATPase, also known as NURF-140), NURF-38, NURF-55 and NURF-215. Additional properties of the NURF complex are disclosed in Sandaltzopoulos et al. (1999) Meth. Enzymology 304:757-765 and references cited therein.
- CHRAC Chr omatin A ccessibility C omplex
- ATP-dependent nucleosome spacing activity mediates ATP-dependent accessibility of chromatin to restriction endonucleases.
- the CHRAC complex includes the ISWI ATPase and four additional polypeptides: p15, p20, p175 and DNA topoisomerase II.
- ACF complex A TP-utilizing c hromatin assembly and remodeling f actor
- Drosophila Drosophila
- ACF contains the ISWI ATPase and three additional polypeptides: pl7, ACFI (p185) and ACFII (p170).
- the RSF complex ( r emodeling and s pacing f actor), found in human cells, contains the ISWI homologue hSNF2h and a subunit known as p325. Its activities include ATP-dependent nucleosome remodeling and spacing. LeRoy et al. (1998) Science 282:1900-1904.
- ISW1 Chromatin remodeling complexes in yeast, with ATPase subunits homologous to the Drosophila ISWI ATPase, include ISW1 and ISW2.
- ISW1 contains the ISW1 ATPase subunit, p74, p105 and p110. ISW1 has been characterized as possessing nucleosome-stimulated ATPase activity and ATP-dependent nucleosome disruption and spacing activities. Tsukiyama et al. (1999) Genes Dev. 13:686-697.
- the yeast ISW2 complex contains the ISW2 ATPase along with a second subunit having a molecular weight of 140 kD. ISW2 possesses nucleosome-stimulated ATPase activity and ATP-dependent nucleosome disruption activity. Tsukiyama et al. (1999)supra.
- the WCRF chromatin remodeling complex was isolated from human (HeLa) cells and contains an ISWI-homologous ATPase known as WCRF 135 (SNF2h) and a subunit known as WCRF 180.
- WCRF 180 has several hallmarks of a transcription factor, including a heterochromatin localization domain, a PHD finger (a cysteine-rich zinc-binding domain) and a bromodomain (a domain reported to be involved in interaction with histones).
- a heterochromatin localization domain including a heterochromatin localization domain, a PHD finger (a cysteine-rich zinc-binding domain) and a bromodomain (a domain reported to be involved in interaction with histones).
- Chromatin remodeling complexes from human (NRD/NURD complex) and amphibian cells (Mi-2 complex) contain a nucleosome-dependent ATPase activity called Mi-2 (also known as CHD). Additional protein components of the amphibian Mi-2 complex include Mtal-like (a DNA-binding protein homologous to metastasis-associated protein), RPD3 (the amphibian homologue of histone deacetylases HDAC1 and HDAC2), RbAp48 (a protein which interacts with histone H4), and MBD3 (a protein containing a methylated CpG binding domain).
- the amphibian complex additionally contains a serine- and proline-rich subunit, p66.
- Activities of the amphibian Mi-2 complex include a nucleosome-dependent ATPase that is not stimulated by free histones or DNA, translational movement of histone octamers relative to DNA, and deacetylation of core histones within a nucleosome. Guschin et al. (2000) Biochemistry 39:5238-5245. Inasmuch as RbAp48 appears to comprise a key structural component of the Mi-2 complex, it is particularly suitable for fusion with a DNA-binding domain for use in the methods disclosed herein.
- Human NRD complexes contain, in addition to Mi-2, homologues of amphibian Mtal-like (MTA-2), RPD3 (HDAC1 and HDAC2), RbAp48 and MBD3, as well as additional proteins.
- MTA-2 amphibian Mtal-like
- RPD3 HDAC1 and HDAC2
- RbAp48 RbAp48 and MBD3
- additional proteins See Zhang et al. (1999) Genes Dev. 13:1924-1935; and Kornberg et al. (1999) Curr. Opin. Genet. Dev. 9:148-151.
- the methyl-binding-domain protein MBD3 is a component of Mi-2-containing chromatin remodeling complexes.
- MBD3 and related methyl binding domain proteins recognize and bind to CpG dinucleotide sequences in which the C residue is methylated.
- MBD proteins are capable of recruiting histone deacetylases to regions of chromatin rich in methylated CpG.
- a MBD protein can comprise a component of a chromatin remodeling complex.
- Proteins containing a methyl-binding domain include, but are not limited to, MBD1, MBD2, MBD3, MBD4, MeCP1 and MeCP2. See, for example, Bird et al. (1999) Cell 99:451-454.
- DNA methyl transferases which methylate the 5-position of C residues in CpG dinucleotides such as, for example, DNMT1, DNMT2, DNMT3a and DNMT3b, can be used as components of a chromatin remodeling complex.
- each cell type contains a multiplicity of chromatin remodeling complexes which can share certain common subunits, and that the composition of a chromatin remodeling complex can vary with cell type.
- the number of polypeptide subunits in a chromatin remodeling complex varies over a wide range, from two in the ISW2 and RSF complexes to over 15 in the yeast RSC complex. It also appears to be the case that different chromatin remodeling complexes can have partially overlapping activities (i.e., that a degree of functional redundancy exists among different chromatin remodeling complexes).
- the present disclosure is therefore intended to embrace any and all polypeptides present in any type of chromatin remodeling complex, currently known or to be discovered.
- chromatin remodeling complexes In the process of gene activation, binding of chromatin remodeling complexes to chromatin generally precedes binding of histone acetyl transferase (HAT) and/or histone deacetylase (HDAC) complexes, suggesting that HAT and HDAC complexes are recruited by the chromatin remodeling complex, or that remodeled chromatin is more conducive to binding of HAT and HDAC complexes.
- HAT histone acetyl transferase
- HDAC histone deacetylase
- chromatin modification facilitates covalent modification of nucleosomal histones by acetylation or deacetylation.
- Histone acetylation is generally correlated with transcriptional activation; while deacetylation of histones is generally associated with transcriptional repression.
- HAT enzymes including budding yeast Gcn5p, which is required for expression of a subset of the yeast genome, its mammalian orthologue CREB-binding protein (CBP), p300 (both of the latter two used as coactivators by a wide variety of mammalian transcription factors), TAF II 250 (a component of the basal transcriptional machinery), and steroid receptor coactivator 1 (SRC-1), which potentiates transcriptional activation by a number of nuclear hormone receptors.
- CBP mammalian orthologue CREB-binding protein
- p300 both of the latter two used as coactivators by a wide variety of mammalian transcription factors
- TAF II 250 a component of the basal transcriptional machinery
- SRC-1 steroid receptor coactivator 1
- HDAC1 Two major classes of functionally distinct HDACs have been identified in higher eukaryotes.
- Class I includes HDAC1, HDAC2 and HDAC3, which are homologous to the yeast Rpd3 histone deacetylase.
- Class II includes HDAC4, HDAC5 and HDAC6; and are homologous to the yeast Hda1 histone deacetylase. Ng et al., supra.
- a ZFP DNA-binding domain is fused to a histone acetyl transferase or to a histone deacetylase, to effect chromatin modification in the form of covalent modification (acetylation or deacetylation) of histones.
- modification of chromatin by a chromatin remodeling complex is followed by binding of a ZFP-HAT fusion or a ZFP-HDAC fusion, to establish an active or inactive chromatin state, respectively.
- a fusion between a DNA-binding domain and a protein that is a component of a HAT- or HDAC-containing complex is provided.
- HAT- and HDAC-containing complexes, and their component polypeptide subunits have been described. See, for example, Grunstein (1997) Nature 389:349-352; Hartzog et al. (1997) Curr. Opin. Genet. Devel. 7:192-198; Kadonaga (1998) Cell 92:307-313; Kuo et al.
- HAT-containing complexes in yeast there are several HAT-containing complexes in yeast, one of which is the SAGA complex (Spt-Ada-Gcn5-acetyltransferase). Grant et al. (1997) Genes Devel. 11:1640-1650; Ikeda et al. (1999) Mol. Cell. Biol. 19:855-863.
- HDAC-containing complexes include the Sin3 complex, which is conserved in organisms from yeast to mammals.
- the components of the yeast Sin3 complex include Sin3p, RPD3 (a histone deacetylase), RbAp48, and RbAp46.
- the components of the mammalian Sin3 complex include mSin3A, mSin3B, HDAC1, HDAC2, RbAp48, RbAp49, SAP30 and SAP18.
- Sin3 proteins from yeast, Drosophila, and vertebrates contain a PAH (paired amphipathic helices) domain, comprising four conserved repeats which form two amphipathic helices separated by a flexible linker.
- HDAC1, HDAC2 and RPD3 are histone deacetylases.
- the RbAp48 and RbAP49 proteins interact with histones.
- SAP30 and SAP18 are specificity determinants.
- Mi-2 complex Another HDAC-containing complex (which also possess chromatin remodeling activity, see supra) is the Mi-2 complex.
- the mammalian Mi-2 complex (also known as NuRD) comprises the following polypeptides: Mi-2 (also known as CHD), HDAC1, HDAC2, MTA-2 and MBD3. See, for example, Ahringer (2000) Trends Genet. 16:351-356.
- the amphibian Mi-2 complex comprises Mi-2, Mta1-like (homologous to mammalian MTA2), p66, RbAp48, RPD3 and MBD3. Guschin et al. (2000) Biochemistry 39:5238-5245.
- Coactivators and corepressors which associate with the Sin3 complex to aid in targeting and in its interaction with receptors and other transcriptional regulatory proteins have been described. Examples include, but are not limited to, the vertebrate N-CoR, Rb and SMRT proteins and their homologues, as well as the Drosophila SMRTER and Groucho proteins and their homologues. For the purposes of the present disclosure, such coactivators and corepressors are considered to be components of chromatin remodeling complexes, inasmuch as they are capable of targeting various types of chromatin modification, if fused to a DNA-binding domain.
- TR thyroid hormone receptor
- T3 thyroid hormone
- T3 thyroid hormone
- SMRT and NCoR proteins that interact directly with the receptor, as well as Sin3, which interacts with SMRT/NCoR.
- Sin3 also interacts with a number of histone deacetylases, for example, HDACs 1 through 8 (some of which may also interact directly with TR).
- HDACs 1 through 8 some of which may also interact directly with TR.
- Recruitment of histone deacetylases by DNA-bound TR is believed to play a major role in its ability to confer repression; however, it is also possible that repressive factors other than HDACs are recruited by TR.
- Binding of ligand to DNA-bound TR results in the decay of the repressive complex associated with the TR and recruitment of activating factors to the DNA-bound, ligand-bound TR.
- activating factors include, but are not limited to, the histone acetyltransferases SRC-1, CBP/p300 and P/CAF.
- Oligomeric activation complexes can also be recruited by ligand-bound TR, such as, for example, DRIP and ARC. Rachez et al. (1999) Nature 398:824-827; and Naar et al. (1999) Nature 398:828-832.
- GR glucocorticoid receptor
- TR and related nuclear receptors are modular proteins comprising an amino-terminal region (of undefined function), a central DNA binding domain and a carboxy-terminal ligand binding domain (LBD).
- the LBD in addition to binding hormone, is responsible for interactions with both the repressive and activating factors described above.
- Gal4 heterologous DNA binding domain
- T3-dependent activation of transcription can be achieved using a fusion of the TR LBD with the Gal4 DNA-binding domain Tone et al. (1994) J. Biol. Chem. 269:31,157-31,161.
- a mutant nuclear hormone receptor LBD derived, for example, from TR or GR can be used as a component of a fusion with a DNA-binding domain, to recruit activating or repressing protein complexes to a region of interest in cellular chromatin.
- Certain naturally-occurring mutant LBDs are available; and new mutants can be constructed by methods well-known to those of skill in the art.
- the site of action of such complexes is determined by the specificity of the DNA-binding domain; while their activity is determined by the nature of the mutation to the LBD and is independent of ligand concentration.
- a fusion comprising a LBD that has been mutated such that it is unable to bind hormone will facilitate formation of repressive complexes; while a fusion molecule comprising a LBD mutation that changes the conformation of the LBD such that it resembles a ligand-bound LBD will stimulate the formation of complexes that facilitate transcriptional activation.
- a mutant nuclear hormone receptor LBD can be considered a component of a chromatin remodeling complex.
- the methods and compositions disclosed herein include fusion molecules comprising a DNA-binding domain and a component of a chromatin remodeling complex.
- the component of a chromatin remodeling complex can be either an enzymatic component or a non-enzymatic component.
- a fusion molecule comprising an enzymatic component will result in modification of a more limited region of cellular chromatin, compared to a fusion molecule comprising a non-enzymatic component. This is because, when the enzymatic component is directly fused to a DNA-binding domain, its activity is regionally restricted to the vicinity of the target site of the DNA-binding domain.
- Fusion molecules are constructed by methods of cloning and biochemical conjugation that are well-known to those of skill in the art. Fusion molecules comprise a DNA-binding domain and a component of a chromatin remodeling complex or a functional fragment thereof. Fusion molecules also optionally comprise nuclear localization signals (such as, for example, that from the SV40 medium T-antigen) and epitope tags (such as, for example, FLAG and hemagglutinin). Fusion proteins (and nucleic acids encoding them) are designed such that the translational reading frame is preserved among the components of the fusion. See Examples 2 and 4, infra for additional details on the construction of fusion molecules.
- Fusions between a polypeptide component of a chromatin remodeling complex (or a functional fragment thereof) on the one hand, and a non-protein DNA-binding domain (e.g., antibiotic, intercalator, minor groove binder, nucleic acid) on the other, are constructed by methods of biochemical conjugation known to those of skill in the art. See, for example, the Pierce Chemical Company (Rockford, Ill.) Catalogue. Methods and compositions for making fusions between a minor groove binder and a polypeptide have been described. Mapp et al. (2000) Proc. Natl. Acad. Sci. USA 97:3930-3935.
- a fusion between a polypeptide DNA-binding domain and a component of a chromatin remodeling complex is encoded by a fusion nucleic acid.
- the nucleic acid can be cloned into intermediate vectors for transformation into prokaryotic or eukaryotic cells for replication and/or expression.
- Intermediate vectors for storage or manipulation of the fusion nucleic acid or production of fusion protein can be prokaryotic vectors, (e.g., plasmids), shuttle vectors, insect vectors, or viral vectors for example.
- a fusion nucleic acid can also cloned into an expression vector, for administration to a bacterial cell, fungal cell, protozoal cell, plant cell, or animal cell, preferably a mammalian cell, more preferably a human cell.
- a cloned fusion nucleic acid it is typically subcloned into an expression vector that contains a promoter to direct transcription.
- Suitable bacterial and eukaryotic promoters are well known in the art and described, e.g., in Sambrook et al., supra; Ausubel et al., supra; and Kriegler, Gene Transfer and Expression: A Laboratory Manual (1990).
- Bacterial expression systems are available in, e.g., E. coli, Bacillus sp., and Salmonella. Palva et al. (1983) Gene 22:229-235. Kits for such expression systems are commercially available.
- Eukaryotic expression systems for mammalian cells, yeast, and insect cells are well known in the art and are also commercially available, for example, from Invitrogen, Carlsbad, Calif. and Clontech, Palo Alto, Calif.
- the promoter used to direct expression of a fusion nucleic acid depends on the particular application. For example, a strong constitutive promoter is typically used for expression and purification of a fusion protein. In contrast, when a fusion protein is used in vivo, either a constitutive or an inducible promoter is used, depending on the particular use of the fusion protein. In addition, a weak promoter can be used, such as HSV TK or a promoter having similar activity.
- the promoter typically can also include elements that are responsive to transactivation, e.g., hypoxia response elements, Gal4 response elements, lac repressor response element, and small molecule control systems such as tet-regulated systems and the RU-486 system.
- an expression vector typically contains a transcription unit or expression cassette that contains additional elements required for the expression of the nucleic acid in host cells, either prokaryotic or eukaryotic.
- a typical expression cassette thus contains a promoter operably linked, e.g., to the fusion nucleic acid sequence, and signals required, e.g., for efficient polyadenylation of the transcript, transcriptional termination, ribosome binding, and/or translation termination. Additional elements of the cassette may include, e.g., enhancers, and heterologous spliced intronic signals.
- the particular expression vector used to transport the genetic information into the cell is selected with regard to the intended use of the fusion polypeptide, e.g., expression in plants, animals, bacteria, fungi, protozoa etc.
- Standard bacterial expression vectors include plasmids such as pBR322, pBR322-based plasmids, pSKF, pET23D, and commercially available fusion expression systems such as GST and LacZ.
- Epitope tags can also be added to recombinant proteins to provide convenient methods of isolation, for monitoring expression, and for monitoring cellular and subcellular localization, e.g., c-myc or FLAG.
- Expression vectors containing regulatory elements from eukaryotic viruses are often used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus vectors, and vectors derived from Epstein-Barr virus.
- exemplary eukaryotic vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, baculovirus pDSVE, and any other allowing expression of proteins under the direction of the SV40 early promoter, SV40 late promoter, metallothionein promoter, murine mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters shown effective for expression in eukaryotic cells.
- Some expression systems have markers for selection of stably transfected cell lines such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase.
- High-yield expression systems are also suitable, such as baculovirus vectors in insect cells, with a fusion nucleic acid sequence under the transcriptional control of the polyhedrin promoter or any other strong baculovirus promoter.
- Standard transfection methods can be used to produce bacterial, mammalian, yeast, insect, or other cell lines that express large quantities of fusion protein, which can be purified, if desired, using standard techniques. See, e.g., Colley et al. (1989) J. Biol. Chem. 264:17619-17622; and Guide to Protein Purification, in Methods in Enzymology, vol. 182 (Deutscher, ed.) 1990. Transformation of eukaryotic and prokaryotic cells are performed according to standard techniques. See, e.g., Morrison (1977) J. Bacteriol. 132:349-351; Clark-Curtiss et al. (1983) in Methods in Enzymology 101:347-362 (Wu et al., eds).
- Any procedure for introducing foreign nucleotide sequences into host cells can be used. These include, but are not limited to, the use of calcium phosphate transfection, DEAE-dextran-mediated transfection, polybrene, protoplast fusion, electroporation, lipid-mediated delivery (e.g., liposomes), microinjection, particle bombardment, introduction of naked DNA, plasmid vectors, viral vectors (both episomal and integrative) and any of the other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., Sambrook et al., supra). It is only necessary that the particular genetic engineering procedure used be capable of successfully introducing at least one gene into the host cell capable of expressing the protein of choice.
- Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome.
- Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For reviews of gene therapy procedures, see, for example, Anderson (1992) Science 256:808-813; Nabel et al.
- Methods of non-viral delivery of nucleic acids include lipofection, microinjection, ballistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA.
- Lipofection is described in, e.g., U.S. Pat. Nos. 5,049,386; 4,946,787; and 4,897,355 and lipofection reagents are sold commercially (e.g., TransfectamTM and LipofectinTM).
- Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424 and WO 91/16024. Nucleic acid can be delivered to cells (ex vivo administration) or to target tissues (in vivo administration).
- lipid:nucleic acid complexes including targeted liposomes such as immunolipid complexes
- RNA or DNA virus-based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in the body and trafficking the viral payload to the nucleus.
- Viral vectors can be administered directly to patients (in vivo) or they can be used to treat cells in vitro, wherein the modified cells are administered to patients (ex vivo).
- Conventional viral based systems for the delivery of ZFPs include retroviral, lentiviral, poxviral, adenoviral, adeno-associated viral, vesicular stomatitis viral and herpesviral vectors.
- Lentiviral vectors are retroviral vector that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue.
- Retroviral vectors have a packaging capacity of up to 6-10 kb of foreign sequence and are comprised of cis-acting long terminal repeats (LTRs). The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression.
- LTRs long terminal repeats
- Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), simian immunodeficiency virus (SIV), human immunodeficiency virus (HIV), and combinations thereof.
- MiLV murine leukemia virus
- GaLV gibbon ape leukemia virus
- SIV simian immunodeficiency virus
- HAV human immunodeficiency virus
- Adeno-associated virus (AAV) vectors are also used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures. See, e.g., West et al. (1987) Virology 160:38-47; U.S. Pat. No. 4,797,368; WO 93/24641; Kotin (1994) Hum. Gene Ther. 5:793-801; and Muzyczka (1994) J. Clin. Invest. 94:1351. Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No.
- AAV-2 parvovirus adeno-associated virus type 2
- Exemplary AAV vectors are derived from a plasmid containing the AAV 145 bp inverted terminal repeats flanking a transgene expression cassette. Efficient gene transfer and stable transgene delivery due to integration into the genomes of the transduced cell are key features for this vector system.
- Wagner et al. (1998) Lancet 351 (9117):1702-3; and Kearns et al. (1996) Gene Ther. 9:748-55.
- pLASN and MFG-S are examples are retroviral vectors that have been used in clinical trials.
- PA317/pLASN was the first therapeutic vector used in a gene therapy trial. (Blaese et al. (1995) Science 270:475-480. Transduction efficiencies of 50% or greater have been observed for MFG-S packaged vectors. Ellem et al. (1997) Immunol Immunother. 44(1):10-20; Dranoff et al. (1997) Hum. Gene Ther. 1:111-2.
- Adenoviral-based systems are useful.
- Adenoviral based vectors are capable of very high transduction efficiency in many cell types and are capable of infecting, and hence delivering nucleic acid to, both dividing and non-dividing cells. With such vectors, high titers and levels of expression have been obtained.
- Adenovirus vectors can be produced in large quantities in a relatively simple system.
- Ad Replication-deficient recombinant adenoviral
- Ad can be produced at high titer and they readily infect a number of different cell types.
- Most adenovirus vectors are engineered such that a transgene replaces the Ad E1a, E1b, and/or E3 genes; the replication defector vector is propagated in human 293 cells that supply the required E1 functions in trans.
- Ad vectors can transduce multiple types of tissues in vivo, including non-dividing, differentiated cells such as those found in the liver, kidney and muscle. Conventional Ad vectors have a large carrying capacity for inserted DNA.
- Ad vector An example of the use of an Ad vector in a clinical trial involved polynucleotide therapy for antitumor immunization with intramuscular injection. Sterman et al. (1998) Hum. Gene Ther. 7:1083-1089. Additional examples of the use of adenovirus vectors for gene transfer in clinical trials include Rosenecker et al. (1996) Infection 24:5-10; Sterman et al., supra; Welsh et al. (1995) Hum. Gene Ther. 2:205-218; Alvarez et al. (1997) Hum. Gene Ther. 5:597-613; and Topf et al. (1998) Gene Ther. 5:507-513.
- Packaging cells are used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and ⁇ 2 cells or PA317 cells, which package retroviruses.
- Viral vectors used in gene therapy are usually generated by a producer cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the protein to be expressed. Missing viral functions are supplied in trans, if necessary, by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome, which are required for packaging and integration into the host genome.
- Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences.
- the cell line is also infected with adenovirus as a helper.
- the helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid.
- the helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment, which preferentially inactivates adenoviruses.
- a viral vector can be modified to have specificity for a given cell type by expressing a ligand as a fusion protein with a viral coat protein on the outer surface of the virus.
- the ligand is chosen to have affinity for a receptor known to be present on the cell type of interest.
- Han et al. (1995) Proc. Natl. Acad. Sci. USA 92:9747-9751 reported that Moloney murine leukemia virus can be modified to express human heregulin fused to gp70, and the recombinant virus infects certain human breast cancer cells expressing human epidermal growth factor receptor.
- filamentous phage can be engineered to display antibody fragments (e.g., F ab or F v ) having specific binding affinity for virtually any chosen cellular receptor.
- F ab or F v antibody fragments
- non-viral vectors Such vectors can be engineered to contain specific uptake sequences thought to favor uptake by specific target cells.
- Gene therapy vectors can be delivered in vivo by administration to an individual patient, typically by systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical application, as described infra.
- vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector.
- Ex vivo cell transfection for diagnostics, research, or for gene therapy is well known to those of skill in the art.
- cells are isolated from the subject organism, transfected with a nucleic acid (gene or cDNA), and re-infused back into the subject organism (e.g., patient).
- a nucleic acid gene or cDNA
- Various cell types suitable for ex vivo transfection are well known to those of skill in the art. See, e.g., Freshney et al., Culture of Animal Cells, A Manual of Basic Technique, 3rd ed., 1994, and references cited therein, for a discussion of isolation and culture of cells from patients.
- hematopoietic stem cells are used in ex vivo procedures for cell transfection and gene therapy.
- the advantage to using stem cells is that they can be differentiated into other cell types in vitro, or can be introduced into a mammal (such as the donor of the cells) where they will engraft in the bone marrow.
- Methods for differentiating CD34+ stem cells in vitro into clinically important immune cell types using cytokines such a GM-CSF, IFN- ⁇ and TNF- ⁇ are known. Inaba et al. (1992) J. Exp. Med. 176:1693-1702.
- Stem cells are isolated for transduction and differentiation using known methods. For example, stem cells are isolated from bone marrow cells by panning the bone marrow cells with antibodies which bind unwanted cells, such as CD4+ and CD8+ (T cells), CD45+ (panB cells), GR-1 (granulocytes), and Iad (differentiated antigen presenting cells). See Inaba et al., supra.
- Vectors e.g., retroviruses, adenoviruses, liposomes, etc.
- therapeutic nucleic acids can be also administered directly to the organism for transduction of cells in vivo.
- naked DNA can be administered.
- Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or tissue cells. Suitable methods of administering such nucleic acids are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- compositions of the present invention are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions of the present invention, as described below. See, e.g., Remington's Pharmaceutical Sciences, 17th ed., 1989.
- one or more polypeptides comprising a fusion between a DNA-binding domain and a component of a chromatin remodeling complex, can be introduced into a cell.
- An important factor in the administration of polypeptide compounds is ensuring that the polypeptide has the ability to traverse the plasma membrane of a cell, or the membrane of an intra-cellular compartment such as the nucleus.
- Cellular membranes are composed of lipid-protein bilayers that are freely permeable to small, nonionic lipophilic compounds and are inherently impermeable to polar compounds, macromolecules, and therapeutic or diagnostic agents.
- proteins, lipids and other compounds which have the ability to translocate polypeptides across a cell membrane, have been described.
- membrane translocation polypeptides have amphiphilic or hydrophobic amino acid subsequences that have the ability to act as membrane-translocating carriers.
- homeodomain proteins have the ability to translocate across cell membranes.
- the shortest internalizable peptide of a homeodomain protein, Antennapedia was found to be the third helix of the protein, from amino acid position 43 to 58. Prochiantz (1996) Curr. Opin. Neurobiol. 6:629-634.
- Another subsequence, the h (hydrophobic) domain of signal peptides was found to have similar cell membrane translocation characteristics. Lin et al. (1995) J. Biol. Chem. 270:14255-14258.
- Examples of peptide sequences which can be linked to a fusion polypeptide for facilitating its uptake into cells include, but are not limited to: an 11 amino acid peptide of the tat protein of HIV; a 20 residue peptide sequence which corresponds to amino acids 84-103 of the p16 protein (see Fahraeus et al. (1996) Curr. Biol. 6:84); the third helix of the 60-amino acid long homeodomain of Antennapedia (Derossi et al. (1994) J. Biol. Chem.
- K-FGF Kaposi fibroblast growth factor
- VP22 translocation domain from HSV (Elliot et al. (1997) Cell 88:223-233).
- Other suitable chemical moieties that provide enhanced cellular uptake can also be linked, either covalently or non-covalently, to fusion polypeptides.
- Toxin molecules also have the ability to transport polypeptides across cell membranes. Often, such molecules (called “binary toxins”) are composed of at least two parts: a translocation or binding domain and a separate toxin domain. Typically, the translocation domain, which can optionally be a polypeptide, binds to a cellular receptor, facilitating transport of the toxin into the cell.
- Clostridium perfringens iota toxin diphtheria toxin (DT), Pseudomonas exotoxin A (PE), pertussis toxin (PT), Bacillus anthracis toxin, and pertussis adenylate cyclase (CYA) have been used to deliver peptides to the cell cytosol as internal or amino-terminal fusions.
- DT diphtheria toxin
- PE Pseudomonas exotoxin A
- PE pertussis toxin
- PT Bacillus anthracis toxin
- CYA pertussis adenylate cyclase
- Such subsequences can be used to translocate polypeptides, including fusion polypeptides as disclosed herein, across a cell membrane. This is accomplished, for example, by derivatizing the fusion polypeptide with one of these translocation sequences, or by forming an additional fusion of the translocation sequence with the fusion polypeptide.
- a linker can be used to link the fusion polypeptide and the translocation sequence. Any suitable linker can be used, e.g., a peptide linker.
- a fusion polypeptide can also be introduced into an animal cell, preferably a mammalian cell, via liposomes and liposome derivatives such as immunoliposomes.
- liposome refers to vesicles comprised of one or more concentrically ordered lipid bilayers, which encapsulate an aqueous phase.
- the aqueous phase typically contains the compound to be delivered to the cell.
- the liposome fuses with the plasma membrane, thereby releasing the compound into the cytosol.
- the liposome is phagocytosed or taken up by the cell in a transport vesicle. Once in the endosome or phagosome, the liposome is either degraded or it fuses with the membrane of the transport vesicle and releases its contents.
- the liposome In current methods of drug delivery via liposomes, the liposome ultimately becomes permeable and releases the encapsulated compound at the target tissue or cell. For systemic or tissue specific delivery, this can be accomplished, for example, in a passive manner wherein the liposome bilayer is degraded over time through the action of various agents in the body. Alternatively, active drug release involves using an agent to induce a permeability change in the liposome vesicle. Liposome membranes can be constructed so that they become destabilized when the environment becomes acidic near the liposome membrane. See, e.g., Proc. Natl. Acad. Sci. USA 84:7851 (1987); Biochemistry 28:908 (1989).
- DOPE Dioleoylphosphatidylethanolamine
- liposomes typically comprise a fusion polypeptide as disclosed herein, a lipid component, e.g., a neutral and/or cationic lipid, and optionally include a receptor-recognition molecule such as an antibody that binds to a predetermined cell surface receptor or ligand (e.g., an antigen).
- a lipid component e.g., a neutral and/or cationic lipid
- a receptor-recognition molecule such as an antibody that binds to a predetermined cell surface receptor or ligand (e.g., an antigen).
- Suitable methods include, for example, sonication, extrusion, high pressure/homogenization, microfluidization, detergent dialysis, calcium-induced fusion of small liposome vesicles and ether-fusion methods, all of which are well known in the art.
- a liposome it may be desirable to target a liposome using targeting moieties that are specific to a particular cell type, tissue, and the like.
- targeting moieties e.g., ligands, receptors, and monoclonal antibodies
- targeting moieties include monoclonal antibodies specific to antigens associated with neoplasms, such as prostate cancer specific antigen and MAGE. Tumors can also be diagnosed by detecting gene products resulting from the activation or over-expression of oncogenes, such as ras or c-erbB2. In addition, many tumors express antigens normally expressed by fetal tissue, such as the alphafetoprotein (AFP) and carcinoembryonic antigen (CEA).
- AFP alphafetoprotein
- CEA carcinoembryonic antigen
- Sites of viral infection can be diagnosed using various viral antigens such as hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C antigens, Epstein-Barr virus antigens, human immunodeficiency type-1 virus (HIV-1) and papilloma virus antigens.
- Inflammation can be detected using molecules specifically recognized by surface molecules which are expressed at sites of inflammation such as integrins (e.g., VCAM-1), selectin receptors (e.g., ELAM-1) and the like.
- Standard methods for coupling targeting agents to liposomes are used. These methods generally involve the incorporation into liposomes of lipid components, e.g., phosphatidylethanolamine, which can be activated for attachment of targeting agents, or incorporation of derivatized lipophilic compounds, such as lipid derivatized bleomycin.
- Antibody targeted liposomes can be constructed using, for instance, liposomes which incorporate protein A. See, Renneisen et al. (1990) J. Biol. Chem. 265:16337-16342 and Leonetti et al. (1990) Proc. Natl. Acad. Sci. USA 87:2448-2451.
- Fusion polypeptides as disclosed herein, and expression vectors encoding fusion polypeptides can be used in conjunction with various methods of gene therapy to facilitate the action of a therapeutic gene product.
- a fusion polypeptide can be administered directly to a patient, e.g., to facilitate the modulation of gene expression and for therapeutic or prophylactic applications, for example, cancer, ischemia, diabetic retinopathy, macular degeneration, rheumatoid arthritis, psoriasis, HIV infection, sickle cell anemia, Alzheimer's disease, muscular dystrophy, neurodegenerative diseases, vascular disease, cystic fibrosis, stroke, and the like.
- microorganisms whose inhibition can be facilitated through use of the methods and compositions disclosed herein include pathogenic bacteria, e.g., Chlamydia, Rickettsial bacteria, Mycobacteria, Staphylococci, Streptococci, Pneumococci, Meningococci and Conococci, Klebsiella, Proteus, Serratia, Pseudomonas, Legionella, Diphtheria, Salmonella, Bacilli (e.g., anthrax), Vibrio (e.g., cholera), Clostridium (e.g., tetanus, botulism), Yersinia (e.g., plague), Leptospirosis, and Borrellia (e.g., Lyme disease bacteria); infectious fungus, e.g., Aspergillus, Candida species; protozoa such as sporozoa (e.g., Plasmodia), rhizopods,
- fusion polypeptide or a nucleic acid encoding a fusion polypeptide is by any of the routes normally used for introducing polypeptides or nucleic acids into ultimate contact with the tissue to be treated.
- the fusion polypeptides or nucleic acids are administered in any suitable manner, preferably with pharmaceutically acceptable carriers. Suitable methods of administering such modulators are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- compositions are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions. See, e.g., Remington's Pharmaceutical Sciences, 17 th ed. 1985.
- Fusion polypeptides or nucleic acids can be made into aerosol formulations (i.e., they can be “nebulized”) to be administered via inhalation. Aerosol formulations can be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like.
- Formulations suitable for parenteral administration include aqueous and non-aqueous, isotonic sterile injection solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives.
- Compositions can be administered, for example, by intravenous infusion, orally, topically, intraperitoneally, intravesically or intrathecally.
- formulations of compounds can be presented in unit-dose or multi-dose sealed containers, such as ampoules and vials.
- Injection solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind known to those of skill in the art.
- chromatin remodeling complexes Numerous activities of chromatin remodeling complexes have been described, including but not limited to the following.
- a characteristic activity of all chromatin remodeling complexes is nucleosome- or DNA-dependent ATPase activity.
- Chromatin remodeling complexes can facilitate binding of transcription factors to genes in a chromatin context and facilitate accessibility of sequences in chromatin to restriction enzymes and other nucleases.
- Certain remodeling complexes (those containing the ISWI ATPase) also possess the ability to assemble periodic nucleosome arrays (i.e. they are capable of spacing nucleosomes). Changes in DNA topology (i.
- Chromatin remodeling complexes are also capable of transferring histones from chromatin to either DNA or protein acceptors. Stimulation of transcription initiation can also result from the action of chromatin remodeling complexes.
- the mechanism underlying all the above-mentioned activities may be the ability of chromatin remodeling complexes to promote nucleosome sliding or, more basically, to destabilize the histone-DNA interaction. Accordingly, any protein or multiprotein complex capable of destabilizing histone-DNA interactions and/or promoting nucleosome movement is suitable for use as a component of a chromatin remodeling complex.
- chromatin remodeling complexes can be assayed by a number of techniques, as are known to those of skill in the art, and as have been described in publications disclosing the isolation and characterization of the various chromatin remodeling complexes, as set forth supra. See also Imblazano et al. (1994) Nature 370:481-485 and Cote et al. (1993) Science 265:53-60 for descriptions of assays involving facilitation of transcription factor binding. Assays involving nucleosome repositioning are described by, for example, Hamiche et al. (1999) Cell 97:833-842 and Guschin et al. (2000) Biochemistry 39:5238-5245.
- An additional assay for chromatin modification is modulation of gene expression, when the modification is part of a two-step process in which chromatin modification allows binding of a molecule which modulates gene expression (e.g., a polypeptide comprising a fusion between a zinc finger DNA-binding domain and a transcriptional regulatory domain).
- a molecule which modulates gene expression e.g., a polypeptide comprising a fusion between a zinc finger DNA-binding domain and a transcriptional regulatory domain.
- Assays for gene modulation e.g., transcriptional activation and/or repression, reporter gene activity, measurement of protein levels
- transcriptional activation and/or repression, reporter gene activity, measurement of protein levels are well-known to those of skill in the art and are described, for example, in co-owned WO 00/41566.
- compositions and methods disclosed herein can be used to facilitate and/or modulate a number of processes involving cellular chromatin. These processes include, but are not limited to, transcription, replication, recombination, repair, integration, maintenance of telomeres, and processes involved in chromosome stability and disjunction. Accordingly, the methods and compositions disclosed herein can be used to affect any of these processes, as well as any other process which can be influenced by chromatin structure such as, for example, detection of specific sequences or sequence variants in cellular chromatin.
- chromatin modification facilitates access of one or more transcriptional regulatory factors, either endogenous or exogenous, to a target site in cellular chromatin, thereby participating in modulation of gene expression.
- Modulation of gene expression can include either increases or decreases in the level of gene expression.
- chromatin modification increases the efficiency of recombination, thereby facilitating, for example, targeted integration of an exogenous nucleic acid.
- modification of chromatin is used to facilitate the modulation of gene expression.
- Modulation can include gene activation and gene repression, as well as more subtle increases or decreases in the level of gene expression.
- Activation of gene expression can be mediated, for instance, by the activity of a histone acetyl transferase that has been recruited to a region of interest by the methods and compositions disclosed herein.
- Repression of gene expression can be mediated, for instance, by the activity of a histone deacetylase that has been recruited to a region of interest by the methods and compositions disclosed herein.
- chromatin modification could render regulatory sequences more accessible to transcriptional repressors or less accessible to positive transcriptional regulatory factors.
- any gene in any organism can be modulated by chromatin modification as disclosed herein, including therapeutically relevant genes, genes of infecting microorganisms, viral genes, and genes whose expression is modulated in the process of target validation.
- genes include, but are not limited to, vascular endothelial growth factor (VEGF), VEGF receptors flt and flk, CCR-5, low density lipoprotein receptor (LDLR), estrogen receptor, HER-2/neu, BRCA-1, BRCA-2, phosphoenolpyruvate carboxykinase (PEPCK), CYP7, fibrinogen, apolipoprotein A (ApoA), apolipoprotein B (ApoB), renin, phosphoenolpyruvate carboxykinase (PEPCK), CYP7, fibrinogen, nuclear factor ⁇ B (NF- ⁇ B), inhibitor of NF- ⁇ B (I- ⁇ B), tumor necrosis factors (e.g.,
- viral genes include, but are not limited to, hepatitis virus genes such as, for example, HBV-C, HBV-S, HBV-X and HBV-P; and HIV genes such as, for example, tat and rev. Modulation of expression of genes encoding antigens of a pathogenic organism can be achieved using the disclosed methods and compositions.
- Additional genes include those encoding cytokines, lymphokines, interleukins, growth factors, mitogenic factors, apoptotic factors, cytochromes, chemotactic factors, chemokine receptors (e.g., CCR-2, CCR-3, CCR-5, CXCR-4), phospholipases (e.g., phospholipase C), nuclear receptors, retinoid receptors, organellar receptors, hormones, hormone receptors, oncogenes, tumor suppressors, cyclins, cell cycle checkpoint proteins (e.g.,Chk1, Chk2), senescence-associated genes, immunoglobulins, genes encoding heavy metal chelators, protein tyrosine kinases, protein tyrosine phosphatases, tumor necrosis factor receptor-associated factors (e.g., Traf-3, Traf-6), apolipoproteins, thrombic factors, vasoactive factors, neuroreceptors, cell surface receptor
- tissue or cell type such as, for example, brain, muscle, heart, nervous system, circulatory system, reproductive system, genitourinary system, digestive system and respiratory system
- a particular tissue or cell type such as, for example, brain, muscle, heart, nervous system, circulatory system, reproductive system, genitourinary system, digestive system and respiratory system
- chromatin includes any cellular nucleoprotein structure. This can include, but is not limited to chromosomes (i.e., nuclear genomes), episomes, organellar nucleoproteins, such as mitochondrial and chloroplast genomes, and nucleoproteins associated with infecting bacterial or viral genomes. It is known that non-eukaryotic genomes are organized into nucleoprotein structures. In eukaryotic cells, the genome is enclosed in the nucleus. Accordingly, contact of a molecule with cellular chromatin includes introduction of the molecule into the nucleus of a cell.
- Cells include, but are not limited to, prokaryotic, eukaryotic and Archaeal cells.
- Eukaryotic cells include plant, fungal, protozoal and animal cells, including mammalian cells, primate cells and human cells.
- Methods of chromatin modification in a region of interest can also be combined with methods involving binding of endogenous or exogenous transcriptional regulators in the region of interest to achieve modulation of gene expression.
- Modulation of gene expression can be in the form of repression as, for example, when the target gene resides in a pathological infecting microorganism or in an endogenous gene of the subject, such as an oncogene or a viral receptor, that contributes to a disease state.
- modulation can be in the form of activation, if activation of a gene (e.g., a tumor suppressor gene) can ameliorate a disease state.
- an exogenous molecule can be formulated with a pharmaceutically acceptable carrier, as is known to those of skill in the art. See, for example, Remington's Pharmaceutical Sciences, 17 th ed., 1985; and co-owned WO 00/42219.
- certain embodiments include the use of a fusion molecule comprising a DNA-binding domain and a component of a chromatin remodeling complex, to modify chromatin structure in a region of interest, in combination with a second molecule having transcriptional regulatory activity which binds in the region of interest after modification of chromatin structure in the region of interest.
- the second molecule comprises a fusion between a DNA-binding domain and either a transcriptional activation domain or a transcriptional repression domain. Any polypeptide sequence or domain capable of influencing gene expression, which can be fused to a DNA-binding domain, is suitable for use. Activation and repression domains are known to those of skill in the art and are disclosed, for example, in co-owned WO 00/41566.
- Exemplary activation domains include, but are not limited to, VP16, VP64, the p65 subunit of NF-kappa B, ligand-bound thyroid hormone receptor and its functional fragments, p300, CBP, PCAF,SRC1 PvALF, AtHD2A and ERF-2. See, for example, Robyr et al. (2000) Mol. Endocrinol. 14:329-347; Collingwood et al. (1999) J. Mol. Endocrinol. 23:255-275; Leo et al. (2000) Gene 245:1-11; Manteuffel-Cymborowska (1999) Acta Biochim. Pol.
- Additional exemplary activation domains include, but are not limited to, OsGAI, HALF-1, C1, AP1, ARF-5, -6, -7, and -8, CPRF1, CPRF4, MYC-RP/GP, and TRAB1. See, for example, Ogawa et al. (2000) Gene 245:21-29; Okanami et al.
- Exemplary repression domains include, but are not limited to, KRAB, SID, v-erbA, unliganded thyroid hormone receptor and its functional fragments, MBD2, MBD3, members of the DNMT family (e.g., DNMT1, DNMT3A, DNMT3B), Rb, and MeCP2.
- KRAB KRAB
- SID v-erbA
- v-erbA unliganded thyroid hormone receptor and its functional fragments
- MBD2, MBD3, members of the DNMT family e.g., DNMT1, DNMT3A, DNMT3B
- Rb DNMT3B
- MeCP2 MeCP2. See, for example, Bird et al. (1999) Cell 99:451-454; Tyler et al. (1999) Cell 99:443-446; Knoepfler et al. (1999) Cell 99:447-450; and Robertson et al. (2000) Nature Genet. 25:338-3
- Additional exemplary repression domains include, but are not limited to, ROM2 and AtHD2A. See, for example, Chem et al. (1996) Plant Cell 8:305-321; and Wu et al. (2000) Plant J. 22:19-27.
- compositions and methods disclosed herein are useful in a variety of applications and provide advantages over existing methods. These include therapeutic methods in which an exogenous molecule is administered to a subject and used to modulate expression of a target gene within the subject. See, for example, co-pending WO 00/41566.
- the disclosed compositions and methods can also facilitate detection of particular sequences by binding of an exogenous molecule to a binding site in cellular chromatin as in, for example, diagnostic applications. Methods for detection of a target sequence using, for example, a ZFP are described in co-owned WO 00/42219.
- an exogenous molecule such as a sequence-specific DNA binding protein
- a variant allele can be used to detect variant alleles associated with a disease or with a particular phenotype in patient samples and to detect the presence of pathological microorganisms in clinical samples.
- a variant allele comprises a single-nucleotide polymorphism (SNP).
- the sequence-specific DNA binding protein is a ZFP.
- Exogenous molecules can also be used to quantify copy number of a gene in a sample. For example, detection of the loss of one copy of a p53 gene in a clinical sample is an indicator of susceptibility to cancer.
- identification of transgenic plants and animals can be accomplished through detection of a transgene using, for example, binding of a sequence-specific exogenous molecule (such as, for example, a ZFP) as an assay. All of these procedures can be enhanced by recruitment of a chromatin remodeling complex to a region of interest in cellular chromatin to facilitate binding of a binding molecule in the region of interest.
- a sequence-specific exogenous molecule such as, for example, a ZFP
- compositions when used in conjunction with methods of binding of exogenous molecules to cellular chromatin, can be used in assays to determine gene function and to determine changes in phenotype resulting from specific modulation of gene expression. See, for example, co-owned PCT WO 01/19981.
- a zinc finger DNA-binding domain which recognizes the human vascular endothelial growth factor-A (VEGF) gene, was designed and constructed according to design rules and methods disclosed in co-owned WO 00/42219, WO 00/41566, and co-owned U.S. Patent Applications Ser. Nos. 09/444,241 filed Nov. 19, 1999, and 09/535,088 filed Mar. 23, 2000.
- the target site which overlaps the transcription initiation site for the human VEGF-A gene, is shown below as SEQ ID NO: 1, with the arrow indicating the transcription startsite.
- SEQ ID NO: 1 3′-CCCCTCCTA-5′
- SP-1 zinc finger transcription factor was used as backbone for the construction of a designed three-finger DNA binding domain, Veg1, capable of recognizing this sequence.
- SP-1 has a three finger DNA-binding domain related to the well-studied murine zinc finger protein Zif268.
- Site-directed mutagenesis experiments using this domain have shown that correlations between the amino acid sequence of a zinc finger and its target nucleotide sequence, derived from analyses of Zif268, are also applicable to SP-1 and hence can be used to adapt the specificity of SP-1 to DNA sequences other than its normal target site.
- the portion of the SP-1 sequence used for construction of designed zinc finger DNA binding domains corresponds to amino acids 533 to 624.
- Amino acid sequences of designed DNA-binding domains are illustrated in Table 1.
- the designed Veg1 protein comprises three zinc fingers (F1, F2 and F3) which together recognize a 9-base pair target site.
- the amino acid sequence of the recognition helix (positions ⁇ 1 through +6, where +1 is the first amino acid in the ⁇ -helix) for each of the DNA-binding fingers is given.
- Target sites and ZFP DNA-binding domains in the human VEGF-A gene Name Target site Location AA sequence Veg 1 5′-GGGGAGGAT-3′ ⁇ 8 to +1 F1: TTSNLRR (SEQ ID NO:3) (SEQ ID NO:2) F2: RSSNLQR (SEQ ID NO:4) F3: RSDHLSR (SEQ ID NO:5) Veg 3a 5′-GCGGAGGCT-3′ +3 to +11 F1: QSSDLQR (SEQ ID NO:7) (SEQ ID NO:6) F2: RSSNLQR (SEQ ID NO:8) F3: RSDELSR (SEQ ID NO:9)
- PCR polymerase chain reaction
- oligonucleotides contained different sequences encoding amino acids at positions ⁇ 1, +2, +3 and +6 in each recognition helix, depending on its target triplet sequence. Codon bias was chosen to allow expression in both mammalian cells and E. coli. Assembly of Veg 1 coding sequences was carried out as follows. First, the six oligonucleotides (three universal and three specific, as described above) were combined and annealed at 25° C. to form a gapped DNA scaffold. Next, gaps were filled by conducting a four-cycle PCR reaction (using Taq and Pfu thermostable DNA polymerases) to generate a double-stranded template.
- This template was amplified (for thirty cycles) using a pair of external primers containing Kpn I and Hind III restriction sites. PCR products were directly cloned into the Kpn I and Hind III sites of the Tac promoter vector, pMal-c2 (New England Biolabs, Beverly, Mass.). The Veg1 zinc finger DNA-binding domain was expressed from this vector and purified as a fusion with the maltose binding protein according to the manufacturer's instructions (New England Biolabs, Beverly, Mass.).
- Veg1 nucleotide sequence KpnI GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGG (SEQ ID NO:10) GCTGTGGTAAAGTTTACGGCACAACCTCAAATCTGCGTCGTCACCTGCGCTGG CACACCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCTT CACCCGTTCGTCAAACCTGCAGCGTCACAAGCGTACCCACACCGGTGAGAAG AAATTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGTAGTGACCACCTGTC CCGTCACATCAAGACCCACCAGAATAAGAAGGGTGGATCC BamHI Veg1 amino acid sequence VPIPGKKKQHICHIQGCGKVYGTTSNLRRHLRWHTGERPFMCTWSYCGK (SEQ ID NO:11) RF
- Partially pure unfused ZFPs were produced as follows (adapted from Desjarlais et al. (1992) Proteins: Structure, Function and Genetics 12:101-104). A frozen cell pellet was resuspended in 1/50th volume of 1 M NaCl, 25 mM Tris-HCl (pH 8.0), 100 ⁇ M ZnCl 2 , 5 mM DTT. Samples were boiled for 10 min and centrifuged for 10 min at ⁇ 3,000 ⁇ g. At this point, ZFP protein in the supernatant was >50% pure (as estimated by staining of SDS-polyacrylamide gels with Coomassie blue), and the product migrated at the predicted molecular weight of around 11 kDa.
- the second method for producing ZFPs was to express them as fusions to the E. coli Maltose Binding Protein (MBP).
- MBP Maltose Binding Protein
- N-terminal MBP fusions to ZFPs were constructed by PCR amplification of the pET15b clones and insertion into the vector pMal-c2 (New England Biolabs) under the control of the Tac promoter. The fusion allows simple purification and detection of recombinant protein. It had been reported previously that zinc finger DNA-binding proteins can be expressed from this vector in soluble form to high levels in E. coli and can bind efficiently to the appropriate DNA target without refolding. Liu et al. (1997) Proc. Natl. Acad. Sci.
- MBP-fused proteins were as described by the manufacturer (New England Biolabs, Beverly, Mass.). Transformants were grown in LB medium supplemented with glucose and ampicillin, and were induced with IPTG for 3 hrs at 37° C. The cells were lysed by French press, then exposed to an agarose-based amylose resin, which specifically binds to the MBP moiety, thus acting as an affinity resin for the MBP fusion protein. The MBP fusion protein was eluted with 10 mM maltose to release ZFP of >50% purity. In some cases, protein was further concentrated using a Centricon 30 filter unit (Amicon).
- a 29-mer duplex oligonucleotide was used as a binding target for electrophoretic mobility shift analysis of Veg1.
- the sequence of the duplex was as follows: 5′-CATGCAT AGC GGGGAGGAT CGC CATCGAT-3′ (SEQ ID NO:12) 3′-GTACGTA TCGCCCCTCCTAGCG GTAGCTA-5′
- top strand was labeled, prior to annealing, with polynucleotide kinase and ⁇ - 32 P ATP. Top and bottom strands were annealed in a reaction containing each oligonucleotide at 0.5 ⁇ M, 10 mM Tris-HCl (pH 8.0), 1 mM EDTA, and 50 mM NaCl. The mix was heated to 95° C. for 5 min and slow-cooled to 30° C. over 60 min. Duplex formation was confirmed by polyacrylamide gel electrophoresis. Free label and single stranded DNA remaining in the target preparations did not appear to interfere with the binding reactions.
- Binding reactions contained 50 pM 5′ 32 P labeled double stranded target DNA, 10 mM Tris-HCl (pH 7.5), 100 mM KCl, 1 mM MgCl 2 , 1 mM dithiothreitol, 10% glycerol, 200 ⁇ g/ml bovine serum albumin, 0.02% NP-40, 20 ⁇ g/ml poly dI-dC (optionally), and 100 ⁇ M ZnCl 2 , in a final volume of 20 ⁇ l.
- Protein was added to the binding reaction as one-fifth volume from a dilution series made in 200 mM NaCl, 20 mM Tris (pH 7.5), 1 mM DTT. Binding was allowed to proceed for 45 min at room temperature. Polyacrylamide gel electrophoresis was carried out at room temperature using precast 10% or 10-20% Tris-HCl gels (BioRad, Hercules, Calif.) and Tris-Glycine running buffer (25 mM Tris-HCl, 192 mM glycine, pH 8.3) containing 0.1 mM ZnCl 2 . Radioactive signals were quantitated with a Phosphorimager.
- FIG. 2 shows the results of EMSA analysis of Veg1, using a four-fold dilution series of the Veg1 protein. Shifted product, indicative of labeled target with bound protein, is indicated by an arrow in FIG. 2A. The amount of shifted product was determined at each protein concentration and quantitated on a Phosphorimager (Molecular Dynamics). The relative signal (percent of maximal amount of shifted product) was plotted as a function of log 10 protein concentration. In this case, the protein concentration yielding half-maximal binding of Veg1 to its target site (i.e., the apparent k d ) was approximately 50 nM. MBP-fused and unfused versions of Veg1 bound to the target site with similar affinities.
- the Veg1 DNA binding domain is subcloned into a eukaryotic expression vector, in such a way that it is fused to the hBAF155 subunit of the brm/BRG chromatin remodeling complex.
- a cDNA sequence encoding a full length BAF155 protein is cloned using long range PCR. Barnes (1994) Proc. Natl. Acad. Sci. USA 91:2216-2220; Cheng et al. (1994) Proc. Natl. Acad. Sci. USA 91:5695-5699.
- oligonucleotide primers are homologous to sequences just upstream of the translation initiation codon at nucleotide 55 and just downstream of the final codon (proline at nucleotide 3366).
- the BAF15 numbering scheme refers to the Genbank Accession number U66615.
- the primer upstream of nucleotide 55 contains a Bam-HI site positioned such that, when upstream sequences encoding the Veg1 DNA-binding domain are fused to BAF155 sequences, the translational reading frame is preserved.
- nucleotide 3366 contains a HindIII site just downstream of the final codon of BAF155 positioned such that, if BAF155 sequences are fused to downstream sequences encoding a FLAG epitope tag, the translational reading frame is preserved.
- PCR is performed using cDNA from HeLa cells as template.
- Amplified product having a size of approximately 3400 base pairs is gel-purified and cloned directly into a Topo2 cloning vector (Invitrogen, Carlsbad, Calif.).
- Site-directed mutagenesis is used to eliminate the BamHI site at BAF155 position 2304, without altering the coding capacity or translational reading frame of the gene.
- a similar approach is used to eliminate the KpnI sites at nucleotides 2235 and 3243, and the HindIII sites at nucleotides 656 and 2365.
- the cloned and modified BAF155 gene is then removed from the Topo2 vector by digestion with BamHI and HindII, and gel-purified.
- the expression vector is modified from pcDNA3.1( ⁇ ) (Invitrogen, Carlsbad, Calif.), by digesting it with EcoRI and HindIII, and inserting a double-stranded oligonucleotide encoding an EcoRI site, a translation initiation sequence (Kozak (1991) J. Biol. Chem. 266:19,867-19,870), a nuclear localization signal (NLS), a KpnI site, a BamHI site and a HindIII site.
- the NLS is derived from the SV40 large T-antigen (Kalderon et al. (1984) Cell 39:499-509), and has the amino acid sequence MAPKKKRKVGIHGV (SEQ ID NO: 13).
- This plasmid is then digested with BamHI and HindIII, and the BamHI-HindIII fragment comprising the BAF155 gene (supra) is inserted.
- a double-stranded oligonucleotide encoding a FLAG epitope (having the sequence DYKDDDDK, SEQ ID NO: 14), and containing HindIII sites at both ends is inserted concurrently with the BAF155-containing fragment.
- the FLAG-containing HindIII fragment can be inserted in a separate, subsequent ligation.
- the resulting construct comprises, in order, CMV immediate early promoter, EcoRI site, translation initiation sequence, SV40 large T-antigen nuclear localization sequence, KpnI site, BamHI site, HBAF155 coding sequence, HindIII site, FLAG epitope, HindIII site, bovine growth hormone (bGH) polyadenylation signal, in a pcDNA3.1 (Invitrogen, Carlsbad, Calif.) plasmid backbone.
- the CMV promoter and bGH polyadenylation signal are derived from the original pcDNA3.1 vector, as are sequences for replication and selection.
- the Veg1 ZFP DNA-binding domain (see Example 1) is inserted, as a KpnI-BamHI fragment, into the vector described in the preceding paragraph to generate a vector encoding a protein having the structure (from N- to C-terminus): Nuclear localization sequence—Veg1 DNA binding domain—hBAF155-FLAG epitope tag.
- Nuclear localization sequence Veg1 DNA binding domain—hBAF155-FLAG epitope tag.
- the integrity of these constructs, and the preservation of the reading frame, is confirmed at each step of the procedure by nucleotide sequence analysis.
- this vector Upon transfection into mammalian cells this vector produces a NLS-Veg1-BAF155-FLAG fusion, whose transcription is controlled by a CMV immediate early promoter and a bovine growth hormone polyadenylation signal.
- a polynucleotide encoding a component of a chromatin remodeling complex (or a functional fragment thereof) is obtained by PCR from cDNA (or optionally genomic DNA) using primers containing flanking BamHI and HindIII sites. BamHI, KpnI and HindIII sites, if present in the amplified product, are removed by site-directed mutagenesis, preserving the reading frame and coding capacity in the process.
- the amplified gene is introduced into a BamHI/HindIII-digested expression vector constructed as described above, optionally along with a HindIII fragment containing a FLAG epitope.
- the resulting construct is digested with KpnI and BamHI and a KpnI/BamHI fragment, encoding a DNA-binding domain, preferably a ZFP DNA-binding domain, is inserted.
- Sequences encoding nuclear localization sequences and FLAG epitopes, for immunological detection of the fusion protein, are optionally included in the construct.
- Plasmids encoding these fusions are propagated in any suitable host strain, preferably E. coli strains JM109 or HB101.
- a zinc finger DNA-binding domain which recognizes the human vascular endothelial growth factor-A (VEGF) gene, was designed and constructed according to design rules and methods disclosed in co-owned WO 00/42219, WO 00/41566, and co-owned U.S. patent applications Ser. Nos. 09/444,241 filed Nov. 19, 1999 and 09/535,088 filed Mar. 23, 2000.
- the target site which overlaps the transcription initiation site for the human VEGF-A gene, is shown below as SEQ ID NO: 15, with the arrow indicating the transcription startsite.
- SEQ ID NO: 15 3′-CCCCTCCTAGCGCCTCCGA-5′
- Amino acids 533-624 of the human SP-1 zinc finger transcription factor were used as backbone for the construction of a designed six-finger DNA binding domain, Veg3a/1, capable of recognizing this sequence.
- Amino acid sequences of the designed DNA-binding domains are illustrated in Table 1.
- the designed Veg3a/1 protein comprises two subdomains, Veg1 and Veg3a, each comprising three zinc fingers (F1, F2 and F3) and each recognizing a 9-base pair subsite of the target site, joined by the linker sequence DGGGS (SEQ ID NO: 16).
- the amino acid sequence of the recognition helix (positions ⁇ 1 through +6, where +1 is the first amino acid in the ⁇ -helix) for each of the DNA-binding fingers is given in Table 1.
- Veg3a nucleotide sequence KpnI GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGG (SEQ ID NO:17) GCTGTGGTAAAGTTTACGGCCAGTCCTCCGACCTGCAGCGTCACCTGCGCTG GCACACCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCT TCACCCGTTCGTCAAACCTACAGAGGCACAAGCGTACACACACCGGTGAGAA GAAATTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGAAGTGACGAGCTG TCACGACATATCAAGACCCACCAGAACAAGAAGGGTGGATCC BamHI Veg3a amino acid sequence: VPIPGKKKQHICHIQGCGKVYG QSSDLQR HLRWHTGERPFMCTWSYCG
- the purified Veg3a zinc finger DNA-binding domain is tested for affinity to its 20 DNA target site by electrophoretic mobility shift analysis.
- a double-stranded target oligonucleotide was constructed by annealing complementary 29-mers, then end-labeled using polynucleotide kinase and ⁇ - 32 P-ATP.
- the sequence of the target was as follows: 5′-CATGCAT ATC GCGGAGGCT TGG CATCGAT-3′ (SEQ ID NO:19) 3′-GTACGTA TAGCGCCTCCGAACC GTAGCTA-5′
- Veg1 and Veg3a binding subdomains were joined to each other, using the linker sequence DGGGS (SEQ ID NO: 16). This particular linker sequence was chosen because it permits binding of two three-finger ZFP binding subdomains to two 9-bp sites that are separated by a one nucleotide gap, as is the case for the Veg1 and Veg3a sites. See also Liu et al., supra.
- the 6-finger Veg3a/1 protein encoding sequence was generated as follows. Sequences encoding Veg3a recognition helices were PCR-amplified from the Veg3a-encoding vector (supra) using the primers SPE7 (5′-GAGCA GAATTC GGCAAGAAGAAGCAGCAC) (SEQ ID NO: 20) and SPEamp12 (5′-GTGG TCTAGA CAGCTCGTCACTTCGC) (SEQ ID NO: 21) to generate a double-stranded fragment bounded by EcoRI and XbaI restriction sites (underlined in the sequences of the primers). The amplification product was digested with EcoRI and XbaI.
- Sequences encoding Veg1 recognition helices were PCR-amplified from the Veg1-encoding vector (Example 1) using the primers SPEamp13 (5′-GGAG CCAAGG CTGTGGTAAAGTTTACGG) (SEQ ID NO: 22) and SPEamp11 (5′-GGAG AAGCTT GGATCCTCATTATCCC) (SEQ ID NO: 23) to generate a double-stranded amplification product bounded by StyI and HindIII restriction sites (underlined in the sequences of the primers). The resulting amplification product was digested with StyI and HindIII.
- a third double-stranded fragment was constructed, using synthetic oligonucleotides, which encodes the DGGGS linker, flanked by the remainders of the Veg1 and Veg3a DNA-binding domains, and bounded by XbaI and StyI sites.
- sequence of this third fragment is as follows, with the XbaI and StyI sites underlined: XbaI 5′ CTAGA CACATCAAAACCCACCAGAACAAGAAAGACGGCGGTGGC 3′ T GTGTAGTTTTGGGTGGTCTTGTTCTTTCTGCCGCCACCG AGCGGCAAAAAGAAACAGCACATATGTCACAT C 3′ SEQ ID NO:24 TCGCCGTTTTTCTTTGTCGTGTATACAGTGTA GGTTC 5′ SEQ ID NO:25 StyI
- the nucleotide sequence encoding the designed 6-finger ZFP Veg3a/1, from the KpnI site to the BamHI site is: GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGGGCTGT (SEQ ID NO:28) GGTAAAGTTTACGGCCAGTCCTCCGACCTGCAGCGTCACCTGCGCTGGCACA CCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCTTCACA CGTTCGTCAAACCTACAGAGGCACAAGCGTACACACACAGGTGAGAAGAAA TTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGAAGTGACGAGCTGTCTAG ACACATCAAAACCCACCAGAACAAGAAAGACGGCGGTGGCAGCGGCAAAAAAA GAAACAGCACATATGTCACATCCAAGTCCAAGTTTACGGCACAACC TCAAATCTGCGTCGTCACCTGCGCTGGCACACCGGCGAGAGGCCTTTCATGT GT
- VEGF3a/1 amino acid sequence (using single letter code) is: VPIPGKKKQHICHIQGCGKVYGQSSDLQRHLRWHTGERFMCTWSYCGKRFTRS (SEQ ID NO:29) SNLQRHKRTHTGEKKFACPECPKRFMRSDELSPHIKTHQNKKDGGGSGKKKQHI CHIQGCGKVYGTTSNLRRHLRWHTGERPFMCTWSYCGKRFTRSSNLQRHKRTH TGEKKFACPECPKRFMRSDHLSRHIKTHQNKKGGS
- the Veg3a/1 protein was expressed in E. coli as an MBP fusion, purified by affinity chromatography, and tested in EMSA experiments as described supra.
- a labeled double-stranded oligonucleotide comprising the target site was prepared by synthesis and annealing of two overlapping oligonucleotides, one of which was labeled with 32 P.
- the oligonucleotides comprised the following sequences (with the target site over/underlined): AGCGAGC GGGGAGGATCGCGGAGGCT TGGGGCAGCCGGGTAG (SEQ ID NO:30) TCGCCCCTCCTAGCGCCTCCGAACCCCGTCGGCCCATCTCGC (SEQ ID NO:31)
- Binding analysis was conducted as described in Example 1 for the Veg1 protein. Binding was allowed to proceed for 60 min at either room temperature or 37° C., and polyacrylamide gel electrophoresis was carried out at room temperature or 37° C. using precast 10% or 10-20% Tris-HCl gels (BioRad) and standard Tris-Glycine running buffer. The room temperature assays yielded an apparent k d (determined as described supra) for this Veg3a/1 protein of approximately 1.5 nM. When binding and electrophoresis were performed at 37° C., the apparent K d of Veg3a/1 was approximately 9 nM when tested against the 18-bp target. Thus, the six finger Veg3a/l ZFP bound with high affinity to its target site.
- a plasmid encoding a fusion between the human MBD1 gene and the Veg3a/1 DNA-binding domain is constructed using methods similar to those described above for the BAF155/Veg1 fusion (Example 2). Sequences encoding MBD1 (GenBank accession No. NM015846) are isolated by PCR from genomic DNA or cDNA. Amplification primers are designed such that the primer corresponding to the upstream region of the gene comprises a BamHI Site at or near its upstream terminus, and the primer corresponding to the downstream region of the gene comprises a HindIII site at or near its downstream terminus.
- the primers are designed to amplify the region between nucleotides 140 (MBD1 initiation codon) and 1,957 (MBD1 termination codon), and to retain the correct reading frame of the MBD1 gene when the amplification product is incorporated as a component of a fusion gene.
- the amplification product is optionally cloned, a BamHI site at nucleotide 264 of the MBD1 sequence is removed by site-specific mutagenesis, and the BamHI/HindIII fragment is released from the cloning vector and purified. Sequences encoding the Veg3a/1 DNA-binding domain are obtained as a KpnI/BamHI fragment (Example 3).
- the MBD1-encoding BamHI/HindIII fragment and the Veg3a/1-coding KpnI/BamHI fragment are inserted into pcDNA3.1( ⁇ ) or a modified derivative (Example 2).
- a nuclear localization signal and/or a FLAG epitope are optionally included in the fusion construct.
- the MBD1 gene can be divided into at least two functional fragments: a methylated DNA binding domain (encoded by nucleotides 158-322) and a functional domain. Accordingly, a MBD/ZFP fusion gene is constructed that lacks sequences encoding the methylated DNA-binding domain, but contains the functional domain of the MBD1 protein.
- the BamHI/HindIII-terminated amplification product comprises nucleotides 322 through 1,957 of the MBD1 gene.
- a similar fusion gene is constructed, in which the MBD2 gene (GenBank accession No. NM003927), or a functional fragment thereof, is fused to a ZFP DNA-binding domain.
- the amplification primers are designed to amplify the region between nucleotides 230 (MBD2 initiation codon) and 1,465 (MBD2 termination codon), and to retain the correct reading frame of the MBD2 gene when the amplification product is incorporated as a component of a fusion gene.
- the amplification product is optionally cloned, a KpnI site at nucleotide 813 and a HindIII site at nucleotide1308 of the MBD1 sequence are removed by site-specific mutagenesis, and the BamHI/HindIII fragment is released from the cloning vector and purified. Sequences encoding the Veg3a/1 DNA-binding domain are obtained as a KpnI/BamHI fragment (Example 3). The MBD2-encoding BamHI/HindIII fragment and the Veg3a/1-coding KpnI/BamHI fragment are inserted into pcDNA3.1( ⁇ ) or a modified derivative (Example 2). A nuclear localization signal and/or a FLAG epitope are optionally included in the fusion construct.
- the methylated DNA-binding domain of the MBD2 gene is encoded by nucleotides 680-862. Accordingly, a MBD/ZFP fusion gene is constructed that lacks sequences encoding the methylated DNA-binding domain, but contains the functional domain of the MBD2 protein, by designing the amplification primers to amplify the region of the MBD2 gene located between nucleotides 862 and 1,465. As in previous examples, the amplification primers comprise BamHI and HindIII sites at or near their termini, to maintain the MBD2 reading frame and facilitate construction of the fusion protein by the methods described supra. In this case, the HindIII site at nucleotide 1,308 is removed subsequent to amplification and prior to construction of the fusion nucleic acid.
- HEK 293 Human embryonic kidney cells (HEK 293) are grown in DMEM (Dulbecco's modified Eagle medium) supplemented with 10% fetal calf serum. Cells are plated in 10 cm dishes at a density of 2.5 ⁇ 10 6 per plate and grown for 24 hours in a CO 2 incubator at 37° C.
- DMEM Dulbecco's modified Eagle medium
- Lipofectamine 2000 is diluted in 2.5 ml Opti-MEM.
- the diluted DNA and lipid are mixed and incubated for 20 minutes at room temperature. Medium is then removed from the cells and replaced with the lipid/DNA mixture.
- Cells are incubated at 37° C. for 3 hours in a CO 2 incubator, then 10 ml of DMEM+10% FBS is added. Cells are harvested 40 hours after transfection for analysis of chromatin structure (Example 6) and gene expression (Example 7).
- Intercalator-protein fusions, MGB-protein fusions and/or TFO-protein fusions are introduced into cells after encapsulation into liposomes, using standard procedures that are well-known in the art.
- Transformed human embryonic kidney 293 cells are grown in DMEM+10% fetal calf serum, supplemented with penicillin and streptomycin, in a 37° C. incubator at 5% CO 2 .
- DMEM+10% fetal calf serum supplemented with penicillin and streptomycin
- two 255 cm 2 plates of cells are used in an experiment.
- the cells reach greater than 90% confluence ( ⁇ 2.5 ⁇ 10 7 cells per plate)
- medium is removed and the cells are rinsed twice with 5 ml of ice-cold PBS (Gibco/Life Technologies, Gaithersburg, Md.). Cells are then scraped from the plates in 5 ml of ice-cold PBS and combined in a 50 ml conical centrifuge tube.
- the plates are washed with 10 ml of ice-cold PBS and the washes are added to the tube. Nuclei are pelleted by centrifugation (1400 rpm for 5 min) and the supernatant is removed. The pellet is mixed by vortexing and, while vortexing, 20 ml of lysis buffer (10 mM Tris pH 7.5, 1.5 mM MgCl 2 , 10 mM KCl, 0.5% IGEPAL CA-630 (Sigma), 1 mM phenylmethylsulfonyl fluoride, 1 mM dithiothreitol) is added.
- lysis buffer 10 mM Tris pH 7.5, 1.5 mM MgCl 2 , 10 mM KCl, 0.5% IGEPAL CA-630 (Sigma), 1 mM phenylmethylsulfonyl fluoride, 1 mM dithiothreitol
- the cell pellet is resuspended in lysis buffer by pipetting and the tube is centrifuged at 1400 rpm for 5 min. The supernatant is removed and the pellet is resuspended in 20 ml of lysis buffer and centrifuged as before. The final pellet is resuspended in 1.5 ml dilution buffer (15 mM Tris pH 7.5, 60 mM KCl, 15 mM NaCl, 5 mM MgCl 2 , 0.1 mM dithiothreitol, 10% glycerol), nuclei are counted in a microscope and the solution is adjusted so that a concentration of approximately 107 nuclei per ml is obtained.
- Nuclei at a concentration of 10 7 per ml in dilution buffer, are digested with different concentrations of DNase I.
- DNase I dilutions are prepared by diluting deoxyribonuclease I (Worthington, Freehold, N.J.) in dilution buffer (supra), optionally supplemented with 0.4 mM CaCl 2 .
- To 100 ⁇ l of resuspended nuclei is added 25 ⁇ l of a DNase I dilution to give final DNase I concentrations ranging from 0.07 Units/ml to 486 Units/ml in three-fold concentration increments. Digestions are conducted at room temperature for 5 min.
- Digestion reactions are then stopped by addition of 125 ⁇ l of Buffer AL (Qiagen DNeasyTM Tissue Kit) and 12.5 ⁇ l of a 20 mg/ml solution of Proteinase K (Qiagen DNeasyTM Tissue Kit), followed by incubation at 70° C. for 10 min.
- Digested DNA is purified using the DNeasyTM Tissue Kit (Qiagen, Valencia, Calif.) according to the manufacturer's instructions.
- Micrococcal nuclease can be used as an alternative to DNase I for examination of chromatin structure. Treatment of nuclei, obtained as described supra, with micrococcal nuclease is conducted as described by Livingstone-Zatchej et al. in Methods in Molecular Biology, Vol. 119, Humana Press, Totowa, N.J., pp. 363-378.
- Nuclei are treated with MPE using the following procedure adapted from Cartwright et al., supra.
- a freshly-diluted stock of 0.4 M H 2 O 2 is prepared by making a 25-fold dilution of a 30% stock solution.
- a freshly-prepared stock of 0.5 M ferrous ammonium sulfate is diluted 400-fold in water.
- a solution of methidiumpropyl EDTA (MPE) is prepared by adding 30 ⁇ l of 5 mM MPE to 90 ⁇ l of water. To this MPE solution is added 120 ⁇ l of the ferrous ammonium sulfate dilution and 2.5 ⁇ l of 1 M dithiothreitol (DTT, freshly prepared from powder).
- nuclei obtained as described supra, are added, in sequence: 3.5 ⁇ l of 0.4 M H 2 O 2 and 37.5 ⁇ l of the MPE/ferrous ammonium sulfate/DTT mixture.
- the reaction is terminated after an appropriate time period (determined empirically) by addition of 40 ⁇ l of 50 mM bathophenanthroline disulfonate, 0.1 ml of 2.5% sodium dodecyl sulfate/50 mM EDTA/50 mM Tris-Cl, pH 7.5 and 10 ⁇ l of Proteinase K (10-14 mg/ml). Proteinase digestion is conducted at 37° C.
- Nucleic acids are precipitated from the aqueous phase by addition of sodium acetate to 0.3 M and 0.7 volume of isopropyl alcohol, incubation on ice for at least 2 hr, and centrifugation. The pellet is washed with 70% ethanol, dried, resuspended in 10 mM Tris-Cl, pH 8 and treated with RNase A (approximately 0.1 mg/ml) for 15 min at 37° C.
- the gel is treated with alkali, neutralized, blotted onto a Nytran membrane (Schleicher & Schuell, Keene, N.H.), and the blotted DNA is crosslinked to the membrane by ultraviolet irradiation.
- Probes are labeled by random priming, using the Prime-It Random Primer Labeling Kit (Stratagene, La Jolla, Calif.) according to the manufacturer's instructions. In a typical labeling reaction, 25-50 ng of DNA template is used in a final volume of 50 ⁇ l. A specific activity of 10 9 cpm/ ⁇ g is typically obtained. Labeled probes are purified on a NucTrap probe column (Stratagene #400702, La Jolla, Calif.).
- the membrane is placed in a hybridization bottle and pre-hybridized in Rapid Hybridization Buffer (Amersham, Arlington Heights, Ill.) at 65° C. for 15 min.
- Probe a 0.1 kb XbaI-KpnI fragment, see FIG. 1A
- hybridization is conducted at 65° C. for 2 hours.
- the membrane is washed once at 65° C. for 10 min. with 2 ⁇ SSC+0.1% SDS, and twice at 65° C. for 10 min. with 0.1 ⁇ SSC+0.1% SDS.
- the membrane is then dried and analyzed either by autoradiography or with a phosphorimager.
- Results are shown in FIG. 3 for analysis of DNase hypersensitivity, in HEK293 cells, within an approximately 1,000 base-pair region upstream of the human VEGF-A gene transcriptional startsite.
- Increasing DNase concentration resulted in the generation of two new sets of DNA fragment doublets, centered at approximately 500 and 1,000 nucleotides, indicating the presence of two DNase hypersensitive regions.
- One of these regions is centered approximately 500 base pairs upstream of the transcriptional startsite; the other is centered on the transcriptional startsite.
- Remodeling of VEGF chromatin can involve, among other things, loss of one or both of these hypersensitive regions, or the generation of one or more additional hypersensitive regions, either upstream or downstream of the transcriptional startsite.
- Activation or repression of transcription resulting from localized chromatin remodeling is determined by measurement of RNA and/or protein gene products. These methods are well-known to those of skill in the art.
- RNA blots RNA blots, nuclease protection and/or quantitative real-time PCR (colloquially known as the “Taqman” assay), as is known to those of skill in the art. See, for example, Ausubel et al., supra.
- Protein production can be measured by immunoassay (e.g., ELISA, immunoprecipitation), gel electrophoresis, and/or immunological detection of protein blots (“Western” blots), as is known to those of skill in the art. See, for example, Ausubel et al., supra.
- Reporter genes can also be used to assay activation and/or repression of specific promoters. Accordingly, effect of chromatin remodeling on a promoter that is operatively linked to a reporter gene (such as, for example, alkaline phosphatase, ⁇ -galactosidase, ⁇ -glucuronidase, chloramphenicol acetyl transferase, horseradish peroxidase, luciferase, or green fluorescent protein) can be assayed by measuring the levels and/or activity of the reporter. Methods for fusion of a promoter to a reporter gene, and methods for assay of reporter gene products, are known to those of skill in the art. See, for example, Ausubel et al., supra.
- RNA analysis Transient transfection of HEK293 cells, seeded in 6-well plates, is carried out as described in Example 6, supra. Cell lysates are harvested 40 hours post-transfection. To assay the activation of the endogenous chromosomal VEGF gene, RNA blotting (“Northern” blotting) is used to measure VEGF mRNA levels. Briefly, PolyA+RNA is isolated from HEK 293 cells transfected with a fusion plasmid or from mock-transfected HEK293 cells, using the Oligotex kit (Qiagen, Valencia, Calif.), according to the manufacturer's instructions.
- the fusion plasmid encodes a fusion protein comprising a nuclear localization sequence, the Veg1 DNA-binding domain, BAF155 and a FLAG epitope (see Example 2, supra).
- 7 ⁇ g of RNA are resolved on a 2.4% agarose gel containing 2.4 M formaldehyde, and the gel is blotted onto Nytran SuPerCharge membrane (Schliecher & Schuell, Keene, N.H.) using 20 ⁇ SSC.
- the membrane is hybridized at 65° C. for 1 hour in Rapid-Hyb Buffer (Amersham-Pharmacia Biotech, Piscataway, N.J.) containing a 32 P-labeled VEGF cDNA probe.
- the VEGF cDNA construct is generated by inserting a human VEGF cDNA fragment, obtained by PCR amplification, into the pCDNA3.1 vector (Invitrogen, Carlsbad Calif.) at the XbaI and EcoRI sites. Structure of the clone is confirmed by sequencing. After hybridization, the VEGF probe is stripped from the membrane, and the blot is re-hybridized with a 32 P-labeled GAPDH DNA probe. VEGF mRNA levels, as determined by RNA blotting, are normalized to GAPDH mRNA levels.
- RNA samples 25 ng are mixed with 0.3 ⁇ M of each primer, 0.1 ⁇ M of probe, 5.5 mM MgCl 2 , 0.3 mM of each dNTP, 0.625 unit of AmpliTaq Gold RNA Polymerase, 6.25 units of Multiscribe Reverse Transcriptase, and 5 units of RNase inhibitor, in Taqman buffer A from Perkin Elmer. Reverse transcription is performed at 48° C. for 30 min. After denaturing at 95° C.
- PCR is conducted for 40 cycles at 95° C. for 15 seconds and 60° C. for one minute. Analysis is conducted, during the amplification reaction, in a 96-well format on an ABI 7700 SDS machine (PE BioSystems, Foster City, Calif.) and data is analyzed with SDS version 1.6.3 software. Exemplary probes and primers for analysis of VEGF and GAPDH genes are presented in Table 2.
- Protein analysis Analysis of protein levels is performed by resolving 10 ⁇ g of whole cell lysate on a 10-20% polyacrylamide gel run in Tris/glycine/SDS buffer (BioRad, Hercules, Calif.). Proteins separated in the gel are transferred onto a nitrocellulose membrane using Tris/glycine/SDS buffer supplemented with 20% methanol, and the filter is blocked with 5% non-fat dry milk for 1 hour at room temperature. The blot is probed for 1 hour at room temperature with anti-Flag M2 monoclonal antibody (Sigma, St.
- cell lysates are prepared (as described above) or culture medium is harvested and analyzed using a commercially available ELISA kit.
- levels of secreted VEGF protein are determined by assay of culture medium using a human VEGF ELISA kit (R & D systems, Minneapolis, Minn.).
- Chromatin remodeling can take the form of, for example, deposition, removal or repositioning of nucleosomes within chromatin.
- Means for detecting chromatin remodeling include, but are not limited to, detecting changes in accessibility of specific sites in chromatin to sequence-specific nucleases such as restriction enzymes, determination of the appearance or disappearance of a regularly repeating pattern of chromatin digestion by non-sequence specific endonucleases such as micrococcal nuclease and DNase I, determination of nucleosome spacing, and nucleosome-binding assays.
- sequence-specific nucleases such as restriction enzymes
- non-sequence specific endonucleases such as micrococcal nuclease and DNase I
- determination of nucleosome spacing and nucleosome-binding assays.
- chromatin remodeling complexes possess ATPase activity; therefore ATP hydrolysis assays can be used in the identification and/or characterization of chromatin remodeling
- Chromatin remodeling complexes utilize the energy of ATP hydrolysis to modify chromatin structure. Consequently, nucleosome- or DNA-dependent ATPase activity can be used to assay for a chromatin remodeling complex.
- ATPase activity is the release of labeled pyrophosphate from ⁇ - 32 P-labeled ATP. Release is measured as the amount of radioactivity that does not bind to activated charcoal in 20 mM phosphoric acid.
- An alternative method for measuring pyrophosphate release is to measure labeled pyrophosphate directly by thin layer chromatography.
- the reaction mixture contains 0.02 ⁇ g/ml DNA (or reconstituted nucleosomal array, see Example 11 infra), 5 ⁇ M SWI/SNF complex (or any other known or putative chromatin remodeling complex), 20 mM Tris, pH 8.0, 5 mM MgCl 2 , 0.2 mM dithiothreitol, 0.1% Tween, 5% glycerol, 100 ⁇ g/ml bovine serum albumin, 100 ⁇ M ATP, and 0.2 ⁇ Ci ( ⁇ - 32 P)ATP (3 Ci/mmol) in a final volume of 20 ⁇ l and is incubated at 37° C.
- Deposition of purified histone octamers onto a specific template under defined conditions can generate a nucleosomal array in which the positions of one or more individual nucleosomes, with respect to the nucleotide sequence of the template, are known.
- Such an array can be used as a substrate in an assay for chromatin remodeling activity, by testing for changes in nucleosome position with respect to nucleotide sequence.
- One such test is restriction endonuclease accessibility. See infra.
- ISWI-encoding sequences were amplified from a recombinant plasmid encoding Drosophila ISWI. Corona et al. (1999) Mol. Cell 3:239-245.
- One of the primers contained, outside of the ISWI-complementary region, sequences encoding a FLAG epitope and, at the 5′ terminus, a 5′ extension encoding Hind III and Xba I sites.
- the other primer contained a 5′ extension encoding a Bam HI site.
- sequences of the primers were as follows: cgatc GGATCC TCCAAAACAGATACAGCTGCC (SEQ ID NO:38) BamHI ISWI seq gatcgcc TCTAGA CTCGAG AAGCTT ACTTGTCATCGTCGTCCTTGTAGTCGCTGCC CTTCTTCTTCTTTTTCGAGTT (SEQ ID NO:39) XbaI HindIII FLAG sequence ISWI seq
- Amplification was conducted at 95° C. for 2 min, followed by 30 cycles of 95° C. for 30 sec, 55° C. for 30 sec, 72° C. for 5 min, and a final step of 72° C. for 5 min. resulted in the generation of an amplification product comprising ISWI- and FLAG-encoding sequences flanked by Bam HI and Hind III sites.
- the amplification product was purified using a PCR Cleanup Kit (Qiagen, Valencia, Calif.) according to the manufacturer's instructions, then digested with Bam HI and Hind III.
- a vector encoding a nuclear localization signal (NLS), a ZFP binding domain targeted to the human erythropoietin gene (Epo 2C), a VP16 activation domain and a FLAG epitope was digested with Bam HI and Hind III to release VP16- and FLAG-encoding sequences.
- the Bam HI/Hind III fragment described in the preceding paragraph was ligated to the vector backbone to generate a vector encoding a fusion protein comprising a NLS, the Epo2C binding domain, ISWI and FLAG.
- the nucleotide sequences of the target sites, and the amino acid sequences of the recognition helices ( ⁇ 1 through +6) for the Epo2C and Epo3B binding domains are provided in Table 3.
- the steroid receptor coactivator 1 (SRC1) protein is a histone acetyltransferase which is capable of recruiting the p300 and CBP proteins (both of which are also histone acetyltransferases).
- SRC1 steroid receptor coactivator 1
- a plasmid encoding SRC1 was used as a template for PCR amplification using the following primers, and the amplification product was digested with Not I. 5′-GGATCCGGCCACCGCGGCCGCATGGATCCATGTAATACAAACCCAACC (SEQ ID NO:48) 5′-ATGAATTCGCGGCCGCCCTGGGTTCCATCTGCTTCTGTTTTGAG (SEQ ID NO:49)
- the pVP16-EPOZFP-862c vector containing a transcription unit encoding a nuclear localization signal (NLS), the EPO ZFP-862 zinc finger binding domain, a VP16 transcriptional activation domain and a FLAG epitope, under transcriptional control of a CMV promoter and a bovine growth hormone polyadenylation signal, was digested with Not I to release VP1 6-encoding sequences. See Zhang et al. (2000) J. Biol. Chem. 275:33,850-33,860 for the design and properties of EPOZFP-862, which binds to a site 862 nucleotides upstream of the EPO transcriptional startsite.
- NLS nuclear localization signal
- the Not I-digested amplification product described in the previous paragraph was inserted into the ZFP-862c vector backbone by ligation, to generate a plasmid encoding a NLS, the EPO ZFP-862 binding domain, amino acids 781-1385 of SRC1 and a FLAG epitope.
- the structure of the resulting construct, pSRC1b-EPO2c, is illustrated schematically in FIG. 4.
- This construct was introduced into human HEK 293 cells by transfection (200 ug of plasmid plus 5 ug of Lipofectamine; Lipofectamine obtained from Gibco/Life Technologies, Gaithersburg, Md.). Approximately 12 hours after exposure of cells to plasmid, the medium was removed and replaced with fresh DMEM supplemented with 10% fetal bovine serum. Twenty-four hours later, the medium was harvested and assayed for secreted EPO, using an erythropoietin ELISA from R&D Systems (Minneapolis, Minn.).
- a control plasmid pcDNA3.1
- MBDs Methyl binding domain proteins
- DNA N-methyl transferases methylate cytosine residues present in certain CpG dinucleotide sequences in cellular DNA. Such methylation can lead to chromatin remodeling at or in the vicinity of the methylated sequence(s) by, for example, binding of one or more MBDs and concomitant or subsequent recruitment of chromatin remodeling complexes.
- the DNMT1 protein can also associate with histone deacetylases (HDACs), which themselves are involved in chromatin remodeling.
- HDACs histone deacetylases
- a series of ZFP-MBD and ZFP-DNMT fusions were tested for their ability to regulate expression of the human VEGF-A gene. Accordingly, a series of plasmids was constructed, in which the VEGF3a/1 ZFP binding domain (see Example 3,supra) was fused to MBD2b, MBD3, MBD3S, MBD3L, DNMT1, DNMT3a or DNMT3b. See, for example, GenBank accession numbers AF072243, AF170347, AW872007, and NM013595.
- the fusion genes also comprised a nuclear localization signal and a FLAG epitope, similar to the constructs described in Examples 11 and 12.
- FIG. 6 shows a schematic diagram of these constructs.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Theoretical Computer Science (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Medical Informatics (AREA)
- Evolutionary Biology (AREA)
- Immunology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Ecology (AREA)
- Peptides Or Proteins (AREA)
Abstract
Methods and compositions for targeted modification of chromatin structure, within a region of interest in cellular chromatin, are provided. Such methods and compositions are useful for facilitating processes such as, for example, transcription and recombination, that require access of exogenous molecules to chromosomal DNA sequences.
Description
- This application is a continuation-in-part of copending U.S. patent application Ser. No. 09/844,508 (filed Apr. 27, 2001), which in turn claims priority to U.S. Provisional Patent Application Serial No. 60/200,590, filed Apr. 28, 2000 and to U.S. Provisional Patent Application Serial No. 60/228,523, filed Aug. 28, 2000. The disclosures of all of the aforementioned patent applications are hereby incorporated by reference in their entireties.
- The present disclosure is in the fields of chromatin structure and genetic regulation, in particular, the modification of chromatin structure to facilitate interaction of molecules with a region of interest in cellular chromatin.
- Regulation of gene expression in a cell is generally mediated by sequence-specific binding of gene regulatory molecules, often proteins, to chromosomal DNA. Regulatory proteins can effect either positive or negative regulation of gene expression. Generally, a regulatory protein will exhibit preference for binding to a particular binding sequence, or target site. Target sites for many regulatory proteins (and other molecules) are known or can be determined by one of skill in the art.
- Despite advances in the selection and design of sequence-specific DNA binding gene regulatory proteins, their application to the regulation of an endogenous cellular gene can, in some cases, be limited if their access to the target site is restricted in the cell. Possible sources of restricted access could be related to one or more aspects of the chromatin structure of the gene. Access can be influenced by the structure of the gene per se (e.g., nucleotide methylation) or by the structure of the chromosomal domain in which the gene resides.
- Cellular DNA, including the cellular genome, generally exists in the form of chromatin, a complex comprising nucleic acid and protein. Indeed, most cellular RNAs also exist in the form of nucleoprotein complexes. The nucleoprotein structure of chromatin has been the subject of extensive research, as is known to those of skill in the art. In general, chromosomal DNA is packaged into nucleosomes. A nucleosome comprises a core and a linker. The nucleosome core comprises an octamer of core histones (two each of H2A, H2B, H3 and H4) around which is wrapped approximately 150 base pairs of chromosomal DNA. In addition, a linker DNA segment of approximately 50 base pairs is associated with linker histone Hi (or a related linker histone in certain specialized cells). Nucleosomes are organized into a higher-order chromatin fiber (sometimes denoted a “solenoid” or a 30 nm fiber) and chromatin fibers are organized into chromosomes. See, for example, Wolffe “Chromatin: Structure and Function” 3rd Ed., Academic Press, San Diego, 1998 and Kornberg et al. (1999) Cell 98:285-294.
- Chromatin structure is not static, but is subject to modification by processes collectively known as chromatin remodeling. Chromatin remodeling can serve, for example, to remove nucleosomes from a region of DNA, move nucleosomes from one region of DNA to another, change the spacing between nucleosomes or add nucleosomes to a region of DNA in the chromosome. Chromatin remodeling can also result in changes in higher order structure, thereby influencing the balance between transcriptionally active chromatin (open chromatin or euchromatin) and transcriptionally inactive chromatin (closed chromatin or heterochromatin).
- Chromosomal proteins are subject to numerous types of chemical modification, some or all of which influence chromatin structure. For example, histones are subject to acetylation by histone acetyltransferases, deacetylation by histone deacetylases, methylation by histone methyltransferases (and therefore presumably to demethylation by histone demethylases), ubiquitination by ubiquitin ligases, de-ubiquitination by ubiquitin hydrolases, phosphorylation by histone kinases, dephosphorylation by histone phosphatases, and reversible ADP-ribosylation by poly-ADP ribose polymerase (PARP, also known as TFIIC). Strahl et al. (2000)Nature 403:41-45. Regulation of chromatin structure by methylation of histone H3 has been described. Rea et al. (2000) Nature 406:593-599. Modifications of non-histone chromosomal proteins include, for example, acetylation of HMG-1 (Munshi et al. (1998) Mol. Cell 2:457-467); HMGs 14 and 17 (Sterner et al. (1981) J. Biol. Chem. 256:8892-8895; Herrera et al. (1999) Mol. Cell. Biol. 19:3466-3473; Bergel et al. (2000) J. Biol. Chem. 275:11,514-11,520) and chromatin-resident transcriptional regulators such as, for example, TFIIE (Imhof et al. (1997) Curr. Biol. 7:689-692), p53 (Gu et al. (1997) Cell 90:595-606) and GATA-1 (Boyes et al. (1998) Nature 396:594-598). Chemical modification of histone and/or non-histone proteins is often a step in the chromatin remodeling process, and can have either positive or negative effects on gene expression. Generally, histone acetylation is correlated with gene activation; while deacetylation of histones is correlated with gene repression.
- A number of enzymes capable of chemical modification of histones have been described and partially characterized. For example, histone acetyl transferases include Gcn5p, p300/CBP-associated factor (P/CAF), p300, CREB-binding protein (CBP), HAT1, TFIID-associated factor 250 (TAFII250), and steroid receptor coactivator-1 (SRC-1). Wade et al. (1997) Trends Biochem. Sci. 22:128-132; Kouzarides (1999) Curr. Opin. Genet. Devel. 9:40-48; Sterner et al. (2000) Microbiol. Mol. Biol. Rev. 64:435-459. The HDAC family of proteins have been identified as histone deacetylases and include homologues to the budding yeast histone deacetylase RPD3 (e.g., HDAC1, HDAC2, HDAC3 and HDAC8) and homologues to the budding yeast histone deacetylase HDA1 (e.g., HDAC4, HDAC5, HDAC6 and HDAC7). Ng et al. (2000) Trends Biochem. Sci. 25:121-126. The Rsk-2 (RKS90) kinase has been identified as a histone kinase. Sassone-Corsi et al. (1999) Science 285:886-891. A histone methyltransferase (CARM-1) has also been identified. Chen et al. (1999) Science 284:2174-2177.
- Effects of alterations in chromatin structure upon gene expression have been reported or inferred. Fryer et al. (1998)Nature 393:88-91; and Kehle et al. (1998) Science 282:1897-1900.
- Because of the dynamic structure of cellular chromatin, the ability of a regulatory molecule to bind its target site in a chromosome may be limited, in certain circumstances, by chromatin structure. For example, if a target site is present in “open” chromatin (generally thought of as nucleosome-free or having an altered nucleosomal conformation compared to bulk chromatin) structural barriers to the binding of a regulatory molecule to its target site are unlikely. By contrast, if a target site is present in “closed” chromatin (i.e. having extensive higher-order structure and/or close nucleosome spacing), steric barriers to binding are likely to exist. Thus, the ability of a regulatory molecule to bind to a target site in cellular chromatin will depend on the structure of the chromatin surrounding that particular target site. The chromatin structure of a particular gene can vary depending on, for example, cell type and/or developmental stage. For this reason, the regulation of a given gene in a particular cell can be influenced not only by the presence or absence of gene regulatory factors, but also by the chromatin structure of the gene.
- Remodeling of chromatin can lead to activation of gene expression in vitro. For example, the NURF chromatin remodeling complex stimulates the transcriptional activation activity of the GAGA transcription factor. Tsukiyama et al. (1995)Cell 83:1011-1020. Transcriptional activation by a GAL4-VP 16 fusion requires the RSF chromatin remodeling complex. LeRoy et al. (1998) Science 282:1900-1904. The SWI/SNF chromatin remodeling complex potentiates transcriptional activation by the VP16 activation domain and by ligand-bound glucocorticoid receptor. Neely et al. (1999) Mol. Cell 4:649-655; Wallberg et al. (2000) Mol. Cell. Biol. 20:2004-2013.
- There are also several examples of a requirement for the activity of chromatin remodeling complexes for gene activation in vivo. The human SWI/SNF chromatin remodeling complex is required for the activity of the glucocorticoid receptor. Fryer et al. (1998)Nature 393:88-91. The mammalian SWI/SNF chromatin remodeling complex is required for activation of the hsp70 gene. de La Serna et al. (2000) Mol. Cell. Biol. 20:2839-2851. Mutations in the Drosophila ISWI protein adversely affect expression of the engrailed and Ultrabithorax genes. Deuring et al. (2000) Mol. Cell 5:355-365. Finally, mutations in the yeast SWI/SNF gene result in a decrease in expression of one group of genes and an increase in expression of another group of genes, showing that chromatin remodeling can have both positive and negative effects on gene expression. Holsteege et al. (1998) Cell 95:717-728; Sudarsanam et al. (2000) Proc. Natl. Acad. Sci. USA 97:3364-3369.
- Despite this knowledge of the effects of chromatin remodeling on gene expression in vitro and in vivo, methods for directed manipulation of chromatin structure are not available. Accordingly, for situations in which a regulatory molecule is prevented, by chromatin structure, from interacting with its target site, methods for targeted modification of chromatin structure are needed. Such methods would be useful, for example, to facilitate binding of regulatory molecules to cellular chromatin and/or to facilitate access of DNA-binding molecules to cellular DNA sequences. This, in turn, would facilitate regulation of gene expression, either positively or negatively, by endogenous and exogenous molecules, and provide additional methods for binding these molecules to binding sites within regions of interest in cellular chromatin.
- Disclosed herein are compositions and methods useful for targeted modification of chromatin. These compositions and methods are useful for facilitating processes that depend upon access of cellular DNA sequences to DNA-binding molecules, for example, transcription, replication, recombination, repair and integration. In one embodiment, targeted modification of chromatin facilitates regulation of gene expression by endogenous or exogenous molecules, by providing access to cellular DNA sequences. Modification is any change in chromatin structure, compared to the normal state of the chromatin in the cell in which it resides.
- Accordingly, in one embodiment, a method for modifying a region of interest in cellular chromatin is provided, wherein the method comprises contacting cellular chromatin with a fusion molecule that binds to a binding site in the region of interest. The fusion molecule comprises a DNA-binding domain and a component of a chromatin remodeling complex or a functional fragment thereof. In a preferred embodiment, the fusion molecule is a polypeptide. Cellular chromatin can be present in any type of cell, including prokaryotic, eucaryotic or archaeal. Eucaryotic cells include microorganisms, fungal cells, plants and animals, including vertebrate, mammalian and human cells.
- In certain embodiments, the DNA-binding domain of a fusion molecule comprises a triplex-forming nucleic acid, an intercalator, an antibiotic, or a minor groove binder. In a preferred embodiment, the DNA-binding domain comprises a zinc finger DNA-binding domain. In a more preferred embodiment, a fusion molecule is a fusion polypeptide comprising a zinc finger DNA-binding domain. Other polypeptide DNA-binding domains are also useful.
- The other portion of the fusion molecule is a component of a chromatin remodeling complex. Numerous chromatin remodeling complexes are known to those of skill in the art. Chromatin remodeling complexes generally contain an enzymatic component, which is often an ATPase, a histone acetyl transferase or a histone deacetylase. ATPase components include, but are not limited to, the following polypeptides: SWI2/SNF2, Mi-2, ISWI, BRM, BRG/BAF, Chd-1, Chd-2, Chd-3, Chd-4 and Mot-1. Additional non-enzymatic components, involved in positioning the enzymatic component with respect to its substrate and/or for interaction with other proteins, are also present in chromatin remodeling complexes and can be used as a portion of a fusion molecule. Many components of chromatin remodeling complexes have been identified by sequence homology. Accordingly, additional chromatin remodeling complexes and their components are likely to be discovered and their use is contemplated by the present disclosure.
- Modification of chromatin structure will facilitate many processes that require access to cellular DNA. In one embodiment, chromatin modification facilitates modulation of expression of a gene of interest. Modulation of expression comprises activation or repression of a gene of interest. In a separate embodiment, chromatin modification facilitates recombination between an exogenous nucleic acid and cellular chromatin. In this way, targeted integration of transgenes is accomplished more efficiently.
- As noted, a fusion molecule can be a polypeptide. Accordingly, in one embodiment, chromatin modification is accomplished by contacting a cell with a polynucleotide encoding a fusion polypeptide, such that the polynucleotide is introduced into the cell and the fusion polypeptide is expressed in the cell. In this regard, fusion polypeptides comprising a fusion between a DNA-binding domain and a component of a chromatin remodeling complex (or functional fragment thereof), as well as polynucleotides encoding them, are provided. Also provided are cells comprising these fusion polypeptides and cells comprising polynucleotides encoding these fusion polypeptides. Preferred are fusion polypeptides comprising a zinc finger DNA binding domain and polynucleotides encoding them.
- In one embodiment, a region of interest in cellular chromatin, which is to be modified, comprises a gene. Exemplary genes whose chromatin structure can be modified through the use of the compositions and methods disclosed herein include, but are not limited to, vascular endothelial growth factor (VEGF), erythropoietin (EPO), androgen receptor, PPAR-γ2, p16, p53, Rb, dystrophin and e-cadherin. Accordingly, in certain embodiments, the DNA binding domain of the fusion molecule is selected to bind to a sequence (i.e., a target site) in one of the aforementioned genes.
- In certain embodiments, modification of chromatin structure, using a fusion molecule as disclosed herein, is accompanied by an additional step of contacting cellular chromatin with a second molecule. Often, the modification of chromatin structure effected by the binding of the fusion molecule facilitates the binding of the second molecule. The second molecule can be a transcription regulatory molecule, either an endogenous factor or one that is exogenously supplied to a cell. In certain embodiments, the second molecule is also a fusion molecule, preferably a fusion polypeptide. In a preferred embodiment, the second molecule comprises a zinc finger DNA-binding domain. The second molecule can also comprise, for example, a transcriptional activation domain or a transcriptional repression domain. Thus, in one embodiment, modification of chromatin structure, in a region of interest, by a fusion molecule as disclosed herein provides access for the binding of a second molecule which can regulate the transcription of a gene in or near the region of interest.
- In another embodiment, a second molecule is a fusion comprising a DNA binding domain and an enzyme (or functional fragment thereof) that covalently modifies histones, for example, a histone acetyl transferase or a histone deacetylase. In this way, a first fusion molecule facilitates remodeling of chromatin, making it a substrate for the activity of a second fusion molecule that facilitates covalent modification of the remodeled chromatin. Alternatively, a second molecule can comprise a fusion between a DNA binding domain and a component of a chromatin remodeling complex that is different from the one present in the first molecule. In this way, it is possible to recruit multiple chromatin remodeling complexes to a region of interest in cellular chromatin.
- In yet another embodiment, cellular chromatin is contacted with three molecules. The first comprises a fusion between a DNA binding domain and a component of a chromatin remodeling complex or a functional fragment thereof. The second molecule can comprise, for example, a transcriptional regulatory molecule (endogenous or exogenous), a fusion between a DNA binding domain and a component of a chromatin remodeling complex or a fusion between a DNA binding domain and an enzyme that covalently modifies histones. The third molecule can be an endogenous or exogenous transcriptional regulatory molecule, or a fusion molecule. A fusion molecule can be a fusion polypeptide and can comprise a DNA binding domain (e.g., a zinc finger DNA binding domain) and a transcriptional regulatory domain, such as, for example, an activation domain or a repression domain. Thus, several combinations of molecules are possible. For example, the first and second molecules can be involved in modifying chromatin structure in a region of interest to allow access to that region by a third molecule which can be, for example, a molecule with transcriptional regulatory function. Alternatively, the first molecule can be involved in the modification of chromatin structure to allow access by the second and third molecules (both of which can be, for example, transcriptional regulatory molecules) in a region of interest in cellular chromatin. In another embodiment, the first molecule can facilitate chromatin remodeling in the region of interest, the second molecule can be involved in covalent modification of histones in the region of interest, and the third molecule can bind in the region of interest and possess transcriptional regulatory function. In similar fashion, fourth, fifth, etc. molecules can also be contacted with cellular chromatin to modify its structure in a region of interest and effect regulation of a gene in that region.
- In one embodiment, methods for modulating expression of a gene comprise the steps of contacting cellular chromatin with a first fusion molecule that binds to a binding site in cellular chromatin, wherein the binding site is in the gene, and wherein the first fusion molecule comprises a DNA-binding domain and a component of a chromatin remodeling complex or a functional fragment thereof, and further contacting the cellular chromatin with a second molecule that binds to a target site in the gene and modulates expression of the gene. In a preferred embodiment, the DNA-binding domain of the first fusion molecule is a zinc finger DNA-binding domain.
- The second molecule can be, for example, a small molecule therapeutic, a minor groove binder, a peptide, a polyamide, a DNA molecule, a triplex-forming oligonucleotide, an RNA molecule, or a polypeptide. Exemplary polypeptides include, but are not limited to, transcription factors, recombinases, integrases, helicases, and DNA or RNA polymerases. Any of the aforementioned molecules can be either exogenous or endogenous. Alternatively, the second molecule can be a second fusion molecule, For example, a fusion polypeptide. In a preferred embodiment, the second molecule is a fusion polypeptide comprising a zinc finger DNA binding domain. The second fusion molecule can also comprise a transcriptional activation domain or a transcriptional repression domain.
- In certain embodiments of methods for modulating expression of a gene, a plurality of first fusion molecules, each having a distinct binding site in the gene, are contacted with cellular chromatin. Similarly, a plurality of second molecules, each having a distinct target site in the gene, can be contacted with cellular chromatin in the practice of methods to modulate expression of a gene. Thus, the disclosed methods for modulating expression of a gene can include the use of a single first fusion molecule and a single second molecule, a single first fusion molecule and a plurality of second molecules, a plurality of first fusion molecules and a single second molecule, and a plurality of first fusion molecules and a plurality of second molecules.
- In additional embodiments, expression of a plurality of genes is modulated according to the disclosed methods. This can be accomplished in several ways. In one embodiment, a plurality of first fusion molecules, each binding to a distinct binding site, wherein each distinct binding site is in a distinct gene, are contacted with cellular chromatin. One or more of the first fusion molecules can be a zinc finger fusion polypeptide comprising a zinc-finger DNA-binding domain. In certain embodiments, a first fusion molecule can bind to a shared binding site in two or more of the plurality of genes. In one embodiment of a method for modulating the expression of a plurality of genes, a single first fusion molecule binds to a shared binding site in all of the plurality of genes whose expression is modulated.
- Additional methods for modulating the expression of a plurality of genes involve contacting a plurality of second molecules with cellular chromatin, in combination with the contact of one or more first fusion molecules with cellular chromatin. Each of the plurality of second molecules can bind to a distinct target site, wherein each distinct target site is in a distinct gene. Alternatively, a single second molecule can bind to a shared target site in two or more different genes. In one embodiment, a single second molecule binds to a shared target site in all of the plurality of genes whose expression is modulated.
- In certain embodiments, to facilitate the binding of the fusion molecule to the cellular chromatin, one or more accessible regions within the region of interest are identified and one or more target sites for the DNA-binding portion of the fusion molecule are identified within the accessible region. In separate embodiments, the DNA-binding domain is capable of binding to nucleosomal DNA sequences and identification of an accessible region is not necessary. In the latter case, chromatin modification, as disclosed herein, often results in the generation of an accessible region in cellular chromatin in the region of interest, which can facilitate the binding of other molecules, either exogenous or endogenous. Exogenous molecules whose binding can be facilitated by the generation of an accessible region through chromatin modification include, but are not limited to, minor groove binders, major groove binders, intercalators, small molecule therapeutics, nucleic acids, and polypeptides, including fusion polypeptides, preferably comprising a zinc finger DNA-binding domain.
- Polynucleotides encoding fusions between a DNA-binding domain and a component of a chromatin remodeling complex, and methods for their construction, are also provided.
- Fusion polypeptides, comprising a DNA-binding domain and a component of a chromatin remodeling complex, and methods for producing such fusion polypeptides, are also provided. In one embodiment, such fusion polypeptides are produced by expressing a polynucleotide as described in the preceding paragraph in a suitable host cell.
- Methods for binding an exogenous molecule to cellular chromatin, wherein the methods comprise targeted modification of chromatin structure as disclosed herein, are also provided.
- FIG. 1 shows the PCR amplification scheme for production of constructs encoding the Veg1 and Veg3a DNA-binding domains.
- FIG. 2 shows the results of DNA-binding affinity determination for the Veg1 DNA-binding subdomain. FIG. 2A shows EMSA analysis. Unbound probe is at the bottom of the gel and shifted probe (bound to Veg1) is indicated by the arrow to the right of the gel photo. Concentration of Veg1 is given at the top. MBP-VEGF1 indicates a binding reaction in which 15 nM of the Veg1-maltose binding protein fusion was used.
- FIG. 3 shows an autoradiogram of a DNA gel, indicating the existence and location of DNaseI-hypersensitive sites in the human VEGF-A gene.
- FIG. 4 is a schematic diagram of the plasmid pSRC1b-EPO2C. The rightward-pointing arrow represents the start site of a transcription unit encoding a fusion protein that includes a nuclear localization signal (NLS), a ZFP binding domain targeted to nucleotide −862 of the human erythropoietin gene (EPO2c ZFP), a portion of the SRC1 protein from amino acids 781-1385 (SRC1b), and a FLAG epitope (Flag). pCMV represents a CMV promoter. Selected restriction enzyme recognition sites are also indicated.
- FIG. 5 shows erythropoietin (EPO) levels in transfected and control cells, as determined by ELISA. The bar labeled pSRC1b-EPO2C represents levels of EPO secreted into the medium by cells transfected with a plasmid encoding a fusion between an EPO-targeted ZFP and a portion of the SRC1 protein. pCDNA3.1 represents secreted EPO levels in cells transfected with a control plasmid that does not encode a ZFP-SRC1 fusion.
- FIG. 6 is a schematic diagram depicting the structure of a set of fusion molecules described in Example 13. NLS refers to a nuclear localization sequence, ZFP-DBD refers to the VEGF3a/l zinc finger DNA-binding domain, MBD refers to a portion of a methyl binding domain protein, DNMT refers to a portion of a DNA N-methyl transferase protein, and Flag refers to a FLAG epitope.
- FIG. 7 shows VEGF levels in transfected and control cells, as determined by ELISA. MOCK refers to cells transfected with a vector that does not contain a ZFP-MBD or ZFP-DNMT fusion. pEGFP-KRAB refers to cells transfected with a green fluorescent protein-encoding plasmid. PVF3a/1 refers to the VEGF3a/1 DNA binding domain described in Examples 3 and 13. MBD refers to various methyl binding domain proteins. DNMT refers to various DNA N-methyl transferases.
- Disclosed herein are compositions and methods useful for modifying chromatin structure in a predetermined region of interest in cellular chromatin. Modification of chromatin structure facilitates many processes involving nucleotide sequence-specific interaction of molecules with cellular chromatin. In certain embodiments, modification of chromatin structure is a prerequisite for binding of a regulatory molecule to its target site in cellular chromatin. Such binding can be useful in the regulation of an endogenous cellular gene by one or more endogenous and/or exogenous molecules.
- Regulation of gene expression often involves recruitment of a chromatin remodeling complex to a region of cellular chromatin (e.g., the promoter of a gene). Recruitment can occur, for example, by protein-protein interactions between a sequence-specific DNA-binding transcriptional regulatory protein bound at a promoter and a component of the remodeling complex. See, for example, Peterson et al. (2000)Curr. Opin. Genet. Devel. 10:187-192. Alterations in chromatin structure in the vicinity of the promoter, mediated by the recruited remodeling complex, facilitate subsequent interactions that result in transcriptional activation or repression. However, the region to which a remodeling complex can be localized is limited by the sequence specificity of the DNA-binding transcriptional regulatory protein, since most, if not all, protein components of chromatin remodeling complexes do not possess sequence-specific DNA-binding activity. Thus, it is not easy to target chromatin remodeling to a particular region of interest in cellular chromatin unless one possesses a protein that is: (1) capable of binding to chromatin in or near the region of interest, and (2) capable of interacting with at least one component of a multi-subunit chromatin remodeling complex.
- The methods and compositions disclosed herein allow targeted modification of any region of interest in cellular chromatin, by employing a fusion molecule comprising a DNA-binding domain and a component of a chromatin remodeling complex or functional fragment thereof. The DNA-binding domain is selected or designed to bind to a target site within or near the region of interest. Any DNA-binding entity having the requisite specificity is suitable. In a preferred embodiment, the DNA-binding domain is a zinc finger DNA-binding domain. Binding of the DNA-binding portion of the fusion molecule localizes the portion of the fusion molecule comprising a component of a chromatin remodeling complex to the region of binding, where it interacts with other components to reconstitute a functional chromatin remodeling complex in the vicinity of the target site. Chromatin remodeling ensues in the vicinity of the target site, which renders the region of binding (e.g., a gene promoter) susceptible to the action of endogenous regulatory factors, and/or to the regulatory activities of exogenous molecules.
- It will be apparent to one of skill in the art that targeted remodeling of chromatin will facilitate the regulation of many processes involving access of molecules to DNA in cellular chromatin including, but not limited to, replication, recombination, repair, transcription, telomere function and maintenance, sister chromatid cohesion, and mitotic chromosome segregation. For example, targeted integration of exogenous DNA into cellular chromatin will be enhanced by chromatin remodeling in the region of the desired integration site.
- The practice of the invention employs, unless otherwise indicated, conventional techniques in molecular biology, biochemistry, chromatin structure and analysis, computational chemistry, cell culture, recombinant DNA and related fields as are within the skill of the art. These techniques are fully explained in the literature. See, for example, Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL, Second edition, Cold Spring Harbor Laboratory Press, 1989; Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, 1987 and periodic updates; the series METHODS IN ENZYMOLOGY, Academic Press, San Diego; Wolffe, CHROMATIN STRUCTURE AND FUNCTION, Third edition, Academic Press, San Diego, 1998; METHODS IN ENZYMOLOGY, Vol. 304, “Chromatin” (P. M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego, 1999; and METHODS IN MOLECULAR BIOLOGY, Vol. 119, “Chromatin Protocols” (P. B. Becker, ed.) Humana Press, Totowa, 1999.
- The terms nucleic acid, polynucleotide, and oligonucleotide are used interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer in either single- or double-stranded form. For the purposes of the present disclosure, these terms are not to be construed as limiting with respect to the length of a polymer. The terms can encompass known analogues of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties. In general, an analogue of a particular nucleotide has the same base-pairing specificity; i.e., an analogue of A will base-pair with T. The terms also encompasses nucleic acids containing modified backbone residues or linkages, which are synthetic, naturally occurring, and non-naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides. Examples of such analogs include, without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs).
- Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated. Nucleic acids include, for example, genes, cDNAs, and mRNAs. Polynucleotide sequences are displayed herein in the conventional 5′-3′ orientation.
- Chromatin is the nucleoprotein structure comprising the cellular genome. Cellular chromatin comprises nucleic acid, primarily DNA, and protein, including histones and non-histone chromosomal proteins. The majority of eukaryotic cellular chromatin exists in the form of nucleosomes, wherein a nucleosome core comprises approximately 150 base pairs of DNA associated with an octamer comprising two each of histones H2A, H2B, H3 and H4; and linker DNA (of variable length depending on the organism) extends between nucleosome cores. A molecule of histone H1 is generally associated with the linker DNA. For the purposes of the present disclosure, the term “chromatin” is meant to encompass all types of cellular nucleoprotein, both prokaryotic and eukaryotic. Cellular chromatin includes both chromosomal and episomal chromatin.
- Chromatin modification, or chromatin remodeling, refers to any process by which the structure of chromatin or its constituents is altered. Remodeling can include, for example, removal or repositioning of nucleosomes, addition of nucleosomes, changes in nucleosome density, changes in the path of DNA along the histone octamer, and/or changes in higher-order chromatin structure such as, for example, unwinding of the chromatin solenoid. Chromatin modification can also include modifications to histones or nucleic acid which might not necessarily change the structure of chromatin as assayable by current methods. For example, acetylation or deacetylation of histones, as well as methylation or demethylation of nucleic acid, are instances of chromatin modification.
- A chromosome, as is known to one of skill in the art, is a chromatin complex comprising all or a portion of the genome of a cell. The genome of a cell is often characterized by its karyotype, which is the collection of all the chromosomes that comprise the genome of the cell. The genome of a cell can comprise one or more chromosomes.
- An episome is a replicating nucleic acid, nucleoprotein complex or other structure comprising a nucleic acid that is not part of the chromosomal karyotype of a cell. Examples of episomes include plasmids and certain viral genomes.
- A target site is a nucleic acid sequence that defines a portion of a nucleic acid to which a binding molecule will bind, provided sufficient conditions for binding exist. For example, the
sequence 5′-GAATTC-3′ is a target site for the Eco RI restriction endonuclease. Although binding of a molecule to its target site will generally occur in a naked nucleic acid molecule, a binding molecule may be incapable of binding to its target site in cellular chromatin, as a result of some aspect of the structure of the chromatin in which the target site is located which makes the target site inaccessible to the binding molecule. In other cases, factors in addition to a target site may be required for binding of a molecule to a nucleic acid at the target site. For instance, binding of a molecule to a polynucleotide comprising a target site may require both a particular nucleotide sequence and a particular protein composition adjacent to, or in the vicinity of, the target site. Conditions such as, for example, temperature, pH, and ionic strength can also affect binding of a molecule to its target site. - Target sites for various transcription factors are known. See, for example, Wingender et al. (1997)Nucleic Acids Res. 25:265-268 and the TRANSFAC Transcription Factor database at http://transfac.gbf.de/TRANSFAC/, accessed on Apr. 13, 2000. In general, target sites for newly-discovered transcription factors, as well as other types of exogenous molecule, can be determined by methods that are well-known to those of skill in the art such as, for example, electrophoretic mobility shift assay, exonuclease protection, DNase footprinting, chemical footprinting and/or direct nucleotide sequence determination of a binding site. See, for example, Ausubel et al., supra, Chapter 12.
- A binding site in cellular chromatin is a region at which a particular molecule, for example a protein, will bind to a target site in the chromatin. A binding site will generally comprise a target site, but not every target site will constitute a binding site in cellular chromatin. For example, a target site may be occluded by one or more chromosomal components, such as histones or nonhistone proteins, or might be rendered inaccessible to its binding molecule because of nucleosomal or higher-order chromatin structure. On the other hand, the presence of one or more chromosomal proteins may be required, in addition to a target site, to define a binding site.
- An accessible region is a site in a chromosome, episome or other cellular structure comprising a nucleic acid, in which a target site present in the nucleic acid can be bound by an exogenous molecule which recognizes the target site. Without wishing to be bound by any particular theory, it is believed that an accessible region is one that is not packaged into a nucleosomal structure. The distinct structure of an accessible region can often be detected by its sensitivity to chemical and enzymatic probes, for example, nucleases.
- An exogenous molecule is a molecule that is not normally present in a cell, but can be introduced into a cell by one or more genetic, biochemical or other methods. Normal presence in the cell is determined with respect to the particular developmental stage and environmental conditions of the cell. Thus, for example, a molecule that is present only during embryonic development of muscle is an exogenous molecule with respect to an adult muscle cell. Similarly, a molecule induced by heat shock is an exogenous molecule with respect to a non-heat-shocked cell. An exogenous molecule can comprise, for example, a functioning version of a malfunctioning endogenous molecule or a malfunctioning version of a normally-functioning endogenous molecule.
- An exogenous molecule can be, among other things, a small molecule, such as is generated by a combinatorial chemistry process, or a macromolecule such as a protein, nucleic acid, carbohydrate, lipid, glycoprotein, lipoprotien, polysaccharide, any modified derivative of the above molecules, or any complex comprising one or more of the above molecules. Nucleic acids include DNA and RNA, can be single- or double-stranded; can be linear, branched or circular; and can be of any length. Nucleic acids include those capable of forming duplexes, as well as triplex-forming nucleic acids. See, for example, U.S. Pat. Nos. 5,176,996 and 5,422,251. Proteins include, but are not limited to, DNA-binding proteins, transcription factors, chromatin remodeling factors, methylated DNA binding proteins, polymerases, methylases, demethylases, acetylases, deacetylases, kinases, phosphatases, integrases, recombinases, ligases, topoisomerases, gyrases and helicases.
- An exogenous molecule can be the same type of molecule as an endogenous molecule, e.g., protein or nucleic acid, providing it has a sequence that is different from an endogenous molecule. For example, an exogenous nucleic acid can comprise an infecting viral genome, a plasmid or episome introduced into a cell, or a chromosome that is not normally present in the cell. Methods for the introduction of exogenous molecules into cells are known to those of skill in the art and include, but are not limited to, lipid-mediated transfer (i.e., liposomes, including neutral and cationic lipids), electroporation, direct injection, cell fusion, particle bombardment, calcium phosphate co-precipitation, DEAE-dextran-mediated transfer and viral vector-mediated transfer.
- By contrast, an endogenous molecule is one that is normally present in a particular cell at a particular developmental stage under particular environmental conditions. For example, an endogenous nucleic acid can comprise a chromosome, the genome of a mitochondrion, chloroplast or other organelle, or a naturally-occurring episomal nucleic acid. Additional endogenous molecules can include proteins, for example, transcription factors and components of chromatin remodeling complexes.
- A fusion molecule is a molecule in which two or more subunit molecules are linked, preferably covalently. The subunit molecules can be the same chemical type of molecule, or can be different chemical types of molecules. Examples of the first type of fusion molecule include, but are not limited to, fusion polypeptides (for example, a fusion between a ZFP DNA-binding domain and a transcriptional activation domain) and fusion nucleic acids (for example, a nucleic acid encoding the fusion polypeptide described supra). Examples of the second type of fusion molecule include, but are not limited to, a fusion between a triplex-forming nucleic acid and a polypeptide, and a fusion between a minor groove binder and a nucleic acid. In a preferred embodiment, a fusion molecule is a nucleic acid which encodes a ZFP DNA-binding domain in operative linkage with a component of a chromatin remodeling complex or functional fragment thereof.
- A gene, for the purposes of the present disclosure, includes a DNA region encoding a gene product (see infra), as well as all DNA regions which regulate the production of the gene product, whether or not such regulatory sequences are adjacent to coding and/or transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, promoter sequences, terminators, translational regulatory sequences such as ribosome binding sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, replication origins, matrix attachment sites and locus control regions.
- Gene expression refers to the conversion of the information, contained in a gene, into a gene product. A gene product can be the direct transcriptional product of a gene (e.g., mRNA, tRNA, rRNA, antisense RNA, ribozyme, structural RNA or any other type of RNA) or a protein produced by translation of a mRNA. Gene products also include RNAs which are modified, by processes such as capping, polyadenylation, methylation, and editing, and proteins modified by, for example, methylation, acetylation, phosphorylation, ubiquitination, ADP-ribosylation, myristilation, and glycosylation.
- Modulation of gene expression refers to a change in the activity of a gene.
- Modulation of expression can include, but is not limited to, gene activation and gene repression. Modulation can be assayed by determining any parameter that is indirectly or directly affected by the expression of the target gene. Such parameters include, e.g., changes in RNA or protein levels; changes in protein activity; changes in product levels;
- changes in downstream gene expression; changes in transcription or activity of reporter genes such as, for example, luciferase, CAT, beta-galactosidase, or GFP (see, e.g., Mistili & Spector, (1997)Nature Biotechnology 15:961-964); changes in signal transduction; changes in phosphorylation and dephosphorylation; changes in receptor-ligand interactions; changes in concentrations of second messengers such as, for example, cGMP, cAMP, IP3, and Ca2+; changes in cell growth, changes in neovascularization, and/or changes in any functional effect of gene expression. Measurements can be made in vitro, in vivo, and/or ex vivo. Such functional effects can be measured by conventional methods, e.g., measurement of RNA or protein levels, measurement of RNA stability, and/or identification of downstream or reporter gene expression. Readout can be by way of, for example, chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible markers, ligand binding assays; changes in intracellular second messengers such as cGMP and inositol triphosphate (IP3); changes in intracellular calcium levels; cytokine release, and the like.
- Gene activation is any process which results in an increase in production of a gene product. A gene product can be either RNA (including, but not limited to, mRNA, rRNA, tRNA, and structural RNA) or protein. Accordingly, gene activation includes those processes which increase transcription of a gene and/or translation of a mRNA. Examples of gene activation processes which increase transcription include, but are not limited to, those which facilitate formation of a transcription initiation complex, those which increase transcription initiation rate, those which increase transcription elongation rate, those which increase processivity of transcription and those which relieve transcriptional repression (by, for example, blocking the binding of a transcriptional repressor). Gene activation can constitute, for example, inhibition of repression as well as stimulation of expression above an existing level. Examples of gene activation processes which increase translation include those which increase translational initiation, those which increase translational elongation and those which increase mRNA stability. In general, gene activation comprises any detectable increase in the production of a gene product, preferably an increase in production of a gene product by about 2-fold, more preferably from about 2- to about 5-fold or any integer therebetween, more preferably between about 5- and about 10-fold or any integer therebetween, more preferably between about 10- and about 20-fold or any integer therebetween, still more preferably between about 20- and about 50-fold or any integer therebetween, more preferably between about 50- and about 100-fold or any integer therebetween, more preferably 100-fold or more.
- Gene repression is any process which results in a decrease in production of a gene product. A gene product can be either RNA (including, but not limited to, mRNA, rRNA, tRNA, and structural RNA) or protein. Accordingly, gene repression includes those processes which decrease transcription of a gene and/or translation of a mRNA. Examples of gene repression processes which decrease transcription include, but are not limited to, those which inhibit formation of a transcription initiation complex, those which decrease transcription initiation rate, those which decrease transcription elongation rate, those which decrease processivity of transcription and those which antagonize transcriptional activation (by, for example, blocking the binding of a transcriptional activator). Gene repression can constitute, for example, prevention of activation as well as inhibition of expression below an existing level. Examples of gene repression processes which decrease translation include those which decrease translational initiation, those which decrease translational elongation and those which decrease mRNA stability. Transcriptional repression includes both reversible and irreversible inactivation of gene transcription. In general, gene repression comprises any detectable decrease in the production of a gene product, preferably a decrease in production of a gene product by about 2-fold, more preferably from about 2- to about 5-fold or any integer therebetween, more preferably between about 5- and about 10-fold or any integer therebetween, more preferably between about 10- and about 20-fold or any integer therebetween, still more preferably between about 20- and about 50-fold or any integer therebetween, more preferably between about 50- and about 100-fold or any integer therebetween, more preferably 100-fold or more. Most preferably, gene repression results in complete inhibition of gene expression, such that no gene product is detectable.
- Accordingly, the terms modulating expression, inhibiting expression and activating expression of a gene can refer to the ability of a molecule to activate or inhibit transcription of a gene. Activation includes prevention of transcriptional inhibition (i.e., prevention of repression of gene expression) and inhibition includes prevention of transcriptional activation (i.e., prevention of gene activation).
- To determine the level of gene expression modulation by a ZFP, cells contacted with, for example, ZFPs can be compared to control cells, e.g., without the zinc finger protein or with a non-specific ZFP, to examine the extent of inhibition or activation. Control samples can be assigned a relative gene expression activity value of 100%. Modulation/inhibition of gene expression is achieved when the gene expression activity value relative to the control is about 80% or below, preferably 50% or below (i.e., 0.5x or less the activity of the control), more preferably 25% or below, more preferably 0-5%. Modulation/activation of gene expression is achieved when the gene expression activity value relative to the control is greater than 100%, preferably 110% or more, more preferably 150% or more (i.e., 1.5×the activity of the control or greater), more preferably 200-500% or more, still more preferably 1000-2000% or more.
- Eucaryotic cells include, but are not limited to, fungal cells (such as yeast), protozoal cells, plant cells, insect cells, animal cells, including avian cells, teleost cells, amphibian cells, reptilian cells, mammalian cells, canine cells, porcine cells, feline cells, murine cells, ovine cells, bovine cell, equine cells, primate cells and human cells.
- A region of interest is any region of cellular chromatin, such as, for example, a gene or a non-coding sequence within or adjacent to a gene, in which it is desirable to, for example, modify chromatin structure and/or bind an exogenous molecule. A region of interest can be present in a chromosome, an episome, an organellar genome (e.g., mitochondrial, chloroplast), or an infecting viral genome, for example. A region of interest can be within the coding region of a gene, within transcribed non-coding regions such as, for example, leader sequences, trailer sequences or introns, or within non-transcribed regions, either upstream or downstream of the coding region.
- The terms operable linkage, operative linkage, operably linked and operatively linked are used with reference to a juxtaposition of two or more components (such as sequence elements), in which the components are arranged such that both components function normally and allow the possibility that at least one of the components can mediate a function that is exerted upon at least one of the other components. By way of illustration, a transcriptional regulatory sequence, such as a promoter, is operatively linked to a coding sequence if the transcriptional regulatory sequence controls the level of transcription of the coding sequence in response to the presence or absence of one or more transcriptional regulatory factors. An operatively linked transcriptional regulatory sequence is generally joined in cis with a coding sequence, but need not be directly adjacent to it. For example, an enhancer can constitute a transcriptional regulatory sequence that is operatively-linked to a coding sequence, even though they are not contiguous.
- With respect to fusion polypeptides, the term operatively linked can refer to the fact that each of the components performs the same function in linkage to the other component as it would if it were not so linked. For example, with respect to a fusion polypeptide in which a ZFP DNA-binding domain is fused to a component of a chromatin remodeling complex (or functional fragment thereof), the ZFP DNA-binding domain and the component of the chromatin remodeling complex (or functional fragment thereof) are in operative linkage if, in the fusion polypeptide, the ZFP DNA-binding domain portion is able to bind its target site and/or its binding site, while the component of the chromatin remodeling complex (or functional fragment thereof) is able to interact with other members of its cognate chromatin remodeling complex.
- A functional fragment of a protein, polypeptide or nucleic acid is a protein, polypeptide or nucleic acid whose sequence is not identical to the full-length protein, polypeptide or nucleic acid, yet retains the same function as the full-length protein, polypeptide or nucleic acid. A functional fragment can possess more, fewer, or the same number of residues as the corresponding native molecule, and/or can contain one or more amino acid or nucleotide substitutions. Methods for determining the function of a nucleic acid (e.g., coding function, ability to hybridize to another nucleic acid) are well-known in the art. Similarly, methods for determining protein function are well-known. For example, the DNA-binding function of a polypeptide can be determined, for example, by filter-binding, electrophoretic mobility-shift, or immunoprecipitation assays. See Ausubel et al., supra. The ability of a protein to interact with another protein can be determined, for example, by co-immunoprecipitation, two-hybrid assays or complementation, both genetic and biochemical. See, for example, Fields et al. (1989)Nature 340:245-246; U.S. Pat. No. 5,585,245 and PCT WO 98/44350.
- The term recombinant, when used with reference to a cell, indicates that the cell replicates an exogenous nucleic acid, or expresses a peptide or protein encoded by an exogenous nucleic acid. Recombinant cells can contain genes that are not found within the native (non-recombinant) form of the cell. Recombinant cells can also contain genes found in the native form of the cell wherein the genes are modified and re-introduced into the cell by artificial means. The term also encompasses cells that contain a nucleic acid endogenous to the cell that has been modified without removing the nucleic acid from the cell; such modifications include those obtained by gene replacement, site-specific mutation, and related techniques. Thus, for example, recombinant cells express genes that are not found within the native (naturally occurring) form of the cell or express a second copy of a native gene that is otherwise normally or abnormally expressed, underexpressed or not expressed at all. Recombinant cells also include cells or cell lines derived from cells that have been modified as described.
- When used with reference, e.g., to a nucleic acid, protein, or vector, the term recombinant refers to nucleic acids, proteins or vectors that have been modified by the introduction of heterologous nucleic acid or amino acid sequence, and includes any other alterations of a native nucleic acid or protein.
- An expression vector is a nucleic acid construct, generated recombinantly or synthetically, with a series of specified nucleic acid elements that permit transcription of a particular nucleic acid in a host cell, and optionally integration and/or replication of the expression vector in a host cell. The expression vector can be part of a plasmid, viral genome, or nucleic acid fragment, of viral or non-viral origin. Expression vectors can be, for example, naked DNA molecules, or can comprise nucleic acid of viral or nonviral origin packaged into viral particles. Typically, the expression vector includes an expression cassette, which comprises a nucleic acid to be transcribed operably linked to control elements that are capable of effecting expression of a nucleic acid that is operatively linked to the control elements in hosts compatible with such sequences. Expression cassettes include at least promoters and optionally, transcription termination signals. Typically, a recombinant expression cassette includes at least a nucleic acid to be transcribed (e.g., a nucleic acid encoding a desired polypeptide) and a promoter. Additional factors necessary or helpful in effecting expression can also be used, for example, an expression cassette can also include nucleotide sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell. Transcription termination signals, enhancers, and other nucleic acid sequences that influence gene expression can also be included in an expression cassette.
- The terms polypeptide, peptide and protein are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an analog or mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. Polypeptides can be modified, e.g., by phosphorylation, methylation, myristilation, acetylation and/or the addition of carbohydrate residues to form glycoproteins. The terms polypeptide, peptide and protein include all of these modified polypeptides, as well as polypeptides comprising any additional covalent or non-covalent modification. Polypeptide sequences are displayed herein in the conventional N-terminal to C-terminal orientation.
- A subsequence or segment, when used in reference to a nucleic acid or polypeptide, refers to a sequence of nucleotides or amino acids that comprise a part of a longer sequence of nucleotides or amino acids (e.g., a polynucleotide or polypeptide), respectively.
- Specific binding between an antibody or other binding agent and an antigen, or between two binding partners, means that the dissociation constant for the interaction is less than 10−6 M. Preferred antibody/antigen or binding partner complexes have a dissociation constant of less than about 10−7 M, and preferably 10−8 M to 10−9 M or 10−10 M or lower.
- A binding domain or binding molecule is a compound that is able to bind, either covalently or non-covalently, to another molecule. The other molecule can be, for example, a polynucleotide (e.g., DNA or RNA) or a polypeptide. Binding domains can comprise any compound able to bind another molecule; exemplary binding domains are polypeptides and are denoted binding proteins. A binding protein can bind to, for example, a DNA molecule (a DNA-binding domain), an RNA molecule (an RNA-binding domain) and/or a protein molecule (a protein-binding domain). In the case of a protein-binding protein, it can bind to itself (to form homodimers, homotrimers, etc.) and/or it can bind to one or more molecules of a different protein or proteins. A binding domain can have more than one type of binding activity. For example, zinc finger proteins have DNA-binding, RNA-binding and protein-binding activity.
- A zinc finger binding protein is a protein or polypeptide that binds DNA, RNA and/or protein, preferably in a sequence-specific manner, as a result of stabilization of protein structure through coordination of a zinc ion. The term zinc finger binding protein is often abbreviated as zinc finger protein or ZFP. The individual DNA binding domains are typically referred to as fingers. A ZFP has least one finger, typically two fingers, three fingers, four fingers, five fingers, or six or more fingers. Each finger binds from two to four base pairs of DNA, typically three or four base pairs of DNA. A ZFP binds to a nucleic acid sequence called a target site or target segment. Each finger typically comprises an approximately 30 amino acid, zinc-chelating, DNA-binding subdomain. An exemplary motif characterizing one class of these proteins (C2H2 class) is -Cys-(X)2-4-Cys-(X)12-His-(X)3-5-His (where X is any amino acid). A single zinc finger of this class consists of an alpha helix containing the two invariant histidine residues and two beta sheets, which form a beta turn containing the two invariant cysteine residues. The two cysteine and two histidine residues coordinate a single zinc atom (see, e.g., Berg & Shi, Science 271:1081-1085 (1996)).
- Zinc finger proteins can be engineered to bind to predetermined sequences. Examples of zinc finger engineering include designed zinc finger proteins and selected zinc finger proteins. A designed zinc finger protein is a protein not occurring in nature whose structure and composition result principally from rational criteria. Rational criteria for design include application of substitution rules and computerized algorithms for processing information in a database storing information of existing ZFP designs and binding data, for example as described in PCT WO 98/53058, WO 98/53059, WO 99/53060 and WO 00/42219. A selected zinc finger protein is a protein not found in nature whose production results primarily from an empirical process such as phage display. See e.g., U.S. Pat. No. 5,789,538; U.S. Pat. No. 6,007,988; U.S. Pat. No. 6,013,453; WO 95/19431; WO 96/06166 WO 98/53057 and WO 98/54311.
- A target site or target sequence for a ZFP can be a nucleotide sequence (either DNA or RNA) or an amino acid sequence. A ZFP target site typically has about four to about ten base pairs, but can be as long as 18-20 base pairs, e.g., for a six-finger ZFP. Typically, a two-fingered ZFP recognizes a four to seven base pair target site, and a three-fingered ZFP recognizes a six to ten base pair target site. By way of example, a DNA target sequence for a three-finger ZFP is generally either 9 or 10 nucleotides in length, depending upon the presence and/or nature of cross-strand interactions between the ZFP and the target sequence. Target sequences can be found in any DNA or RNA sequence, including regulatory sequences, exons, introns, or any non-coding sequence.
- A target subsite or subsite is the portion of a DNA target site that is bound by a single zinc finger. Thus, in the absence of cross-strand interactions, a subsite is generally three nucleotides in length. In cases in which a cross-strand interaction occurs (e.g., a “D-able subsite,” as described for example in co-owned PCT WO 00/42219, incorporated by reference in its entirety herein) a subsite is four nucleotides in length and overlaps with another 3- or 4-nucleotide subsite.
- Kd refers to the dissociation constant for the compound, i.e., the concentration of a compound (e.g., a zinc finger protein) that gives half maximal binding of the compound to its target (i.e., half of the compound molecules are bound to the target) under given conditions (i.e., when [target]<<Kd), as measured using a given assay system (see, e.g., U.S. Pat. No. 5,789,538). Any assay system can be used, as long is it gives an accurate measurement of the actual kd. In one embodiment, the kd for a ZFP is measured using an electrophoretic mobility shift assay (“EMSA”), as described, for example, in WO 00/441566 and WO 00/42219.
- Administering an expression vector, nucleic acid, ZFP, or a delivery vehicle to a cell comprises transducing, transfecting, electroporating, translocating, fusing, phagocytosing, shooting or ballistic methods, etc., i.e., any means by which a protein or nucleic acid can be transported across a cell membrane and preferably into the nucleus of a cell.
- The term effective amount includes that amount which results in the desired result, for example, remodeling of cellular chromatin structure in a region of interest, repression of an active gene, activation of a repressed gene, or inhibition of transcription of a structural gene or translation of RNA.
- A delivery vehicle refers to a compound, e.g., a liposome, toxin, or a membrane translocation polypeptide, which is used to administer an exogenous molecule. Delivery vehicles can be used, for example, to administer nucleic acids encoding fusion molecules. Exemplary delivery vehicles include lipid:nucleic acid complexes, expression vectors, viruses, and the like.
- A promoter is defined as an array of nucleic acid control sequences that direct transcription. As used herein, a promoter typically includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of certain RNA polymerase II type promoters, a TATA element, enhancer, CCAAT box, SP-1 site, etc.
- As used herein, a promoter also optionally includes distal enhancer or repressor elements, which can be located as much as several thousand base pairs from the start site of transcription. The promoters often have an element that is responsive to transactivation by a DNA-binding moiety such as a polypeptide, e.g., a nuclear receptor, Gal4, the lac repressor and the like.
- A constitutive promoter is a promoter that is active under most environmental and developmental conditions. An inducible promoter is a promoter that is active under certain environmental or developmental conditions.
- A regulatory domain or functional domain refers to a protein or a polypeptide sequence (or portion thereof) that has transcriptional modulation activity, or that is capable of interacting with proteins and/or protein domains that have transcriptional modulation activity. Such proteins include, e.g., transcription factors and co-factors (e.g., KRAB, MAD, ERD, SID, nuclear factor kappa B subunit p65, early
growth response factor 1, and nuclear hormone receptors, VP16, VP64), endonucleases, integrases, recombinases, methyltransferases, histone acetyltransferases, histone deacetylases and polypeptides which are components of a chromatin remodeling complex, and their functional fragments. A functional domain can be covalently or non-covalently linked to a DNA-binding domain (e.g., a ZFP) to modulate transcription of a gene of interest. Alternatively, some binding domains, such as for example ZFPs can act in the absence of a functional domain to modulate transcription. Furthermore, transcription of a gene of interest can be modulated by a binding domain, such as a ZFP, linked to multiple functional domains. - The term heterologous is a relative term, which when used with reference to portions of a nucleic acid indicates that the nucleic acid comprises two or more subsequences that are not found in the same relationship to each other in nature. For instance, a nucleic acid that is recombinantly produced typically has two or more sequences from unrelated genes synthetically arranged to make a new functional nucleic acid, e.g., a promoter from one source and a coding region from another source or a fusion of coding sequences from two different genes. The two nucleic acids are thus heterologous to each other in this context. When added to a cell, the recombinant nucleic acids would also be heterologous to the endogenous genes of the cell. Thus, in a cell, a heterologous nucleic acid would include a recombinant nucleic acid that has integrated into the chromosome, or a recombinant extrachromosomal nucleic acid.
- Similarly, a heterologous protein indicates that the protein comprises two or more subsequences that are not found in the same relationship to each other in nature (e.g., a fusion protein, wherein sequences from two or more different proteins are encoded by a single nucleic acid sequence). See, e.g., Ausubel, supra, for an introduction to recombinant techniques.
- A host cell is a cell that contains one or more exogenous molecules such as, for example, expression vectors and/or heterologous nucleic acids. The host cell typically supports the replication or expression of an expression vector. Host cells may be prokaryotic cells such as, for example,E. coli and B. subtilis, or eukaryotic cells such as fungal cells (e.g., yeast), protozoal cells, plant cells, insect cells, animal cells, avian cells, teleost cells, amphibian cells, mammalian cells, primate cells or human cells. Exemplary mammalian cell lines include CHO, HeLa, 293, COS-1, and the like, e.g., cultured cells (in vitro), explants and primary cultures (in vitro and ex vivo), and cells in vivo.
- The term amino acid refers to naturally occurring and synthetic amino acids, as well as amino acid analogues and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, carboxyglutamate, and O-phosphoserine. Amino acid analogue refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an α carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine, and methyl sulfonium. Such analogues have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally occurring amino acid.
- Conservatively modified variants applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al.,Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, for example, at any position where an alanine is specified by a codon in an amino acid herein, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are silent variations, which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in each described sequence.
- As to amino acid and nucleic acid sequences, individual substitutions, deletions or additions that alter, add or delete a single amino acid or nucleotide or a small percentage of amino acids or nucleotides in the sequence create a conservatively modified variant, wherein the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants and alleles. See, e.g., Creighton,Proteins (1984) for a discussion of amino acid properties.
- In certain embodiments, the compositions and methods disclosed herein involve fusions between a DNA-binding domain and a component of a chromatin remodeling complex. In additional embodiments, the compositions and methods disclosed herein involve fusions between a DNA-binding domain and a domain which participates in modulation of gene expression such as, for example a transcriptional activation domain or a transcriptional repression domain. A DNA-binding domain can comprise any molecular entity capable of sequence-specific binding to chromosomal DNA. Binding can be mediated by electrostatic interactions, hydrophobic interactions, or any other type of chemical interaction. Examples of moieties which can comprise part of a DNA-binding domain include, but are not limited to, minor groove binders, major groove binders, antibiotics, intercalating agents, peptides, polypeptides, oligonucleotides, and nucleic acids. An example of a DNA-binding nucleic acid is a triplex-forming oligonucleotide.
- Minor groove binders include substances which, by virtue of their steric and/or electrostatic properties, interact preferentially with the minor groove of double-stranded nucleic acids. Certain minor groove binders exhibit a preference for particular sequence compositions. For instance, netropsin, distamycin and CC-1065 are examples of minor groove binders which bind specifically to AT-rich sequences, particularly runs of A or T. WO 96/32496.
- Many antibiotics are known to exert their effects by binding to DNA. Binding of antibiotics to DNA is often sequence-specific or exhibits sequence preferences. Actinomycin, for instance, is a relatively GC-specific DNA binding agent.
- In a preferred embodiment, a DNA-binding domain is a polypeptide. Certain peptide and polypeptide sequences bind to double-stranded DNA in a sequence-specific manner. For example, transcription factors participate in transcription initiation by RNA Polymerase II through sequence-specific interactions with DNA in the promoter and/or enhancer regions of genes. Defmed regions within the polypeptide sequence of various transcription factors have been shown to be responsible for sequence-specific binding to DNA. See, for example, Pabo et al. (1992)Ann. Rev. Biochem. 61:1053-1095 and references cited therein. These regions include, but are not limited to, motifs known as leucine zippers, helix-loop-helix (HLH) domains, helix-turn-helix domains, zinc fingers, β-sheet motifs, steroid receptor motifs, bZIP domains homeodomains, AT-hooks and others. The amino acid sequences of these motifs are known and, in some cases, amino acids that are critical for sequence specificity have been identified. Polypeptides involved in other process involving DNA, such as replication, recombination and repair, will also have regions involved in specific interactions with DNA. Peptide sequences involved in specific DNA recognition, such as those found in transcription factors, can be obtained through recombinant DNA cloning and expression techniques or by chemical synthesis, and can be attached to other components of a fusion molecule by methods known in the art.
- Proteins containing methyl binding domains, or functional fragments thereof, can also be used as DNA-binding domains. Methyl binding domain proteins recognize and bind to CpG dinucleotide sequences in which the C residue is methylated. Proteins containing a methyl-binding domain include, but are not limited to, MBD1, MBD2, MBD3, MBD4, MeCP1 and MeCP2. See, for example, Bird et al. (1999)Cell 99:451-454.
- Additionally, DNA methyl transferases, which methylate the 5-position of C residues in CpG dinucleotides such as, for example, DNMT1, DNMT2, DNMT3a and DNMT3b, or functional fragments thereof, can be used as a DNA-binding domain. Furthermore, enzymes which demethylate methylated CpG, or functional fragments thereof, can be used as a DNA-binding domain. Fremant et al. (1997)Nucleic Acids Res. 25:2375-2380; Okano et al. (1998) Nature Genet. 19:219-220; Bhattacharya et al. (1999) Nature 397:579-583; and Robertson et al. (2000) Carcinogenesis 21:461-467.
- In a more preferred embodiment, a DNA-binding domain comprises a zinc finger DNA-binding domain. See, for example, Miller et al. (1985)EMBO J. 4:1609-1614; Rhodes et al. (1993) Scientific American Feb.:56-65; and Klug (1999) J. Mol. Biol. 293:215-218. In one embodiment, a target site for a zinc finger DNA-binding domain is identified according to site selection rules disclosed in co-owned WO 00/42219. ZFP DNA-binding domains are designed and/or selected to recognize a particular target site as described in co-owned WO 00/42219; WO 00/41566; and U.S. Ser. Nos. 09/444,241 filed Nov. 19, 1999 and 09/535,088 filed Mar. 23, 2000; as well as U.S. Pats. 5,789,538; 6,007,408; and 6,013,453; and PCT publications WO 95/19431, WO 98/54311, WO 00/23464 and WO 00/27878.
- Certain DNA-binding domains are capable of binding to DNA that is packaged in nucleosomes. See, for example, Cordingley et al. (1987)Cell 48:261-270; Pina et al. (1990) Cell 60:719-731; and Cirillo et al. (1998) EMBO J. 17:244-254. Certain ZFP-containing proteins such as, for example, members of the nuclear hormone receptor superfamily, are capable of binding DNA sequences packaged into chromatin. These include, but are not limited to, the glucocorticoid receptor and the thyroid hormone receptor. Archer et al. (1992) Science 255:1573-1576; Wong et al. (1997) EMBO J. 16:7130-7145. Certain binding domains are able to bind to internucleosomal (linker) DNA sequences. See, e.g., Zhang et al. (2000) J. Biol. Chem. 275:33,850-33,860. Other DNA-binding domains, including certain ZFP-containing binding domains, require more accessible DNA for binding. In the latter case, the binding specificity of the DNA-binding domain can be determined by identifying accessible regions in the cellular chromatin. Accessible regions can be determined as described in co-owned PCT/US01/40617, the disclosure of which is hereby incorporated by reference herein. A DNA-binding domain is then designed and/or selected to bind to a target site within the accessible region.
- Two major types of chromatin modification have been described. The first is dependent on covalent modification. Covalent modification of histones occurs by processes such as, for example, acetylation and deacetylation. Covalent modification of DNA is exemplified by methylation of cytosine residues in CpG dinucleotides. The second type of modification results in changes in nucleosome location and/or conformation, and relies on the activity of ATP-driven chromatin remodeling machines. Both types of chromatin modification are carried out in vivo by multiprotein complexes. For the purposes of the present disclosure, proteins involved in either of these types of chromatin modification can comprise a component of a chromatin remodeling complex.
- Modifications of the first type often comprise histone acetylation, catalyzed by a complex containing a histone acetyl transferase (HAT), or histone deacetylation, catalyzed by a complex containing a histone deacetylase (HDAC). An example of a complex involved in this type of chromatin modification is a histone deacetylase complex, examples of which include the SIN3 and Mi-2 complexes. Knoepfler et al. (1999)Cell 99:447-450. These complexes generally comprise one or more enzymatic components (i.e., a HDAC) as well as one or more non-enzymatic components. Thus, a component of a chromatin remodeling complex can be either an enzymatic or a non-enzymatic component (or a functional fragment of an enzymatic or non-enzymatic component) of a complex involved in the covalent modification of histones.
- Additional types of covalent modification of chromosomal proteins include, but are not limited to, methylation, demethylation, phosphorylation, dephosphorylation, ubiquitination, de-ubiquitination, ADP-ribosylation and de-ribosylation. Proteolysis of chromosomal proteins can also influence chromatin structure. Covalent modification of nucleosomal histones is the basis of a histone code that is involved in regulation of gene expression, at least in part through effects on chromatin structure. See, for example, Jenuwein et al. (2001)Science 293:1074-1080. Accordingly, proteins that participate in covalent modification of histones (such as, for example, histone kinases, histone phosphatases, histone methyl transferases, histone demethylases, SAM synthetases, HP1, Su(Var) proteins and E(var) proteins) and their functional fragments, comprise enzymatic components of chromatin remodeling complexes. These proteins, as well as proteins that interact with the aforementioned proteins and their functional fragments, are useful in the disclosed methods and compositions.
- Furthermore, certain proteins involved in histone modification and regulation of chromatin structure contain conserved domains: these include the bromodomain, the chromodomain, the SET domain, the SANT domain, and the PHD domain. Accordingly, any protein comprising one of these domains is useful as a component of a fusion with a DNA-binding domain for use in the disclosed methods and compositions.
- The second type of chromatin modification is mediated by multiprotein chromatin remodeling complexes, exhibits nucleosome-, histone- and/or DNA-dependent ATPase activity and catalyzes various types of modification of chromatin structure (see infra). Generally, a remodeling complex comprises an enzymatic component (an ATPase protein subunit) and one or more non-enzymatic protein subunits. ATPase subunits are grouped into three major families: the SWI/SNF family, the ISWI family, and the Mi-2/CHD family. See Tyler et al. (1999)Cell 99:443-446. A component of a chromatin remodeling complex can comprise one of its constituent proteins or a functional fragment thereof. Thus, a component of a chromatin remodeling complex can be an enzymatic component or a non-enzymatic component.
- Enzymatic components of chromatin remodeling complexes include, but are not limited to, the following ATPases: SWI2/SNF2, STH1, BRM, HBRM, BRG1, Mi-2/CHD, ISW1, ISW2, ISWI, and hSNF2h. Tyler et al., supra; Armstrong et al. (1998)Curr. Opin. Genet. Dev. 8:165-172; Guschin et al. (1999) Curr. Biol. 9:R742-746; and Wolffe et al. (2000) J. Struct. Biol. 129:102-122.
- Modifications in chromatin structure include those which render chromosomal sequences more accessible to regulatory factors (i.e., formation of “open” chromatin) as well as those which make chromosomal sequences less accessible (i.e., formation of “closed” chromatin). Such modifications can include, for example, removal of nucleosomes from DNA, deposition of nucleosomes onto DNA, repositioning of nucleosomes, changes in nucleosome spacing, changes in nucleosome density, changes in the degree and/or nature of the interaction between DNA and histones in the nucleosome, changes in the path of DNA along the surface of the nucleosome, and/or changes in higher-order chromatin structure such as, for example, unwinding of the chromatin solenoid.
- In certain embodiments, the compositions and methods disclosed herein involve fusions between a DNA-binding domain and a component of a chromatin remodeling complex, as described supra, or a polynucleotide encoding such a fusion.
- Various chromatin remodeling complexes, their components and their activities have been identified and characterized in several organisms and cell types. Complexes known as SWI/SNF, RSC, ISW1 and ISW2 have been isolated and characterized in yeast. In Drosophila, the NURF, CHRAC, ACF and brahma (dSWI/SNF or BRM) complexes have been isolated and characterized. Chromatin remodeling complexes from human cells named brm/BRG (hSWI/SNF), NURD and RSF have been isolated and characterized. See, for example, Cairns (1998)Trends Biochem. Sci. 23:20-25; Murchardt et al. (1999) J. Mol. Biol. 293:185-197; Kingston et al. (1999) Genes. Devel. 13:2339-2352 and their cited references. It is likely that, as the field progresses, additional chromatin remodeling complexes, and their components, will be discovered and characterized; the use of such newly-discovered components of chromatin remodeling complexes is contemplated by the present disclosure. Exemplary chromatin remodeling complexes and their components are now described.
- A. SWI/SNF
- The SWI/SNF chromatin remodeling complex of yeast comprises the SWI2/SNF2 helicase/ATPase and products of the SNF5, SWI3, SWP73, ARP7, ARP9, SWI1, SNF6, SWP82, SWP29 and SNF1 genes. Arp7 and Arp9 are actin-related proteins. (The SWP29 gene product is also known as either TFG-3 or TAF30.) Peterson et al. (2000)Curr. Opin. Genet. Devel. 10:187-192.
- B. SWI/SNF Homologues
- Several chromatin remodeling complexes, have been isolated based on their possession of a subunit with homology to SWI2. These include the Brahma (BRM) complex in Drosophila, the brm/BRG complexes in mammals, and others.
- 1. Brahma
- The Drosophila brahma (brm) complex (also known as dSWI/SNF) contains an ATPase subunit, homologous to SWI2/SNF2, called brahma (brm), as well as SNR1, BAP155 (moira), BAP60, BAP111, BAP55, BAP74 and BAP47/ACT1/ACT2 subunits.
- 2. Mammalian Brm/BRG Complexes
- In humans and mouse, several complexes comprising one of two SWI2/SNF2-homologous ATPases have been characterized. In mice, chromatin remodeling complexes contain either of the two SWI2/SNF2 homologues mBRM or mBRG-1, along with subunits named mSNF5 and mBAF60a. Similarly, in human cells, either of the two SWI2/SNF2 homologues hBRM (also known as hSNF2α) or BRG-1 (also known as hSNF2β) are present in chromatin remodeling complexes also containing the hSNF5 (also known as INI-1), hBAF170, hBAF155, hBAF60a (or hBAF60b or hBAF60c), hBAF57, β-actin, hBAF53, hBAF250 (also known as p270) and hBAF110 subunits.
- 3. Chromatin Remodeling Complexes Active in the Regulation of Human Globin Genes
- Several chromatin remodeling complexes have been discovered by virtue of their participation in the regulation of globin gene expression in human cells. These include E-RC1, comprising BRG-1 and the BAF57 protein, and the PYR complex, comprising hSNF5/INI1, BAF57, BAF60a, and BAF170.
- 4. RSC
- The RSC complex (“remodels the structure of chromatin”), first identified in yeast, is a 15-subunit complex comprising the SWI2/SNF2 homologous ATPase STH1, along with SFH-1, RSC-8, actin-related proteins, RSC-6 and SAS-5. Two recently characterized subunits of RSC, denoted Rsc1 and Rsc2, each contains two bromodomains, a BAH (“bromo adjacent homology”) domain and an A/T hook motif, and thus likely participates in the interaction between the RSC complex and chromatin. Cairns et al. (1999) Mol. Cell 4:715-723.
- 5. ATRX and Related Proteins
- A family of helicase/ATPase proteins with homology to SNF2 have been described. These proteins contain seven conserved domains and are involved in a range of cellular functions, including transcription, recombination and repair. The mammalian ATRX protein is an example of this group of proteins. See Picketts et al. (1996)Hum. Mol. Genet. 5:1899-1907.
- C. ISWI-containing Complexes
- Several chromatin remodeling machines, initially characterized in Drosophila cells; contain an ATPase subunit with homology to yeast SWI2, known as ISWI (“imitation SWI”).
- 1. NURF
- NURF (Nucleosome Remodeling Factor) is a complex of four polypeptides, isolated from Drosophila, that is capable of ATP-dependent remodeling of chromatin. Remodeling by NURF is Sarkosyl-sensitive and nucleosome-dependent (in particular, is dependent on histone tails), and can facilitate binding of transcription factors to chromatin. Tsukiyama et al. (1995) Cell 83:1011-1020. The components of NURF include ISWI (a SWI2-related DNA-dependent ATPase, also known as NURF-140), NURF-38, NURF-55 and NURF-215. Additional properties of the NURF complex are disclosed in Sandaltzopoulos et al. (1999) Meth. Enzymology 304:757-765 and references cited therein.
- 2. CHRAC
- CHRAC (Chromatin Accessibility Complex) possesses ATP-dependent nucleosome spacing activity and mediates ATP-dependent accessibility of chromatin to restriction endonucleases. Varga Weisz et al. (1995) EMBO J. 14:2209-2216; Varga Weisz et al. (1997) Nature 388:598-602. The CHRAC complex includes the ISWI ATPase and four additional polypeptides: p15, p20, p175 and DNA topoisomerase II.
- 3. ACF
- The ACF complex (ATP-utilizing chromatin assembly and remodeling factor), characterized in Drosophila, is able to facilitate the binding of transcriptional activators to chromatin and to affect nucleosome spacing. Ito et al. (1997) Cell 90:145-155. ACF contains the ISWI ATPase and three additional polypeptides: pl7, ACFI (p185) and ACFII (p170).
- 4. RSF
- The RSF complex (remodeling and spacing factor), found in human cells, contains the ISWI homologue hSNF2h and a subunit known as p325. Its activities include ATP-dependent nucleosome remodeling and spacing. LeRoy et al. (1998) Science 282:1900-1904.
- 5. ISW1
- Chromatin remodeling complexes in yeast, with ATPase subunits homologous to the Drosophila ISWI ATPase, include ISW1 and ISW2. ISW1 contains the ISW1 ATPase subunit, p74, p105 and p110. ISW1 has been characterized as possessing nucleosome-stimulated ATPase activity and ATP-dependent nucleosome disruption and spacing activities. Tsukiyama et al. (1999)Genes Dev. 13:686-697.
- 6. ISW2
- The yeast ISW2 complex contains the ISW2 ATPase along with a second subunit having a molecular weight of 140 kD. ISW2 possesses nucleosome-stimulated ATPase activity and ATP-dependent nucleosome disruption activity. Tsukiyama et al. (1999)supra.
- 7. WCRF
- The WCRF chromatin remodeling complex was isolated from human (HeLa) cells and contains an ISWI-homologous ATPase known as WCRF 135 (SNF2h) and a subunit known as WCRF 180. WCRF 180 has several hallmarks of a transcription factor, including a heterochromatin localization domain, a PHD finger (a cysteine-rich zinc-binding domain) and a bromodomain (a domain reported to be involved in interaction with histones). Bochar et al. (2000)Proc. Natl. Acad. Sci. USA 97:1038-1043; Jacobson et al. (2000) Science 288:1422-1425.
- D. Mi-2 Containing Complexes
- Chromatin remodeling complexes from human (NRD/NURD complex) and amphibian cells (Mi-2 complex) contain a nucleosome-dependent ATPase activity called Mi-2 (also known as CHD). Additional protein components of the amphibian Mi-2 complex include Mtal-like (a DNA-binding protein homologous to metastasis-associated protein), RPD3 (the amphibian homologue of histone deacetylases HDAC1 and HDAC2), RbAp48 (a protein which interacts with histone H4), and MBD3 (a protein containing a methylated CpG binding domain). The amphibian complex additionally contains a serine- and proline-rich subunit, p66. Activities of the amphibian Mi-2 complex include a nucleosome-dependent ATPase that is not stimulated by free histones or DNA, translational movement of histone octamers relative to DNA, and deacetylation of core histones within a nucleosome. Guschin et al. (2000)Biochemistry 39:5238-5245. Inasmuch as RbAp48 appears to comprise a key structural component of the Mi-2 complex, it is particularly suitable for fusion with a DNA-binding domain for use in the methods disclosed herein.
- Human NRD complexes contain, in addition to Mi-2, homologues of amphibian Mtal-like (MTA-2), RPD3 (HDAC1 and HDAC2), RbAp48 and MBD3, as well as additional proteins. See Zhang et al. (1999)Genes Dev. 13:1924-1935; and Kornberg et al. (1999) Curr. Opin. Genet. Dev. 9:148-151.
- E. DNA Methyl Transferases and Methylated DNA Binding Proteins
- As mentioned above, the methyl-binding-domain protein MBD3 is a component of Mi-2-containing chromatin remodeling complexes. MBD3 and related methyl binding domain proteins recognize and bind to CpG dinucleotide sequences in which the C residue is methylated. Thus MBD proteins are capable of recruiting histone deacetylases to regions of chromatin rich in methylated CpG. Accordingly, a MBD protein can comprise a component of a chromatin remodeling complex. Proteins containing a methyl-binding domain include, but are not limited to, MBD1, MBD2, MBD3, MBD4, MeCP1 and MeCP2. See, for example, Bird et al. (1999)Cell 99:451-454.
- Additionally, DNA methyl transferases, which methylate the 5-position of C residues in CpG dinucleotides such as, for example, DNMT1, DNMT2, DNMT3a and DNMT3b, can be used as components of a chromatin remodeling complex.
- Not all remodeling complexes have the same activities and the same effects on chromatin structure. It is possible that, as more sensitive assay methods are developed and/or more loosely-bound subunits or accessory factors are identified, the various chromatin remodeling complexes will be found to possess common activities. Accordingly, the activities attributed herein to individual chromatin remodeling complexes should not be construed as limiting.
- Nonetheless, it appears, from the information available to date, that each cell type contains a multiplicity of chromatin remodeling complexes which can share certain common subunits, and that the composition of a chromatin remodeling complex can vary with cell type. The number of polypeptide subunits in a chromatin remodeling complex varies over a wide range, from two in the ISW2 and RSF complexes to over 15 in the yeast RSC complex. It also appears to be the case that different chromatin remodeling complexes can have partially overlapping activities (i.e., that a degree of functional redundancy exists among different chromatin remodeling complexes). The present disclosure is therefore intended to embrace any and all polypeptides present in any type of chromatin remodeling complex, currently known or to be discovered.
- In the process of gene activation, binding of chromatin remodeling complexes to chromatin generally precedes binding of histone acetyl transferase (HAT) and/or histone deacetylase (HDAC) complexes, suggesting that HAT and HDAC complexes are recruited by the chromatin remodeling complex, or that remodeled chromatin is more conducive to binding of HAT and HDAC complexes. See, for example, Cosma et al. (1999)Cell 97:299-311; Krebs et al. (1999) Genes Dev. 13:1412-1421. Accordingly, in one embodiment of the claimed methods, chromatin modification facilitates binding of a HAT- or HDAC-containing complex. In this way, chromatin modification facilitates covalent modification of nucleosomal histones by acetylation or deacetylation. Histone acetylation is generally correlated with transcriptional activation; while deacetylation of histones is generally associated with transcriptional repression.
- Numerous HAT enzymes have been described, including budding yeast Gcn5p, which is required for expression of a subset of the yeast genome, its mammalian orthologue CREB-binding protein (CBP), p300 (both of the latter two used as coactivators by a wide variety of mammalian transcription factors), TAFII250 (a component of the basal transcriptional machinery), and steroid receptor coactivator 1 (SRC-1), which potentiates transcriptional activation by a number of nuclear hormone receptors. Kouzarides (1999) supra; Cheung et al. (2000) Curr. Opin. Cell Biol. 12:326-333; and Sterner et al. (2000) supra.
- Two major classes of functionally distinct HDACs have been identified in higher eukaryotes. Class I includes HDAC1, HDAC2 and HDAC3, which are homologous to the yeast Rpd3 histone deacetylase. Class II includes HDAC4, HDAC5 and HDAC6; and are homologous to the yeast Hda1 histone deacetylase. Ng et al., supra.
- In another embodiment, a ZFP DNA-binding domain is fused to a histone acetyl transferase or to a histone deacetylase, to effect chromatin modification in the form of covalent modification (acetylation or deacetylation) of histones. In yet another embodiment, modification of chromatin by a chromatin remodeling complex is followed by binding of a ZFP-HAT fusion or a ZFP-HDAC fusion, to establish an active or inactive chromatin state, respectively.
- In additional embodiments, a fusion between a DNA-binding domain and a protein that is a component of a HAT- or HDAC-containing complex is provided. In this way, it is possible to recruit HAT or HDAC activity to a region of interest in cellular chromatin, depending of the sequence specificity of the DNA-binding domain. HAT- and HDAC-containing complexes, and their component polypeptide subunits, have been described. See, for example, Grunstein (1997)Nature 389:349-352; Hartzog et al. (1997) Curr. Opin. Genet. Devel. 7:192-198; Kadonaga (1998) Cell 92:307-313; Kuo et al. (1998) BioEssays 20:615-626; Mizzzen et al. (1998) Cell. Mol. Life Sci. 54:6-20; Struhl (1998) Genes Devel. 12:599-606; Workman et al. (1998) Ann. Rev. Biochem. 67:545-579; Ng et al. (1999) Trends Biochem. Sci. 25:121-126; and Knoepfler et al. (1999) Cell 99:447-450. Accordingly components of HAT- and HDAC-containing complexes are well-known to those of skill in the art.
- For example, there are several HAT-containing complexes in yeast, one of which is the SAGA complex (Spt-Ada-Gcn5-acetyltransferase). Grant et al. (1997)Genes Devel. 11:1640-1650; Ikeda et al. (1999) Mol. Cell. Biol. 19:855-863.
- HDAC-containing complexes include the Sin3 complex, which is conserved in organisms from yeast to mammals. The components of the yeast Sin3 complex include Sin3p, RPD3 (a histone deacetylase), RbAp48, and RbAp46. The components of the mammalian Sin3 complex include mSin3A, mSin3B, HDAC1, HDAC2, RbAp48, RbAp49, SAP30 and SAP18. Zhang et al. (1998)Mol. Cell 1:1021-1031. Sin3 proteins from yeast, Drosophila, and vertebrates contain a PAH (paired amphipathic helices) domain, comprising four conserved repeats which form two amphipathic helices separated by a flexible linker. HDAC1, HDAC2 and RPD3 are histone deacetylases. The RbAp48 and RbAP49 proteins interact with histones. SAP30 and SAP18 are specificity determinants.
- Another HDAC-containing complex (which also possess chromatin remodeling activity, see supra) is the Mi-2 complex. Several Mi-2 complexes have been described in humans and amphibians. The mammalian Mi-2 complex (also known as NuRD) comprises the following polypeptides: Mi-2 (also known as CHD), HDAC1, HDAC2, MTA-2 and MBD3. See, for example, Ahringer (2000)Trends Genet. 16:351-356. The amphibian Mi-2 complex comprises Mi-2, Mta1-like (homologous to mammalian MTA2), p66, RbAp48, RPD3 and MBD3. Guschin et al. (2000) Biochemistry 39:5238-5245. Binding of the methylated DNA binding protein present in this complex (MBD3) to methylated CpG dinucleotides in upstream regulatory regions localizes the complex and its associated HDAC activity to methylated genes. Thus, it is believed that the Mi-2 complex is involved in the repression of genes whose upstream DNA is methylated at CpG dinucleotides
- Coactivators and corepressors which associate with the Sin3 complex to aid in targeting and in its interaction with receptors and other transcriptional regulatory proteins have been described. Examples include, but are not limited to, the vertebrate N-CoR, Rb and SMRT proteins and their homologues, as well as the Drosophila SMRTER and Groucho proteins and their homologues. For the purposes of the present disclosure, such coactivators and corepressors are considered to be components of chromatin remodeling complexes, inasmuch as they are capable of targeting various types of chromatin modification, if fused to a DNA-binding domain.
- For additional details and lists of HAT- and HDAC-containing complexes and proteins with which they interact, see Knoepfler et al., supra; Ng et al., supra; and Ahringer, supra.
- The thyroid hormone receptor (TR) is a member of the nuclear hormone receptor superfamily and is normally bound constitutively to its target genes. The effect of TR binding (i.e., either repression or activation of gene expression) ordinarily depends upon the presence or absence of its ligand, thyroid hormone (T3). In the absence of T3 the receptor generally represses gene expression to a level below the basal level. A number of proteins have been identified that are recruited by the unliganded receptor and are believed to constitute a repressive complex. Examples of such proteins include SMRT and NCoR, which interact directly with the receptor, as well as Sin3, which interacts with SMRT/NCoR. Sin3 also interacts with a number of histone deacetylases, for example,
HDACs 1 through 8 (some of which may also interact directly with TR). Recruitment of histone deacetylases by DNA-bound TR is believed to play a major role in its ability to confer repression; however, it is also possible that repressive factors other than HDACs are recruited by TR. - Binding of ligand to DNA-bound TR results in the decay of the repressive complex associated with the TR and recruitment of activating factors to the DNA-bound, ligand-bound TR. Such activating factors include, but are not limited to, the histone acetyltransferases SRC-1, CBP/p300 and P/CAF. Oligomeric activation complexes can also be recruited by ligand-bound TR, such as, for example, DRIP and ARC. Rachez et al. (1999)Nature 398:824-827; and Naar et al. (1999) Nature 398:828-832. These have been shown to interact with other nuclear hormone receptors, in response to ligand binding, and facilitate activation of gene expression in the context of a chromatin template. Another member of the nuclear receptor family, the glucocorticoid receptor (GR), recruits the hBRG1/BAF chromatin remodeling complex in response to ligand binding. Fryer et al. (1998) Nature 393:88-91.
- TR and related nuclear receptors are modular proteins comprising an amino-terminal region (of undefined function), a central DNA binding domain and a carboxy-terminal ligand binding domain (LBD). The LBD, in addition to binding hormone, is responsible for interactions with both the repressive and activating factors described above. When the LBD is fused to a heterologous DNA binding domain (Gal4), it mediates repression of a target promoter containing a Gal4 binding site. Collingwood et al. (1998)EMBO J. 17:4760-4770. In addition, T3-dependent activation of transcription can be achieved using a fusion of the TR LBD with the Gal4 DNA-binding domain Tone et al. (1994) J. Biol. Chem. 269:31,157-31,161.
- Knowledge of the structure of the LBD of TR and related nuclear receptors, together with the results of mutagenesis studies, can be used to design mutant receptors whose repression and activation activity are impervious to hormone concentration. For example, single amino acid mutants of TR that are unable to bind physiological levels of T3 (e.g. G344E, Λ430M, and Λ276I) recruit corepressors to their binding site. Collingwood et al. (1994)Mol. Endocrinol. 8:1262-1277; Collingwood et al. (1998) supra. Conversely, mutations causing conformational changes in the ligand binding domain that mimic those induced by hormone binding have been identified in the estrogen receptor (e.g. L536P and Y541D/E/A) and generate constitutively activating forms of the receptor. Eng et al. (1997) Mol. Cell. Biol. 17:4644-4653; White et al. (1997) EMBO J. 16:1427-1435.
- Accordingly, a mutant nuclear hormone receptor LBD derived, for example, from TR or GR can be used as a component of a fusion with a DNA-binding domain, to recruit activating or repressing protein complexes to a region of interest in cellular chromatin. Certain naturally-occurring mutant LBDs are available; and new mutants can be constructed by methods well-known to those of skill in the art. The site of action of such complexes is determined by the specificity of the DNA-binding domain; while their activity is determined by the nature of the mutation to the LBD and is independent of ligand concentration. For instance, a fusion comprising a LBD that has been mutated such that it is unable to bind hormone will facilitate formation of repressive complexes; while a fusion molecule comprising a LBD mutation that changes the conformation of the LBD such that it resembles a ligand-bound LBD will stimulate the formation of complexes that facilitate transcriptional activation.
- Thus, for the purposes of the present disclosure, a mutant nuclear hormone receptor LBD can be considered a component of a chromatin remodeling complex.
- The methods and compositions disclosed herein include fusion molecules comprising a DNA-binding domain and a component of a chromatin remodeling complex. The component of a chromatin remodeling complex can be either an enzymatic component or a non-enzymatic component. Without wishing to be bound by theory, it is believed that a fusion molecule comprising an enzymatic component will result in modification of a more limited region of cellular chromatin, compared to a fusion molecule comprising a non-enzymatic component. This is because, when the enzymatic component is directly fused to a DNA-binding domain, its activity is regionally restricted to the vicinity of the target site of the DNA-binding domain. (A degree of flexibility might be achieved by providing a linker sequence between the enzymatic component and the DNA-binding domain.) By contrast, if the fusion molecule comprises a non-enzymatic component, there are likely to be several proteins intervening between the DNA-binding domain (and, hence, the target site in the chromatin) and the enzymatic component of the reconstituted chromatin remodeling complex. This potentially allows a wider area of action of the enzymatic component, which could result in remodeling of more extensive sections of chromatin.
- Fusion molecules are constructed by methods of cloning and biochemical conjugation that are well-known to those of skill in the art. Fusion molecules comprise a DNA-binding domain and a component of a chromatin remodeling complex or a functional fragment thereof. Fusion molecules also optionally comprise nuclear localization signals (such as, for example, that from the SV40 medium T-antigen) and epitope tags (such as, for example, FLAG and hemagglutinin). Fusion proteins (and nucleic acids encoding them) are designed such that the translational reading frame is preserved among the components of the fusion. See Examples 2 and 4, infra for additional details on the construction of fusion molecules.
- Fusions between a polypeptide component of a chromatin remodeling complex (or a functional fragment thereof) on the one hand, and a non-protein DNA-binding domain (e.g., antibiotic, intercalator, minor groove binder, nucleic acid) on the other, are constructed by methods of biochemical conjugation known to those of skill in the art. See, for example, the Pierce Chemical Company (Rockford, Ill.) Catalogue. Methods and compositions for making fusions between a minor groove binder and a polypeptide have been described. Mapp et al. (2000)Proc. Natl. Acad. Sci. USA 97:3930-3935.
- In certain embodiments, a fusion between a polypeptide DNA-binding domain and a component of a chromatin remodeling complex (or functional fragment thereof) is encoded by a fusion nucleic acid. In such cases, the nucleic acid can be cloned into intermediate vectors for transformation into prokaryotic or eukaryotic cells for replication and/or expression. Intermediate vectors for storage or manipulation of the fusion nucleic acid or production of fusion protein can be prokaryotic vectors, (e.g., plasmids), shuttle vectors, insect vectors, or viral vectors for example. A fusion nucleic acid can also cloned into an expression vector, for administration to a bacterial cell, fungal cell, protozoal cell, plant cell, or animal cell, preferably a mammalian cell, more preferably a human cell.
- To obtain expression of a cloned fusion nucleic acid, it is typically subcloned into an expression vector that contains a promoter to direct transcription. Suitable bacterial and eukaryotic promoters are well known in the art and described, e.g., in Sambrook et al., supra; Ausubel et al., supra; and Kriegler,Gene Transfer and Expression: A Laboratory Manual (1990). Bacterial expression systems are available in, e.g., E. coli, Bacillus sp., and Salmonella. Palva et al. (1983) Gene 22:229-235. Kits for such expression systems are commercially available. Eukaryotic expression systems for mammalian cells, yeast, and insect cells are well known in the art and are also commercially available, for example, from Invitrogen, Carlsbad, Calif. and Clontech, Palo Alto, Calif.
- The promoter used to direct expression of a fusion nucleic acid depends on the particular application. For example, a strong constitutive promoter is typically used for expression and purification of a fusion protein. In contrast, when a fusion protein is used in vivo, either a constitutive or an inducible promoter is used, depending on the particular use of the fusion protein. In addition, a weak promoter can be used, such as HSV TK or a promoter having similar activity. The promoter typically can also include elements that are responsive to transactivation, e.g., hypoxia response elements, Gal4 response elements, lac repressor response element, and small molecule control systems such as tet-regulated systems and the RU-486 system. See, e.g., Gossen et al. (1992)Proc. Natl. Acad. Sci USA 89:5547-5551; Oligino et al.(1998) Gene Ther. 5:491-496; Wang et al. (1997) Gene Ther. 4:432-441; Neering et al. (1996) Blood 88:1147-1155; and Rendahl et al. (1998) Nat. Biotechnol. 16:757-761.
- In addition to a promoter, an expression vector typically contains a transcription unit or expression cassette that contains additional elements required for the expression of the nucleic acid in host cells, either prokaryotic or eukaryotic. A typical expression cassette thus contains a promoter operably linked, e.g., to the fusion nucleic acid sequence, and signals required, e.g., for efficient polyadenylation of the transcript, transcriptional termination, ribosome binding, and/or translation termination. Additional elements of the cassette may include, e.g., enhancers, and heterologous spliced intronic signals.
- The particular expression vector used to transport the genetic information into the cell is selected with regard to the intended use of the fusion polypeptide, e.g., expression in plants, animals, bacteria, fungi, protozoa etc. Standard bacterial expression vectors include plasmids such as pBR322, pBR322-based plasmids, pSKF, pET23D, and commercially available fusion expression systems such as GST and LacZ. Epitope tags can also be added to recombinant proteins to provide convenient methods of isolation, for monitoring expression, and for monitoring cellular and subcellular localization, e.g., c-myc or FLAG.
- Expression vectors containing regulatory elements from eukaryotic viruses are often used in eukaryotic expression vectors, e.g., SV40 vectors, papilloma virus vectors, and vectors derived from Epstein-Barr virus. Other exemplary eukaryotic vectors include pMSG, pAV009/A+, pMTO10/A+, pMAMneo-5, baculovirus pDSVE, and any other allowing expression of proteins under the direction of the SV40 early promoter, SV40 late promoter, metallothionein promoter, murine mammary tumor virus promoter, Rous sarcoma virus promoter, polyhedrin promoter, or other promoters shown effective for expression in eukaryotic cells.
- Some expression systems have markers for selection of stably transfected cell lines such as thymidine kinase, hygromycin B phosphotransferase, and dihydrofolate reductase. High-yield expression systems are also suitable, such as baculovirus vectors in insect cells, with a fusion nucleic acid sequence under the transcriptional control of the polyhedrin promoter or any other strong baculovirus promoter.
- Elements that are typically included in expression vectors also include a replicon that functions inE. coli (or in the prokaryotic host, if other than E. coli), a selective marker, e.g., a gene encoding antibiotic resistance, to permit selection of bacteria that harbor recombinant plasmids, and unique restriction sites in nonessential regions of the vector to allow insertion of recombinant sequences.
- Standard transfection methods can be used to produce bacterial, mammalian, yeast, insect, or other cell lines that express large quantities of fusion protein, which can be purified, if desired, using standard techniques. See, e.g., Colley et al. (1989)J. Biol. Chem. 264:17619-17622; and Guide to Protein Purification, in Methods in Enzymology, vol. 182 (Deutscher, ed.) 1990. Transformation of eukaryotic and prokaryotic cells are performed according to standard techniques. See, e.g., Morrison (1977) J. Bacteriol. 132:349-351; Clark-Curtiss et al. (1983) in Methods in Enzymology 101:347-362 (Wu et al., eds).
- Any procedure for introducing foreign nucleotide sequences into host cells can be used. These include, but are not limited to, the use of calcium phosphate transfection, DEAE-dextran-mediated transfection, polybrene, protoplast fusion, electroporation, lipid-mediated delivery (e.g., liposomes), microinjection, particle bombardment, introduction of naked DNA, plasmid vectors, viral vectors (both episomal and integrative) and any of the other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., Sambrook et al., supra). It is only necessary that the particular genetic engineering procedure used be capable of successfully introducing at least one gene into the host cell capable of expressing the protein of choice.
- Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids into mammalian cells or target tissues. Such methods can be used to administer nucleic acids encoding fusion polypeptides to cells in vitro. Preferably, nucleic acids are administered for in vivo or ex vivo gene therapy uses. Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For reviews of gene therapy procedures, see, for example, Anderson (1992)Science 256:808-813; Nabel et al. (1993) Trends Biotechnol. 11:211-217; Mitani et al. (1993) Trends Biotechnol. 11:162-166; Dillon (1993) Trends Biotechnol. 11:167-175; Miller (1992) Nature 357:455-460; Van Brunt (1988) Biotechnology 6(10):1149-1154; Vigne (1995) Restorative Neurology and Neuroscience 8:35-36; Kremer et al. (1995) British Medical Bulletin 51(1):31-44; Haddada et al., in Current Topics in Microbiology and Immunology, Doerfler and Böhm (eds), 1995; and Yu et al. (1994) Gene Therapy 1:13-26.
- Methods of non-viral delivery of nucleic acids include lipofection, microinjection, ballistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, and agent-enhanced uptake of DNA. Lipofection is described in, e.g., U.S. Pat. Nos. 5,049,386; 4,946,787; and 4,897,355 and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Felgner, WO 91/17424 and WO 91/16024. Nucleic acid can be delivered to cells (ex vivo administration) or to target tissues (in vivo administration).
- The preparation of lipid:nucleic acid complexes, including targeted liposomes such as immunolipid complexes, is well known to those of skill in the art. See, e.g., Crystal (1995)Science 270:404-410; Blaese et al. (1995) Cancer Gene Ther. 2:291-297; Behr et al. (1994) Bioconjugate Chem. 5:382-389; Remy et al. (1994) Bioconjugate Chem. 5:647-654; Gao et al. (1995) Gene Therapy 2:710-722; Ahmad et al. (1992) Cancer Res. 52:4817-4820; and U.S. Pat. Nos. 4,186,183; 4,217,344; 4,235,871; 4,261,975; 4,485,054; 4,501,728; 4,774,085; 4,837,028 and 4,946,787.
- The use of RNA or DNA virus-based systems for the delivery of nucleic acids take advantage of highly evolved processes for targeting a virus to specific cells in the body and trafficking the viral payload to the nucleus. Viral vectors can be administered directly to patients (in vivo) or they can be used to treat cells in vitro, wherein the modified cells are administered to patients (ex vivo). Conventional viral based systems for the delivery of ZFPs include retroviral, lentiviral, poxviral, adenoviral, adeno-associated viral, vesicular stomatitis viral and herpesviral vectors. Integration in the host genome is possible with certain viral vectors, including the retrovirus, lentivirus, and adeno-associated virus gene transfer methods, often resulting in long term expression of the inserted transgene. Additionally, high transduction efficiencies have been observed in many different cell types and target tissues.
- The tropism of a retrovirus can be altered by incorporating foreign envelope proteins, allowing alteration and/or expansion of the potential target cell population. Lentiviral vectors are retroviral vector that are able to transduce or infect non-dividing cells and typically produce high viral titers. Selection of a retroviral gene transfer system would therefore depend on the target tissue. Retroviral vectors have a packaging capacity of up to 6-10 kb of foreign sequence and are comprised of cis-acting long terminal repeats (LTRs). The minimum cis-acting LTRs are sufficient for replication and packaging of the vectors, which are then used to integrate the therapeutic gene into the target cell to provide permanent transgene expression. Widely used retroviral vectors include those based upon murine leukemia virus (MuLV), gibbon ape leukemia virus (GaLV), simian immunodeficiency virus (SIV), human immunodeficiency virus (HIV), and combinations thereof. Buchscher et al. (1992)J. Virol. 66:2731-2739; Johann et al. (1992) J. Virol. 66:1635-1640; Sommerfelt et al. (1990) Virol. 176:58-59; Wilson et al. (1989) J. Virol. 63:2374-2378; Miller et al. (1991) J. Virol. 65:2220-2224; and PCT/US94/05700).
- Adeno-associated virus (AAV) vectors are also used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures. See, e.g., West et al. (1987)Virology 160:38-47; U.S. Pat. No. 4,797,368; WO 93/24641; Kotin (1994) Hum. Gene Ther. 5:793-801; and Muzyczka (1994) J. Clin. Invest. 94:1351. Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-3260; Tratschin, et al. (1984) Mol. Cell. Biol. 4:2072-2081; Hermonat et al. (1984) Proc. Natl. Acad. Sci. USA 81:6466-6470; and Samulski et al. (1989) J. Virol. 63:3822-3828.
- Recombinant adeno-associated virus vectors based on the defective and nonpathogenic parvovirus adeno-associated virus type 2 (AAV-2) are a promising gene delivery system. Exemplary AAV vectors are derived from a plasmid containing the AAV 145 bp inverted terminal repeats flanking a transgene expression cassette. Efficient gene transfer and stable transgene delivery due to integration into the genomes of the transduced cell are key features for this vector system. Wagner et al. (1998) (9117):1702-3; and Kearns et al. (1996) Gene Ther. 9:748-55.Lancet 351
- pLASN and MFG-S are examples are retroviral vectors that have been used in clinical trials. Dunbar et al. (1995)Blood 85:3048-305; Kohn et al. (1995) Nature Med. 1:1017-102; Malech et al. (1997) Proc. Natl. Acad. Sci. USA 94:12133-12138. PA317/pLASN was the first therapeutic vector used in a gene therapy trial. (Blaese et al. (1995) Science 270:475-480. Transduction efficiencies of 50% or greater have been observed for MFG-S packaged vectors. Ellem et al. (1997) Immunol Immunother. 44(1):10-20; Dranoff et al. (1997) Hum. Gene Ther. 1:111-2.
- In applications for which transient expression is preferred, adenoviral-based systems are useful. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and are capable of infecting, and hence delivering nucleic acid to, both dividing and non-dividing cells. With such vectors, high titers and levels of expression have been obtained. Adenovirus vectors can be produced in large quantities in a relatively simple system.
- Replication-deficient recombinant adenoviral (Ad) can be produced at high titer and they readily infect a number of different cell types. Most adenovirus vectors are engineered such that a transgene replaces the Ad E1a, E1b, and/or E3 genes; the replication defector vector is propagated in
human 293 cells that supply the required E1 functions in trans. Ad vectors can transduce multiple types of tissues in vivo, including non-dividing, differentiated cells such as those found in the liver, kidney and muscle. Conventional Ad vectors have a large carrying capacity for inserted DNA. An example of the use of an Ad vector in a clinical trial involved polynucleotide therapy for antitumor immunization with intramuscular injection. Sterman et al. (1998) Hum. Gene Ther. 7:1083-1089. Additional examples of the use of adenovirus vectors for gene transfer in clinical trials include Rosenecker et al. (1996) Infection 24:5-10; Sterman et al., supra; Welsh et al. (1995) Hum. Gene Ther. 2:205-218; Alvarez et al. (1997) Hum. Gene Ther. 5:597-613; and Topf et al. (1998) Gene Ther. 5:507-513. - Packaging cells are used to form virus particles that are capable of infecting a host cell. Such cells include 293 cells, which package adenovirus, and ψ2 cells or PA317 cells, which package retroviruses. Viral vectors used in gene therapy are usually generated by a producer cell line that packages a nucleic acid vector into a viral particle. The vectors typically contain the minimal viral sequences required for packaging and subsequent integration into a host, other viral sequences being replaced by an expression cassette for the protein to be expressed. Missing viral functions are supplied in trans, if necessary, by the packaging cell line. For example, AAV vectors used in gene therapy typically only possess ITR sequences from the AAV genome, which are required for packaging and integration into the host genome. Viral DNA is packaged in a cell line, which contains a helper plasmid encoding the other AAV genes, namely rep and cap, but lacking ITR sequences. The cell line is also infected with adenovirus as a helper. The helper virus promotes replication of the AAV vector and expression of AAV genes from the helper plasmid. The helper plasmid is not packaged in significant amounts due to a lack of ITR sequences. Contamination with adenovirus can be reduced by, e.g., heat treatment, which preferentially inactivates adenoviruses.
- In many gene therapy applications, it is desirable that the gene therapy vector be delivered with a high degree of specificity to a particular tissue type. A viral vector can be modified to have specificity for a given cell type by expressing a ligand as a fusion protein with a viral coat protein on the outer surface of the virus. The ligand is chosen to have affinity for a receptor known to be present on the cell type of interest. For example, Han et al. (1995)Proc. Natl. Acad. Sci. USA 92:9747-9751 reported that Moloney murine leukemia virus can be modified to express human heregulin fused to gp70, and the recombinant virus infects certain human breast cancer cells expressing human epidermal growth factor receptor. This principle can be extended to other pairs of virus expressing a ligand fusion protein and target cell expressing a receptor. For example, filamentous phage can be engineered to display antibody fragments (e.g., Fab or Fv) having specific binding affinity for virtually any chosen cellular receptor. Although the above description applies primarily to viral vectors, the same principles can be applied to non-viral vectors. Such vectors can be engineered to contain specific uptake sequences thought to favor uptake by specific target cells.
- Gene therapy vectors can be delivered in vivo by administration to an individual patient, typically by systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical application, as described infra. Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector.
- Ex vivo cell transfection for diagnostics, research, or for gene therapy (e.g., via re-infusion of the transfected cells into the host organism) is well known to those of skill in the art. In a preferred embodiment, cells are isolated from the subject organism, transfected with a nucleic acid (gene or cDNA), and re-infused back into the subject organism (e.g., patient). Various cell types suitable for ex vivo transfection are well known to those of skill in the art. See, e.g., Freshney et al.,Culture of Animal Cells, A Manual of Basic Technique, 3rd ed., 1994, and references cited therein, for a discussion of isolation and culture of cells from patients.
- In one embodiment, hematopoietic stem cells are used in ex vivo procedures for cell transfection and gene therapy. The advantage to using stem cells is that they can be differentiated into other cell types in vitro, or can be introduced into a mammal (such as the donor of the cells) where they will engraft in the bone marrow. Methods for differentiating CD34+ stem cells in vitro into clinically important immune cell types using cytokines such a GM-CSF, IFN-γ and TNF-γ are known. Inaba et al. (1992)J. Exp. Med. 176:1693-1702.
- Stem cells are isolated for transduction and differentiation using known methods. For example, stem cells are isolated from bone marrow cells by panning the bone marrow cells with antibodies which bind unwanted cells, such as CD4+ and CD8+ (T cells), CD45+ (panB cells), GR-1 (granulocytes), and Iad (differentiated antigen presenting cells). See Inaba et al., supra.
- Vectors (e.g., retroviruses, adenoviruses, liposomes, etc.) containing therapeutic nucleic acids can be also administered directly to the organism for transduction of cells in vivo. Alternatively, naked DNA can be administered. Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or tissue cells. Suitable methods of administering such nucleic acids are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions of the present invention, as described below. See, e.g.,Remington's Pharmaceutical Sciences, 17th ed., 1989.
- In certain embodiments, one or more polypeptides, comprising a fusion between a DNA-binding domain and a component of a chromatin remodeling complex, can be introduced into a cell. An important factor in the administration of polypeptide compounds is ensuring that the polypeptide has the ability to traverse the plasma membrane of a cell, or the membrane of an intra-cellular compartment such as the nucleus. Cellular membranes are composed of lipid-protein bilayers that are freely permeable to small, nonionic lipophilic compounds and are inherently impermeable to polar compounds, macromolecules, and therapeutic or diagnostic agents. However, proteins, lipids and other compounds, which have the ability to translocate polypeptides across a cell membrane, have been described.
- For example, “membrane translocation polypeptides” have amphiphilic or hydrophobic amino acid subsequences that have the ability to act as membrane-translocating carriers. In one embodiment, homeodomain proteins have the ability to translocate across cell membranes. The shortest internalizable peptide of a homeodomain protein, Antennapedia, was found to be the third helix of the protein, from amino acid position 43 to 58. Prochiantz (1996)Curr. Opin. Neurobiol. 6:629-634. Another subsequence, the h (hydrophobic) domain of signal peptides, was found to have similar cell membrane translocation characteristics. Lin et al. (1995) J. Biol. Chem. 270:14255-14258.
- Examples of peptide sequences which can be linked to a fusion polypeptide for facilitating its uptake into cells include, but are not limited to: an 11 amino acid peptide of the tat protein of HIV; a 20 residue peptide sequence which corresponds to amino acids 84-103 of the p16 protein (see Fahraeus et al. (1996)Curr. Biol. 6:84); the third helix of the 60-amino acid long homeodomain of Antennapedia (Derossi et al. (1994) J. Biol. Chem. 269:10444); the h region of a signal peptide, such as the Kaposi fibroblast growth factor (K-FGF) h region (Lin et al., supra); and the VP22 translocation domain from HSV (Elliot et al. (1997) Cell 88:223-233). Other suitable chemical moieties that provide enhanced cellular uptake can also be linked, either covalently or non-covalently, to fusion polypeptides.
- Toxin molecules also have the ability to transport polypeptides across cell membranes. Often, such molecules (called “binary toxins”) are composed of at least two parts: a translocation or binding domain and a separate toxin domain. Typically, the translocation domain, which can optionally be a polypeptide, binds to a cellular receptor, facilitating transport of the toxin into the cell. Several bacterial toxins, including Clostridium perfringens iota toxin, diphtheria toxin (DT), Pseudomonas exotoxin A (PE), pertussis toxin (PT),Bacillus anthracis toxin, and pertussis adenylate cyclase (CYA), have been used to deliver peptides to the cell cytosol as internal or amino-terminal fusions. Arora et al. (1993) J. Biol. Chem. 268:3334-3341; Perelle et al. (1993) Infect. Immun. 61:5147-5156; Stenmark et al. (1991) J. Cell Biol. 113:1025-1032; Donnelly et al. (1993) Proc. Natl. Acad. Sci. USA 90:3530-3534; Carbonetti et al. (1995) Abstr. Annu. Meet. Am. Soc. Microbiol. 95:295; Sebo et al. (1995) Infect. Immun. 63:3851-3857; Klimpel et al. (1992) Proc. Natl. Acad. Sci. USA. 89:10277-10281; and Novak et al. (1992) J. Biol. Chem. 267:17186-17193.
- Such subsequences can be used to translocate polypeptides, including fusion polypeptides as disclosed herein, across a cell membrane. This is accomplished, for example, by derivatizing the fusion polypeptide with one of these translocation sequences, or by forming an additional fusion of the translocation sequence with the fusion polypeptide. Optionally, a linker can be used to link the fusion polypeptide and the translocation sequence. Any suitable linker can be used, e.g., a peptide linker.
- A fusion polypeptide can also be introduced into an animal cell, preferably a mammalian cell, via liposomes and liposome derivatives such as immunoliposomes. The term “liposome” refers to vesicles comprised of one or more concentrically ordered lipid bilayers, which encapsulate an aqueous phase. The aqueous phase typically contains the compound to be delivered to the cell.
- The liposome fuses with the plasma membrane, thereby releasing the compound into the cytosol. Alternatively, the liposome is phagocytosed or taken up by the cell in a transport vesicle. Once in the endosome or phagosome, the liposome is either degraded or it fuses with the membrane of the transport vesicle and releases its contents.
- In current methods of drug delivery via liposomes, the liposome ultimately becomes permeable and releases the encapsulated compound at the target tissue or cell. For systemic or tissue specific delivery, this can be accomplished, for example, in a passive manner wherein the liposome bilayer is degraded over time through the action of various agents in the body. Alternatively, active drug release involves using an agent to induce a permeability change in the liposome vesicle. Liposome membranes can be constructed so that they become destabilized when the environment becomes acidic near the liposome membrane. See, e.g.,Proc. Natl. Acad. Sci. USA 84:7851 (1987); Biochemistry 28:908 (1989). When liposomes are endocytosed by a target cell, for example, they become destabilized and release their contents. This destabilization is termed fusogenesis. Dioleoylphosphatidylethanolamine (DOPE) is the basis of many “fusogenic” systems.
- For use with the methods and compositions disclosed herein, liposomes typically comprise a fusion polypeptide as disclosed herein, a lipid component, e.g., a neutral and/or cationic lipid, and optionally include a receptor-recognition molecule such as an antibody that binds to a predetermined cell surface receptor or ligand (e.g., an antigen). A variety of methods are available for preparing liposomes as described in, e.g.; U.S. Pat. Nos. 4,186,183; 4,217,344; 4,235,871; 4,261,975; 4,485,054; 4,501,728; 4,774,085; 4,837,028; 4,235,871; 4,261,975; 4,485,054; 4,501,728; 4,774,085; 4,837,028; 4,946,787; PCT Publication No. WO 91/17424; Szoka et al. (1980)Ann. Rev. Biophys. Bioeng. 9:467; Deamer et al. (1976) Biochim. Biophys. Acta 443:629-634; Fraley, et al. (1979) Proc. Natl. Acad. Sci. USA 76:3348-3352; Hope et al. (1985) Biochim. Biophys. Acta 812:55-65; Mayer et al. (1986) Biochim. Biophys. Acta 858:161-168; Williams et al. (1988) Proc. Natl. Acad. Sci. USA 85:242-246; Liposomes, Ostro (ed.), 1983, Chapter 1); Hope et al. (1986) Chem. Phys. Lip. 40:89; Gregoriadis, Liposome Technology (1984) and Lasic, Liposomes: from Physics to Applications (1993). Suitable methods include, for example, sonication, extrusion, high pressure/homogenization, microfluidization, detergent dialysis, calcium-induced fusion of small liposome vesicles and ether-fusion methods, all of which are well known in the art.
- In certain embodiments, it may be desirable to target a liposome using targeting moieties that are specific to a particular cell type, tissue, and the like. Targeting of liposomes using a variety of targeting moieties (e.g., ligands, receptors, and monoclonal antibodies) has been previously described. See, e.g., U.S. Pat. Nos. 4,957,773 and 4,603,044.
- Examples of targeting moieties include monoclonal antibodies specific to antigens associated with neoplasms, such as prostate cancer specific antigen and MAGE. Tumors can also be diagnosed by detecting gene products resulting from the activation or over-expression of oncogenes, such as ras or c-erbB2. In addition, many tumors express antigens normally expressed by fetal tissue, such as the alphafetoprotein (AFP) and carcinoembryonic antigen (CEA). Sites of viral infection can be diagnosed using various viral antigens such as hepatitis B core and surface antigens (HBVc, HBVs) hepatitis C antigens, Epstein-Barr virus antigens, human immunodeficiency type-1 virus (HIV-1) and papilloma virus antigens. Inflammation can be detected using molecules specifically recognized by surface molecules which are expressed at sites of inflammation such as integrins (e.g., VCAM-1), selectin receptors (e.g., ELAM-1) and the like.
- Standard methods for coupling targeting agents to liposomes are used. These methods generally involve the incorporation into liposomes of lipid components, e.g., phosphatidylethanolamine, which can be activated for attachment of targeting agents, or incorporation of derivatized lipophilic compounds, such as lipid derivatized bleomycin. Antibody targeted liposomes can be constructed using, for instance, liposomes which incorporate protein A. See, Renneisen et al. (1990)J. Biol. Chem. 265:16337-16342 and Leonetti et al. (1990) Proc. Natl. Acad. Sci. USA 87:2448-2451.
- Fusion polypeptides as disclosed herein, and expression vectors encoding fusion polypeptides, can be used in conjunction with various methods of gene therapy to facilitate the action of a therapeutic gene product. In such applications, a fusion polypeptide can be administered directly to a patient, e.g., to facilitate the modulation of gene expression and for therapeutic or prophylactic applications, for example, cancer, ischemia, diabetic retinopathy, macular degeneration, rheumatoid arthritis, psoriasis, HIV infection, sickle cell anemia, Alzheimer's disease, muscular dystrophy, neurodegenerative diseases, vascular disease, cystic fibrosis, stroke, and the like. Examples of microorganisms whose inhibition can be facilitated through use of the methods and compositions disclosed herein include pathogenic bacteria, e.g., Chlamydia, Rickettsial bacteria, Mycobacteria, Staphylococci, Streptococci, Pneumococci, Meningococci and Conococci, Klebsiella, Proteus, Serratia, Pseudomonas, Legionella, Diphtheria, Salmonella, Bacilli (e.g., anthrax), Vibrio (e.g., cholera), Clostridium (e.g., tetanus, botulism), Yersinia (e.g., plague), Leptospirosis, and Borrellia (e.g., Lyme disease bacteria); infectious fungus, e.g., Aspergillus, Candida species; protozoa such as sporozoa (e.g., Plasmodia), rhizopods (e.g., Entamoeba) and flagellates (Trypanosoma, Leishmania, Trichomonas, Giardia, etc.);viruses, e.g., hepatitis (A, B, or C), herpes viruses (e.g., VZV, HSV-1, HHV-6, HSV-II, CMV, and EBV), HIV, Ebola, Marburg and related hemorrhagic fever-causing viruses, adenoviruses, influenza viruses, flaviviruses, echoviruses, rhinoviruses, coxsackie viruses, comaviruses, respiratory syncytial viruses, mumps viruses, rotaviruses, measles viruses, rubella viruses, parvoviruses, vaccinia viruses, HTLV viruses, retroviruses, lentiviruses, dengue viruses, papillomaviruses, polioviruses, rabies viruses, and arboviral encephalitis viruses, etc.
- Administration of therapeutically effective amounts of a fusion polypeptide or a nucleic acid encoding a fusion polypeptide is by any of the routes normally used for introducing polypeptides or nucleic acids into ultimate contact with the tissue to be treated. The fusion polypeptides or nucleic acids are administered in any suitable manner, preferably with pharmaceutically acceptable carriers. Suitable methods of administering such modulators are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions. See, e.g.,Remington's Pharmaceutical Sciences, 17th ed. 1985.
- Fusion polypeptides or nucleic acids, alone or in combination with other suitable components, can be made into aerosol formulations (i.e., they can be “nebulized”) to be administered via inhalation. Aerosol formulations can be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like.
- Formulations suitable for parenteral administration, such as, for example, by intravenous, intramuscular, intradermal, and subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives. Compositions can be administered, for example, by intravenous infusion, orally, topically, intraperitoneally, intravesically or intrathecally. The formulations of compounds can be presented in unit-dose or multi-dose sealed containers, such as ampoules and vials. Injection solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind known to those of skill in the art.
- Numerous activities of chromatin remodeling complexes have been described, including but not limited to the following. A characteristic activity of all chromatin remodeling complexes is nucleosome- or DNA-dependent ATPase activity. Chromatin remodeling complexes can facilitate binding of transcription factors to genes in a chromatin context and facilitate accessibility of sequences in chromatin to restriction enzymes and other nucleases. Certain remodeling complexes (those containing the ISWI ATPase) also possess the ability to assemble periodic nucleosome arrays (i.e. they are capable of spacing nucleosomes). Changes in DNA topology (i. e., degree of supercoiling) can also result from the action of chromatin remodeling complexes; these are believed to reflect either alterations of the path of DNA along the nucleosome or alterations in the path of linker DNA along the chromatin fiber. Chromatin remodeling complexes are also capable of transferring histones from chromatin to either DNA or protein acceptors. Stimulation of transcription initiation can also result from the action of chromatin remodeling complexes.
- Without wishing to be bound by any particular theory, the inventors recognize the possibility that the mechanism underlying all the above-mentioned activities may be the ability of chromatin remodeling complexes to promote nucleosome sliding or, more basically, to destabilize the histone-DNA interaction. Accordingly, any protein or multiprotein complex capable of destabilizing histone-DNA interactions and/or promoting nucleosome movement is suitable for use as a component of a chromatin remodeling complex.
- The various activities of chromatin remodeling complexes can be assayed by a number of techniques, as are known to those of skill in the art, and as have been described in publications disclosing the isolation and characterization of the various chromatin remodeling complexes, as set forth supra. See also Imblazano et al. (1994)Nature 370:481-485 and Cote et al. (1993) Science 265:53-60 for descriptions of assays involving facilitation of transcription factor binding. Assays involving nucleosome repositioning are described by, for example, Hamiche et al. (1999) Cell 97:833-842 and Guschin et al. (2000) Biochemistry 39:5238-5245. Accordingly, it is possible for one of skill in the art to determine whether a given multiprotein complex is a chromatin remodeling complex and to determine whether a particular polypeptide is a component of a chromatin remodeling complex or functional fragment thereof. Additional examples of assays for chromatin remodeling activity are provided infra and in publications such as Methods in Enzymology, Vol. 304, “Chromatin” (P. M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego, 1999; and Methods in Molecular Biology, Vol. 119, “Chromatin Protocols” (P. B. Becker, ed.) Humana Press, Totowa, 1999. See also U.S. Pat. No. 5,972,608.
- An additional assay for chromatin modification is modulation of gene expression, when the modification is part of a two-step process in which chromatin modification allows binding of a molecule which modulates gene expression (e.g., a polypeptide comprising a fusion between a zinc finger DNA-binding domain and a transcriptional regulatory domain). Assays for gene modulation (e.g., transcriptional activation and/or repression, reporter gene activity, measurement of protein levels) are well-known to those of skill in the art and are described, for example, in co-owned WO 00/41566.
- The compositions and methods disclosed herein can be used to facilitate and/or modulate a number of processes involving cellular chromatin. These processes include, but are not limited to, transcription, replication, recombination, repair, integration, maintenance of telomeres, and processes involved in chromosome stability and disjunction. Accordingly, the methods and compositions disclosed herein can be used to affect any of these processes, as well as any other process which can be influenced by chromatin structure such as, for example, detection of specific sequences or sequence variants in cellular chromatin.
- Targeted modification of chromatin structure, as disclosed herein, can be used in processes such as, for example, therapeutic regulation of disease-related genes, engineering of cells for manufacture of protein pharmaceuticals, pharmaceutical discovery (including target discovery, target validation and engineering of cells for high throughput screening methods) and plant agriculture.
- For example, in one embodiment, chromatin modification facilitates access of one or more transcriptional regulatory factors, either endogenous or exogenous, to a target site in cellular chromatin, thereby participating in modulation of gene expression. Modulation of gene expression can include either increases or decreases in the level of gene expression. In another exemplary embodiment, chromatin modification increases the efficiency of recombination, thereby facilitating, for example, targeted integration of an exogenous nucleic acid.
- Thus, in certain embodiments, modification of chromatin is used to facilitate the modulation of gene expression. Modulation can include gene activation and gene repression, as well as more subtle increases or decreases in the level of gene expression. Activation of gene expression can be mediated, for instance, by the activity of a histone acetyl transferase that has been recruited to a region of interest by the methods and compositions disclosed herein. Repression of gene expression can be mediated, for instance, by the activity of a histone deacetylase that has been recruited to a region of interest by the methods and compositions disclosed herein. Without wishing to be bound by any particular theory, it is believed that modification of chromatin in the vicinity of a particular gene will make that gene's regulatory sequences more (or less, in the case of repression) accessible to transcriptional activators. Alternatively, chromatin modification could render regulatory sequences more accessible to transcriptional repressors or less accessible to positive transcriptional regulatory factors.
- Accordingly, expression of any gene in any organism can be modulated by chromatin modification as disclosed herein, including therapeutically relevant genes, genes of infecting microorganisms, viral genes, and genes whose expression is modulated in the process of target validation. Such genes include, but are not limited to, vascular endothelial growth factor (VEGF), VEGF receptors flt and flk, CCR-5, low density lipoprotein receptor (LDLR), estrogen receptor, HER-2/neu, BRCA-1, BRCA-2, phosphoenolpyruvate carboxykinase (PEPCK), CYP7, fibrinogen, apolipoprotein A (ApoA), apolipoprotein B (ApoB), renin, phosphoenolpyruvate carboxykinase (PEPCK), CYP7, fibrinogen, nuclear factor κB (NF-κB), inhibitor of NF-κB (I-κB), tumor necrosis factors (e.g., TNF-α, TNF-β), interleukin-1 (IL-1), FAS (CD95), FAS ligand (CD95L), atrial natriuretic factor, platelet-derived factor (PDF), amyloid precursor protein (APP), tyrosinase, tyrosine hydroxylase, β-aspartyl hydroxylase, alkaline phosphatase, calpains (e.g., CAPN10) neuronal pentraxin receptor, adriamycin response protein, apolipoprotein E (apoE), leptin, leptin receptor, UCP-1, IL-1, IL-1 receptor, IL-2, IL-3, IL-4, IL-5, IL-6, IL-12, IL-15, interleukin receptors, G-CSF, GM-CSF, colony stimulating factor, erythropoietin (EPO), platelet-derived growth factor (PDGF), PDGF receptor, fibroblast growth factor (FGF), FGF receptor, PAF, p16, p19, p53, Rb, p21, myc, myb, globin, dystrophin, eutrophin, cystic fibrosis transmembrane conductance regulator (CFTR), GNDF, nerve growth factor (NGF), NGF receptor, epidermal growth factor (EGF), EGF receptor, transforming growth factors (e.g., TGF-α, TGF-β), fibroblast growth factor (FGF), interferons (e.g., IFN-α, IFN-β and IFN-γ), insulin-related growth factor-1 (IGF-1), angiostatin, ICAM-1, signal transducer and activator of transcription (STAT), androgen receptors, e-cadherin, cathepsins (e.g., cathepsin W), topoisomerase, telomerase, bcl, bcl-2, Bax, T Cell-specific tyrosine kinase (Lck), p38 mitogen-activated protein kinase, protein tyrosine phosphatase (hPTP), adenylate cyclase, guanylate cyclase, α7 neuronal nicotinic acetylcholine receptor, 5-hydroxytryptamine (serotonin)-2A receptor, transcription elongation factor-3 (TEF-3), phosphatidylcholine transferase, ftz, PTI-1, polygalacturonase, EPSP synthase, FAD2-1, Δ-9 desaturase, Δ-12 desaturase, Δ-15 desaturase, acetyl-Coenzyme A carboxylase, acyl-ACP thioesterase, ADP-glucose pyrophosphorylase, starch synthase, cellulose synthase, sucrose synthase, fatty acid hydroperoxide lyase, and peroxisome proliferator-activated receptors, such as PPAR-γ2.
- Expression of human, mammalian, bacterial, fungal, protozoal, Archaeal, plant and viral genes can be modulated; viral genes include, but are not limited to, hepatitis virus genes such as, for example, HBV-C, HBV-S, HBV-X and HBV-P; and HIV genes such as, for example, tat and rev. Modulation of expression of genes encoding antigens of a pathogenic organism can be achieved using the disclosed methods and compositions.
- Additional genes include those encoding cytokines, lymphokines, interleukins, growth factors, mitogenic factors, apoptotic factors, cytochromes, chemotactic factors, chemokine receptors (e.g., CCR-2, CCR-3, CCR-5, CXCR-4), phospholipases (e.g., phospholipase C), nuclear receptors, retinoid receptors, organellar receptors, hormones, hormone receptors, oncogenes, tumor suppressors, cyclins, cell cycle checkpoint proteins (e.g.,Chk1, Chk2), senescence-associated genes, immunoglobulins, genes encoding heavy metal chelators, protein tyrosine kinases, protein tyrosine phosphatases, tumor necrosis factor receptor-associated factors (e.g., Traf-3, Traf-6), apolipoproteins, thrombic factors, vasoactive factors, neuroreceptors, cell surface receptors, G-proteins, G-protein-coupled receptors (e.g., substance K receptor, angiotensin receptor, α- and β-adrenergic receptors, serotonin receptors, and PAF receptor), muscarinic receptors, acetylcholine receptors, GABA receptors, glutamate receptors, dopamine receptors, adhesion proteins (e.g., CAMs, selectins, integrins and immunoglobulin superfamily members), ion channels, receptor-associated factors, hematopoietic factors, transcription factors, and molecules involved in signal transduction. Expression of disease-related genes, and/or of one or more genes specific to a particular tissue or cell type such as, for example, brain, muscle, heart, nervous system, circulatory system, reproductive system, genitourinary system, digestive system and respiratory system can also be modulated.
- For the purposes of the present disclosure, chromatin includes any cellular nucleoprotein structure. This can include, but is not limited to chromosomes (i.e., nuclear genomes), episomes, organellar nucleoproteins, such as mitochondrial and chloroplast genomes, and nucleoproteins associated with infecting bacterial or viral genomes. It is known that non-eukaryotic genomes are organized into nucleoprotein structures. In eukaryotic cells, the genome is enclosed in the nucleus. Accordingly, contact of a molecule with cellular chromatin includes introduction of the molecule into the nucleus of a cell.
- Cells include, but are not limited to, prokaryotic, eukaryotic and Archaeal cells. Eukaryotic cells include plant, fungal, protozoal and animal cells, including mammalian cells, primate cells and human cells.
- The fusion molecules disclosed herein comprise a DNA-binding domain which binds to a target site. In certain embodiments, the target site is present in an accessible region of cellular chromatin. Accessible regions can be determined as described in co-owned PCT/US01/40617. In additional embodiments, the DNA-binding domain of a fusion molecule is capable of binding to cellular chromatin regardless of whether its target site is in an accessible region or not. For example, such DNA-binding domains are capable of binding to linker DNA and/or nucleosomal DNA. Examples of this type of “pioneer” DNA binding domain are found in certain steroid receptor and in hepatocyte nuclear factor 3 (HNF3). Cordingley et al., supra; Pina et a., supra; and Cirillo et al., supra.
- Methods of chromatin modification in a region of interest can also be combined with methods involving binding of endogenous or exogenous transcriptional regulators in the region of interest to achieve modulation of gene expression. Modulation of gene expression can be in the form of repression as, for example, when the target gene resides in a pathological infecting microorganism or in an endogenous gene of the subject, such as an oncogene or a viral receptor, that contributes to a disease state. Alternatively, modulation can be in the form of activation, if activation of a gene (e.g., a tumor suppressor gene) can ameliorate a disease state. For such applications, an exogenous molecule can be formulated with a pharmaceutically acceptable carrier, as is known to those of skill in the art. See, for example, Remington's Pharmaceutical Sciences, 17th ed., 1985; and co-owned WO 00/42219.
- Thus, certain embodiments include the use of a fusion molecule comprising a DNA-binding domain and a component of a chromatin remodeling complex, to modify chromatin structure in a region of interest, in combination with a second molecule having transcriptional regulatory activity which binds in the region of interest after modification of chromatin structure in the region of interest. In certain embodiments, the second molecule comprises a fusion between a DNA-binding domain and either a transcriptional activation domain or a transcriptional repression domain. Any polypeptide sequence or domain capable of influencing gene expression, which can be fused to a DNA-binding domain, is suitable for use. Activation and repression domains are known to those of skill in the art and are disclosed, for example, in co-owned WO 00/41566.
- Exemplary activation domains include, but are not limited to, VP16, VP64, the p65 subunit of NF-kappa B, ligand-bound thyroid hormone receptor and its functional fragments, p300, CBP, PCAF,SRC1 PvALF, AtHD2A and ERF-2. See, for example, Robyr et al. (2000)Mol. Endocrinol. 14:329-347; Collingwood et al. (1999) J. Mol. Endocrinol. 23:255-275; Leo et al. (2000) Gene 245:1-11; Manteuffel-Cymborowska (1999) Acta Biochim. Pol. 46:77-89; McKenna et al. (1999) J. Steroid Biochem. Mol. Biol. 69:3-12; Malik et al. (2000) Trends Biochem. Sci. 25:277-283; and Lemon et al. (1999) Curr. Opin. Genet. Dev. 9:499-504. Additional exemplary activation domains include, but are not limited to, OsGAI, HALF-1, C1, AP1, ARF-5, -6, -7, and -8, CPRF1, CPRF4, MYC-RP/GP, and TRAB1. See, for example, Ogawa et al. (2000) Gene 245:21-29; Okanami et al. (1996) Genes Cells 1:87-99; Goff et al. (1991) Genes Dev. 5:298-309; Cho et al. (1999) Plant Mol. Biol. 40:419-429; Ulmason et al. (1999) Proc. Natl. Acad. Sci. USA 96:5844-5849; Sprenger-Haussels et al. (2000) Plant J. 22:1-8; Gong et al. (1999) Plant Mol. Biol. 41:33-44; and Hobo et al. (1999) Proc. Natl. Acad. Sci. USA 96:15,348-15,353.
- Exemplary repression domains include, but are not limited to, KRAB, SID, v-erbA, unliganded thyroid hormone receptor and its functional fragments, MBD2, MBD3, members of the DNMT family (e.g., DNMT1, DNMT3A, DNMT3B), Rb, and MeCP2. See, for example, Bird et al. (1999)Cell 99:451-454; Tyler et al. (1999) Cell 99:443-446; Knoepfler et al. (1999) Cell 99:447-450; and Robertson et al. (2000) Nature Genet. 25:338-342. Additional exemplary repression domains include, but are not limited to, ROM2 and AtHD2A. See, for example, Chem et al. (1996) Plant Cell 8:305-321; and Wu et al. (2000) Plant J. 22:19-27.
- It is likely that many transcriptional regulatory molecules, both endogenous and exogenous, are unable to interact with their target sites (and, hence, unable to exert their regulatory effects) when the target site is present in cellular chromatin. Without wishing to be bound by any particular theory, it is believed that chromatin modification in a region of interest can make such target sites accessible to their binding molecules. Accordingly, the methods and compositions disclosed herein complement methods of in vivo gene regulation using exogenous molecules, in those cases in which the target site for the exogenous molecule is not in an accessible region in cellular chromatin. Methods of gene regulation using exogenous molecules are disclosed, for example, in co-owned WO 00/41566. These include applications in regulation of plant gene expression, functional genomics and transgenic animals.
- Significant difficulties currently exist in therapeutic situations which require the reactivation of a developmentally-silenced gene. Developmentally-induced gene inactivation can be mediated by methylation of CpG islands in the upstream region of a gene. Thus, use of a binding domain specific for methylated DNA as the DNA-binding portion of a fusion can facilitate recruitment of a chromatin remodeling complex to the upstream region of a developmentally-silenced gene, making the gene accessible to exogenous regulatory factors, and resulting in therapeutic re-activation of the gene. In another embodiment, a fusion between a methylated DNA-binding domain and a demethylase can be used for reactivation of a gene silenced by methylation.
- The compositions and methods disclosed herein are useful in a variety of applications and provide advantages over existing methods. These include therapeutic methods in which an exogenous molecule is administered to a subject and used to modulate expression of a target gene within the subject. See, for example, co-pending WO 00/41566. The disclosed compositions and methods can also facilitate detection of particular sequences by binding of an exogenous molecule to a binding site in cellular chromatin as in, for example, diagnostic applications. Methods for detection of a target sequence using, for example, a ZFP are described in co-owned WO 00/42219. For example, an exogenous molecule, such as a sequence-specific DNA binding protein, can be used to detect variant alleles associated with a disease or with a particular phenotype in patient samples and to detect the presence of pathological microorganisms in clinical samples. In one embodiment, a variant allele comprises a single-nucleotide polymorphism (SNP). In a non-mutually exclusive embodiment, the sequence-specific DNA binding protein is a ZFP. Exogenous molecules can also be used to quantify copy number of a gene in a sample. For example, detection of the loss of one copy of a p53 gene in a clinical sample is an indicator of susceptibility to cancer. Additionally, identification of transgenic plants and animals can be accomplished through detection of a transgene using, for example, binding of a sequence-specific exogenous molecule (such as, for example, a ZFP) as an assay. All of these procedures can be enhanced by recruitment of a chromatin remodeling complex to a region of interest in cellular chromatin to facilitate binding of a binding molecule in the region of interest.
- The disclosed methods and compositions, when used in conjunction with methods of binding of exogenous molecules to cellular chromatin, can be used in assays to determine gene function and to determine changes in phenotype resulting from specific modulation of gene expression. See, for example, co-owned PCT WO 01/19981.
- The following examples are presented as illustrative of, but not limiting, the claimed subject matter.
- A zinc finger DNA-binding domain, which recognizes the human vascular endothelial growth factor-A (VEGF) gene, was designed and constructed according to design rules and methods disclosed in co-owned WO 00/42219, WO 00/41566, and co-owned U.S. Patent Applications Ser. Nos. 09/444,241 filed Nov. 19, 1999, and 09/535,088 filed Mar. 23, 2000. The target site, which overlaps the transcription initiation site for the human VEGF-A gene, is shown below as SEQ ID NO: 1, with the arrow indicating the transcription startsite.
□ 5′-GGGGAGGAT-3′ (SEQ ID NO:1) 3′-CCCCTCCTA-5′ - The human SP-1 zinc finger transcription factor was used as backbone for the construction of a designed three-finger DNA binding domain, Veg1, capable of recognizing this sequence. SP-1 has a three finger DNA-binding domain related to the well-studied murine zinc finger protein Zif268. Christy et al. (1988)Proc. Natl. Acad. Sci. USA 85:7857-7861. Site-directed mutagenesis experiments using this domain have shown that correlations between the amino acid sequence of a zinc finger and its target nucleotide sequence, derived from analyses of Zif268, are also applicable to SP-1 and hence can be used to adapt the specificity of SP-1 to DNA sequences other than its normal target site. Desjarlais et al. (1994) Proc. Natl. Acad. Sci. USA 91:11099-11103. The portion of the SP-1 sequence used for construction of designed zinc finger DNA binding domains corresponds to amino acids 533 to 624.
- Amino acid sequences of designed DNA-binding domains are illustrated in Table 1. As can be seen in the Table, the designed Veg1 protein comprises three zinc fingers (F1, F2 and F3) which together recognize a 9-base pair target site. The amino acid sequence of the recognition helix (positions −1 through +6, where +1 is the first amino acid in the α-helix) for each of the DNA-binding fingers is given.
TABLE 1 Target sites and ZFP DNA-binding domains in the human VEGF-A gene Name Target site Location AA sequence Veg 1 5′-GGGGAGGAT-3′ −8 to +1 F1: TTSNLRR (SEQ ID NO:3) (SEQ ID NO:2) F2: RSSNLQR (SEQ ID NO:4) F3: RSDHLSR (SEQ ID NO:5) Veg 3a 5′-GCGGAGGCT-3′ +3 to +11 F1: QSSDLQR (SEQ ID NO:7) (SEQ ID NO:6) F2: RSSNLQR (SEQ ID NO:8) F3: RSDELSR (SEQ ID NO:9) - A polymerase chain reaction (PCR)-based assembly procedure, using six overlapping oligonucleotides, was applied to the synthesis of a synthetic gene encoding the Veg1 DNA-binding domain. See FIG. 1. Three of the oligonucleotides (1, 3, and 5 in FIG. 1) correspond to “universal” sequences that encode portions of the DNA-binding domain between the recognition helices. These oligonucleotides are constant for any given zinc finger construct. The other three “specific” oligonucleotides (2, 4, and 6 in FIG. 1) were designed to encode the recognition helices. These oligonucleotides contained different sequences encoding amino acids at positions −1, +2, +3 and +6 in each recognition helix, depending on its target triplet sequence. Codon bias was chosen to allow expression in both mammalian cells and E. coli. Assembly of Veg1 coding sequences was carried out as follows. First, the six oligonucleotides (three universal and three specific, as described above) were combined and annealed at 25° C. to form a gapped DNA scaffold. Next, gaps were filled by conducting a four-cycle PCR reaction (using Taq and Pfu thermostable DNA polymerases) to generate a double-stranded template. This template was amplified (for thirty cycles) using a pair of external primers containing Kpn I and Hind III restriction sites. PCR products were directly cloned into the Kpn I and Hind III sites of the Tac promoter vector, pMal-c2 (New England Biolabs, Beverly, Mass.). The Veg1 zinc finger DNA-binding domain was expressed from this vector and purified as a fusion with the maltose binding protein according to the manufacturer's instructions (New England Biolabs, Beverly, Mass.).
- Accuracy of the Veg1 clone was verified by DNA sequencing. The Veg1 nucleotide and amino acid sequences are as follows.
Veg1 nucleotide sequence: KpnI GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGG (SEQ ID NO:10) GCTGTGGTAAAGTTTACGGCACAACCTCAAATCTGCGTCGTCACCTGCGCTGG CACACCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCTT CACCCGTTCGTCAAACCTGCAGCGTCACAAGCGTACCCACACCGGTGAGAAG AAATTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGTAGTGACCACCTGTC CCGTCACATCAAGACCCACCAGAATAAGAAGGGTGGATCC BamHI Veg1 amino acid sequence VPIPGKKKQHICHIQGCGKVYGTTSNLRRHLRWHTGERPFMCTWSYCGK (SEQ ID NO:11) RFTRSSNLQRHKRTHTGEKKFACPECPKRFMRSDHLSRHIKTHQNKKGGS - Expression of designed ZFPs was carried out in two different systems. In the first, the DNA-binding peptides were expressed inE. coli by inserting them into the commercially available pET15b vector (Novagen). This vector contains a T7 promoter sequence to drive expression of the recombinant protein. Constructs were introduced into E. coli BL21/DE3 (lacIq) cells, which contain an IPTG-inducible T7 RNA polymerase. Cultures were supplemented with 50 μM ZnCl2, were grown at 37° C. to an OD at 600 mn of 0.5-0.6, and protein production was induced with IPTG for 2 hrs. These proteins are referred to as “unfused” ZFPs.
- Partially pure unfused ZFPs were produced as follows (adapted from Desjarlais et al. (1992)Proteins: Structure, Function and Genetics 12:101-104). A frozen cell pellet was resuspended in 1/50th volume of 1 M NaCl, 25 mM Tris-HCl (pH 8.0), 100 μM ZnCl2, 5 mM DTT. Samples were boiled for 10 min and centrifuged for 10 min at ˜3,000×g. At this point, ZFP protein in the supernatant was >50% pure (as estimated by staining of SDS-polyacrylamide gels with Coomassie blue), and the product migrated at the predicted molecular weight of around 11 kDa.
- The second method for producing ZFPs was to express them as fusions to theE. coli Maltose Binding Protein (MBP). N-terminal MBP fusions to ZFPs were constructed by PCR amplification of the pET15b clones and insertion into the vector pMal-c2 (New England Biolabs) under the control of the Tac promoter. The fusion allows simple purification and detection of recombinant protein. It had been reported previously that zinc finger DNA-binding proteins can be expressed from this vector in soluble form to high levels in E. coli and can bind efficiently to the appropriate DNA target without refolding. Liu et al. (1997) Proc. Natl. Acad. Sci. USA 94:5525-5530. Production of MBP-fused proteins was as described by the manufacturer (New England Biolabs, Beverly, Mass.). Transformants were grown in LB medium supplemented with glucose and ampicillin, and were induced with IPTG for 3 hrs at 37° C. The cells were lysed by French press, then exposed to an agarose-based amylose resin, which specifically binds to the MBP moiety, thus acting as an affinity resin for the MBP fusion protein. The MBP fusion protein was eluted with 10 mM maltose to release ZFP of >50% purity. In some cases, protein was further concentrated using a
Centricon 30 filter unit (Amicon). - Partially purified ZFPs (both unfused and MBP fusions) were tested by electrophoretic mobility shift assay (EMSA) to assess their ability to bind to their target DNA sequences. Protein concentrations were measured by Bradford assay (BioRad). Since SDS-polyacrylamide gels demonstrated >50% homogeneity of ZFP produced by either purification method, no adjustment was made for ZFP purity in the calculations. For this reason, the data generated by EMSA (shown below) represent an underestimate of the true affinity of the proteins for their targets (i.e., kd will be overestimated). In addition, inactive protein in the preparations could also contribute to an underestimate of the binding affinity of the active molecules in the preparation. Two separate preparations of protein were used for determination of kd, to help control for differences in ZFP activity.
- A 29-mer duplex oligonucleotide was used as a binding target for electrophoretic mobility shift analysis of Veg1. The sequence of the duplex (with VEGF sequences in bold and target site under/overlined) was as follows:
5′-CATGCATAGC GGGGAGGAT CGCCATCGAT-3′ (SEQ ID NO:12) 3′-GTACGTATCGCCCCTCCTAGCGGTAGCTA-5′ - The top strand was labeled, prior to annealing, with polynucleotide kinase and γ-32P ATP. Top and bottom strands were annealed in a reaction containing each oligonucleotide at 0.5 μM, 10 mM Tris-HCl (pH 8.0), 1 mM EDTA, and 50 mM NaCl. The mix was heated to 95° C. for 5 min and slow-cooled to 30° C. over 60 min. Duplex formation was confirmed by polyacrylamide gel electrophoresis. Free label and single stranded DNA remaining in the target preparations did not appear to interfere with the binding reactions.
- Assays for binding of Veg1 to the target oligonucleotide (above) were performed by titrating protein against a fixed amount of duplex target. Binding reactions contained 50
pM 5′ 32P labeled double stranded target DNA, 10 mM Tris-HCl (pH 7.5), 100 mM KCl, 1 mM MgCl2, 1 mM dithiothreitol, 10% glycerol, 200 μg/ml bovine serum albumin, 0.02% NP-40, 20 μg/ml poly dI-dC (optionally), and 100 μM ZnCl2, in a final volume of 20 μl. Protein was added to the binding reaction as one-fifth volume from a dilution series made in 200 mM NaCl, 20 mM Tris (pH 7.5), 1 mM DTT. Binding was allowed to proceed for 45 min at room temperature. Polyacrylamide gel electrophoresis was carried out at room temperature using precast 10% or 10-20% Tris-HCl gels (BioRad, Hercules, Calif.) and Tris-Glycine running buffer (25 mM Tris-HCl, 192 mM glycine, pH 8.3) containing 0.1 mM ZnCl2. Radioactive signals were quantitated with a Phosphorimager. - FIG. 2 shows the results of EMSA analysis of Veg1, using a four-fold dilution series of the Veg1 protein. Shifted product, indicative of labeled target with bound protein, is indicated by an arrow in FIG. 2A. The amount of shifted product was determined at each protein concentration and quantitated on a Phosphorimager (Molecular Dynamics). The relative signal (percent of maximal amount of shifted product) was plotted as a function of log10 protein concentration. In this case, the protein concentration yielding half-maximal binding of Veg1 to its target site (i.e., the apparent kd) was approximately 50 nM. MBP-fused and unfused versions of Veg1 bound to the target site with similar affinities.
- The Veg1 DNA binding domain is subcloned into a eukaryotic expression vector, in such a way that it is fused to the hBAF155 subunit of the brm/BRG chromatin remodeling complex. First, a cDNA sequence encoding a full length BAF155 protein is cloned using long range PCR. Barnes (1994)Proc. Natl. Acad. Sci. USA 91:2216-2220; Cheng et al. (1994) Proc. Natl. Acad. Sci. USA 91:5695-5699. Reagents and enzymes for performing long-range PCR are available from Roche Molecular Biochemicals (Indianapolis, Ind.) under the name “Expand PCR System.” The oligonucleotide primers are homologous to sequences just upstream of the translation initiation codon at nucleotide 55 and just downstream of the final codon (proline at nucleotide 3366). (The BAF15 numbering scheme refers to the Genbank Accession number U66615.) In addition, the primer upstream of nucleotide 55 contains a Bam-HI site positioned such that, when upstream sequences encoding the Veg1 DNA-binding domain are fused to BAF155 sequences, the translational reading frame is preserved. Furthermore, the primer downstream of nucleotide 3366 contains a HindIII site just downstream of the final codon of BAF155 positioned such that, if BAF155 sequences are fused to downstream sequences encoding a FLAG epitope tag, the translational reading frame is preserved.
- PCR is performed using cDNA from HeLa cells as template. Amplified product having a size of approximately 3400 base pairs is gel-purified and cloned directly into a Topo2 cloning vector (Invitrogen, Carlsbad, Calif.). Site-directed mutagenesis is used to eliminate the BamHI site at BAF155 position 2304, without altering the coding capacity or translational reading frame of the gene. A similar approach is used to eliminate the KpnI sites at nucleotides 2235 and 3243, and the HindIII sites at nucleotides 656 and 2365. The cloned and modified BAF155 gene is then removed from the Topo2 vector by digestion with BamHI and HindII, and gel-purified.
- The expression vector is modified from pcDNA3.1(−) (Invitrogen, Carlsbad, Calif.), by digesting it with EcoRI and HindIII, and inserting a double-stranded oligonucleotide encoding an EcoRI site, a translation initiation sequence (Kozak (1991)J. Biol. Chem. 266:19,867-19,870), a nuclear localization signal (NLS), a KpnI site, a BamHI site and a HindIII site. The NLS is derived from the SV40 large T-antigen (Kalderon et al. (1984) Cell 39:499-509), and has the amino acid sequence MAPKKKRKVGIHGV (SEQ ID NO: 13).
- This plasmid is then digested with BamHI and HindIII, and the BamHI-HindIII fragment comprising the BAF155 gene (supra) is inserted. A double-stranded oligonucleotide encoding a FLAG epitope (having the sequence DYKDDDDK, SEQ ID NO: 14), and containing HindIII sites at both ends is inserted concurrently with the BAF155-containing fragment. Alternatively, the FLAG-containing HindIII fragment can be inserted in a separate, subsequent ligation. The resulting construct comprises, in order, CMV immediate early promoter, EcoRI site, translation initiation sequence, SV40 large T-antigen nuclear localization sequence, KpnI site, BamHI site, HBAF155 coding sequence, HindIII site, FLAG epitope, HindIII site, bovine growth hormone (bGH) polyadenylation signal, in a pcDNA3.1 (Invitrogen, Carlsbad, Calif.) plasmid backbone. The CMV promoter and bGH polyadenylation signal are derived from the original pcDNA3.1 vector, as are sequences for replication and selection. Next, the Veg1 ZFP DNA-binding domain (see Example 1) is inserted, as a KpnI-BamHI fragment, into the vector described in the preceding paragraph to generate a vector encoding a protein having the structure (from N- to C-terminus): Nuclear localization sequence—Veg1 DNA binding domain—hBAF155-FLAG epitope tag. The integrity of these constructs, and the preservation of the reading frame, is confirmed at each step of the procedure by nucleotide sequence analysis. Upon transfection into mammalian cells this vector produces a NLS-Veg1-BAF155-FLAG fusion, whose transcription is controlled by a CMV immediate early promoter and a bovine growth hormone polyadenylation signal.
- Similar procedures are used to construct a plasmid encoding a fusion of a DNA-binding domain with any component of a chromatin remodeling complex. In brief, a polynucleotide encoding a component of a chromatin remodeling complex (or a functional fragment thereof) is obtained by PCR from cDNA (or optionally genomic DNA) using primers containing flanking BamHI and HindIII sites. BamHI, KpnI and HindIII sites, if present in the amplified product, are removed by site-directed mutagenesis, preserving the reading frame and coding capacity in the process. The amplified gene is introduced into a BamHI/HindIII-digested expression vector constructed as described above, optionally along with a HindIII fragment containing a FLAG epitope. The resulting construct is digested with KpnI and BamHI and a KpnI/BamHI fragment, encoding a DNA-binding domain, preferably a ZFP DNA-binding domain, is inserted. Sequences encoding nuclear localization sequences and FLAG epitopes, for immunological detection of the fusion protein, are optionally included in the construct.
- Plasmids encoding these fusions are propagated in any suitable host strain, preferablyE. coli strains JM109 or HB101.
- A zinc finger DNA-binding domain, which recognizes the human vascular endothelial growth factor-A (VEGF) gene, was designed and constructed according to design rules and methods disclosed in co-owned WO 00/42219, WO 00/41566, and co-owned U.S. patent applications Ser. Nos. 09/444,241 filed Nov. 19, 1999 and 09/535,088 filed Mar. 23, 2000. The target site, which overlaps the transcription initiation site for the human VEGF-A gene, is shown below as SEQ ID NO: 15, with the arrow indicating the transcription startsite.
□ 5′-GGGGAGGATCGCGGAGGCT-3′ (SEQ ID NO:15) 3′-CCCCTCCTAGCGCCTCCGA-5′ - Amino acids 533-624 of the human SP-1 zinc finger transcription factor were used as backbone for the construction of a designed six-finger DNA binding domain, Veg3a/1, capable of recognizing this sequence.
- Amino acid sequences of the designed DNA-binding domains are illustrated in Table 1. As can be seen in the Table, the designed Veg3a/1 protein comprises two subdomains, Veg1 and Veg3a, each comprising three zinc fingers (F1, F2 and F3) and each recognizing a 9-base pair subsite of the target site, joined by the linker sequence DGGGS (SEQ ID NO: 16). The amino acid sequence of the recognition helix (positions −1 through +6, where +1 is the first amino acid in the α-helix) for each of the DNA-binding fingers is given in Table 1.
- Synthesis of the VegI binding domain was described in Example 1. Assembly of the Veg3a coding sequences was carried out as described above for Veg1 (Example 1 and FIG. 1) except that different specific oligonucleotides were used to encode the Veg3a recognition helices.
- Accuracy of the Veg3a clone was verified by DNA sequencing. The Veg3a nucleotide and amino acid sequences are as follows.
Veg3a nucleotide sequence: KpnI GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGG (SEQ ID NO:17) GCTGTGGTAAAGTTTACGGCCAGTCCTCCGACCTGCAGCGTCACCTGCGCTG GCACACCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCT TCACCCGTTCGTCAAACCTACAGAGGCACAAGCGTACACACACCGGTGAGAA GAAATTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGAAGTGACGAGCTG TCACGACATATCAAGACCCACCAGAACAAGAAGGGTGGATCC BamHI Veg3a amino acid sequence: VPIPGKKKQHICHIQGCGKVYG QSSDLQR HLRWHTGERPFMCTWSYCGK (SEQ ID NO:18). RFT RSSNLQR HKRTHTGEKKFACPECPKRFM RSDELSR HIKTHQNKKGGS - The recognition regions of the Veg3a polypeptide (amino acids −1 through +6 of the zinc finger recognition helices) are shown in bold underline.
- The purified Veg3a zinc finger DNA-binding domain is tested for affinity to its 20 DNA target site by electrophoretic mobility shift analysis. A double-stranded target oligonucleotide was constructed by annealing complementary 29-mers, then end-labeled using polynucleotide kinase and γ-32P-ATP. The sequence of the target (with VEGF sequences in bold and target sites under/overlined) was as follows:
5′-CATGCATATC GCGGAGGCT TGGCATCGAT-3′ (SEQ ID NO:19) 3′-GTACGTATAGCGCCTCCGAACCGTAGCTA-5′ - Veg1 and Veg3a binding subdomains were joined to each other, using the linker sequence DGGGS (SEQ ID NO: 16). This particular linker sequence was chosen because it permits binding of two three-finger ZFP binding subdomains to two 9-bp sites that are separated by a one nucleotide gap, as is the case for the Veg1 and Veg3a sites. See also Liu et al., supra.
- The 6-finger Veg3a/1 protein encoding sequence was generated as follows. Sequences encoding Veg3a recognition helices were PCR-amplified from the Veg3a-encoding vector (supra) using the primers SPE7 (5′-GAGCAGAATTCGGCAAGAAGAAGCAGCAC) (SEQ ID NO: 20) and SPEamp12 (5′-GTGGTCTAGACAGCTCGTCACTTCGC) (SEQ ID NO: 21) to generate a double-stranded fragment bounded by EcoRI and XbaI restriction sites (underlined in the sequences of the primers). The amplification product was digested with EcoRI and XbaI. Sequences encoding Veg1 recognition helices were PCR-amplified from the Veg1-encoding vector (Example 1) using the primers SPEamp13 (5′-GGAGCCAAGGCTGTGGTAAAGTTTACGG) (SEQ ID NO: 22) and SPEamp11 (5′-GGAGAAGCTTGGATCCTCATTATCCC) (SEQ ID NO: 23) to generate a double-stranded amplification product bounded by StyI and HindIII restriction sites (underlined in the sequences of the primers). The resulting amplification product was digested with StyI and HindIII. A third double-stranded fragment was constructed, using synthetic oligonucleotides, which encodes the DGGGS linker, flanked by the remainders of the Veg1 and Veg3a DNA-binding domains, and bounded by XbaI and StyI sites. The sequence of this third fragment is as follows, with the XbaI and StyI sites underlined:
XbaI 5′ CTAGA CACATCAAAACCCACCAGAACAAGAAAGACGGCGGTGGC 3′ T GTGTAGTTTTGGGTGGTCTTGTTCTTTCTGCCGCCACCG AGCGGCAAAAAGAAACAGCACATATGTCACATC 3′ SEQ ID NO:24 TCGCCGTTTTTCTTTGTCGTGTATACAGTGTAGGTTC 5′SEQ ID NO:25 StyI - These three fragments were ligated to one another, the ligation product was amplified with primers SPE7 and SPEamp11, and the resulting amplification product was digested with EcoRI and HindIII and cloned into the EcoRI and HindIII sites of pUC19.
- The linked Veg3a- and Veg 1-encoding sequence, in the pUC19 backbone, was then amplified with the primers GB19 (5′-GCCATGCCGGTACCCATACCTGGCAAGAAGAAGCAGCAC) (SEQ ID NO: 26) and GB10 (5′-CAGATCGGATCCACCCTTCTTATTCTGGTGGGT (SEQ ID NO: 27) to introduce KpnI and BamHI sites at the ends of the amplification products (restriction sites underlined). The amplification products were then digested with KpnI and BamHI, and cloned into the modified pMAL-c2 expression vector described above.
- The nucleotide sequence encoding the designed 6-finger ZFP Veg3a/1, from the KpnI site to the BamHI site is:
GGTACCCATACCTGGCAAGAAGAAGCAGCACATCTGCCACATCCAGGGCTGT (SEQ ID NO:28) GGTAAAGTTTACGGCCAGTCCTCCGACCTGCAGCGTCACCTGCGCTGGCACA CCGGCGAGAGGCCTTTCATGTGTACCTGGTCCTACTGTGGTAAACGCTTCACA CGTTCGTCAAACCTACAGAGGCACAAGCGTACACACACAGGTGAGAAGAAA TTTGCTTGCCCGGAGTGTCCGAAGCGCTTCATGCGAAGTGACGAGCTGTCTAG ACACATCAAAACCCACCAGAACAAGAAAGACGGCGGTGGCAGCGGCAAAAA GAAACAGCACATATGTCACATCCAAGGCTGTGGTAAAGTTTACGGCACAACC TCAAATCTGCGTCGTCACCTGCGCTGGCACACCGGCGAGAGGCCTTTCATGT GTACCTGGTCCTACTGTGGTAAACGCTTCACCCGTTCGTCAAACCTGCAGCGT CACAAGCGTACCCACACCGGTGAGAAGAAATTTGCTTGCCCGGAGTGTCCGA AGCGCTTCATGCGTAGTGACCACCTGTCCCGTCACATCAAGACCCACCAGAA TAAGAAGGGTGGATCC - The VEGF3a/1 amino acid sequence (using single letter code) is:
VPIPGKKKQHICHIQGCGKVYGQSSDLQRHLRWHTGERFMCTWSYCGKRFTRS (SEQ ID NO:29) SNLQRHKRTHTGEKKFACPECPKRFMRSDELSPHIKTHQNKKDGGGSGKKKQHI CHIQGCGKVYGTTSNLRRHLRWHTGERPFMCTWSYCGKRFTRSSNLQRHKRTH TGEKKFACPECPKRFMRSDHLSRHIKTHQNKKGGS - The Veg3a/1 protein was expressed inE. coli as an MBP fusion, purified by affinity chromatography, and tested in EMSA experiments as described supra. A labeled double-stranded oligonucleotide comprising the target site was prepared by synthesis and annealing of two overlapping oligonucleotides, one of which was labeled with 32P. The oligonucleotides comprised the following sequences (with the target site over/underlined):
AGCGAGCGGGGAGGATCGCGGAGGCTTGGGGCAGCCGGGTAG (SEQ ID NO:30) TCGCCCCTCCTAGCGCCTCCGAACCCCGTCGGCCCATCTCGC (SEQ ID NO:31) - Binding analysis was conducted as described in Example 1 for the Veg1 protein. Binding was allowed to proceed for 60 min at either room temperature or 37° C., and polyacrylamide gel electrophoresis was carried out at room temperature or 37° C. using precast 10% or 10-20% Tris-HCl gels (BioRad) and standard Tris-Glycine running buffer. The room temperature assays yielded an apparent kd (determined as described supra) for this Veg3a/1 protein of approximately 1.5 nM. When binding and electrophoresis were performed at 37° C., the apparent Kd of Veg3a/1 was approximately 9 nM when tested against the 18-bp target. Thus, the six finger Veg3a/l ZFP bound with high affinity to its target site.
- A plasmid encoding a fusion between the human MBD1 gene and the Veg3a/1 DNA-binding domain is constructed using methods similar to those described above for the BAF155/Veg1 fusion (Example 2). Sequences encoding MBD1 (GenBank accession No. NM015846) are isolated by PCR from genomic DNA or cDNA. Amplification primers are designed such that the primer corresponding to the upstream region of the gene comprises a BamHI Site at or near its upstream terminus, and the primer corresponding to the downstream region of the gene comprises a HindIII site at or near its downstream terminus. The primers are designed to amplify the region between nucleotides 140 (MBD1 initiation codon) and 1,957 (MBD1 termination codon), and to retain the correct reading frame of the MBD1 gene when the amplification product is incorporated as a component of a fusion gene. The amplification product is optionally cloned, a BamHI site at nucleotide 264 of the MBD1 sequence is removed by site-specific mutagenesis, and the BamHI/HindIII fragment is released from the cloning vector and purified. Sequences encoding the Veg3a/1 DNA-binding domain are obtained as a KpnI/BamHI fragment (Example 3). The MBD1-encoding BamHI/HindIII fragment and the Veg3a/1-coding KpnI/BamHI fragment are inserted into pcDNA3.1(−) or a modified derivative (Example 2). A nuclear localization signal and/or a FLAG epitope are optionally included in the fusion construct.
- The MBD1 gene can be divided into at least two functional fragments: a methylated DNA binding domain (encoded by nucleotides 158-322) and a functional domain. Accordingly, a MBD/ZFP fusion gene is constructed that lacks sequences encoding the methylated DNA-binding domain, but contains the functional domain of the MBD1 protein. In this case, the BamHI/HindIII-terminated amplification product comprises nucleotides 322 through 1,957 of the MBD1 gene.
- A similar fusion gene is constructed, in which the MBD2 gene (GenBank accession No. NM003927), or a functional fragment thereof, is fused to a ZFP DNA-binding domain. In this case, the amplification primers are designed to amplify the region between nucleotides 230 (MBD2 initiation codon) and 1,465 (MBD2 termination codon), and to retain the correct reading frame of the MBD2 gene when the amplification product is incorporated as a component of a fusion gene. The amplification product is optionally cloned, a KpnI site at nucleotide 813 and a HindIII site at nucleotide1308 of the MBD1 sequence are removed by site-specific mutagenesis, and the BamHI/HindIII fragment is released from the cloning vector and purified. Sequences encoding the Veg3a/1 DNA-binding domain are obtained as a KpnI/BamHI fragment (Example 3). The MBD2-encoding BamHI/HindIII fragment and the Veg3a/1-coding KpnI/BamHI fragment are inserted into pcDNA3.1(−) or a modified derivative (Example 2). A nuclear localization signal and/or a FLAG epitope are optionally included in the fusion construct.
- The methylated DNA-binding domain of the MBD2 gene is encoded by nucleotides 680-862. Accordingly, a MBD/ZFP fusion gene is constructed that lacks sequences encoding the methylated DNA-binding domain, but contains the functional domain of the MBD2 protein, by designing the amplification primers to amplify the region of the MBD2 gene located between nucleotides 862 and 1,465. As in previous examples, the amplification primers comprise BamHI and HindIII sites at or near their termini, to maintain the MBD2 reading frame and facilitate construction of the fusion protein by the methods described supra. In this case, the HindIII site at nucleotide 1,308 is removed subsequent to amplification and prior to construction of the fusion nucleic acid.
- Human embryonic kidney cells (HEK 293) are grown in DMEM (Dulbecco's modified Eagle medium) supplemented with 10% fetal calf serum. Cells are plated in 10 cm dishes at a density of 2.5×106 per plate and grown for 24 hours in a CO2 incubator at 37° C. For transfection, 10 μg of plasmid DNA is diluted in 2.5 ml Opti-MEM (Life Technologies), and 50 μl of
Lipofectamine 2000 is diluted in 2.5 ml Opti-MEM. The diluted DNA and lipid are mixed and incubated for 20 minutes at room temperature. Medium is then removed from the cells and replaced with the lipid/DNA mixture. Cells are incubated at 37° C. for 3 hours in a CO2 incubator, then 10 ml of DMEM+10% FBS is added. Cells are harvested 40 hours after transfection for analysis of chromatin structure (Example 6) and gene expression (Example 7). - Intercalator-protein fusions, MGB-protein fusions and/or TFO-protein fusions are introduced into cells after encapsulation into liposomes, using standard procedures that are well-known in the art.
- Recruitment of a chromatin remodeling complex to a region of interest in cellular chromatin, by a fusion molecule comprising a DNA-binding domain and a component of a chromatin remodeling complex, is evidenced by alteration of chromatin structure in the region of interest. Alteration of chromatin structure mediated by the Veg1-BAF155 fusion molecule described supra (Example 2) is assessed by investigating nuclease hypersensitive sites in the vicinity of the Veg1 binding site, as described in this example.
- Transformed human
embryonic kidney 293 cells are grown in DMEM+10% fetal calf serum, supplemented with penicillin and streptomycin, in a 37° C. incubator at 5% CO2. Typically, two 255 cm2 plates of cells are used in an experiment. When the cells reach greater than 90% confluence (˜2.5×107 cells per plate), medium is removed and the cells are rinsed twice with 5 ml of ice-cold PBS (Gibco/Life Technologies, Gaithersburg, Md.). Cells are then scraped from the plates in 5 ml of ice-cold PBS and combined in a 50 ml conical centrifuge tube. The plates are washed with 10 ml of ice-cold PBS and the washes are added to the tube. Nuclei are pelleted by centrifugation (1400 rpm for 5 min) and the supernatant is removed. The pellet is mixed by vortexing and, while vortexing, 20 ml of lysis buffer (10 mM Tris pH 7.5, 1.5 mM MgCl2, 10 mM KCl, 0.5% IGEPAL CA-630 (Sigma), 1 mM phenylmethylsulfonyl fluoride, 1 mM dithiothreitol) is added. The cell pellet is resuspended in lysis buffer by pipetting and the tube is centrifuged at 1400 rpm for 5 min. The supernatant is removed and the pellet is resuspended in 20 ml of lysis buffer and centrifuged as before. The final pellet is resuspended in 1.5 ml dilution buffer (15 mM Tris pH 7.5, 60 mM KCl, 15 mM NaCl, 5 mM MgCl2, 0.1 mM dithiothreitol, 10% glycerol), nuclei are counted in a microscope and the solution is adjusted so that a concentration of approximately 107 nuclei per ml is obtained. - Nuclei, at a concentration of 107 per ml in dilution buffer, are digested with different concentrations of DNase I. DNase I dilutions are prepared by diluting deoxyribonuclease I (Worthington, Freehold, N.J.) in dilution buffer (supra), optionally supplemented with 0.4 mM CaCl2. To 100 μl of resuspended nuclei is added 25 μl of a DNase I dilution to give final DNase I concentrations ranging from 0.07 Units/ml to 486 Units/ml in three-fold concentration increments. Digestions are conducted at room temperature for 5 min. Digestion reactions are then stopped by addition of 125 μl of Buffer AL (Qiagen DNeasy™ Tissue Kit) and 12.5 μl of a 20 mg/ml solution of Proteinase K (Qiagen DNeasy™ Tissue Kit), followed by incubation at 70° C. for 10 min. Digested DNA is purified using the DNeasy™ Tissue Kit (Qiagen, Valencia, Calif.) according to the manufacturer's instructions.
- Purified DNase-treated DNA is digested with restriction enzyme at 37° C. overnight with 40 Units of restriction enzyme in the presence of 0.4 mg/ml RNase A. After digestion, DNA is ethanol-precipitated from 0.3 M sodium acetate.
- Micrococcal nuclease can be used as an alternative to DNase I for examination of chromatin structure. Treatment of nuclei, obtained as described supra, with micrococcal nuclease is conducted as described by Livingstone-Zatchej et al. inMethods in Molecular Biology, Vol. 119, Humana Press, Totowa, N.J., pp. 363-378.
- Nuclei are treated with MPE using the following procedure adapted from Cartwright et al., supra. A freshly-diluted stock of 0.4 M H2O2 is prepared by making a 25-fold dilution of a 30% stock solution. A freshly-prepared stock of 0.5 M ferrous ammonium sulfate is diluted 400-fold in water. A solution of methidiumpropyl EDTA (MPE) is prepared by adding 30 μl of 5 mM MPE to 90 μl of water. To this MPE solution is added 120 μl of the ferrous ammonium sulfate dilution and 2.5 μl of 1 M dithiothreitol (DTT, freshly prepared from powder). To a suspension of nuclei, obtained as described supra, are added, in sequence: 3.5 μl of 0.4 M H2O2 and 37.5 μl of the MPE/ferrous ammonium sulfate/DTT mixture. The reaction is terminated after an appropriate time period (determined empirically) by addition of 40 μl of 50 mM bathophenanthroline disulfonate, 0.1 ml of 2.5% sodium dodecyl sulfate/50 mM EDTA/50 mM Tris-Cl, pH 7.5 and 10 μl of Proteinase K (10-14 mg/ml). Proteinase digestion is conducted at 37° C. for at least 8 hours and the mixture is then extracted twice with phenol/chloroform and once with chloroform. Nucleic acids are precipitated from the aqueous phase by addition of sodium acetate to 0.3 M and 0.7 volume of isopropyl alcohol, incubation on ice for at least 2 hr, and centrifugation. The pellet is washed with 70% ethanol, dried, resuspended in 10 mM Tris-Cl, pH 8 and treated with RNase A (approximately 0.1 mg/ml) for 15 min at 37° C.
- Pellets of precipitated, digested DNA, obtained after treatment with enzymatic or chemical probes as described supra, are resuspended in 22 μl of loading buffer containing glycerol and tracking dyes (“Gel loading solution,” Sigma Chemical Corp., St. Louis, Mo.) and incubated at 55° C. for 3-4 hours. Twenty microliters of resuspended sample is loaded onto a 1% agarose gel containing 1×TAE buffer and 0.5 μg/ml ethidium bromide, and electrophoresis is conducted at 22 Volts for 16 hours in Tris-acetate-EDTA buffer. After electrophoresis, the gel is treated with alkali, neutralized, blotted onto a Nytran membrane (Schleicher & Schuell, Keene, N.H.), and the blotted DNA is crosslinked to the membrane by ultraviolet irradiation.
- Probes are labeled by random priming, using the Prime-It Random Primer Labeling Kit (Stratagene, La Jolla, Calif.) according to the manufacturer's instructions. In a typical labeling reaction, 25-50 ng of DNA template is used in a final volume of 50 μl. A specific activity of 109 cpm/μg is typically obtained. Labeled probes are purified on a NucTrap probe column (Stratagene #400702, La Jolla, Calif.).
- The membrane is placed in a hybridization bottle and pre-hybridized in Rapid Hybridization Buffer (Amersham, Arlington Heights, Ill.) at 65° C. for 15 min. Probe (a 0.1 kb XbaI-KpnI fragment, see FIG. 1A) is added (approximately 0.03 μg containing approximately 3.3×107 cpm) and hybridization is conducted at 65° C. for 2 hours. Following hybridization, the membrane is washed once at 65° C. for 10 min. with 2×SSC+0.1% SDS, and twice at 65° C. for 10 min. with 0.1×SSC+0.1% SDS. The membrane is then dried and analyzed either by autoradiography or with a phosphorimager.
- Results are shown in FIG. 3 for analysis of DNase hypersensitivity, in HEK293 cells, within an approximately 1,000 base-pair region upstream of the human VEGF-A gene transcriptional startsite. Increasing DNase concentration resulted in the generation of two new sets of DNA fragment doublets, centered at approximately 500 and 1,000 nucleotides, indicating the presence of two DNase hypersensitive regions. One of these regions is centered approximately 500 base pairs upstream of the transcriptional startsite; the other is centered on the transcriptional startsite.
- Remodeling of VEGF chromatin can involve, among other things, loss of one or both of these hypersensitive regions, or the generation of one or more additional hypersensitive regions, either upstream or downstream of the transcriptional startsite.
- General. Activation or repression of transcription resulting from localized chromatin remodeling is determined by measurement of RNA and/or protein gene products. These methods are well-known to those of skill in the art.
- For example, Mizuguchi et al. (1999) inMethods in Molecular Biology, Vol. 119, “Chromatin Protocols” (P. B. Becker, ed.) Humana Press, Totowa, 1999, pp. 333-342 describe a procedure for in vitro transcription of chromatin that has been remodeled by the Drosophila NURF complex. This assay can be used to detect changes in transcriptional properties of chromatin (either activation or repression) following chromatin remodeling.
- Production and specificity of RNA can also be measured by RNA blots, nuclease protection and/or quantitative real-time PCR (colloquially known as the “Taqman” assay), as is known to those of skill in the art. See, for example, Ausubel et al., supra.
- Protein production can be measured by immunoassay (e.g., ELISA, immunoprecipitation), gel electrophoresis, and/or immunological detection of protein blots (“Western” blots), as is known to those of skill in the art. See, for example, Ausubel et al., supra.
- Reporter genes, either chromosomal or extrachromosomal, can also be used to assay activation and/or repression of specific promoters. Accordingly, effect of chromatin remodeling on a promoter that is operatively linked to a reporter gene (such as, for example, alkaline phosphatase, β-galactosidase, β-glucuronidase, chloramphenicol acetyl transferase, horseradish peroxidase, luciferase, or green fluorescent protein) can be assayed by measuring the levels and/or activity of the reporter. Methods for fusion of a promoter to a reporter gene, and methods for assay of reporter gene products, are known to those of skill in the art. See, for example, Ausubel et al., supra.
- RNA analysis. Transient transfection of HEK293 cells, seeded in 6-well plates, is carried out as described in Example 6, supra. Cell lysates are harvested 40 hours post-transfection. To assay the activation of the endogenous chromosomal VEGF gene, RNA blotting (“Northern” blotting) is used to measure VEGF mRNA levels. Briefly, PolyA+RNA is isolated from
HEK 293 cells transfected with a fusion plasmid or from mock-transfected HEK293 cells, using the Oligotex kit (Qiagen, Valencia, Calif.), according to the manufacturer's instructions. The fusion plasmid encodes a fusion protein comprising a nuclear localization sequence, the Veg1 DNA-binding domain, BAF155 and a FLAG epitope (see Example 2, supra). 7 μg of RNA are resolved on a 2.4% agarose gel containing 2.4 M formaldehyde, and the gel is blotted onto Nytran SuPerCharge membrane (Schliecher & Schuell, Keene, N.H.) using 20×SSC. The membrane is hybridized at 65° C. for 1 hour in Rapid-Hyb Buffer (Amersham-Pharmacia Biotech, Piscataway, N.J.) containing a 32P-labeled VEGF cDNA probe. The VEGF cDNA construct is generated by inserting a human VEGF cDNA fragment, obtained by PCR amplification, into the pCDNA3.1 vector (Invitrogen, Carlsbad Calif.) at the XbaI and EcoRI sites. Structure of the clone is confirmed by sequencing. After hybridization, the VEGF probe is stripped from the membrane, and the blot is re-hybridized with a 32P-labeled GAPDH DNA probe. VEGF mRNA levels, as determined by RNA blotting, are normalized to GAPDH mRNA levels. - For real-time quantitative PCR (“Taqman”) analysis of mRNA abundance, total cellular RNA from
transfected HEK 293 cells is isolated using the Rneasy Kit (Qiagen, Valencia, Calif.). RNA samples (25 ng) are mixed with 0.3 μM of each primer, 0.1 μM of probe, 5.5 mM MgCl2, 0.3 mM of each dNTP, 0.625 unit of AmpliTaq Gold RNA Polymerase, 6.25 units of Multiscribe Reverse Transcriptase, and 5 units of RNase inhibitor, in Taqman buffer A from Perkin Elmer. Reverse transcription is performed at 48° C. for 30 min. After denaturing at 95° C. for 10 minutes, PCR is conducted for 40 cycles at 95° C. for 15 seconds and 60° C. for one minute. Analysis is conducted, during the amplification reaction, in a 96-well format on an ABI 7700 SDS machine (PE BioSystems, Foster City, Calif.) and data is analyzed with SDS version 1.6.3 software. Exemplary probes and primers for analysis of VEGF and GAPDH genes are presented in Table 2.TABLE 2 Primer and Probe sequences for hydrolyzable probe analysis Gene Forward primer Reverse primer Probe VEGF 5′- CTGGTAGCGGGG 5′- GCCACGACCTCCG 5′-CTACCCGGCTGC AGGATCG-3′ AGCTAC-3′ CCCAAGCCTC-3′ (SEQ ID NO:32) (SEQ ID NO:33) (SEQ ID NO:34) GAPDH 5′- CCTTTTGCAGACC 5′- GCAGGGATGATGT 5′-CACTGCCACCCA ACAGTCCA-3′ TCTGGAGA-3′ GAAGACTGTGG-3′ (SEQ ID NO:35) (SEQ ID NO:36) (SEQ ID NO:37) - Protein analysis. Analysis of protein levels is performed by resolving 10 μg of whole cell lysate on a 10-20% polyacrylamide gel run in Tris/glycine/SDS buffer (BioRad, Hercules, Calif.). Proteins separated in the gel are transferred onto a nitrocellulose membrane using Tris/glycine/SDS buffer supplemented with 20% methanol, and the filter is blocked with 5% non-fat dry milk for 1 hour at room temperature. The blot is probed for 1 hour at room temperature with anti-Flag M2 monoclonal antibody (Sigma, St. Louis, Mo.) diluted 1:1000 in 5% (w/v) non-fat dry milk/0.1% PBS-Tween, then washed twice for 5 sec and once for 15 min with 0.1% PBS-Tween. All washes are performed at room temperature. The blot is then incubated for one hour at room temperature with a horseradish peroxidase-conjugated anti-mouse antibody (Amersham-Pharmacia Biotech, Piscataway, N.J.), used at a 1:3000 dilution in 5% (w/v) non fat dry milk /0.1% PBS Tween. This is followed by two 5 sec washes and one 15 min wash with 0.1% PBS-Tween. Protein bands are detected using the ECL system (Amersham-Pharmacia Biotech, Piscataway, N.J.).
- For analysis of protein level by ELISA, cell lysates are prepared (as described above) or culture medium is harvested and analyzed using a commercially available ELISA kit. For example, levels of secreted VEGF protein are determined by assay of culture medium using a human VEGF ELISA kit (R & D systems, Minneapolis, Minn.).
- Results. Transfection of the Veg1 /hBAF155 fusion construct (Example 2) into
cultured HEK 293 cells results in activation of VEGF gene expression, compared to untransfected cells, as evidenced by increases in VEGF mRNA and protein levels. Vectors lacking the ZFP and/or BAF155 portions of the fusion are used as controls. Transfection efficiency is measured by co-transfection of a green fluorescent protein expression vector. A mock transfection control is also carried out. - Introduction of the Veg3a/1-MBD1 or Veg3a/1-MBD2 fusion construct into
cultured HEK 293 cells by transfection results in repression of VEGF gene expression, compared to untransfected cells, as evidenced by decreases in VEGF mRNA and protein levels. Controls similar to those described above are also conducted. - Methods for the purification, assay and characterization of various chromatin remodeling complexes are well-known to those of skill in the art. See, for example,Methods in Enzymology, Vol. 304, “Chromatin” (P. M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego, 1999; and Methods in Molecular Biology, Vol.119, “Chromatin Protocols” (P. B. Becker, ed.) Humana Press, Totowa, 1999.
- Chromatin remodeling can take the form of, for example, deposition, removal or repositioning of nucleosomes within chromatin. Means for detecting chromatin remodeling include, but are not limited to, detecting changes in accessibility of specific sites in chromatin to sequence-specific nucleases such as restriction enzymes, determination of the appearance or disappearance of a regularly repeating pattern of chromatin digestion by non-sequence specific endonucleases such as micrococcal nuclease and DNase I, determination of nucleosome spacing, and nucleosome-binding assays. Also, as mentioned supra, chromatin remodeling complexes possess ATPase activity; therefore ATP hydrolysis assays can be used in the identification and/or characterization of chromatin remodeling complexes.
- Restriction endonuclease accessibility assays are described by Logie et al., supra and Varga-Weisz et al. (1999)Meth. Enzymology 304:742-757. Assays for nucleosome spacing, DNase I accessibility, ATPase activity and nucleosome binding are disclosed by Varga-Weisz et al., supra. Assays to detect facilitation of transcription factor binding are described by Cote et al. (1994) Science 265:53-60 and Kwon et al. (1994) Nature 370:477-481. Assays for nucleosome repositioning (i.e., “sliding”) are disclosed by Hamiche et al. (1999) Cell 97:833-842.
- These assays, and others, can be used for the purification and characterization of chromatin remodeling complexes from various species, for example, the yeast SWI/SNF complex (Logie et al., supra), the Drosophila CHRAC complex (Varga-Weisz et al., supra) and the Drosophila NURF complex (Sandaltzopoulos et al., supra).
- Chromatin remodeling complexes utilize the energy of ATP hydrolysis to modify chromatin structure. Consequently, nucleosome- or DNA-dependent ATPase activity can be used to assay for a chromatin remodeling complex.
- Methods and compositions for conducting ATPase assays are well-known to those of skill in the art. One measure of ATPase activity is the release of labeled pyrophosphate from γ-32P-labeled ATP. Release is measured as the amount of radioactivity that does not bind to activated charcoal in 20 mM phosphoric acid.
- An alternative method for measuring pyrophosphate release is to measure labeled pyrophosphate directly by thin layer chromatography. The reaction mixture contains 0.02 μg/ml DNA (or reconstituted nucleosomal array, see Example 11 infra), 5 μM SWI/SNF complex (or any other known or putative chromatin remodeling complex), 20 mM Tris, pH 8.0, 5 mM MgCl2, 0.2 mM dithiothreitol, 0.1% Tween, 5% glycerol, 100 μg/ml bovine serum albumin, 100 μM ATP, and 0.2 μCi (γ-32P)ATP (3 Ci/mmol) in a final volume of 20 μl and is incubated at 37° C. At the conclusion of the assay (under these conditions the reaction rate is linear for 5-10 minutes), 1 μl is pipetted onto a polyethyleneimine cellulose sheet and the sheet is developed in a solution of 0.75 M potassium phosphate, pH 3.5. In this system, ATP and pyrophosphate are clearly resolved from each other and from the origin. Quantitation is carried out either by autoradiography followed by excision and scintillation counting of labeled spots, or by phosphorimaging.
- The preceding methods are adapted from those described by Logie et al. (1999)Meth. Enzymology 304:726-741.
- An alternative solvent for thin-layer chromatography is 0.5 M LiCl/1 M formic acid. In this system, pyrophosphate is separated from unhydrolyzed ATP, which remains at the origin. Varga-Weisz et al. (1999)Meth. Enzymol. 304:742-757.
- Deposition of purified histone octamers onto a specific template under defined conditions can generate a nucleosomal array in which the positions of one or more individual nucleosomes, with respect to the nucleotide sequence of the template, are known. Such an array can be used as a substrate in an assay for chromatin remodeling activity, by testing for changes in nucleosome position with respect to nucleotide sequence. One such test is restriction endonuclease accessibility. See infra.
- Preparation of reconstituted nucleosome arrays can be conducted according to Logie et al., supra and Varga-Weisz et al., supra. Additional methods can be found inMethods in Enzymology, Vol. 304, “Chromatin” (P. M. Wassarman and A. P. Wolffe, eds.), Academic Press, San Diego, 1999; and Methods in Molecular Biology, Vol. 119, “Chromatin Protocols” (P. B. Becker, ed.) Humana Press, Totowa, 1999.
- ISWI-encoding sequences were amplified from a recombinant plasmid encoding Drosophila ISWI. Corona et al. (1999)Mol. Cell 3:239-245. One of the primers contained, outside of the ISWI-complementary region, sequences encoding a FLAG epitope and, at the 5′ terminus, a 5′ extension encoding Hind III and Xba I sites. The other primer contained a 5′ extension encoding a Bam HI site. The sequences of the primers were as follows:
cgatcGGATCCTCCAAAACAGATACAGCTGCC (SEQ ID NO:38) BamHI ISWI seq gatcgccTCTAGACTCGAGAAGCTTACTTGTCATCGTCGTCCTTGTAGTCGCTGCCCTTCTTCTTCTTTTTCGAGTT (SEQ ID NO:39) XbaI HindIII FLAG sequence ISWI seq - Amplification was conducted at 95° C. for 2 min, followed by 30 cycles of 95° C. for 30 sec, 55° C. for 30 sec, 72° C. for 5 min, and a final step of 72° C. for 5 min. resulted in the generation of an amplification product comprising ISWI- and FLAG-encoding sequences flanked by Bam HI and Hind III sites. The amplification product was purified using a PCR Cleanup Kit (Qiagen, Valencia, Calif.) according to the manufacturer's instructions, then digested with Bam HI and Hind III.
- A vector encoding a nuclear localization signal (NLS), a ZFP binding domain targeted to the human erythropoietin gene (Epo 2C), a VP16 activation domain and a FLAG epitope was digested with Bam HI and Hind III to release VP16- and FLAG-encoding sequences. The Bam HI/Hind III fragment described in the preceding paragraph was ligated to the vector backbone to generate a vector encoding a fusion protein comprising a NLS, the Epo2C binding domain, ISWI and FLAG. A vector encoding a protein that is identical, except for the presence of an Epo3B binding domain in place of Epo2C, was constructed by similar methods for use as a control. The nucleotide sequences of the target sites, and the amino acid sequences of the recognition helices (−1 through +6) for the Epo2C and Epo3B binding domains are provided in Table 3.
TABLE 3 Target sites and recognition helix sequences for Epo2C and Epo3B ZFP Target F1 (−1 to +=6) F2 (−1 to +=6) F3 (−1 to +=6) Epo2c GGTGAGGAGT RSDNALR RSDNLAR DSSKLSR (SEQ ID NO:40) (SEQ ID NO:41) (SEQ ID NO:42) (SEQ ID NO:43) Epo3b GCGGTGGCTC QSSDLTR RSDALSR RSDERKR (SEQ ID NO:44) (SEQ ID NO:45) (SEQ ID NO:46) (SEQ ID NO:47) - The steroid receptor coactivator 1 (SRC1) protein is a histone acetyltransferase which is capable of recruiting the p300 and CBP proteins (both of which are also histone acetyltransferases). Liu et al. (1999)Proc. Natl. Acad. Sci. USA 96:9485-9490; Sheppard et al. (2001) Mol. Cell. Biol. 21:39-50 and references cited therein) A construct encoding a portion of
SRC 1 common to the a and e isoforms (amino acids 781 through 1385, Kalkhoven et al. (1998) EMBO J. 17:232-243), fused to a zinc finger binding domain targeted to the human erythropoietin (EPO) gene, was constructed as follows. - A plasmid encoding SRC1 was used as a template for PCR amplification using the following primers, and the amplification product was digested with Not I.
5′-GGATCCGGCCACCGCGGCCGCATGGATCCATGTAATACAAACCCAACC (SEQ ID NO:48) 5′-ATGAATTCGCGGCCGCCCTGGGTTCCATCTGCTTCTGTTTTGAG (SEQ ID NO:49) - The pVP16-EPOZFP-862c vector, containing a transcription unit encoding a nuclear localization signal (NLS), the EPO ZFP-862 zinc finger binding domain, a VP16 transcriptional activation domain and a FLAG epitope, under transcriptional control of a CMV promoter and a bovine growth hormone polyadenylation signal, was digested with Not I to release VP1 6-encoding sequences. See Zhang et al. (2000)J. Biol. Chem. 275:33,850-33,860 for the design and properties of EPOZFP-862, which binds to a site 862 nucleotides upstream of the EPO transcriptional startsite. The Not I-digested amplification product described in the previous paragraph was inserted into the ZFP-862c vector backbone by ligation, to generate a plasmid encoding a NLS, the EPO ZFP-862 binding domain, amino acids 781-1385 of SRC1 and a FLAG epitope. The structure of the resulting construct, pSRC1b-EPO2c, is illustrated schematically in FIG. 4.
- This construct was introduced into
human HEK 293 cells by transfection (200 ug of plasmid plus 5 ug of Lipofectamine; Lipofectamine obtained from Gibco/Life Technologies, Gaithersburg, Md.). Approximately 12 hours after exposure of cells to plasmid, the medium was removed and replaced with fresh DMEM supplemented with 10% fetal bovine serum. Twenty-four hours later, the medium was harvested and assayed for secreted EPO, using an erythropoietin ELISA from R&D Systems (Minneapolis, Minn.). - Results of the assay, shown in FIG. 5, indicated that transfection of 293 cells with the pSRC1b-EPO2c fusion plasmid activated expression of EPO, compared to cells transfected with a control plasmid (pcDNA3.1) not encoding such a fusion. Thus, 7FP-targeted binding, to the EPO gene, of a protein which is capable of chromatin remodeling (by virtue of its histone acetyltransferase activity) and can serve as a component of chromatin remodeling complexes (by virtue of its ability to bind p300 and CBP) resulted in activation of gene expression.
- Methyl binding domain proteins (MBDs) participate in repression of the expression of certain genes by binding to methylated cytosine residues present in CpG dinucleotides and recruiting chromatin remodeling complexes to the site of binding. MBDs are also present as a component of certain chromatin remodeling complexes.
- DNA N-methyl transferases (DNMTs) methylate cytosine residues present in certain CpG dinucleotide sequences in cellular DNA. Such methylation can lead to chromatin remodeling at or in the vicinity of the methylated sequence(s) by, for example, binding of one or more MBDs and concomitant or subsequent recruitment of chromatin remodeling complexes. The DNMT1 protein can also associate with histone deacetylases (HDACs), which themselves are involved in chromatin remodeling.
- A series of ZFP-MBD and ZFP-DNMT fusions were tested for their ability to regulate expression of the human VEGF-A gene. Accordingly, a series of plasmids was constructed, in which the VEGF3a/1 ZFP binding domain (see Example 3,supra) was fused to MBD2b, MBD3, MBD3S, MBD3L, DNMT1, DNMT3a or DNMT3b. See, for example, GenBank accession numbers AF072243, AF170347, AW872007, and NM013595. The fusion genes also comprised a nuclear localization signal and a FLAG epitope, similar to the constructs described in Examples 11 and 12. FIG. 6 shows a schematic diagram of these constructs.
- HeLa cells were transfected with the constructs shown in FIG. 6. Seventy-two hours after transfection, secreted VEGF levels were measured using a VEGF ELISA (R&D Systems, Minneapolis, Minn.) according to the manufacturer's instructions. Cells were co-transfected with a green fluorescent protein-encoding plasmid to allow measurement of transfection efficiency. The results, presented in FIG. 7, show that transfection of HeLa cells with all of the MBD and DNMT fusions tested resulted in repression of VEGF expression. When corrected for transfection efficiency (approximately 50% in this experiment), intracellular expression of the MBD2b-VEGF3a/1 fusion resulted in essentially 100% repression of VEGF expression. Thus, fusions between a targeted ZFP binding domain and proteins whose mechanism of modulating gene expression involves chromatin remodeling are able to repress gene expression.
- All patents, patent applications and publications mentioned herein are hereby incorporated by reference in their entirety.
- Although disclosure has been provided in some detail by way of illustration and example for the purposes of clarity of understanding, it will be apparent to those skilled in the art that various changes and modifications can be practiced without departing from the spirit or scope of the disclosure. Accordingly, the foregoing descriptions and examples should not be construed as limiting.
-
1 49 1 9 DNA Artificial Sequence Description of Artificial Sequence Veg 1 target site 3′ to 5′ 1 cccctccta 9 2 9 DNA Artificial Sequence Description of Artificial Sequence Veg 1 target site 5′ to 3′ 2 ggggaggat 9 3 7 PRT Artificial Sequence Description of Artificial Sequence Veg 1 AA sequence F1 3 Thr Thr Ser Asn Leu Arg Arg 1 5 4 7 PRT Artificial Sequence Description of Artificial Sequence Veg 1 AA sequence F2 4 Arg Ser Ser Asn Leu Gln Arg 1 5 5 7 PRT Artificial Sequence Description of Artificial Sequence Veg 1 AA sequence F3 5 Arg Ser Asp His Leu Ser Arg 1 5 6 9 DNA Artificial Sequence Description of Artificial Sequence Veg 3a target site 6 gcggaggct 9 7 7 PRT Artificial Sequence Description of Artificial Sequence Veg 3a AA sequence F1 7 Gln Ser Ser Asp Leu Gln Arg 1 5 8 7 PRT Artificial Sequence Description of Artificial Sequence Veg 3a AA sequence F2 8 Arg Ser Ser Asn Leu Gln Arg 1 5 9 7 PRT Artificial Sequence Description of Artificial Sequence Veg 3a AA sequence F3 9 Arg Ser Asp Glu Leu Ser Arg 1 5 10 298 DNA Artificial Sequence Description of Artificial Sequence Veg1 nucleotide sequence 10 ggtacccata cctggcaaga agaagcagca catctgccac atccagggct gtggtaaagt 60 ttacggcaca acctcaaatc tgcgtcgtca cctgcgctgg cacaccggcg agaggccttt 120 catgtgtacc tggtcctact gtggtaaacg cttcacccgt tcgtcaaacc tgcagcgtca 180 caagcgtacc cacaccggtg agaagaaatt tgcttgcccg gagtgtccga agcgcttcat 240 gcgtagtgac cacctgtccc gtcacatcaa gacccaccag aataagaagg gtggatcc 298 11 99 PRT Artificial Sequence Description of Artificial Sequence Veg1 amino acid sequence 11 Val Pro Ile Pro Gly Lys Lys Lys Gln His Ile Cys His Ile Gln Gly 1 5 10 15 Cys Gly Lys Val Tyr Gly Thr Thr Ser Asn Leu Arg Arg His Leu Arg 20 25 30 Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly 35 40 45 Lys Arg Phe Thr Arg Ser Ser Asn Leu Gln Arg His Lys Arg Thr His 50 55 60 Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe Met 65 70 75 80 Arg Ser Asp His Leu Ser Arg His Ile Lys Thr His Gln Asn Lys Lys 85 90 95 Gly Gly Ser 12 29 DNA Artificial Sequence Description of Artificial Sequence duplex oligonucleotide binding target 5′-3′ 12 catgcatagc ggggaggatc gccatcgat 29 13 14 PRT Artificial Sequence Description of Artificial Sequence NLS derived SV40 large T-antigen 13 Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val 1 5 10 14 8 PRT Artificial Sequence Description of Artificial Sequence double-stranded oligonucleotide encoding a FLAG epitope 14 Asp Tyr Lys Asp Asp Asp Asp Lys 1 5 15 19 DNA Artificial Sequence Description of Artificial Sequence target site for human VEGF-A 15 ggggaggatc gcggaggct 19 16 5 PRT Artificial Sequence Description of Artificial Sequence linker sequence 16 Asp Gly Gly Gly Ser 1 5 17 298 DNA Artificial Sequence Description of Artificial Sequence Veg3a nucleotide sequence 17 ggtacccata cctggcaaga agaagcagca catctgccac atccagggct gtggtaaagt 60 ttacggccag tcctccgacc tgcagcgtca cctgcgctgg cacaccggcg agaggccttt 120 catgtgtacc tggtcctact gtggtaaacg cttcacccgt tcgtcaaacc tacagaggca 180 caagcgtaca cacaccggtg agaagaaatt tgcttgcccg gagtgtccga agcgcttcat 240 gcgaagtgac gagctgtcac gacatatcaa gacccaccag aacaagaagg gtggatcc 298 18 99 PRT Artificial Sequence Description of Artificial Sequence Veg3a amino acid sequence 18 Val Pro Ile Pro Gly Lys Lys Lys Gln His Ile Cys His Ile Gln Gly 1 5 10 15 Cys Gly Lys Val Tyr Gly Gln Ser Ser Asp Leu Gln Arg His Leu Arg 20 25 30 Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly 35 40 45 Lys Arg Phe Thr Arg Ser Ser Asn Leu Gln Arg His Lys Arg Thr His 50 55 60 Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe Met 65 70 75 80 Arg Ser Asp Glu Leu Ser Arg His Ile Lys Thr His Gln Asn Lys Lys 85 90 95 Gly Gly Ser 19 29 DNA Artificial Sequence Description of Artificial Sequence Veg3a DNA target site 19 catgcatatc gcggaggctt ggcatcgat 29 20 29 DNA Artificial Sequence Description of Artificial Sequence primer SPE7 20 gagcagaatt cggcaagaag aagcagcac 29 21 26 DNA Artificial Sequence Description of Artificial Sequence primer SPEamp12 21 gtggtctaga cagctcgtca cttcgc 26 22 28 DNA Artificial Sequence Description of Artificial Sequence primer SPEamp13 22 ggagccaagg ctgtggtaaa gtttacgg 28 23 26 DNA Artificial Sequence Description of Artificial Sequence primer SPEamp11 23 ggagaagctt ggatcctcat tatccc 26 24 77 DNA Artificial Sequence Description of Artificial Sequence fragment encoding DGGGS linker, 5′ to 3′ 24 ctagacacat caaaacccac cagaacaaga aagacggcgg tggcagcggc aaaaagaaac 60 agcacatatg tcacatc 77 25 77 DNA Artificial Sequence Description of Artificial Sequence fragment encoding DGGGS linker, 3′ to 5′ 25 tgtgtagttt tgggtggtct tgttctttct gccgccaccg tcgccgtttt tctttgtcgt 60 gtatacagtg taggttc 77 26 39 DNA Artificial Sequence Description of Artificial Sequence primer GB19 26 gccatgccgg tacccatacc tggcaagaag aagcagcac 39 27 33 DNA Artificial Sequence Description of Artificial Sequence primer GB10 27 cagatcggat ccacccttct tattctggtg ggt 33 28 589 DNA Artificial Sequence Description of Artificial Sequence Veg3a/1 nucleotide sequence 28 ggtacccata cctggcaaga agaagcagca catctgccac atccagggct gtggtaaagt 60 ttacggccag tcctccgacc tgcagcgtca cctgcgctgg cacaccggcg agaggccttt 120 catgtgtacc tggtcctact gtggtaaacg cttcacacgt tcgtcaaacc tacagaggca 180 caagcgtaca cacacaggtg agaagaaatt tgcttgcccg gagtgtccga agcgcttcat 240 gcgaagtgac gagctgtcta gacacatcaa aacccaccag aacaagaaag acggcggtgg 300 cagcggcaaa aagaaacagc acatatgtca catccaaggc tgtggtaaag tttacggcac 360 aacctcaaat ctgcgtcgtc acctgcgctg gcacaccggc gagaggcctt tcatgtgtac 420 ctggtcctac tgtggtaaac gcttcacccg ttcgtcaaac ctgcagcgtc acaagcgtac 480 ccacaccggt gagaagaaat ttgcttgccc ggagtgtccg aagcgcttca tgcgtagtga 540 ccacctgtcc cgtcacatca agacccacca gaataagaag ggtggatcc 589 29 196 PRT Artificial Sequence Description of Artificial Sequence Veg3a/1 amino acid sequence 29 Val Pro Ile Pro Gly Lys Lys Lys Gln His Ile Cys His Ile Gln Gly 1 5 10 15 Cys Gly Lys Val Tyr Gly Gln Ser Ser Asp Leu Gln Arg His Leu Arg 20 25 30 Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys Gly 35 40 45 Lys Arg Phe Thr Arg Ser Ser Asn Leu Gln Arg His Lys Arg Thr His 50 55 60 Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe Met 65 70 75 80 Arg Ser Asp Glu Leu Ser Arg His Ile Lys Thr His Gln Asn Lys Lys 85 90 95 Asp Gly Gly Gly Ser Gly Lys Lys Lys Gln His Ile Cys His Ile Gln 100 105 110 Gly Cys Gly Lys Val Tyr Gly Thr Thr Ser Asn Leu Arg Arg His Leu 115 120 125 Arg Trp His Thr Gly Glu Arg Pro Phe Met Cys Thr Trp Ser Tyr Cys 130 135 140 Gly Lys Arg Phe Thr Arg Ser Ser Asn Leu Gln Arg His Lys Arg Thr 145 150 155 160 His Thr Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pro Lys Arg Phe 165 170 175 Met Arg Ser Asp His Leu Ser Arg His Ile Lys Thr His Gln Asn Lys 180 185 190 Lys Gly Gly Ser 195 30 42 DNA Artificial Sequence Description of Artificial Sequence Veg3a/1 target site 1 30 agcgagcggg gaggatcgcg gaggcttggg gcagccgggt ag 42 31 42 DNA Artificial Sequence Description of Artificial Sequence Veg3a/1 target site 2 31 tcgcccctcc tagcgcctcc gaaccccgtc ggcccatctc gc 42 32 19 DNA Artificial Sequence Description of Artificial Sequence VEGF forward primer 32 ctggtagcgg ggaggatcg 19 33 19 DNA Artificial Sequence Description of Artificial Sequence VEGF reverse primer 33 gccacgacct ccgagctac 19 34 22 DNA Artificial Sequence Description of Artificial Sequence VEGF probe 34 ctacccggct gccccaagcc tc 22 35 21 DNA Artificial Sequence Description of Artificial Sequence GAPDH forward primer 35 ccttttgcag accacagtcc a 21 36 21 DNA Artificial Sequence Description of Artificial Sequence GAPDH reverse primer 36 gcagggatga tgttctggag a 21 37 23 DNA Artificial Sequence Description of Artificial Sequence GAPDH probe 37 cactgccacc cagaagactg tgg 23 38 32 DNA Artificial Sequence Description of Artificial Sequence ISWI primer 1 38 cgatcggatc ctccaaaaca gatacagctg cc 32 39 77 DNA Artificial Sequence Description of Artificial Sequence ISWI primer 2 39 gatcgcctct agactcgaga agcttacttg tcatcgtcgt ccttgtagtc gctgcccttc 60 ttcttctttt tcgagtt 77 40 10 DNA Artificial Sequence Description of Artificial Sequence Epo2c target site 40 ggtgaggagt 10 41 7 PRT Artificial Sequence Description of Artificial Sequence Epo2c recognition helix F1 41 Arg Ser Asp Asn Ala Leu Arg 1 5 42 7 PRT Artificial Sequence Description of Artificial Sequence Epo2c recognition helix F2 42 Arg Ser Asp Asn Leu Ala Arg 1 5 43 7 PRT Artificial Sequence Description of Artificial Sequence Epo2c recognition helix F3 43 Asp Ser Ser Lys Leu Ser Arg 1 5 44 10 DNA Artificial Sequence Description of Artificial Sequence Epo3b target site 44 gcggtggctc 10 45 7 PRT Artificial Sequence Description of Artificial Sequence Epo3b recognition helix F1 45 Gln Ser Ser Asp Leu Thr Arg 1 5 46 7 PRT Artificial Sequence Description of Artificial Sequence Epo3b recognition helix F2 46 Arg Ser Asp Ala Leu Ser Arg 1 5 47 7 PRT Artificial Sequence Description of Artificial Sequence Epo3b recognition helix F3 47 Arg Ser Asp Glu Arg Lys Arg 1 5 48 48 DNA Artificial Sequence Description of Artificial Sequence SRC1 primer 1 48 ggatccggcc accgcggccg catggatcca tgtaatacaa acccaacc 48 49 44 DNA Artificial Sequence Description of Artificial Sequence SRC1 primer 2 49 atgaattcgc ggccgccctg ggttccatct gcttctgttt tgag 44
Claims (73)
1. A method for modifying a region of interest in cellular chromatin, the method comprising the step of contacting the cellular chromatin with a fusion molecule that binds to a binding site in the region of interest, wherein the fusion molecule comprises a DNA binding domain and a component of a chromatin remodeling complex or functional fragment thereof, thereby modifying the region of interest.
2. The method of claim 1 , wherein the cellular chromatin is present in a plant cell.
3. The method of claim 1 , wherein the cellular chromatin is present in an animal cell.
4. The method of claim 3 , wherein the cell is a human cell.
5. The method of claim 1 , wherein the fusion molecule is a fusion polypeptide.
6. The method of claim 1 , wherein the DNA-binding domain comprises a zinc finger DNA-binding domain.
7. The method of claim 1 , wherein the DNA-binding domain is a triplex-forming nucleic acid or a minor groove binder.
8. The method of claim 1 , wherein the component of a chromatin remodeling complex or functional fragment thereof is an enzymatic component.
9. The method of claim 1 , wherein the component of a chromatin remodeling complex or functional fragment thereof is a non-enzymatic component.
10. The method of claim 1 , wherein chromatin modification facilitates detection of a sequence of interest.
11. The method of claim 10 , wherein the sequence of interest comprises a single nucleotide polymorphism.
12. The method of claim 1 , wherein chromatin modification facilitates activation of a gene of interest.
13. The method of claim 1 , wherein chromatin modification facilitates repression of a gene of interest.
14. The method of claim 1 , wherein chromatin modification facilitates recombination between an exogenous nucleic acid and cellular chromatin.
15. The method of claim 5 , wherein the method further comprises the step of contacting a cell with a polynucleotide encoding the fusion polypeptide, wherein the fusion polypeptide is expressed in the cell.
16. The method of claim 1 , further comprising the step of identifying an accessible region in the cellular chromatin, wherein the fusion molecule binds to a target site in the accessible region.
17. The method of claim 1 , wherein the region of interest comprises a gene.
18. The method of claim 17 , wherein the gene encodes a product selected from the group consisting of vascular endothelial growth factor, erythropoietin, androgen receptor, PPAR-γ2, p16, p53, pRb, dystrophin and e-cadherin.
19. The method of claim 1 , further comprising the step of contacting the cellular chromatin with a second molecule.
20. The method of claim 19 , wherein the second molecule is a transcriptional regulatory protein.
21. The method of claim 19 , wherein the second molecule is a fusion molecule.
22. The method of claim 21 , wherein the second molecule is a fusion polypeptide.
23. The method of claim 21 , wherein the second molecule comprises a zinc finger DNA-binding domain.
24. The method of claim 23 , wherein the second molecule further comprises a transcriptional activation domain.
25. The method of claim 23 , wherein the second molecule further comprises a transcriptional repression domain.
26. The method of claim 23 , wherein the second molecule further comprises a polypeptide sequence selected from the group consisting of a histone acetyl transferase, a histone deacetylase, a functional fragment of a histone acetyl transferase, and a functional fragment of a histone deacetylase.
27. The method of claim 19 , further comprising the step of contacting the cellular chromatin with a third molecule.
28. The method of claim 27 , wherein the third molecule is a transcriptional regulatory protein.
29. The method of claim 27 ,wherein the third molecule is a fusion molecule.
30. The method of claim 29 ,wherein the third molecule is a fusion polypeptide.
31. The method of claim 29 , wherein the third molecule comprises a zinc finger DNA-binding domain.
32. The method of claim 31 , wherein the third molecule further comprises a transcriptional activation domain.
33. The method of claim 31 , wherein the third molecule further comprises a transcriptional repression domain.
34. A fusion polypeptide comprising:
a) a DNA binding domain; and
b) a component of a chromatin remodeling complex or a functional fragment thereof.
35. The polypeptide of claim 34 , wherein the DNA-binding domain is a zinc finger DNA binding domain.
36. The polypeptide of claim 34 , wherein the DNA binding domain binds to a target site in a gene encoding a product selected from the group consisting of vascular endothelial growth factor, erythropoietin, androgen receptor, PPAR-γ2, p16, p53, pRb, dystrophin and e-cadherin.
37. The polypeptide of claim 34 , wherein the component of a chromatin remodeling complex or functional fragment thereof is an enzymatic component.
38. The polypeptide of claim 34 , wherein the component of a chromatin remodeling complex or functional fragment thereof is a non-enzymatic component.
39. The polypeptide of claim 37 , wherein the enzymatic component of a chromatin remodeling complex or functional fragment thereof is selected from the group consisting of a SWI/SNF complex family member, an Mi-2 complex family member, an ISWI complex family member, a BRM family member, a BRG/BAF complex family member, a Mot-1 complex family member, a Chd-1 family member, a Chd-2 family member, a Chd-3 family member, a Chd-4 family member, a histone acetyl transferase and a histone deacetylase.
40. The polypeptide of claim 37 , wherein the enzymatic component of a chromatin remodeling complex or functional fragment thereof is selected from the group consisting of a histone methyl transferase, a histone demethylase, a histone kinase, a histone phosphatase, a histone ubiquitinating enzyme, a histone-ADP-ribosylase and a histone protease.
41. A polynucleotide encoding the fusion polypeptide of claim 34 .
42. A cell comprising the fusion polypeptide of claim 34 .
43. A cell comprising the polynucleotide of claim 41 .
44. A method for modulating expression of a gene, the method comprising the steps of:
a) contacting cellular chromatin with a first fusion molecule that binds to a binding site in cellular chromatin, wherein the binding site is in the gene and wherein the first fusion molecule comprises a DNA-binding domain and a component of a chromatin remodeling complex or functional fragment thereof; and
b) further contacting the cellular chromatin with a second molecule that binds to a target site in the gene and modulates expression of the gene.
45. The method of claim 44 , wherein modulation comprises activation of expression of the gene.
46. The method of claim 44 , wherein modulation comprises repression of expression of the gene.
47. The method of claim 44 wherein the DNA-binding domain of the first fusion molecule comprises a zinc finger DNA-binding domain.
48. The method of claim 44 wherein the second molecule is a polypeptide.
49. The method of claim 48 wherein the second molecule comprises a zinc finger DNA-binding domain.
50. The method of claim 49 , wherein the second molecule further comprises an activation domain.
51. The method of claim 49 , wherein the second molecule further comprises a repression domain.
52. The method of claim 44 wherein the second molecule is a transcription factor.
53. The method of claim 52 wherein the transcription factor is an exogenous molecule.
54. The method of claim 52 wherein the transcription factor is an endogenous molecule.
55. The method of claim 44 wherein the first fusion molecule and the second molecule each comprise a zinc finger DNA-binding domain.
56. The method of claim 44 wherein a plurality of first fusion molecules is contacted with cellular chromatin, wherein each of the first fusion molecules binds to a distinct binding site.
57. The method of claim 44 , wherein a plurality of second molecules is contacted with cellular chromatin, wherein each of the second molecules binds to a distinct target site.
58. The method of claim 56 wherein at least one of the first fusion molecules comprises a zinc finger DNA-binding domain.
59. The method of claim 57 wherein at least one of the second molecules comprises a zinc finger DNA-binding domain.
60. The method of claim 44 wherein the expression of a plurality of genes is modulated.
61. The method of claim 60 wherein a plurality of first fusion molecules is contacted with cellular chromatin, wherein each of the first fusion molecules binds to a distinct binding site.
62. The method of claim 61 wherein at least one of the first fusion molecules is a zinc finger fusion polypeptide.
63. The method of claim 60 , wherein a plurality of second molecules is contacted with cellular chromatin, wherein each of the second molecules binds to a distinct binding site.
64. The method of claim 63 wherein at least one of the second molecules is a zinc finger fusion polypeptide.
65. The method of claim 60 wherein the first fusion molecule binds to a shared binding site in two or more of the plurality of genes.
66. The method of claim 65 wherein the first fusion molecule is a zinc finger fusion polypeptide.
67. The method of claim 60 wherein the second molecule binds to a shared target site in two or more of the plurality of genes.
68. The method of claim 67 wherein the second molecule is a zinc finger fusion polypeptide.
69. The method of claim 1 , wherein chromatin modification results in the generation of an accessible region in the cellular chromatin.
70. The method of claim 69 , wherein generation of the accessible region facilitates binding of an exogenous molecule.
71. The method of claim 70 , wherein the exogenous molecule is selected from the group consisting of polypeptides, nucleic acids, small molecule therapeutics, minor groove binders, major groove binders and intercalators.
72. A method for producing a fusion polypeptide, wherein the fusion polypeptide comprises a zinc finger DNA binding domain and a component of a chromatin remodeling complex or a functional fragment thereof, the method comprising the step of expressing the polynucleotide of claim 41 in a suitable host cell.
73. A method for binding an exogenous molecule to a binding site, wherein the binding site is located within a region of interest in cellular chromatin, wherein the method comprises:
(a) contacting cellular chromatin with a fusion molecule that binds to a binding site in the region of interest, wherein the fusion molecule comprises a DNA binding domain and a component of a chromatin remodeling complex or functional fragment thereof, thereby modifying cellular chromatin within the region of interest; and
(b) introducing the exogenous molecule into the cell;
whereby the exogenous molecule binds to the binding site.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/084,826 US20030049649A1 (en) | 2000-04-28 | 2002-02-24 | Targeted modification of chromatin structure |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US20059000P | 2000-04-28 | 2000-04-28 | |
US22852300P | 2000-08-28 | 2000-08-28 | |
US09/844,508 US7001768B2 (en) | 2000-04-28 | 2001-04-27 | Targeted modification of chromatin structure |
US10/084,826 US20030049649A1 (en) | 2000-04-28 | 2002-02-24 | Targeted modification of chromatin structure |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/844,508 Continuation-In-Part US7001768B2 (en) | 2000-04-28 | 2001-04-27 | Targeted modification of chromatin structure |
Publications (1)
Publication Number | Publication Date |
---|---|
US20030049649A1 true US20030049649A1 (en) | 2003-03-13 |
Family
ID=46150079
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/084,826 Abandoned US20030049649A1 (en) | 2000-04-28 | 2002-02-24 | Targeted modification of chromatin structure |
Country Status (1)
Country | Link |
---|---|
US (1) | US20030049649A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007016037A3 (en) * | 2005-07-27 | 2007-06-14 | Univ Texas | Methods for trans-differentiating cells |
US7250514B1 (en) | 2002-10-21 | 2007-07-31 | Takeda San Diego, Inc. | Histone deacetylase inhibitors |
US10711045B2 (en) * | 2015-03-30 | 2020-07-14 | Universität Stuttgart | Isolation of nucleosomes having multiple-modified histone protein octamers |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5972608A (en) * | 1997-08-27 | 1999-10-26 | University Of Massachusetts | Assays and reagents for chromatin remodeling enzymes and their modulators |
US20020173006A1 (en) * | 1998-03-02 | 2002-11-21 | Massachusetts Institute Of Technology | Poly zinc finger proteins with improved linkers |
US6534261B1 (en) * | 1999-01-12 | 2003-03-18 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
-
2002
- 2002-02-24 US US10/084,826 patent/US20030049649A1/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5972608A (en) * | 1997-08-27 | 1999-10-26 | University Of Massachusetts | Assays and reagents for chromatin remodeling enzymes and their modulators |
US20020173006A1 (en) * | 1998-03-02 | 2002-11-21 | Massachusetts Institute Of Technology | Poly zinc finger proteins with improved linkers |
US6534261B1 (en) * | 1999-01-12 | 2003-03-18 | Sangamo Biosciences, Inc. | Regulation of endogenous gene expression in cells using zinc finger proteins |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7250514B1 (en) | 2002-10-21 | 2007-07-31 | Takeda San Diego, Inc. | Histone deacetylase inhibitors |
WO2007016037A3 (en) * | 2005-07-27 | 2007-06-14 | Univ Texas | Methods for trans-differentiating cells |
US8440460B2 (en) | 2005-07-27 | 2013-05-14 | The Board Of Regents Of The University Of Texas System | Methods for transdifferentiating cells |
US10711045B2 (en) * | 2015-03-30 | 2020-07-14 | Universität Stuttgart | Isolation of nucleosomes having multiple-modified histone protein octamers |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7785792B2 (en) | Targeted modification of chromatin structure | |
AU2001253914A1 (en) | Targeted modification of chromatin structure | |
US9234187B2 (en) | Modified zinc finger binding proteins | |
AU2002241946A1 (en) | Modified zinc finger binding proteins | |
US7253273B2 (en) | Treatment of neuropathic pain with zinc finger proteins | |
US6919204B2 (en) | Modulation of gene expression using localization domains | |
US20090181455A1 (en) | Modulation of gene expression using insulator binding proteins | |
US20030049649A1 (en) | Targeted modification of chromatin structure | |
WO2002044386A2 (en) | Targeted regulation of gene expression | |
US20040132033A1 (en) | Human heparanase gene regulatory sequences |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: SANGAMO BIOSCIENCES, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WOLFFE, ALAN P. (BY ELIZABEH J. WOLFEE SPECIAL ADMINISTRATOR);COLLINGWOOD, TREVOR;SNOWDEN, ANDREW;REEL/FRAME:012622/0876;SIGNING DATES FROM 20020403 TO 20020404 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |