WO2002034295A1 - Synthetic regulatory compounds - Google Patents
Synthetic regulatory compounds Download PDFInfo
- Publication number
- WO2002034295A1 WO2002034295A1 PCT/US2000/029617 US0029617W WO0234295A1 WO 2002034295 A1 WO2002034295 A1 WO 2002034295A1 US 0029617 W US0029617 W US 0029617W WO 0234295 A1 WO0234295 A1 WO 0234295A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- moiety
- acid binding
- synthetic regulatory
- synthetic
- Prior art date
Links
- 150000001875 compounds Chemical class 0.000 title claims abstract description 327
- 230000001105 regulatory effect Effects 0.000 title claims abstract description 257
- 230000027455 binding Effects 0.000 claims abstract description 203
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 182
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 179
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 179
- 238000000034 method Methods 0.000 claims abstract description 82
- 239000000203 mixture Substances 0.000 claims abstract description 64
- 238000012216 screening Methods 0.000 claims abstract description 8
- 125000005647 linker group Chemical group 0.000 claims description 126
- 229920002647 polyamide Polymers 0.000 claims description 125
- 239000004952 Polyamide Substances 0.000 claims description 122
- 108090000623 proteins and genes Proteins 0.000 claims description 115
- 108020004414 DNA Proteins 0.000 claims description 114
- 210000004027 cell Anatomy 0.000 claims description 106
- 238000013518 transcription Methods 0.000 claims description 105
- 230000035897 transcription Effects 0.000 claims description 105
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 96
- 230000004913 activation Effects 0.000 claims description 61
- 230000014509 gene expression Effects 0.000 claims description 57
- 102000053602 DNA Human genes 0.000 claims description 56
- 238000000338 in vitro Methods 0.000 claims description 40
- 230000000694 effects Effects 0.000 claims description 39
- 108091034117 Oligonucleotide Proteins 0.000 claims description 28
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 claims description 21
- 108010077544 Chromatin Proteins 0.000 claims description 20
- 210000003483 chromatin Anatomy 0.000 claims description 20
- 150000003839 salts Chemical class 0.000 claims description 18
- 238000012360 testing method Methods 0.000 claims description 16
- 229910052739 hydrogen Inorganic materials 0.000 claims description 14
- 239000002253 acid Substances 0.000 claims description 13
- 125000001072 heteroaryl group Chemical group 0.000 claims description 13
- 239000001257 hydrogen Substances 0.000 claims description 13
- 150000002460 imidazoles Chemical class 0.000 claims description 12
- 125000000168 pyrrolyl group Chemical group 0.000 claims description 11
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 claims description 10
- 238000001727 in vivo Methods 0.000 claims description 10
- 238000007634 remodeling Methods 0.000 claims description 10
- 230000004962 physiological condition Effects 0.000 claims description 9
- KAESVJOAVNADME-UHFFFAOYSA-N 1H-pyrrole Natural products C=1C=CNC=1 KAESVJOAVNADME-UHFFFAOYSA-N 0.000 claims description 8
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 claims description 7
- 150000003384 small molecules Chemical class 0.000 claims description 7
- OGNSCSPNOLGXSM-UHFFFAOYSA-N (+/-)-DABA Natural products NCCC(N)C(O)=O OGNSCSPNOLGXSM-UHFFFAOYSA-N 0.000 claims description 6
- 150000001720 carbohydrates Chemical class 0.000 claims description 6
- 229960003692 gamma aminobutyric acid Drugs 0.000 claims description 6
- 150000003233 pyrroles Chemical class 0.000 claims description 6
- 241000283690 Bos taurus Species 0.000 claims description 5
- 241000282465 Canis Species 0.000 claims description 5
- 241000283073 Equus caballus Species 0.000 claims description 5
- 241000282324 Felis Species 0.000 claims description 5
- 241000288906 Primates Species 0.000 claims description 5
- 150000002632 lipids Chemical class 0.000 claims description 5
- 239000007788 liquid Substances 0.000 claims description 5
- 241001529936 Murinae Species 0.000 claims description 4
- 210000004102 animal cell Anatomy 0.000 claims description 4
- 125000005392 carboxamide group Chemical group NC(=O)* 0.000 claims description 4
- 230000007423 decrease Effects 0.000 claims description 4
- 239000003937 drug carrier Substances 0.000 claims description 4
- 102000000490 Mediator Complex Human genes 0.000 claims description 3
- 108010080991 Mediator Complex Proteins 0.000 claims description 3
- 229940000635 beta-alanine Drugs 0.000 claims description 3
- 125000004122 cyclic group Chemical group 0.000 claims description 3
- 230000009881 electrostatic interaction Effects 0.000 claims description 3
- 238000005411 Van der Waals force Methods 0.000 claims description 2
- 239000003550 marker Substances 0.000 claims description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 claims 2
- 210000005260 human cell Anatomy 0.000 claims 1
- 241001465754 Metazoa Species 0.000 abstract description 15
- 201000010099 disease Diseases 0.000 abstract description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 12
- 241000282414 Homo sapiens Species 0.000 abstract description 11
- 230000002194 synthesizing effect Effects 0.000 abstract description 7
- 239000003814 drug Substances 0.000 abstract description 6
- 241000282412 Homo Species 0.000 abstract description 4
- 239000000562 conjugate Substances 0.000 description 73
- 239000012190 activator Substances 0.000 description 66
- 239000002773 nucleotide Substances 0.000 description 63
- 125000003729 nucleotide group Chemical group 0.000 description 63
- 238000006243 chemical reaction Methods 0.000 description 52
- 102000004196 processed proteins & peptides Human genes 0.000 description 46
- 230000015572 biosynthetic process Effects 0.000 description 45
- 239000002585 base Substances 0.000 description 38
- 230000003213 activating effect Effects 0.000 description 34
- 235000018102 proteins Nutrition 0.000 description 33
- 102000004169 proteins and genes Human genes 0.000 description 33
- 238000003786 synthesis reaction Methods 0.000 description 32
- 230000002103 transcriptional effect Effects 0.000 description 32
- 230000006870 function Effects 0.000 description 31
- 230000003993 interaction Effects 0.000 description 30
- -1 designated AH Proteins 0.000 description 28
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 24
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 24
- 150000007970 thio esters Chemical class 0.000 description 24
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 23
- 238000006471 dimerization reaction Methods 0.000 description 23
- 230000004568 DNA-binding Effects 0.000 description 22
- 239000000243 solution Substances 0.000 description 22
- 238000003556 assay Methods 0.000 description 21
- 238000002474 experimental method Methods 0.000 description 21
- 238000004448 titration Methods 0.000 description 20
- 235000001014 amino acid Nutrition 0.000 description 18
- 238000013459 approach Methods 0.000 description 17
- 229940126214 compound 3 Drugs 0.000 description 17
- 238000011144 upstream manufacturing Methods 0.000 description 16
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 15
- 150000001413 amino acids Chemical class 0.000 description 15
- 238000009472 formulation Methods 0.000 description 15
- 239000000126 substance Substances 0.000 description 15
- SECXISVLQFMRJM-UHFFFAOYSA-N N-Methylpyrrolidone Chemical compound CN1CCCC1=O SECXISVLQFMRJM-UHFFFAOYSA-N 0.000 description 14
- 125000004429 atom Chemical group 0.000 description 13
- 239000002552 dosage form Substances 0.000 description 13
- 125000000539 amino acid group Chemical group 0.000 description 12
- 239000003446 ligand Substances 0.000 description 12
- 229920001223 polyethylene glycol Polymers 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- 241000196324 Embryophyta Species 0.000 description 11
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- 108700008625 Reporter Genes Proteins 0.000 description 11
- 230000000295 complement effect Effects 0.000 description 11
- 239000000499 gel Substances 0.000 description 11
- 230000008569 process Effects 0.000 description 11
- 239000007787 solid Substances 0.000 description 11
- QFLWZFQWSBQYPS-AWRAUJHKSA-N (3S)-3-[[(2S)-2-[[(2S)-2-[5-[(3aS,6aR)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]pentanoylamino]-3-methylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-4-[1-bis(4-chlorophenoxy)phosphorylbutylamino]-4-oxobutanoic acid Chemical compound CCCC(NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H](NC(=O)CCCCC1SC[C@@H]2NC(=O)N[C@H]12)C(C)C)P(=O)(Oc1ccc(Cl)cc1)Oc1ccc(Cl)cc1 QFLWZFQWSBQYPS-AWRAUJHKSA-N 0.000 description 10
- 229940125782 compound 2 Drugs 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- 108010025076 Holoenzymes Proteins 0.000 description 9
- 101710081079 Minor spike protein H Proteins 0.000 description 9
- 108700026226 TATA Box Proteins 0.000 description 9
- 102000006290 Transcription Factor TFIID Human genes 0.000 description 9
- 108010083268 Transcription Factor TFIID Proteins 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 9
- 244000052769 pathogen Species 0.000 description 9
- 238000010532 solid phase synthesis reaction Methods 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 8
- 239000011230 binding agent Substances 0.000 description 8
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 8
- 108020004999 messenger RNA Proteins 0.000 description 8
- 230000035699 permeability Effects 0.000 description 8
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 8
- 102100039556 Galectin-4 Human genes 0.000 description 7
- 108060001084 Luciferase Proteins 0.000 description 7
- 239000002202 Polyethylene glycol Substances 0.000 description 7
- 101710156592 Putative TATA-binding protein pB263R Proteins 0.000 description 7
- 102000009572 RNA Polymerase II Human genes 0.000 description 7
- 108010009460 RNA Polymerase II Proteins 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 102100040296 TATA-box-binding protein Human genes 0.000 description 7
- 101710145783 TATA-box-binding protein Proteins 0.000 description 7
- 239000000370 acceptor Substances 0.000 description 7
- AGEZXYOZHKGVCM-UHFFFAOYSA-N benzyl bromide Chemical compound BrCC1=CC=CC=C1 AGEZXYOZHKGVCM-UHFFFAOYSA-N 0.000 description 7
- 238000013461 design Methods 0.000 description 7
- 230000001404 mediated effect Effects 0.000 description 7
- 239000002904 solvent Substances 0.000 description 7
- 238000003860 storage Methods 0.000 description 7
- INKIPSJJIQBVKP-UHFFFAOYSA-N thiolane-2,5-dione Chemical compound O=C1CCC(=O)S1 INKIPSJJIQBVKP-UHFFFAOYSA-N 0.000 description 7
- 238000011282 treatment Methods 0.000 description 7
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 6
- 108010001515 Galectin 4 Proteins 0.000 description 6
- 239000005089 Luciferase Substances 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 6
- 235000010724 Wisteria floribunda Nutrition 0.000 description 6
- 108091006088 activator proteins Proteins 0.000 description 6
- 239000004480 active ingredient Substances 0.000 description 6
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 6
- 125000003275 alpha amino acid group Chemical group 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 210000004899 c-terminal region Anatomy 0.000 description 6
- 229940125898 compound 5 Drugs 0.000 description 6
- 230000021615 conjugation Effects 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 230000002401 inhibitory effect Effects 0.000 description 6
- 230000000977 initiatory effect Effects 0.000 description 6
- 239000002062 molecular scaffold Substances 0.000 description 6
- 229920001184 polypeptide Polymers 0.000 description 6
- 150000003141 primary amines Chemical class 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 230000009711 regulatory function Effects 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 108010033040 Histones Proteins 0.000 description 5
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 5
- 239000004721 Polyphenylene oxide Substances 0.000 description 5
- 102000040945 Transcription factor Human genes 0.000 description 5
- 108091023040 Transcription factor Proteins 0.000 description 5
- 201000011510 cancer Diseases 0.000 description 5
- 235000014633 carbohydrates Nutrition 0.000 description 5
- 230000003197 catalytic effect Effects 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 230000005764 inhibitory process Effects 0.000 description 5
- 239000000543 intermediate Substances 0.000 description 5
- 239000012669 liquid formulation Substances 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 210000004940 nucleus Anatomy 0.000 description 5
- 230000001717 pathogenic effect Effects 0.000 description 5
- 239000000546 pharmaceutical excipient Substances 0.000 description 5
- 229920000570 polyether Polymers 0.000 description 5
- 229920000642 polymer Polymers 0.000 description 5
- 230000035755 proliferation Effects 0.000 description 5
- 238000004007 reversed phase HPLC Methods 0.000 description 5
- 239000007909 solid dosage form Substances 0.000 description 5
- 230000008685 targeting Effects 0.000 description 5
- RMVRSNDYEFQCLF-UHFFFAOYSA-N thiophenol Chemical compound SC1=CC=CC=C1 RMVRSNDYEFQCLF-UHFFFAOYSA-N 0.000 description 5
- 108091006106 transcriptional activators Proteins 0.000 description 5
- OXHNLMTVIGZXSG-UHFFFAOYSA-N 1-Methylpyrrole Chemical compound CN1C=CC=C1 OXHNLMTVIGZXSG-UHFFFAOYSA-N 0.000 description 4
- MCTWTZJPVLRJOU-UHFFFAOYSA-N 1-methyl-1H-imidazole Chemical compound CN1C=CN=C1 MCTWTZJPVLRJOU-UHFFFAOYSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 241000271566 Aves Species 0.000 description 4
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- 102000003893 Histone acetyltransferases Human genes 0.000 description 4
- 108090000246 Histone acetyltransferases Proteins 0.000 description 4
- 108010047956 Nucleosomes Proteins 0.000 description 4
- 229920002472 Starch Polymers 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 239000002775 capsule Substances 0.000 description 4
- 239000004202 carbamide Substances 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 239000000539 dimer Substances 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 229940093476 ethylene glycol Drugs 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 208000015181 infectious disease Diseases 0.000 description 4
- 230000003834 intracellular effect Effects 0.000 description 4
- 210000001623 nucleosome Anatomy 0.000 description 4
- 239000000863 peptide conjugate Substances 0.000 description 4
- 239000012071 phase Substances 0.000 description 4
- 229920002401 polyacrylamide Polymers 0.000 description 4
- 239000000843 powder Substances 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 235000019698 starch Nutrition 0.000 description 4
- 230000000638 stimulation Effects 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- WWTBZEKOSBFBEM-SPWPXUSOSA-N (2s)-2-[[2-benzyl-3-[hydroxy-[(1r)-2-phenyl-1-(phenylmethoxycarbonylamino)ethyl]phosphoryl]propanoyl]amino]-3-(1h-indol-3-yl)propanoic acid Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)C(CP(O)(=O)[C@H](CC=1C=CC=CC=1)NC(=O)OCC=1C=CC=CC=1)CC1=CC=CC=C1 WWTBZEKOSBFBEM-SPWPXUSOSA-N 0.000 description 3
- 241000251468 Actinopterygii Species 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 3
- 101710096438 DNA-binding protein Proteins 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 108010058643 Fungal Proteins Proteins 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- 102000006947 Histones Human genes 0.000 description 3
- OPFJDXRVMFKJJO-ZHHKINOHSA-N N-{[3-(2-benzamido-4-methyl-1,3-thiazol-5-yl)-pyrazol-5-yl]carbonyl}-G-dR-G-dD-dD-dD-NH2 Chemical compound S1C(C=2NN=C(C=2)C(=O)NCC(=O)N[C@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC(O)=O)C(N)=O)=C(C)N=C1NC(=O)C1=CC=CC=C1 OPFJDXRVMFKJJO-ZHHKINOHSA-N 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 3
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 3
- 102000018068 TATA-Binding Protein Associated Factors Human genes 0.000 description 3
- 108010091120 TATA-Binding Protein Associated Factors Proteins 0.000 description 3
- YTPLMLYBLZKORZ-UHFFFAOYSA-N Thiophene Chemical class C=1C=CSC=1 YTPLMLYBLZKORZ-UHFFFAOYSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 229960005305 adenosine Drugs 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 238000000211 autoradiogram Methods 0.000 description 3
- 239000001506 calcium phosphate Substances 0.000 description 3
- 229910000389 calcium phosphate Inorganic materials 0.000 description 3
- 235000011010 calcium phosphates Nutrition 0.000 description 3
- 150000003857 carboxamides Chemical class 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 238000000423 cell based assay Methods 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- 235000010980 cellulose Nutrition 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 229940126086 compound 21 Drugs 0.000 description 3
- 229940126208 compound 22 Drugs 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 239000013578 denaturing buffer Substances 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 239000012091 fetal bovine serum Substances 0.000 description 3
- 125000000524 functional group Chemical group 0.000 description 3
- 238000000099 in vitro assay Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 239000003921 oil Substances 0.000 description 3
- 235000019198 oils Nutrition 0.000 description 3
- 238000005580 one pot reaction Methods 0.000 description 3
- 239000006187 pill Substances 0.000 description 3
- 230000003389 potentiating effect Effects 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 230000007115 recruitment Effects 0.000 description 3
- 230000014493 regulation of gene expression Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000008107 starch Substances 0.000 description 3
- 239000011550 stock solution Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 150000005846 sugar alcohols Polymers 0.000 description 3
- 238000010189 synthetic method Methods 0.000 description 3
- 239000003826 tablet Substances 0.000 description 3
- 108091006107 transcriptional repressors Proteins 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 3
- AAWZDTNXLSGCEK-LNVDRNJUSA-N (3r,5r)-1,3,4,5-tetrahydroxycyclohexane-1-carboxylic acid Chemical class O[C@@H]1CC(O)(C(O)=O)C[C@@H](O)C1O AAWZDTNXLSGCEK-LNVDRNJUSA-N 0.000 description 2
- PIYNUZCGMLCXKJ-UHFFFAOYSA-N 1,4-dioxane-2,6-dione Chemical compound O=C1COCC(=O)O1 PIYNUZCGMLCXKJ-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- IAZDPXIOMUYVGZ-WFGJKAKNSA-N Dimethyl sulfoxide Chemical compound [2H]C([2H])([2H])S(=O)C([2H])([2H])[2H] IAZDPXIOMUYVGZ-WFGJKAKNSA-N 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- 102000012199 E3 ubiquitin-protein ligase Mdm2 Human genes 0.000 description 2
- 108050002772 E3 ubiquitin-protein ligase Mdm2 Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108050007372 Fibroblast Growth Factor Proteins 0.000 description 2
- 102000018233 Fibroblast Growth Factor Human genes 0.000 description 2
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical class C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- 229920002527 Glycogen Polymers 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 2
- 102000003964 Histone deacetylase Human genes 0.000 description 2
- 108090000353 Histone deacetylase Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- OFOBLEOULBTSOW-UHFFFAOYSA-N Malonic acid Chemical compound OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- 240000004658 Medicago sativa Species 0.000 description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 2
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical compound CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 102000043276 Oncogene Human genes 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- 108091081548 Palindromic sequence Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 108010034634 Repressor Proteins Proteins 0.000 description 2
- 102000009661 Repressor Proteins Human genes 0.000 description 2
- 240000006394 Sorghum bicolor Species 0.000 description 2
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 2
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 108091008874 T cell receptors Proteins 0.000 description 2
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 2
- 102000006289 Transcription Factor TFIIA Human genes 0.000 description 2
- 108010083262 Transcription Factor TFIIA Proteins 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 108020004566 Transfer RNA Proteins 0.000 description 2
- 235000021307 Triticum Nutrition 0.000 description 2
- 241000209140 Triticum Species 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 125000001931 aliphatic group Chemical group 0.000 description 2
- 125000000217 alkyl group Chemical group 0.000 description 2
- 230000029936 alkylation Effects 0.000 description 2
- 238000005804 alkylation reaction Methods 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 230000003466 anti-cipated effect Effects 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 239000012131 assay buffer Substances 0.000 description 2
- XRWSZZJLZRKHHD-WVWIJVSJSA-N asunaprevir Chemical compound O=C([C@@H]1C[C@H](CN1C(=O)[C@@H](NC(=O)OC(C)(C)C)C(C)(C)C)OC1=NC=C(C2=CC=C(Cl)C=C21)OC)N[C@]1(C(=O)NS(=O)(=O)C2CC2)C[C@H]1C=C XRWSZZJLZRKHHD-WVWIJVSJSA-N 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- SRSXLGNVWSONIS-UHFFFAOYSA-N benzenesulfonic acid Chemical class OS(=O)(=O)C1=CC=CC=C1 SRSXLGNVWSONIS-UHFFFAOYSA-N 0.000 description 2
- KGNDCEVUMONOKF-UGPLYTSKSA-N benzyl n-[(2r)-1-[(2s,4r)-2-[[(2s)-6-amino-1-(1,3-benzoxazol-2-yl)-1,1-dihydroxyhexan-2-yl]carbamoyl]-4-[(4-methylphenyl)methoxy]pyrrolidin-1-yl]-1-oxo-4-phenylbutan-2-yl]carbamate Chemical compound C1=CC(C)=CC=C1CO[C@H]1CN(C(=O)[C@@H](CCC=2C=CC=CC=2)NC(=O)OCC=2C=CC=CC=2)[C@H](C(=O)N[C@@H](CCCCN)C(O)(O)C=2OC3=CC=CC=C3N=2)C1 KGNDCEVUMONOKF-UGPLYTSKSA-N 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 244000309466 calf Species 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 229940125833 compound 23 Drugs 0.000 description 2
- 229940125961 compound 24 Drugs 0.000 description 2
- 230000001268 conjugating effect Effects 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- JJMDCOVWQOJGCB-UHFFFAOYSA-N delta-aminovaleric acid Natural products [NH3+]CCCCC([O-])=O JJMDCOVWQOJGCB-UHFFFAOYSA-N 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000010494 dissociation reaction Methods 0.000 description 2
- 230000005593 dissociations Effects 0.000 description 2
- 238000009510 drug design Methods 0.000 description 2
- 239000003995 emulsifying agent Substances 0.000 description 2
- 239000003925 fat Substances 0.000 description 2
- 230000002496 gastric effect Effects 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- 102000034356 gene-regulatory proteins Human genes 0.000 description 2
- 108091006104 gene-regulatory proteins Proteins 0.000 description 2
- 229940096919 glycogen Drugs 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- 125000005842 heteroatom Chemical group 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000006951 hyperphosphorylation Effects 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000012750 in vivo screening Methods 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 2
- 229960000367 inositol Drugs 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 239000008297 liquid dosage form Substances 0.000 description 2
- 239000012160 loading buffer Substances 0.000 description 2
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 230000000051 modifying effect Effects 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- IDBIFFKSXLYUOT-UHFFFAOYSA-N netropsin Chemical compound C1=C(C(=O)NCCC(N)=N)N(C)C=C1NC(=O)C1=CC(NC(=O)CN=C(N)N)=CN1C IDBIFFKSXLYUOT-UHFFFAOYSA-N 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 239000000816 peptidomimetic Substances 0.000 description 2
- 238000003322 phosphorimaging Methods 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 2
- 239000008057 potassium phosphate buffer Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000002685 pulmonary effect Effects 0.000 description 2
- 239000000376 reactant Substances 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 206010039073 rheumatoid arthritis Diseases 0.000 description 2
- 108020004418 ribosomal RNA Proteins 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 239000000600 sorbitol Substances 0.000 description 2
- 238000005507 spraying Methods 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 239000011593 sulfur Substances 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 239000002562 thickening agent Substances 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 230000003827 upregulation Effects 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 230000001018 virulence Effects 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- GHYOCDFICYLMRF-UTIIJYGPSA-N (2S,3R)-N-[(2S)-3-(cyclopenten-1-yl)-1-[(2R)-2-methyloxiran-2-yl]-1-oxopropan-2-yl]-3-hydroxy-3-(4-methoxyphenyl)-2-[[(2S)-2-[(2-morpholin-4-ylacetyl)amino]propanoyl]amino]propanamide Chemical compound C1(=CCCC1)C[C@@H](C(=O)[C@@]1(OC1)C)NC([C@H]([C@@H](C1=CC=C(C=C1)OC)O)NC([C@H](C)NC(CN1CCOCC1)=O)=O)=O GHYOCDFICYLMRF-UTIIJYGPSA-N 0.000 description 1
- VZQHRKZCAZCACO-PYJNHQTQSA-N (2s)-2-[[(2s)-2-[2-[[(2s)-2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]propanoyl]amino]prop-2-enoylamino]-3-methylbutanoyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)C(=C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VZQHRKZCAZCACO-PYJNHQTQSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UGUHFDPGDQDVGX-UHFFFAOYSA-N 1,2,3-thiadiazole Chemical class C1=CSN=N1 UGUHFDPGDQDVGX-UHFFFAOYSA-N 0.000 description 1
- BBVIDBNAYOIXOE-UHFFFAOYSA-N 1,2,4-oxadiazole Chemical class C=1N=CON=1 BBVIDBNAYOIXOE-UHFFFAOYSA-N 0.000 description 1
- YGTAZGSLCXNBQL-UHFFFAOYSA-N 1,2,4-thiadiazole Chemical class C=1N=CSN=1 YGTAZGSLCXNBQL-UHFFFAOYSA-N 0.000 description 1
- UDGKZGLPXCRRAM-UHFFFAOYSA-N 1,2,5-thiadiazole Chemical class C=1C=NSN=1 UDGKZGLPXCRRAM-UHFFFAOYSA-N 0.000 description 1
- FKASFBLJDCHBNZ-UHFFFAOYSA-N 1,3,4-oxadiazole Chemical class C1=NN=CO1 FKASFBLJDCHBNZ-UHFFFAOYSA-N 0.000 description 1
- MBIZXFATKUQOOA-UHFFFAOYSA-N 1,3,4-thiadiazole Chemical class C1=NN=CS1 MBIZXFATKUQOOA-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- 238000005160 1H NMR spectroscopy Methods 0.000 description 1
- QWENRTYMTSOGBR-UHFFFAOYSA-N 1H-1,2,3-Triazole Chemical class C=1C=NNN=1 QWENRTYMTSOGBR-UHFFFAOYSA-N 0.000 description 1
- ZPOROQKDAPEMOL-UHFFFAOYSA-N 1h-pyrrol-3-ol Chemical compound OC=1C=CNC=1 ZPOROQKDAPEMOL-UHFFFAOYSA-N 0.000 description 1
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 1
- PIINGYXNCHTJTF-UHFFFAOYSA-N 2-(2-azaniumylethylamino)acetate Chemical group NCCNCC(O)=O PIINGYXNCHTJTF-UHFFFAOYSA-N 0.000 description 1
- FALRKNHUBBKYCC-UHFFFAOYSA-N 2-(chloromethyl)pyridine-3-carbonitrile Chemical compound ClCC1=NC=CC=C1C#N FALRKNHUBBKYCC-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- DJQYYYCQOZMCRC-UHFFFAOYSA-N 2-aminopropane-1,3-dithiol Chemical compound SCC(N)CS DJQYYYCQOZMCRC-UHFFFAOYSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical group OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- RVEQXKRJJIVQII-UHFFFAOYSA-N 3-hydroxy-n-methyl-1h-pyrrole-2-carboxamide Chemical compound CNC(=O)C=1NC=CC=1O RVEQXKRJJIVQII-UHFFFAOYSA-N 0.000 description 1
- NSPMIYGKQJPBQR-UHFFFAOYSA-N 4H-1,2,4-triazole Chemical class C=1N=CNN=1 NSPMIYGKQJPBQR-UHFFFAOYSA-N 0.000 description 1
- UHPMCKVQTMMPCG-UHFFFAOYSA-N 5,8-dihydroxy-2-methoxy-6-methyl-7-(2-oxopropyl)naphthalene-1,4-dione Chemical compound CC1=C(CC(C)=O)C(O)=C2C(=O)C(OC)=CC(=O)C2=C1O UHPMCKVQTMMPCG-UHFFFAOYSA-N 0.000 description 1
- STQGQHZAVUOBTE-UHFFFAOYSA-N 7-Cyan-hept-2t-en-4,6-diinsaeure Natural products C1=2C(O)=C3C(=O)C=4C(OC)=CC=CC=4C(=O)C3=C(O)C=2CC(O)(C(C)=O)CC1OC1CC(N)C(O)C(C)O1 STQGQHZAVUOBTE-UHFFFAOYSA-N 0.000 description 1
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 102400000069 Activation peptide Human genes 0.000 description 1
- 101800001401 Activation peptide Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000588986 Alcaligenes Species 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 241000239223 Arachnida Species 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 201000001320 Atherosclerosis Diseases 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241001148536 Bacteroides sp. Species 0.000 description 1
- 102100023995 Beta-nerve growth factor Human genes 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 101150035324 CDK9 gene Proteins 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 206010008609 Cholangitis sclerosing Diseases 0.000 description 1
- 241000588923 Citrobacter Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 102000008169 Co-Repressor Proteins Human genes 0.000 description 1
- 108010060434 Co-Repressor Proteins Proteins 0.000 description 1
- 208000026372 Congenital cystic kidney disease Diseases 0.000 description 1
- AAWZDTNXLSGCEK-UHFFFAOYSA-N Cordycepinsaeure Natural products OC1CC(O)(C(O)=O)CC(O)C1O AAWZDTNXLSGCEK-UHFFFAOYSA-N 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- WEAHRLBPCANXCN-UHFFFAOYSA-N Daunomycin Natural products CCC1(O)CC(OC2CC(N)C(O)C(C)O2)c3cc4C(=O)c5c(OC)cccc5C(=O)c4c(O)c3C1 WEAHRLBPCANXCN-UHFFFAOYSA-N 0.000 description 1
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 1
- 108010066486 EGF Family of Proteins Proteins 0.000 description 1
- 102000018386 EGF Family of Proteins Human genes 0.000 description 1
- 241000588914 Enterobacter Species 0.000 description 1
- 241000194033 Enterococcus Species 0.000 description 1
- 206010053155 Epigastric discomfort Diseases 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 208000007217 Esophageal Stenosis Diseases 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 208000004248 Familial Primary Pulmonary Hypertension Diseases 0.000 description 1
- 208000000571 Fibrocystic breast disease Diseases 0.000 description 1
- 206010016654 Fibrosis Diseases 0.000 description 1
- 241000223218 Fusarium Species 0.000 description 1
- 102000006580 General Transcription Factors Human genes 0.000 description 1
- 108010008945 General Transcription Factors Proteins 0.000 description 1
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 208000001204 Hashimoto Disease Diseases 0.000 description 1
- 208000030836 Hashimoto thyroiditis Diseases 0.000 description 1
- 241000589989 Helicobacter Species 0.000 description 1
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 102100039869 Histone H2B type F-S Human genes 0.000 description 1
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 1
- 101001035372 Homo sapiens Histone H2B type F-S Proteins 0.000 description 1
- 101000918983 Homo sapiens Neutrophil defensin 1 Proteins 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 208000014919 IgG4-related retroperitoneal fibrosis Diseases 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000000589 Interleukin-1 Human genes 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 208000002260 Keloid Diseases 0.000 description 1
- 206010023330 Keloid scar Diseases 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 240000007472 Leucaena leucocephala Species 0.000 description 1
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 1
- 239000000232 Lipid Bilayer Substances 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- 206010067125 Liver injury Diseases 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241000192041 Micrococcus Species 0.000 description 1
- 229930192392 Mitomycin Natural products 0.000 description 1
- 206010028124 Mucosal ulceration Diseases 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 201000003793 Myelodysplastic syndrome Diseases 0.000 description 1
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 1
- JQLLUCKSSWNYKG-UHFFFAOYSA-N NCCC(=O)C(N)=O Chemical class NCCC(=O)C(N)=O JQLLUCKSSWNYKG-UHFFFAOYSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 206010028813 Nausea Diseases 0.000 description 1
- MRWXACSTFXYYMV-UHFFFAOYSA-N Nebularine Natural products OC1C(O)C(CO)OC1N1C2=NC=NC=C2N=C1 MRWXACSTFXYYMV-UHFFFAOYSA-N 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 108010025020 Nerve Growth Factor Proteins 0.000 description 1
- 108010042309 Netropsin Proteins 0.000 description 1
- 208000009905 Neurofibromatoses Diseases 0.000 description 1
- 102100029494 Neutrophil defensin 1 Human genes 0.000 description 1
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 1
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 1
- 229920001007 Nylon 4 Polymers 0.000 description 1
- 206010030194 Oesophageal stenosis Diseases 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- ZCQWOFVYLHDMMC-UHFFFAOYSA-N Oxazole Chemical class C1=COC=N1 ZCQWOFVYLHDMMC-UHFFFAOYSA-N 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 1
- 102000016387 Pancreatic elastase Human genes 0.000 description 1
- 108010067372 Pancreatic elastase Proteins 0.000 description 1
- 206010033661 Pancytopenia Diseases 0.000 description 1
- 241000191992 Peptostreptococcus Species 0.000 description 1
- 108700020962 Peroxidase Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 206010036049 Polycystic ovaries Diseases 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 241000605861 Prevotella Species 0.000 description 1
- 241000186429 Propionibacterium Species 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 241000588769 Proteus <enterobacteria> Species 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 208000028017 Psychotic disease Diseases 0.000 description 1
- 206010064911 Pulmonary arterial hypertension Diseases 0.000 description 1
- WTKZEGDFNFYCGP-UHFFFAOYSA-N Pyrazole Chemical class C=1C=NNC=1 WTKZEGDFNFYCGP-UHFFFAOYSA-N 0.000 description 1
- AAWZDTNXLSGCEK-ZHQZDSKASA-N Quinic acid Natural products O[C@H]1CC(O)(C(O)=O)C[C@H](O)C1O AAWZDTNXLSGCEK-ZHQZDSKASA-N 0.000 description 1
- 102000017143 RNA Polymerase I Human genes 0.000 description 1
- 108010013845 RNA Polymerase I Proteins 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 206010063837 Reperfusion injury Diseases 0.000 description 1
- 108700025701 Retinoblastoma Genes Proteins 0.000 description 1
- 206010038979 Retroperitoneal fibrosis Diseases 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- 206010040070 Septic Shock Diseases 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 206010058141 Skin graft rejection Diseases 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 208000007271 Substance Withdrawal Syndrome Diseases 0.000 description 1
- 102000019197 Superoxide Dismutase Human genes 0.000 description 1
- 108010012715 Superoxide dismutase Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- FZWLAAWBMGSTSO-UHFFFAOYSA-N Thiazole Chemical class C1=CSC=N1 FZWLAAWBMGSTSO-UHFFFAOYSA-N 0.000 description 1
- 102000004408 Transcription factor TFIIB Human genes 0.000 description 1
- 108090000941 Transcription factor TFIIB Proteins 0.000 description 1
- 101710195626 Transcriptional activator protein Proteins 0.000 description 1
- 108010009583 Transforming Growth Factors Proteins 0.000 description 1
- 102000009618 Transforming Growth Factors Human genes 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 208000026911 Tuberous sclerosis complex Diseases 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- VGQOVCHZGQWAOI-UHFFFAOYSA-N UNPD55612 Natural products N1C(O)C2CC(C=CC(N)=O)=CN2C(=O)C2=CC=C(C)C(O)=C12 VGQOVCHZGQWAOI-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 208000014070 Vestibular schwannoma Diseases 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 238000002441 X-ray diffraction Methods 0.000 description 1
- TVXBFESIOXBWNM-UHFFFAOYSA-N Xylitol Natural products OCCC(O)C(O)C(O)CCO TVXBFESIOXBWNM-UHFFFAOYSA-N 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- DILVWDXOASDRPN-MROZADKFSA-N [(2r,3r,4r)-3,4,5-trihydroxy-1-oxopentan-2-yl] dihydrogen phosphate Chemical group OC[C@@H](O)[C@@H](O)[C@H](C=O)OP(O)(O)=O DILVWDXOASDRPN-MROZADKFSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 235000011054 acetic acid Nutrition 0.000 description 1
- 150000001242 acetic acid derivatives Chemical class 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 208000004064 acoustic neuroma Diseases 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 150000001338 aliphatic hydrocarbons Chemical class 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 150000001371 alpha-amino acids Chemical class 0.000 description 1
- 235000008206 alpha-amino acids Nutrition 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 150000001409 amidines Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 239000003708 ampul Substances 0.000 description 1
- VGQOVCHZGQWAOI-HYUHUPJXSA-N anthramycin Chemical compound N1[C@@H](O)[C@@H]2CC(\C=C\C(N)=O)=CN2C(=O)C2=CC=C(C)C(O)=C12 VGQOVCHZGQWAOI-HYUHUPJXSA-N 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000843 anti-fungal effect Effects 0.000 description 1
- 230000002924 anti-infective effect Effects 0.000 description 1
- 229940124599 anti-inflammatory drug Drugs 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000012458 antifungal assay Methods 0.000 description 1
- 230000001640 apoptogenic effect Effects 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 239000008365 aqueous carrier Substances 0.000 description 1
- 239000013011 aqueous formulation Substances 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 125000003289 ascorbyl group Chemical class [H]O[C@@]([H])(C([H])([H])O*)[C@@]1([H])OC(=O)C(O*)=C1O* 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 229940092714 benzenesulfonic acid Drugs 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 239000004327 boric acid Substances 0.000 description 1
- 208000011803 breast fibrocystic disease Diseases 0.000 description 1
- 150000001649 bromium compounds Chemical class 0.000 description 1
- 238000002815 broth microdilution Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 239000004067 bulking agent Substances 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 239000007894 caplet Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 150000001805 chlorine compounds Chemical class 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 230000007882 cirrhosis Effects 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- DQLATGHUWYMOKM-UHFFFAOYSA-L cisplatin Chemical compound N[Pt](N)(Cl)Cl DQLATGHUWYMOKM-UHFFFAOYSA-L 0.000 description 1
- 229960004316 cisplatin Drugs 0.000 description 1
- 235000015165 citric acid Nutrition 0.000 description 1
- 150000001860 citric acid derivatives Chemical class 0.000 description 1
- 229940125904 compound 1 Drugs 0.000 description 1
- 229940125797 compound 12 Drugs 0.000 description 1
- 238000013329 compounding Methods 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000013270 controlled release Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000009146 cooperative binding Effects 0.000 description 1
- 230000009133 cooperative interaction Effects 0.000 description 1
- 229920001577 copolymer Polymers 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 150000001923 cyclic compounds Chemical class 0.000 description 1
- ZHGASCUQXLPSDT-UHFFFAOYSA-N cyclohexanesulfonic acid Chemical class OS(=O)(=O)C1CCCCC1 ZHGASCUQXLPSDT-UHFFFAOYSA-N 0.000 description 1
- HCAJEUSONLESMK-UHFFFAOYSA-N cyclohexylsulfamic acid Chemical compound OS(=O)(=O)NC1CCCCC1 HCAJEUSONLESMK-UHFFFAOYSA-N 0.000 description 1
- 208000024389 cytopenia Diseases 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- STQGQHZAVUOBTE-VGBVRHCVSA-N daunorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(C)=O)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 STQGQHZAVUOBTE-VGBVRHCVSA-N 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000003413 degradative effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 239000000412 dendrimer Substances 0.000 description 1
- 229920000736 dendritic polymer Polymers 0.000 description 1
- 238000003795 desorption Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 230000000447 dimerizing effect Effects 0.000 description 1
- IUNMPGNGSSIWFP-UHFFFAOYSA-N dimethylaminopropylamine Chemical compound CN(C)CCCN IUNMPGNGSSIWFP-UHFFFAOYSA-N 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 230000003467 diminishing effect Effects 0.000 description 1
- 238000002224 dissection Methods 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 229960004679 doxorubicin Drugs 0.000 description 1
- 229940112141 dry powder inhaler Drugs 0.000 description 1
- VQNATVDKACXKTF-XELLLNAOSA-N duocarmycin Chemical compound COC1=C(OC)C(OC)=C2NC(C(=O)N3C4=CC(=O)C5=C([C@@]64C[C@@H]6C3)C=C(N5)C(=O)OC)=CC2=C1 VQNATVDKACXKTF-XELLLNAOSA-N 0.000 description 1
- 229960005501 duocarmycin Drugs 0.000 description 1
- 229930184221 duocarmycin Natural products 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 239000012039 electrophile Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 239000002702 enteric coating Substances 0.000 description 1
- 238000009505 enteric coating Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 125000004185 ester group Chemical group 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- CCIVGXIOQKPBKL-UHFFFAOYSA-M ethanesulfonate Chemical compound CCS([O-])(=O)=O CCIVGXIOQKPBKL-UHFFFAOYSA-M 0.000 description 1
- 125000001033 ether group Chemical group 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 229940126864 fibroblast growth factor Drugs 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 239000012458 free base Substances 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- VZCYOOQTPOCHFL-OWOJBTEDSA-L fumarate(2-) Chemical class [O-]C(=O)\C=C\C([O-])=O VZCYOOQTPOCHFL-OWOJBTEDSA-L 0.000 description 1
- 238000007306 functionalization reaction Methods 0.000 description 1
- JKFAIQOWCVVSKC-UHFFFAOYSA-N furazan Chemical class C=1C=NON=1 JKFAIQOWCVVSKC-UHFFFAOYSA-N 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 239000007897 gelcap Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 150000002306 glutamic acid derivatives Chemical class 0.000 description 1
- 229960005150 glycerol Drugs 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 231100000753 hepatic injury Toxicity 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 210000003630 histaminocyte Anatomy 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000004356 hydroxy functional group Chemical group O* 0.000 description 1
- 125000002883 imidazolyl group Chemical group 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 229940060367 inert ingredients Drugs 0.000 description 1
- 208000027866 inflammatory disease Diseases 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000009830 intercalation Methods 0.000 description 1
- 230000002687 intercalation Effects 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 239000002563 ionic surfactant Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- ZLTPDFXIESTBQG-UHFFFAOYSA-N isothiazole Chemical class C=1C=NSC=1 ZLTPDFXIESTBQG-UHFFFAOYSA-N 0.000 description 1
- CTAPFRYPJLPFDF-UHFFFAOYSA-N isoxazole Chemical class C=1C=NOC=1 CTAPFRYPJLPFDF-UHFFFAOYSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 210000001117 keloid Anatomy 0.000 description 1
- 210000002510 keratinocyte Anatomy 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 150000003893 lactate salts Chemical class 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 206010025135 lupus erythematosus Diseases 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 150000002688 maleic acid derivatives Chemical class 0.000 description 1
- 150000004701 malic acid derivatives Chemical class 0.000 description 1
- 150000002690 malonic acid derivatives Chemical class 0.000 description 1
- 229960001855 mannitol Drugs 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- HEBKCHPVOIAQTA-UHFFFAOYSA-N meso ribitol Natural products OCC(O)C(O)C(O)CO HEBKCHPVOIAQTA-UHFFFAOYSA-N 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 229940071648 metered dose inhaler Drugs 0.000 description 1
- AFVFQIVMOAPDHO-UHFFFAOYSA-M methanesulfonate group Chemical class CS(=O)(=O)[O-] AFVFQIVMOAPDHO-UHFFFAOYSA-M 0.000 description 1
- 229940098779 methanesulfonic acid Drugs 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 229960004857 mitomycin Drugs 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 125000000896 monocarboxylic acid group Chemical group 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 206010028417 myasthenia gravis Diseases 0.000 description 1
- LNOPIUAQISRISI-UHFFFAOYSA-N n'-hydroxy-2-propan-2-ylsulfonylethanimidamide Chemical compound CC(C)S(=O)(=O)CC(N)=NO LNOPIUAQISRISI-UHFFFAOYSA-N 0.000 description 1
- OWWASHBXVFOTDF-UHFFFAOYSA-N n-methyl-1h-imidazole-2-carboxamide Chemical compound CNC(=O)C1=NC=CN1 OWWASHBXVFOTDF-UHFFFAOYSA-N 0.000 description 1
- LBZQAEQLSRWNLM-UHFFFAOYSA-N n-methyl-1h-pyrrole-2-carboxamide Chemical compound CNC(=O)C1=CC=CN1 LBZQAEQLSRWNLM-UHFFFAOYSA-N 0.000 description 1
- 230000008693 nausea Effects 0.000 description 1
- MRWXACSTFXYYMV-FDDDBJFASA-N nebularine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC=C2N=C1 MRWXACSTFXYYMV-FDDDBJFASA-N 0.000 description 1
- 239000006199 nebulizer Substances 0.000 description 1
- 229940053128 nerve growth factor Drugs 0.000 description 1
- 201000004931 neurofibromatosis Diseases 0.000 description 1
- 230000005015 neuronal process Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 239000002736 nonionic surfactant Substances 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 108020004017 nuclear receptors Proteins 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 235000021317 phosphate Nutrition 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 150000003014 phosphoric acid esters Chemical class 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 230000000865 phosphorylative effect Effects 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000007539 photo-oxidation reaction Methods 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 208000030761 polycystic kidney disease Diseases 0.000 description 1
- 201000010065 polycystic ovary syndrome Diseases 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 229920005990 polystyrene resin Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 201000008312 primary pulmonary hypertension Diseases 0.000 description 1
- 238000011027 product recovery Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 229940043274 prophylactic drug Drugs 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 208000005069 pulmonary fibrosis Diseases 0.000 description 1
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000007430 reference method Methods 0.000 description 1
- 230000037425 regulation of transcription Effects 0.000 description 1
- 210000003289 regulatory T cell Anatomy 0.000 description 1
- 230000007363 regulatory process Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 208000037803 restenosis Diseases 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 150000003873 salicylate salts Chemical class 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 208000010157 sclerosing cholangitis Diseases 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000036303 septic shock Effects 0.000 description 1
- 102000023888 sequence-specific DNA binding proteins Human genes 0.000 description 1
- 108091008420 sequence-specific DNA binding proteins Proteins 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 229910001415 sodium ion Inorganic materials 0.000 description 1
- 229910052979 sodium sulfide Inorganic materials 0.000 description 1
- GRVFOGOEDUUMBP-UHFFFAOYSA-N sodium sulfide (anhydrous) Chemical compound [Na+].[Na+].[S-2] GRVFOGOEDUUMBP-UHFFFAOYSA-N 0.000 description 1
- 229960002920 sorbitol Drugs 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 208000011117 substance-related disease Diseases 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 150000003890 succinate salts Chemical class 0.000 description 1
- 229940014800 succinic anhydride Drugs 0.000 description 1
- IIACRCGMVDHOTQ-UHFFFAOYSA-N sulfamic acid Chemical class NS(O)(=O)=O IIACRCGMVDHOTQ-UHFFFAOYSA-N 0.000 description 1
- 150000003871 sulfonates Chemical class 0.000 description 1
- 150000003467 sulfuric acid derivatives Chemical class 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 229940095064 tartrate Drugs 0.000 description 1
- 150000003892 tartrate salts Chemical class 0.000 description 1
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 229930192474 thiophene Chemical class 0.000 description 1
- JOXIMZWYDAKGHI-UHFFFAOYSA-N toluene-4-sulfonic acid Chemical class CC1=CC=C(S(O)(=O)=O)C=C1 JOXIMZWYDAKGHI-UHFFFAOYSA-N 0.000 description 1
- 238000005809 transesterification reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 208000009999 tuberous sclerosis Diseases 0.000 description 1
- 102000003390 tumor necrosis factor Human genes 0.000 description 1
- 231100000588 tumorigenic Toxicity 0.000 description 1
- 230000000381 tumorigenic effect Effects 0.000 description 1
- 238000003160 two-hybrid assay Methods 0.000 description 1
- 230000009452 underexpressoin Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 108010064245 urinary gonadotropin fragment Proteins 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 239000000273 veterinary drug Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
- 239000000811 xylitol Substances 0.000 description 1
- HEBKCHPVOIAQTA-SCDXWVJYSA-N xylitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)CO HEBKCHPVOIAQTA-SCDXWVJYSA-N 0.000 description 1
- 235000010447 xylitol Nutrition 0.000 description 1
- 229960002675 xylitol Drugs 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K49/00—Preparations for testing in vivo
- A61K49/0004—Screening or testing of compounds for diagnosis of disorders, assessment of conditions, e.g. renal clearance, gastric emptying, testing for diabetes, allergy, rheuma, pancreas functions
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/62—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being a protein, peptide or polyamino acid
- A61K47/64—Drug-peptide, drug-protein or drug-polyamino acid conjugates, i.e. the modifying agent being a peptide, protein or polyamino acid which is covalently bonded or complexed to a therapeutically active agent
- A61K47/645—Polycationic or polyanionic oligopeptides, polypeptides or polyamino acids, e.g. polylysine, polyarginine, polyglutamic acid or peptide TAT
Definitions
- This invention relates to synthetic regulatory compounds comprising a double-stranded nucleic acid binding moiety, a linker, and a regulatory moiety, the synthesis and testing of such compounds, and applications therefor.
- the regulation of gene expression is critical to the growth, development, proliferation, and maintenance of all living cells and organisms.
- the positive or negative regulation of genes is under the control of signal transduction cascades that transmit information from the cell surface to the nucleus.
- Signal- transduction cascades are generally triggered by ligands which may be small molecules, soluble peptides, extracellular matrix, adhesive proteins attached to cell surfaces of neighboring or migrating cells, and even metabolic intermediates.
- ligands interact with a membrane bound, or sometimes soluble intracellular, receptor, thus triggering a cascade of events that ultimately either stimulate or inhibit the activity of the mRNA-synthesizing machinery at one or more genes.
- PNAs protein nucleic acid
- oligomers that contain the standard purine and pyramidine bases of an oligonucleotide but contain a simple amide-based backbone as opposed to the sugar-phosphate backbone found in nucleic acids. Nielsen (1997) Chem. Eur. J. 3:505-508. PNAs have been shown to bind with very high affinity to single-stranded DNAs and RNAs.
- Another class of such molecules is oligonucleotides that are capable of promoting "triple helix" formation.
- Yet another class of molecules are the so-called “polyamides” developed by Dervan and co-workers. See, e.g., Dervan et al. (1999) Curr. Opin. Chem. Biol. 3:688-693. Polyamides having high sequence specificity and association constants have been developed based on a "code” by which a given pair of substituted or unsubstituted imidazole /pyrrole pair can be selecting to bind to particular nucleotide base pairs in the minor groove of double stranded DNA. Sequence specific polyamides that adhere with proteins that bind via the major- groove of double stranded DNA have also been developed. Bremer et al. (1998) Chem. Biol. 5:119-133.
- the invention concerns synthetic regulatory compounds, each of which comprises at least one nucleic acid binding moiety, at least one regulatory moiety, and at least one linker connecting the nucleic acid binding moiety(ies) to the regulatory moiety(ies).
- synthetic is meant any compound wherein the at least one of the particular nucleic acid binding moiety(ies) and at least one of the particular regulatory moiety(ies) are not found in the same molecule in nature, i.e., in a wild-type animal or plant.
- non-natural is meant that the compound (e.g., a peptide) is not naturally occurring.
- nucleic acid binding moiety refers to any compound that binds to a nucleic acid molecule, be it single- or double-stranded DNA or RNA (with double- stranded DNA being preferred), under desired conditions. What constitutes “desired conditions” will vary depending upon application, but in general refers to reaction conditions such as temperature, pH, solvent, ionic strength, the presence or absence of chaotropic agents, reactant concentrations, etc. to be encountered in the eventual intended application of the compound. In the context of synthetic regulatory compounds to be used as drugs, for example, preferred desired conditions are physiological conditions and, again, these will vary depending upon the particular animal, plant, and environment being considered, but in any event may be determined by one ordinarily skilled in the art.
- the particular conditions used need not be equivalent to those actually found in a cell for example; however, such conditions should sufficiently represent or approximate those likely to be encountered in the eventual application (e.g., the conditions in a human cancer cell, when the compound is a pharmaceutical intended to treat or prevent such cancer) such that meaningful data can be generated.
- a nucleic acid binding moiety of a compound according to the invention will specifically interact with a target nucleotide sequence.
- a "target nucleotide sequence” refers to a specific sequence of nucleotides, and is typically represented in the 5' to 3' direction using standard single letter notation, where "A” represents adenine, "G” represents guanine, “T” represents thymine, and “C” represents cytosine, and "U” represents uracil.
- a “target nucleotide sequence” within a double-stranded nucleic acid molecule preferably comprises a sequence greater than 3 but preferably less than about 20 nucleotides in length.
- the target sequence in general, is defined by the nucleotide sequence on one of the strands of the double-stranded nucleic acid. Such sequences are pre-selected or pre-determined, and are thus referred to as targets. While such sequences comprise a specific sequence of nucleotides, it will be appreciated that such sequence can include a different nucleotide at the same position, i.e., is degenerate at that position, with respect to one or more positions in the particular sequence. Degenerate bases can be represented by any suitable nomenclature, for example, that which is described in World Intellectual Property Organization Standard ST.25 (1998), Appendix 2.
- nucleic acid binding moiety of a synthetic regulatory compound of the invention specifically binds to its target nucleotide sequence, it does so in a manner that does not compete for binding site interaction with endogenous compounds.
- This can be accomplished in any suitable manner, for example, by selecting a target nucleotide sequence that is adjacent or proximal to (e.g., within about 500 bases, preferably about 200 bases, and even more preferably within about 100 or fewer bases of) the binding site for a DNA-binding protein of the transcriptional machinery.
- regulatory element any cis element of defined nucleotide sequence that can be identified in a nucleic acid molecule and which associates with a DNA-binding protein of the transcriptional machinery or a protein involved in providing chromatin structure. Such elements include promoters and enhancers.
- a “promoter” is the minimum sequence necessary to initiate transcription of a target gene.
- An “enhancer” is a cis-acting sequence that increases the utilization of a eukaryotic promoter.
- Preferred cis elements to be targeted by the nucleic acid binding moieties of the invention's synthetic regulatory compounds are those that occur endogenously in association with the gene whose transcription is to be regulated.
- RNA polymerase I which transcribes ribosomal RNA (rRNA) in eukaryotic cells
- RNA polymerase II which transcribes messenger RNA (mRNA) in eukaryotic cells
- RNA polymerase III which transcribes transfer RNA (tRNA) in eukaryotic cells.
- the interaction between the nucleic acid binding moiety can occur at different regions in the nucleic acid molecule.
- the nucleic acid binding moiety preferably interacts with the DNA via minor and/or major groove interactions.
- the nucleic acid binding moiety is a molecular " scaffold that spatially arrays a plurality of hydrogen bond donors and acceptors in a manner that allows the formation of hydrogen bonds with the corresponding hydrogen bond acceptors and donors in the target nucleotide sequence when the nucleic acid binding moiety interacts with a nucleic acid molecule (e.g., dsDNA) containing the target nucleotide sequence.
- the molecular scaffold can provide moieties that allow specific electrostatic and/or van der Waals interactions between units of the scaffold and atoms in the target nucleic acid molecule.
- Preferred nucleic acid binding moieties that can act as such molecular scaffolds include protein nucleic acids (PNAs) and oligonucleotides.
- PNAs protein nucleic acids
- Another preferred class of nucleic acid binding moiety useful in the practice of this invention can be represented as follows: - Qi — Z ⁇ - Q 2 - Z 2 - ... - Q m - Z m -, wherein each of Q , Q , ...
- Q m is independently selected from a heteroaromatic moiety and (CH 2 ) P , wherein p is an integer between 1 and 10, inclusive, and is preferably 1, 2, or 3; wherein each of Z ⁇ , Z 2 , ..., Z m is independently selected from the group consisting ofa covalent bond and a linking group; and m is between 1 and 20, inclusive, with 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 being preferred.
- at least one of Q , Q 2 , ... Q m is a heteroaromatic moiety, for example, a substituted or unsubstituted imidazole or pyrrole moiety.
- At least about 50%, 60%, 70%, 80%, or more of Qi, Q 2 , ... Q m are a heteroaromatic moieties, particularly substituted or unsubstituted imidazole or pyrrole moieties.
- at least one of Z ⁇ , Z 2 , ..., Z m is a linking group having between 1 and 10, inclusive, and preferably 2, 3, 4, or 5, backbone atoms.
- each of Z ls Z 2 , ..., Z m is a carboxamide group.
- SUBSTITUTE SHEET (RULE 2 such as substituted or unsubstituted imidazoles or pyrroles) linked by carboxamide bonds, are referred to as polyamide nucleic acid binding moieties.
- Polyamides can be designed to assume one of several alternative conformations upon base-specific interaction with a nucleic acid molecule, including hairpin, H-pin, slipped, overlapped, and cyclic conformations.
- Such conformations include intermolecular 2:1 binding motifs between dsDNA molecules comprising a corresponding target sequence, as well as intramolecular 2:1 binding motifs between dsDNA molecules comprising a corresponding target sequence.
- a nucleic acid binding moiety included in a synthetic regulatory compound of the invention has, under desired conditions, a binding specificity for its corresponding target nucleotide sequence of at least about two, and preferably at least about 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, 100 or more, as compared to a mismatch target sequence.
- a "mismatch" target sequence refers to a target nucleotide sequence in which one nucleotide is different at a particular position, as compared to the target nucleotide sequence.
- a nucleic acid binding moiety included in a synthetic regulatory compound of the invention also has, under desired conditions, at least about submicromolar, and preferably at least about nanomolar or picomolar binding affinity for its target nucleotide sequence. Binding affinity can be determined by any suitable technique including, but not limited to, quantitative DNase I footprint analysis. Association constants (K a ) for preferred nucleic acid bindmg moieties are at least about 10° M "1 , IO 7 M ' IO 8 M “1 , IO 9 M “1 , 10 10 M *1 , IO 11 M "1 , or IO 12 M “1 .
- nucleic acid binding moieties associated in a manner that allows each to retain its nucleic acid binding function, for example, through the use of a flexible linker.
- One representative class of synthetic regulatory compounds comprising a plurality (e.g., 2, 3, 4, or more ) of nucleic acid binding moieties include those wherein two such moieties are tethered to one another via a linker, and the linker is attached to another linker (or, alternatively, is a dendrimeric linker having three or more functional sites) that is attached to a regulatory moiety.
- nucleic acid binding moieties are each separately attached via a linker to the regulatory moiety.
- nucleic acid binding moieties are tethered via a linker and one of them is also linked via a separate linker to a regulatory moiety.
- Many other configurations of this sort are encompassed herein. These include, but are not limited to, compounds containing multiple regulatory moieties (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more), and compounds containing multiple regulatory moieties and a plurality (e.g., 2, 3, 4, or more) of nucleic acid binding moieties.
- Synthetic regulatory compounds that comprise a plurality of nucleic acid binding moieties typically will have greater target sequence specificity than a synthetic regulatory compound containing fewer nucleic acid binding moieties.
- the compounds bind to both in the minor and major grooves of dsDNA, and provide groups that can be modified by other molecules, for example, by linkers to which one or more regulatory moiety can be attached.
- synthetic regulatory compounds that comprise a plurality of regulatory moieties can interact with more than one component of the transcription or chromatin structure machinery.
- a synthetic regulatory compound according to the invention also contains, in addition to a nucleic acid binding moiety, a regulatory moiety.
- a regulatory moiety refers to an ability to alter the level of expression of a particular gene above (i.e., up-regulate or activate) or below (i.e., down-regulate or repress) the basal level of expression that would occur in the particular system (for example, an in vitro transcription system or a cell) in the absence of the compound under the same conditions.
- a regulatory moiety that activates transcription is referred to herein as an “activation moiety” or “activator”
- a regulatory moiety that represses transcription is referred to as a "repressor moiety” or "repressor.”
- a regulatory moiety is any compound that can positively or negatively effect, by either a direct mechanism (i.e., by direct interaction with one or more components of the transcription complex) or and indirect mechanism (i.e., by (i) direct interaction with a repressor protein or (ii) direct interaction with a protein involved in chromatin or nucleosome structure), transcription of a target gene, other than by direct electrostatic interaction with double-stranded DNA.
- direct interaction is meant direct, non-covalent association between two components.
- regulatory moieties include peptides, polypeptides, lipids, carbohydrates, and any combination thereof.
- a "peptide” is a polymer (i.e., a linear chain of two or more identical or non-identical subunits joined by covalent bonds) made up of naturally occurring or synthetic D- or L-, or D- and L-, amino acids joined by peptide bonds.
- peptides contain at least two amino acid residues (i.e., the molecules resulting from the formation of a peptide bond between two amino acids, or between an amino acid residue and another amino acid) but fewer than about 50 amino acid residues.
- a “polypeptide” is a also a polymer of amino acid residues linked by peptide bonds, but typically contains at least about 50 amino acid residues.
- peptide is used to refer to a regulatory moiety that is less than about 50 amino acid residues in length, and “polypeptide” refers to larger polymers of amino acid residues linked by peptide bonds.
- a “lipid” is a substantially water-insoluble molecule that contains as a major constituent an aliphatic hydrocarbon. Lipids include fatty acids, neutral fats, waxes, and steroids. The hydrocarbon portions of the molecule can be of any length, can be saturated or unsaturated, and can be straight- or branched-chain.
- Carbohydrate refers to any aldehyde or ketone derivative of a polyhydric alcohol, and includes starches, sugars, celluloses, and gums.
- a regulator moiety can be naturally occurring or be derived from or an analog of a naturally occurring molecule. Alternatively, it can be synthetic.
- Preferred regulatory moieties are peptides and small organic molecules.
- “Peptide”, in this context, refers to an amino acid polymer comprised of between one to about fifty, inclusive, residues of amino acids (i.e., any molecule that contains an amino group and a carboxylic group linked by peptide, or carboxamide, bonds.
- Such peptides can be duplicative of amino acid sequences that occur naturally (e.g., an activation or repressor domain of a transcriptional activator or repressor, respectively).
- such peptides are comprised of the twenty L amino acids commonly found in proteins in nature. Alternatively, they can comprise one or more D enantiomers of such amino acids, or other rarely observed amino acids.
- such peptides are comprised of amino acids not typically found in naturally occurring proteins.
- Peptidic regulatory moieties can be synthesized by any suitable method, including recombinant techniques or solid state or in-solution synthetic methods. After synthesis, they can be linked to a suitable nucleic acid binding moiety. If it is unknown whether a particular peptide possesses regulatory function, it can be screened in a suitable system. Indeed, large numbers of such peptides (as can, for example, result from combinatorially generating peptides) can be screened in high throughput formats.
- the regulatory moieties are small organic molecules that have been identified as having regulatory function.
- small organic molecule is meant any water soluble organic molecule having a molecular weight of less than about 10 kDa (kilo Dalton), preferably less than about 5 kDa, more preferably less than about 2.5 kDa, even more preferably less than about 1.5 kDa, and even more preferably less than about 1 kDa.
- a synthetic regulatory compound comprises two, three, four, or more regulatory moieties linked to the nucleic acid binding moiety (ies).
- the regulatory moieties can be the same or different, and can serve to recruit or retard the same or different transcription factors.
- linker can be any molecule that can be used to link at least one nucleic acid binding moiety to at least one regulatory moiety in a manner that allows the nucleic acid binding moiety(ies) to retain its(their) intended nucleic acid binding function(s) and the regulatory moiety(ies) to retain its(their) ability to influence transcription of the target gene.
- the linker should provide adequate spacing between and/or orientation with respect to the nucleic acid binding moiety and the regulatory moiety so as to allow each to retain its respective function.
- the backbone moieties preferably are heteroatoms including, but not limited to, carbon, nitrogen, oxygen, sulfur, and phosphorus, with the majority of such atoms being carbon.
- One or more different side chain groups can also appended to the backbone. Spacing between multiple side chain unit groups can be variable or
- SUBSTITUTE SHEET (RULE 26 consistent along the backbone.
- linkers include polyethylene glycol, alkyl chains, and peptides.
- the synthetic regulatory compounds are cell permeable. In other words, they are able to come into contact with and be internalized by a cell.
- Such cells include animal (both * prokaryotic and eukaryotic) and plant cells.
- Preferred animal cells include vertebrate cells, particularly arachnid, avian, fish, insect, and mammalian cells.
- Particularly preferred are mammalian cells, for example, bovine, avian, canine, equine, feline, human, murine, ovine, porcine, and primate cells.
- Preferred plant cells include those of commercially important grains (e.g., alfalfa, barley, corn, rice, soy, sorghum, wheat) and ornamental plants.
- Synthetic regulatory compounds of the invention can be prepared as pharmaceutical salts by any method known in the art depending upon the intended application.
- a related aspect of the invention concerns compositions comprising a synthetic regulatory compound according to the invention and a carrier, for example, a pharmaceutically acceptable carrier.
- Such compositions can be dry or liquid formulations, and can optionally include one or more excipients, stabilizers, bulking agents, etc. Dry formulations include those that have been lyophilized or freeze dried, which formulations can be reconstituted in a solvent prior to use.
- Liquid formulations include aqueous formulations, oils, emulsions, and suspensions.
- compositions according to the invention can be delivered to a subject by any suitable route.
- a composition in the context of compositions intended for agricultural use, can be applied by spraying a liquid or broadcasting a solid.
- such compositions can be injected (e.g., via subcutaneous, intramuscular, intravenous, and other parenteral routes), inhaled, eaten, or delivered topically.
- kits containing a synthetic regulatory compound or composition according to the invention typically comprise one or more synthetic regulatory compounds of the invention in a suitable composition and packaged in an appropriate storage container, for example, a vial or ampule.
- Kits often further comprise external packaging, such as a box or other container, to protect and support the storage container containing the composition.
- the packaging also contains directions for use, package inserts, etc.
- Still another related aspect of the invention concerns complexes comprising a synthetic regulatory compound of the invention complexed with a nucleic acid molecule, e.g., a DNA, particularly a dsDNA.
- a nucleic acid molecule e.g., a DNA, particularly a dsDNA.
- Such complexes can be formed, for example, by exposing a composition (for example, a cell or an in vitro transcription reaction) containing a nucleic acid (e.g., dsDNA) comprising a target nucleotide sequence to a synthetic regulatory compound according to the invention.
- a related aspect concerns cells containing a synthetic regulatory compound of the invention.
- Such cells include animal cells and plant cells.
- Preferred animal cells include avian, bovine, canine, equine, feline, fish, human, murine, ovine, porcine, and primate cells. Such cells can be in vivo or in vitro (including ex vivo).
- Another aspect of the invention relates to methods for regulating expression of a target gene by exposing the target gene and its associated regulatory elements to a synthetic regulatory compound according to the invention. Such methods can be carried out in vitro and in vivo.
- Still another aspect of the invention concerns the use of a synthetic regulatory compound of the invention to prevent or treat a disease.
- Such methods can be accomplished by delivering an effective amount of the synthetic regulatory compound to an animal or plant.
- An "effective amount” refers to an amount of a synthetic regulatory compound sufficient to induce or effectuate a detectable change in transcription ofa desired gene in a cell-free in vitro system or a cell, be it in an organism or in culture. What constitutes an effective amount will depend on a variety of factors that the skilled artisan will take into account in arriving at the desired delivery regimen.
- Another aspect of the invention concerns methods of screening for synthetic regulatory compounds from amongst one or more test compounds.
- These screening methods include both in vitro and in vivo screening methods, and can include methods involving an in vitro screen followed by an in vivo screen (e.g., a cell-based screen).
- Such methods can be performed, for example, by exposing, under transcription conditions, a dsDNA encoding a regulatable gene to a test compound comprising a nucleic acid binding moiety targeted to a transcription-associated regulatory element of the regulatable gene conjugated via a linker to a regulatory moiety, and determining whether the test compound regulates expression of the regulatable gene.
- the regulatable gene is a marker gene, such as a gene encoding a luciferase or green fluorescent protein.
- the various moieties of the invention can be synthetic or natural products. Synthetic moieties can be synthesized by solution or solid phase methods. Two or more moieties can also be synthesized together.
- Compounds according to the invention can be in unpurified, substantially purified, and purified forms. The compounds can be present with any additional component(s) such as a solvent, reactant, or by-product that is present during compound synthesis or purification, and any additional component(s) that is present during the use or manufacture ofa compound or that is added during formulation or compounding ofa compound. Another aspect relates to methods for synthesizing the compounds of the invention.
- such methods typically comprise separately obtaining each of the nucleic acid binding moiety, the linker, and the regulatory moiety to be linked together to form the synthetic regulatory compound. Two or more of these moieties are then linked by a suitable chemistry, followed by linkage to the third moiety.
- the particular order of addition of moieties can vary, and its selection is within the skill of the ordinary artisan.
- the linker and regulatory moiety, or linker and nucleic acid binding moiety can be synthesized as a single unit. In other embodiments, all three elements are synthesized together.
- the compound, or its intermediary compounds in intermediary steps is preferably purified.
- FIGURES Figure 1 illustrates a double-stranded DNA (dsDNA) molecule comprising an inverted tandem repeat of the target nucleotide sequence of an eight ring hairpin polyamide molecule (SEQ ID NO: 1).
- dsDNA double-stranded DNA
- SEQ ID NO: 1 illustrates a double-stranded DNA (dsDNA) molecule comprising an inverted tandem repeat of the target nucleotide sequence of an eight ring hairpin polyamide molecule (SEQ ID NO: 1).
- imidazole moieties are represented by blackened circles and pyrroles are represented by open circles.
- Diamonds represent ⁇ -alanine.
- Amino acids are represented by the standard one letter code. This nomenclature is used consistently throughout the specification unless otherwise expressly indicated.
- Two synthetic regulatory compounds are illustrated as being bound to their respective target sites in the dsDNA.
- Linked to each polyamide through a linker is one of three activator peptides.
- the compounds are designated 7 (SEQ ID NO: 2), 8, and 9 (SEQ ID NO: 3).
- the linker comprises of a Cys residue linked to an XL peptide.
- the linker is larger and contains a Cys residue at its amino-terminus, and an XL peptide at its carboxy-terminus.
- Compound 9 comprises a 28 amino acid residue peptide, additionally including a Cys residue at the N-terminus and a XL peptide at its C- terminus.
- Figure 2 contains 3 panels, A, B, and C.
- Panels A and B show dsDNA comprising inverted tandem repeats of target nucleotide sequences for a specific polyamide.
- the polyamide has an activating peptide (SEQ ID NOs: 4 and 5) attached to its C-terminus via a linker.
- the polyamide has the linker-activating peptide conjugate attached via the C-terminal pyrrole residue.
- the polyamide contains conventional C-terminal moieties, for example, ⁇ -Dp.
- Panel C depicts a single hairpin polyamide to which two linker- activating peptide domains are attached via internal pyrrole moieties and SEQ ID NOs: 6 and 7). Also described are compounds 10-20, specifically the portions of the compounds comprising the linker and activating domains.
- Figure 3 is a graphic representation of the template used for in vitro transcription assays.
- the template contains three inverted tandem repeat target nucleotide sequences (hatched region) to be targeted by nucleic acid binding moiety ofa synthetic regulatory compound.
- the 3 '-terminus of the downstream most of these sequences is 50 based pairs upstream of the adenovirus major late promoter TATA box that drives the expression of a "g-less" transcript.
- the number of nucleotides between the target nucleotide sequences is 3, 5, or 7.
- Figure 4 depicts a graphic illustration of a double stranded nucleic acid molecule (SEQ ID NOs: 8 and 9) containing an inverted tandem repeat target nucleotide sequence for an eight ring polyamide which provides DNA sequence target specificity. Also depicted are the chemical structures of N-methylpyrrole and N-methylimidazole sub-units that may be incorporated into nucleic acid targeting moieties, e.g., polyamides, for use in conjunction with the present invention. Also shown are two representative linker molecules that may be used to join nucleic acid binding moiety and a regulatory moiety in order to generate a synthetic regulatory compound according to the invention. The figure also lists the compounds that were generated upon the ligation of the respective peptides to the polyamide.
- Figure 5 depicts synthesis of synthetic activators.
- the synthesis of polyamide 1 was accomplished according to established protocols, (a) Treatment of compound 1 with thiolane-2,5-dione followed by benzyl bromide provided thioester compound 2 in good yield (53%). (b) Combination of thioester-containing compound 2 with each peptide (SEQ ID NOs: 10 and 11) in denaturing buffer then provided the targeted conjugates via the native ligation reaction.
- Figure 6 depicts substitution of the dimerization module with a flexible ethylene gly col-derived linker. The synthesis of hairpin polyamides 6 and 7 and compounds 8 and 9 was carried out as described.
- Figure 7 illustrates a synthetic regulatory compound comprising a hairpin polyamide (linked head to tail by a short linker (e.g., ⁇ -aminobutyric acid)) attached to a linker domain (LD) to which an activation domain (AD) is attached.
- the polyamide is associated with the minor groove of a DNA double helix via its DNA recognition domain.
- a dsDNA comprising a target nucleotide sequence with which is associated a hairpin polyamide targeted to that sequence.
- the chemical structure of pyrrole moieties open circles
- an imidazole residue shadedazole residue
- Figure 8 depicts compound 3 (PA-Gcn4-AH) bound to its cognate palindromic DNA site and activated transcription in vitro when its predetermined DNA binding sites are present (SEQ ID NOs: 12 and 13).
- A (Upper) Storage phosphor autoradiogram ofa quantitative DNase I footprinting titration of compound 3 on the 3- P-labeled 271-bp pPT7 EcoRI/Ev ⁇ /11 restriction fragment carried out according to established protocols. Pre-equilibration of compound 3 with the DNA fragment was carried out for 75 min., before initiation of the cleavage reactions.
- the lanes are: the A sequencing lane; DNase I digestion products in the presence of compound 3 at concentrations of 100 nM, 50 nM, 25 nM, 10 nM, 5 nM, 2.5 nM, 1 nM, 0.5 nM, and 0.25 nM, respectively; DNase I digestion products with no compound 3 present; undigested DNA.
- (Lower) Data for compound 3 in complex with the 19-bp palindromic site. The curve through the data points is the best-fit cooperative Langmuir binding titration isotherm (n ⁇ ) obtained from a nonlinear least-squares algorithm.
- Figure 9 Depicts dependence of activation level upon time and activating region.
- A (Upper) Storage phosphor autoradiogram showing in vitro transcription time course experiment with compounds 3 and 4 present at 300 nM concentration. Aliquots were processed at 10, 20, 30, 40, and 60 min. (Lower) Comparison of the amount of transcript obtained at each time point relative to basal transcription levels in which no conjugate was present (fold activation).
- B (Upper) Storage phosphor autoradiogram showing the effect of increasing concentrations of untethered 21-aa AH peptide on transcription reactions containing either compounds 4 or 3 at 300 nM concentration. Transcription reactions were performed for 30 min, in the presence
- UT ⁇ SHEET (RULE 26) of 0, 0.2 :M, 2 :M, and 10 :M concentrations of AH peptide. (Lower)
- the amount of transcript obtained was compared with basal transcription levels to give the respective fold activation value. These values are displayed as percent activation compared with the results from the reaction containing compound 3 (lane 5), which is defined as 100%.
- Figure 10 depicts the activation levels for compounds 3, 5, 8, and 9 were determined by comparison with the amount of transcript obtained from reactions containing the relevant parent hairpin polyamides.
- Figure 11 describes the structure of three polyamides, 1, 5, 9.
- compound 9 shows a hairpin polyamide linked to a VP2 peptide through a linker attached to the C-terminal-most pyrrole of the polyamide.
- Figure 12 shows the ability of various conjugates to activate transcription in vitro. Background levels of transcription in the absence of any polyamides are shown in lane 1. Addition of the polyamide alone does not alter the levels of transcription appreciably (lane 2). Lane 3 shows the levels of transcription elicited by conjugate 2; lane 4 shows activation elicited by conjugate 4 and lane 5 demonstrates the activity of conjugate 3. Lane 6 shows activation elicited by conjugate 9. Lane 7 represents the activation due to conjugate 6; lane 8 corresponds to conjugate 8 and lane 9 is due to conjugate 7.
- Figure 13 shows the structures of compounds 10 and 11, as well as a dsDNA molecule (SEQ ID NOs: 18 and 19) containing a target nucleotide sequence for the polyamide, wherein the polyamide is attached via the PEG linker (shaded hexagon) to the VP2 regulatory peptide.
- SEQ ID NOs: 18 and 19 a dsDNA molecule
- Figure 14 shows the modular nature of the synthetic regulator.
- In vitro transcription reactions show that substituting the imidazole polyamide (conjugate " 1 of Figure 11) for that described in Figure 13 (conjugate 10) alters the template recognition properties in a specific manner.
- Lanes 1-7 represent reactions performed with the template designed to bind polyamide 10 and its various conjugates (#11) and lanes 8-14 show reactions which were performed with template 1 described in Figure 4.
- Lanes 2 & 3 using template 2 and lanes 9&10 on template 1 show the inability of either polyamide (at 400nM) to influence transcription levels.
- Lanes 4&5 on template 2 and lanes 13 &14 on template 1 show that the corresponding polyamide conjugate elicits transcription on the appropriate template.
- Lanes 6&7 for conjugate 7 and lanes 11& 12 for conjugate 11 show that on the non- conjugate sites the regulatory moiety bearing conjugates do not elicit transcription above background levels.
- the synthetic regulatory compounds of the invention represent a novel class of nucleic acid binding ligands in which a nucleic acid binding moiety is tethered through a linker to a second functional moiety, viz, a regulatory moiety (e.g., an activator), that interacts directly with elements of the endogenous transcriptional machinery or indirectly through interaction with a repressor protein or components involved in chromatin structure (e.g., nucleosomes).
- a regulatory moiety e.g., an activator
- Cell-permeable members of this class can be targeted to designated sites in the genome, and can be used, for example, as tools to study mechanistic aspects of transcriptional regulation and to correct the ectopic gene expression that often occurs in disease.
- a nucleic acid binding moiety is any compound or chemical that can provide site-specific recognition of a target nucleotide sequence in a nucleic acid molecule preferably a dsDNA.
- specificity is provided by an ability to recognize specific based pairs within the major groove and/or minor groove of a double-stranded nucleic acid molecule. Interactions of this sort are typically mediated by electrostatic forces, hydrogen bonding, van der Waals forces, and steric considerations. In natural systems, such binding specificity is typically mediated by peptides that are comprised within protein-based transcription factors and other proteins that bind to nucleic acids within cells and viruses.
- such peptides are referred to as "natural DNA binding ligands," and are not within the scope of the instant invention as it relates to embodiments that comprise but one nucleic acid binding moiety.
- one or more such natural peptides can be included in the synthetic regulatory compound, provided that at least one of such nucleic acid binding moieties is a non-natural nucleic acid binding moiety.
- any such peptides whether now known or later developed, can be included in synthetic regulatory compounds that comprise a plurality of nucleic acid binding elements.
- the nucleic acid binding moiety comprises a molecular scaffold.
- a molecular scaffold is any compound that spatially arises a plurality of hydrogen bond donors and acceptors in a manner that allows the formation of hydrogen bonds with the corresponding hydrogen bond acceptors and donors in a target nucleotide sequence in a nucleic acid molecule.
- the structures of single- and double-stranded nucleic acid molecules at high resolution, such as the three-dimensional coordinates of each individual atom within the nucleic acid molecule are known to a resolution of less than about 2 A. Such structures can be determined by techniques known in the art, for example, x-ray crystallography or NMR spectroscopy.
- a high-resolution model of any particular nucleotide sequence within a nucleic acid molecule can be determined and used for modeling purposes. From such models, it is possible to deduce the positions of ⁇ " various hydrogen bond donor and acceptor elements within a nucleic acid molecule. The distance ranges over which such interactions occur are also known in the art, as are such disinteractions for other forces involved in nucleic acid molecule/nucleic acid binding ligand interactions. From such data, it is possible to design synthetic ligands that correspondingly array hydrogen bond acceptors and donors in a manner that allows formation of hydrogen bonding upon interaction of the particular ligand with a particular nucleotide sequence in such nucleic acid molecule.
- This invention encompasses designing a molecular scaffold directed toward a particular target nucleotide sequence. Such molecules can then be screened in vitro and/or in vivo to ascertain if the desired interaction occurs. Those molecules exhibiting the desired interaction, particularly those that do so at submacromolar and preferably subnanomolar binding affinities can be selected for further use. Alternatively, such compounds can be used as a basis for lead optimization in order to generate additional analogs that exhibit the desired interactions. Preferably, any such molecule selected for further use will also exhibit cell-permeability.
- the nucleic acid binding moiety comprises the structure h- Ql— Z l — Q2— Zz — . . . — Q m — Zm ⁇ where each of Qi, Q 2 , ... , Q m , is a heteroaromatic moiety or (CH ) P (where p is an integer between 1 and 3, inclusive); each of Zi, Z , ..., Z m is a covalent bond or a linking group; and m is an integer between 1 and 9 (preferably between 2 and 4), inclusive.
- Q is a heteroaromatic moiety, it is preferably selected from optionally substituted imidazole, pyrrole, pyrazole, furan, isothiazole, oxazole, isoxazole, thiazole, thiophene, furazan, 1,2,3-thiadiazole, 1,2,4-thiadiazole, 1,2,5- thiadiazole, 1,3,4-thiadiazole, 1,2,3-triazole, 1,2,4-triazole, 1,3,4-oxadiazole, 1,2,4- oxadiazole, and thiophene moieties.
- substituents include Cl, F, CH 3 (e.g., as in N-methylpyrrole or N-methylimidazole), hydroxy (e.g., as in 3- hydroxypyrrole).
- Linking groups Zj, Z 2 , ... Z m can be between 2 and 5 (preferably 2) backbone atoms long.
- Exemplary linking groups include carboxamide, amidine ' and ester groups, with carboxamide groups being preferred.
- PNAs Protein nucleic acids
- the PNA backbone comprises N-(2-aminoethyl)glycine units to which the nucleobases are attached.
- PNAs containing all four natural nucleobases hybridize to complementary oligonucleotides according to Watson-Crick base-pairing rules, and thus represent true DNA mimics in terms of base-pair recognition. Egholm et al. (1993) Nature 65:566-568.
- PNAs Since a PNA backbone is uncharged, PNA/DNA and PNA/RNA duplexes exhibit greater thermal stability, as compared to DNA/DNA, DNA/RNA, or RNA/RNA duplexes. PNAs have the additional advantage in not being recognized by nucleases or proteases. In addition, PNAs can be synthesized on an automated solid state synthesizer using standard t-Boc chemistry. The design and synthesis of PNA molecules are described, for example, in U.S. Patent Nos. 5,539,083; 5,864,010; 5,977,296; and 5,985,563. PNA-based non-natural nucleic acid binding moieties can be designed, synthesized, and incorporated into compounds according to the invention. Chemistries suitable for attaching linkers (or linkers already conjugated to one or regulatory moieties) to such PNAs are known in the art and can be used for such purposes. C. Triplex-Forming Oligonucleotides
- nucleic acid binding moiety is an oligonucleotide capable of base-pair specific recognition with a double-stranded nucleic acid, such that the oligonucleotide can form a triple helix.
- Oligonucleotide-directed triple helix formation is one of the most effective methods for accomplishing the sequence specific recognition of double helical DNA. See, e.g., U.S. Patent No. 5,847,555; Moser et al. (1987) Science 238:645; Le Doan et al. (1987) NuclTAcids Res. "" 15:7749; Maher et al.
- Triple helices form as the result of hydrogen bonding between bases in a third strand of DNA and duplex base pairs in the double stranded DNA, via Hoogsteen base pairs.
- oligonucleotides typically comprise between about 10 to about 200, preferably about 10 to about 50, nucleotides.
- Such oligonucleotides can be synthesized by any suitable method, for example, by conventional automated solid state techniques.
- nucleotide sequences targeted by triplex-forming oligonucleotides comprise purine rich tracts on one of the strands of the double- stranded, double-helical nucleic acid.
- the triple helix so formed contains the oligonucleotide bound in either a parallel or anti-parallel orientation with respect to the target sequence depending on the nucleotide sequences used in the oligonucleotide.
- a parallel orientation occurs when the oligonucleotide is a pyrimidine-rich oligonucleotide.
- the pyrimidine-rich oligonucleotide contains a thymine containing nucleotide (T) when the nucleotide at the complementary position in the purine-rich target sequence is an adenosine containing nucleotide (A) and a cytosine containing nucleotide (C) when the nucleotide at the complementary position in the purine-rich target sequence is a guanine containing nucleotide (G).
- T thymine containing nucleotide
- A adenosine containing nucleotide
- C cytosine containing nucleotide
- G guanine containing nucleotide
- An anti-parallel orientation occurs when a purine-rich oligonucleotide is used.
- anti-parallel orientation is obtained when the purine-rich oligonucleotide contains a guanine containing nucleotide (G) when the nucleotide at the complementary position in the purine-rich target sequence is a guanine containing nucleotide (G) and an adenosine containing nucleotide (A) when the nucleotide at the complementary position in the purine-rich target sequence is an adenosine containing nucleotide (A).
- G guanine containing nucleotide
- A adenosine containing nucleotide
- Synthetic triple helix-forming oligonucleotides can also bind in an anti- parallel orientation to a purine-rich target sequence.
- triple helix-forming oligonucleotides contain a G when the nucleotide in the complementary position of the purine-rich target sequence is G, T or A when the nucleotide in the complementary position in the purine-rich target sequence is an A and the nucleotide nebularine when the complementary position in the purine-rich target sequence is C. Because such moieties are capable of targeting specific nucleotide sequences in double-helical nucleic acids, they represent suitable nucleic acid moieties for use in developing candidate synthetic regulatory compounds.
- such oligonucleotides will be designed to target a sequence that will allow a regulatory moiety attached thereto through a linker to be brought into proximity of the promoter of the gene, the expression of which is desired to be regulated.
- a linker to be brought into proximity of the promoter of the gene, the expression of which is desired to be regulated.
- one or more oligonucleotides can be designed that will possess the desired triplex-forming function.
- the oligonucleotides used in the invention to form triple helices can be made synthetically by well-known synthetic techniques to contain a structure corresponding to the naturally occurring polyribonucleic or polydeoxyribonucleic acids. See, e.g., U.S. Patent No. 5,847,555.
- the phosphoribose backbone of such oligonucleotides can be modified such that the thus formed oligonucleotide has greater chemical and/or biological stability. Biological stability of the oligonucleotide is desirable when the oligonucleotides are used in vivo.
- Such modified oligonucleotides can be synthesized with a structure that is stable under physiological conditions, including enhanced resistance to nuclease degradation. Further, when used in vivo, such nucleotides preferably have a minimal length that permits targeted triple helix formation so as to facilitate the transport of the oligonucleotide across the membranes of the cytoplasm and nucleus.
- nucleic acid binding moieties intended for use in a synthetic regulatory compound of the invention are preferably tested against the target nucleotide sequence to identify those which bind with the highest affinity and greatest specificity. Binding affinity can be assessed by any suitable method, for example, DNase I footprinting. Specificity can be determined by using nucleic acids containing target sequences and others containing nucleotide sequences that differ from the target sequence by one or more nucleotides.
- such compounds have binding affinities (in terms of association constants) of at least about IO 6 M “1 , more preferably at least about IO 9 M “1 , and even more preferably, at least about 10 10 M “1 , 10 11 M “1 , IO 12 M “1 , or higher.
- binding affinities in terms of association constants
- Specifities of least about 2, preferably at least about 3, 4, or 5, even more preferably at least about 10, 50, 100, or more, for target versus single base pair "mismatch" sites are preferred.
- Such molecules can then serve as the basis for the synthesis of the synthetic regulatory compounds according to the invention by attaching one or more regulatory moieties thereto through one or more linkers.
- the resulting compounds are then preferably tested in cell-based assays (e.g., where the cells carry a reporter construct that comprises a reporter gene under the control of a promoter region that is targeted by the nucleic acid binding moiety of the compound) to determine if the desired regulatory function is achieved by the particular synthetic regulatory compound. If so, the compound typically will represent a lead compound which can then be used to develop potential therapeutic or prophylactic drugs or other compounds that can be administered to cells to achieve the desired regulatory function.
- D. Polyamides e.g., where the cells carry a reporter construct that comprises a reporter gene under the control of a promoter region that is targeted by the nucleic acid binding moiety of the compound
- the nucleic acid binding moiety includes those that are dsDNA intercalators, dsDNA minor groove binding moieties, and dsDNA major groove binding moieties. It is to be understood that, where the nucleic acid binding moiety is referred to as a "minor groove binder" (or words to that effect), it does not mean that such moiety has binding interactions exclusively with the minor groove; the moiety also can have binding interactions with other parts of the dsDNA, for example, with adjacent base pairs by intercalation, with backbone phosphate groups, or with the major groove.
- the nucleic acid binding moiety is a minor groove binder, which typically (but not necessarily) has an elongate crescent shape, topologically complementary to the shape of the minor groove.
- the minor groove binder can be a residue of a naturally-occurring compound, such as doxorubicin, daunomycin, anthramycin, calicheamycin, mitomycin, duocarmycin, distarnycin, and netropsin, or an analog or a derivative thereof.
- the nucleic acid binding moiety can be a residue ofa synthetic minor groove binder.
- the nucleic acid binding moiety is a synthetic polyamide unit comprising N-methylpyrrole carboxamide ("Py”) units and optionally one or more of N-methylimidazole carboxamide ("Im”), N-methyl-3- hydroxypyrrole carboxamide (“Hp”), glycine carboxamide, P-alanine carboxamide, ⁇ -aminobutyric acid carboxamide, 5 -amino valeric acid carboxamide, and ⁇ -2,4- diaminobutyric acid carboxamide units.
- Such synthetic polyamides are minor groove binders, binding with high binding constants, often greater than 10 M " . The design and synthesis of such polyamides are described for instance in Baird et al.
- Glycyl or ⁇ -alanyl carboxamides can serve as "spacer" groups for adjusting the position of the heteroaromatic carboxamide residues in relation to the nucleotide base pairs of a polyamide-based nucleic acid binding moiety's binding site.
- a ⁇ - aminobutyric acid carboxamide, 5-aminovaleric acid carboxamide, or ⁇ -2,4- diaminobutyric acid carboxamide unit (or other moieties that produce a substantially equivalent structural effect) provides the potential for formation of a "hairpin" conformation, wherein some or all of the heteroaromatic carboxamide units from a portion of the polyamide to one side of the turn-providing moiety bind side-by-side to the heteroaromatic carboxamide units from the portion the polyamide to the other side of the turn-providing moiety. See Figure 2C for a representative diagrammatic illustration of polyamide hairpin conformation.
- Hairpin polyamides are capable of targeting predetermined DNA sequences with affinities and specificities comparable to DNA binding proteins in accordance with a simple set of pairing rules dictated by the side-by-side binding of the aromatic amino acids. Mrksich et al. (1992) Proc. Natl. Acad. Sci. USA 89:7586-7590; Wade et al. (1992) J. Am. Chem. Soc. 114:8784-8794; and Trauger et al. (1996) Nature 382:559-561. These synthetic DNA binding ligands are cell permeable, and one such compound was shown to specifically interfere with gene expression in mammalian cell culture. Gottesfeld et al. (1997) Nature 387:202-205; and Dickinson et al. (1998) Proc. Natl. Acad. Sci. USA 95:12890-12895.
- the unpaired heteroaromatic carboxamide units from one polyamide can be available to form cooperative side-by-side pairings with unpaired heteroaromatic carboxamide units from another polyamide, for example another hairpin or straight-chain polyamide. Such cooperative interaction can serve to increase DNA binding specificity.
- nucleic acid binding moieties having other conformations e.g., cyclic or "H-pin" conformations, respectively.
- the 2-amino group of ⁇ -2,4-diaminobutyric acid provides, among other locations, an attachment point for tandem-linked polyamide units, as well as providing a moiety that can be used to introduce chirality into the polyamide.
- a Py, Hp, or Py equivalent heteroaromatic carboxamide can be replaced with a ⁇ carboxamide to form pairs such as ⁇ / ⁇ ⁇ /Py, or Py/ ⁇ .
- Compounds of this invention are useful because they are strong nucleic acid binders, often as nanobinders (i.e., association constant (K a ) of IO 9 M “1 ) or even as picobinders (K a of 10 12 M “1 ). It is especially noteworthy that some compounds of the invention are nanobinders while having relatively few heteroaromatic moieties (3-5), while previously described nanobinders have generally required a larger number of heteroaromatic moieties.
- compounds of this invention will have anti-fungal (e.g., yeast, filamentous fungi) and/or anti-bacterial (Gram-positive, Gram-negative, aerobic, anaerobic) properties and therefore can be used for combating (i.e., preventing and/or treating) infections by such pathogens.
- Other pathogens against which compounds of this invention can be used include protozoa and viruses.
- a compound of this invention can be used in combination with a pharmaceutically acceptable carrier.
- the composition can be dry, or it can be a solution. Treatment can be reactive, for example, to combat an existing infection, or prophylactic, for preventing infection in an organism susceptible to infection.
- Host organisms that can be treated include eukaryotic organisms, in particular plants and animals.
- the plant can be an agriculturally important crop, such as wheat, rice, corn, soybean, sorghum, and alfalfa.
- Animals of interest include mammals such as bovines, canine, equines, felines, ovines, porcines, and primates (including humans).
- SUBSTrr ⁇ TE SHEET (RULE 26) While not wishing to be bound by any particular theory, it is believed that the synthetic regulatory compounds of this invention derive their biological activity by binding to double stranded nucleic acid, in particular double stranded DNA, and recruiting (in the case of activators) transcriptional machinery to the gene to be expressed or, in the case of repressors, inhibiting such recruitment
- the matching of a synthetic regulatory compound of this invention against a particular gene can be accomplished by rational design if the desired target dsDNA base pair sequence - e.g., a sequence in a gene (particularly a regulatory region thereof, e.g., a promoter, enhancer) is known.
- a nucleic acid binding moiety that binds to the target base pair sequence with the desired degree of specificity is preferably used.
- the NABM can be a residue of a naturally occurring dsDNA binder with known specificity for the target sequence, or can be a synthetic dsDNA binder synthesized according to the base pair recognition rules discussed hereinabove.
- the matching can be accomplished by a suitable screening method. i. Polyamide Synthesis.
- U.S. Patent Nos. 6,090,947 and 5,998,140 detail solid state synthetic processes for use in synthesizing polyamides useful in the practice of this invention.
- a solid support preferably a polystyrene resin
- amino acids To begin, the appropriate amino acid monomer or dimer is protected at its amino (NH 2 ) group with a Boc-group or an Fmoc-group and activated at the carboxylic acid (COOH) group by formation of an -OBt ester.
- the protected and activated amino acids are then sequentially added to the solid support, beginning with the carboxy terminal amino acid.
- the amino acids are deprotected and the peptide is cleaved from the resin and purified.
- the reactions were stopped by the addition of 50 ⁇ L ofa solution containing 2.25 M NaCl, 150 mM EDTA, 23 ⁇ M base pair calf thymus DNA, and 0.6 mg/ml glycogen, and ethanol precipitated.
- the reactions were resuspended in 1 x TBE/ 80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice.
- the reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and then exposed to a storage phosphor screen at 22°C. iii. Quantitative DNase I Footprint Titration Data Analysis.
- I S j te 0 and I ref ° are the site and reference intensities, respectively, from a DNase I control lane to which no polyamide was added.
- the major players in the regulation of gene expression within the nucleus are: the genes and their regulatory sequences that are complexed with structural proteins (e.g., histones) in chromatin; chromatin remodeling activities that allow access to a gene and its regulatory regions; regulatory proteins that instruct the transcription machinery to express (or, as in the case of repressors, prevent the expression of) the relevant genes; and the RNA-synthesizing machinery that decodes the genes.
- a host of other activities play a role in this process, for instance, those that facilitate elongation of paused transcripts, or those that lead to the processing of nascent transcripts and those that play a role in release of full-length transcripts.
- Activators Positive regulation (stimulation) of gene expression requires factors called transcriptional activators.
- An economical 'recruitment' model posits that activator proteins bind to DNA and recruit the transcriptional machinery to the promoter of the gene, thereby stimulating gene expression.
- Most activators comprise three functional modules. Of these, specificity in targeting genes is achieved by the DNA recognition module which binds to cognate DNA sequences near a promoter of a gene and in most cases DNA binding specificity is further enhanced by dimerization.
- a key functional module, the activating region is thought to bind one or more components of the transcriptional machinery. While not wishing to be bound to a particular theory, it is believed that weak interactions between an activating region and several components of the machinery result in high avidity 'multi-dentate' binding.
- the typical activating region e.g., those used here
- Repressors These proteins appear to function to inhibit gene expression at several levels. Some repressors function in part by blocking the activity of activators directly, for example, by binding to an activation domain on an activating protein in order to prevent its interaction with a component of the transcriptional machinery. Another example includes MDM-2, which not only binds to the activating region of p53, but also indirectly attenuates transcriptional activity by stimulating p53 's degradation via a proteolytic pathway.
- repressors are recruited to promoters where they serve to inhibit the ability of transcriptional machinery to utilize the proximal promoter by either directly interacting with the machinery and inactivating it, or indirectly by mediating changes in chromatin structure so as to prevent the components of a transcriptional apparatus from interacting with DNA.
- RNA polymerase II (12 subunits), several general transcription factors (TF ⁇ -A, B, D, E, F, H), mediator complex (-20 Srb and Med subunits), elongator complex, co- activator proteins and several additional polypeptides, some of which remain to be defined. Most of these proteins are conserved through evolution and occur in species from yeast to humans.
- RNA polymerase II holoenzyme Many of the components of the transcription machinery exist in large multi- subunit complexes which associate with the RNA polymerase II, and are known as the RNA polymerase II holoenzyme.
- the RNA-polymerase II holoenzyme can be
- STiTUTE SHEET (RULE 26 i broadly described as containing two functional parts. One part is the "catalytic core” that is required for synthesizing mRNA while the other is the mediator (Bjorkland et al. (1996) Trends Biochem. Sci. 21:335-337), a complex of approximately twenty proteins that is required for the holoenzyme to respond to activators. It is believed that the holoenzyme, along with additional factors that " do not associate tightly (such as TBP/TFIID and a class of proteins known as co- activators (Thompson et al. (1993) Cell 73:1361-1375; and Koleske et al.
- TFIID (Burley et al. (1996) Ann. Rev. Biochem. 65:769-799), an essential component of the transcriptional machinery, is not typically found associated with the holoenzyme, and is a target of activators and some repressors as well. It is a protein complex containing about thirteen components, including TBP and TBP- associated factors (TAFs). Kim et al. (1993) Nature 365:520-527; Kim et al. (1993) Nature 365:512-520; and Dynlacht et al. (1991) Cell 66:563-576.
- TBP is a sequence-specific DNA-binding protein that recognizes and binds via the minor groove to a sequence known as the TATA box (consensus: S'-TATAAAA-S') that exists in the promoters of many genes. Hoopes et al. (1992) J. Biol. Chem. 267:11539-1154; and Coleman et al. (1995) J. Biol. Chem. 270:13850-13859. TFIID associates with TFIIA, which is comprised of three polypeptides. TFIIA helps TFIID bind to DNA perhaps by competing with repressors as well as displacing inhibitory domains within TAFs away from TBP. Geiger et al.
- TFIIB a holoenzyme component
- Chromatin Remodeling Machinery In order for a gene sequestered in chromatin to become available for transcription, the chromatin structure must be remodeled. Felsenfeld (1992) Nature 355:219-224; Springfield et al. (1996) Genes Dev. 10:905-92; and Kadonaga (1998) Cell 92:307-313. Chromatin remodeling occurs through activator-mediated recruitment of at least two types of chromatin remodeling complexes. One of which comprise the histone acetyl transferases that contain proteins that acetylate certain lysine residues in the amino-terminal tails of histone proteins (Brownell et al. (1996) Curr. Opin. Genet. Dev.
- Swi Snf uses energy derived from ATP hydrolysis to facilitate binding of the transcriptional machinery to a particular promoter. Burns et al. (1997) Mol. Cell. Biol. 17:4811-4819; Quinn et al. (1996) Nature 379:844-847; Kwon et al. (1994) Nature 370:477-481; and Cote et al. (1994) Science 265:65-68. Activators can recruit chromatin remodeling complexes through direct binding.
- the viral activator VPl 6 has been shown to bind to components of both the multi-protein histone acetyl transferase (HAT) complex (Berger et al. (1992) Cell 70:251-265; and Candau et al. (1997) EMBO J. 16:555- 565), as well as the Swi/Snf complex.
- HAT multi-protein histone acetyl transferase
- TFIID another target of VPl 6, was observed to display a weak HAT activity. Mizzen et al. ( 1996) Cell 87 : 1261 - 1270; and Wilson et al. (1996) Cell 84:235-244.
- an activator bound to a promoter or enhancer recruits the chromatin remodeling machinery to the adjacent promoter. It then recruits the transcriptional machinery to form a pre-initiation complex at the promoter. It appears that assembly of a pre-initiation complex can require two synchronized steps: TFIID/TBP -TATA binding in concert with the association of the holoenzyme with the complex at the promoter.
- TFIID/TBP -TATA binding in concert with the association of the holoenzyme with the complex at the promoter.
- the complex must open (melt) the double helix to expose the template strands.
- RNA polymerase Once mRNA initiation occurs and after a certain length of transcript is synthesized the polymerase must move away from the promoter to continue mRNA synthesis.
- Certain activators such as HSF and Tat function to stimulate this stage of transcription process, possibly by recruiting the pTEFB complex which contains a kinase (Cdk9) capable of phosphorylating the largest of the 12 subunits of the polymerase. It has been reported that promoter escape appears to involve hyperphosphorylation of the carboxy-terminal domain of the largest subunit of the RNA polymerase II.
- This hyperphosphorylation achieves two goals: first, it can provide the signal to detach the mediator complex from the catalytic core; and second, it can permit the association of RNA processing and elongator complexes with the rapidly elongating polymerase.
- the release of the mediator and TFIID during promoter escape by the polymerase would provide a mechanistic basis for a re-initiation event by another polymerase catalytic core. Svejstrup et al. (1997) Proc. Natl. Acad. Sci. USA 94:6075-6078; and Zawel et al. (1995) Genes Dev. 9:1479-1490. It has been found that mediator complexes are limiting, whereas the catalytic machinery is more abundant.
- activators directly interact with both the mediator as well as TBP/TFIID, thus, they can play a major role in helping to retain the mediator and/or TFIID at the promoter. Therefore, the next transcription complex can be reassembled rapidly by only recruiting the core fragment of the RNA polymerase II holoenzyme. It is postulated that re-initiation is much more likely than initiation alone to contribute significantly to rapid stimulation of gene expression. And activators must clearly play a role in (Ho et al. (1996) Nature 382:822-826) facilitating multiple rounds of transcription re-initiation.
- Repression requires the opposite series of events.
- a repressor can first directly engage an activator and mask its activating surface thereby preventing its interactions with the transcriptional and chromatin remodeling machinery.
- the repressor can also directly interrupt the low-level activator-independent assembly of the transcriptional machinery at the exposed promoter.
- the repressor such as Retinoblastoma gene product (Rb)
- Rb Retinoblastoma gene product
- deacetylated histone H3 tails are then methylated by methyl transferases, which are also recruited by repressors.
- the methylated histone tails bind to chromatin compacting proteins such as HP-1.
- HP-1 chromatin compacting proteins
- a regulatory moiety is any molecule that can positively or negatively effect transcription of a target gene, other than by direct electrostatic interaction with double-stranded DNA.
- Representative embodiments of regulatory moieties include peptides, polypeptides, lipids, carbohydrates, and any combination thereof.
- a regulator moiety can be naturally occurring or be derived from or an analog of a naturally occurring molecule. Alternatively, it can be synthetic.
- Preferred regulatory moieties are peptides and small organic molecules.
- B. Activators An activator is any molecule that can activate transcription of a target gene. According to current understanding, an activator binds to its cognate sites in the genome and recruits the transcriptional machinery to nearby promoters; initiation of transcription then follows. Ptashne et al.
- Activators can be small molecules or peptides. For example, many protein activators are known. The peptides within such proteins that provide activation activity can be identified. Peptidomimetics of such peptides can also be generated, as can other analogs and derivatives. The key is whether the peptide or other molecule retains the desired activation function. Small molecules can be developed from rational design approaches modeled after known activators. Alternatively, or in addition, they can be identified by screening combinatorial libraries, natural product libraries, and/or libraries of already synthesized small organic molecules. Such molecules can be tested for activator function is a suitable test system.
- candidate molecules can be screened in an in vitro or cell-based reporter system comprising a reporter construct that carries a reporter gene the expression of which is under the control of a regulatable promoter.
- Candidate compounds that are found to activate transcription of the reporter gene can be retained for further study.
- the particular regulatory moiety can be identified as an activator. The activator can also then be tested for activity in other compounds that employ different linkers and/or nucleic acid binding moieties, and even different regulatory moieties, or multiples of the same regulatory moiety. In this way, it will be possible to develop a library of activators, the members of which can be used for various, and even multiple, applications.
- Repressors A repressor is any molecule that can inhibit or prevent transcription ofa target gene. Repressors can be small molecules or peptides. For example, many protein repressors are known. The peptides within such proteins that provide repressor activity can be identified. Peptidomimetics of such peptides can also be generated, as can other analogs and derivatives. The key is whether the peptide or other molecule retains the desired repressor function. Repressors can be obtained in ways analogous to those used to identify activators. 3 Linkers
- a linker is any molecule that can be used to link at least one nucleic acid binding moiety to at least one regulatory moiety in a manner that allows the nucleic acid binding moiety(ies) to retain its(their) intended nucleic acid binding function(s) and the regulatory moiety(ies) to retain its(their) ability to influence transcription of the target gene.
- the linker should provide adequate spacing between and/or orientation with respect to the nucleic acid binding moiety and the regulatory moiety so as to allow each to retain its respective function.
- a linker is a polymer of backbone units e.g. methylene, propyl, and ether groups. Other backbone units include, but are not limited to, five or six membered rings. One or more backbone units can include a heteroatom including, but not limited to, sulfur, nitrogen or oxygen.
- Linkers used in the practice of this invention can be branched, although straight chain molecules are preferred. They can be amphipathic or aliphatic, with molecules in the latter class being preferred. Representative examples of suitable linkers include polyethylene glycol, alkyl chains, and peptides.
- a linker can contain one or more first reactive moieties for conjugation by a suitable chemistry to a nucleic acid binding moiety.
- a linker can contain one or more second reactive moieties for conjugation by a suitable chemistry to a regulatory moiety.
- the chemistry used to conjugate the first reactive moiety(ies) to the nucleic acid binding moietyries) can be the same or different as the chemistry used to conjugate the second reactive moiety (ies) of the linker to the regulatory moiety(ies).
- the linker is a dendrimer with respect to second reactive moieties, in that it contains two or more such moieties and thus can be useful in linking two or more regulatory moieties having the same or different regulatory activities to a nucleic acid binding moiety.
- Linkers can be conjugated to a nucleic acid binding moiety at any suitable location. Such locations include at the one or both ends of the molecule, or at an internal position. Preferably, when conjugated to the nucleic acid binding moiety, the linker is oriented such that upon interaction with the nucleic acid molecule (as mediated by the nucleic acid binding moiety) it is projected away from the nucleic acid molecule so as to avoid steric hindrance or other interaction with the nucleic acid.
- nucleic acid binding moiety is a polyamide
- preferred locations for conjugating a linker thereto include the carboxy terminus, the amino terminus, and an internal amino acid residue (e.g., a ⁇ -alanine residue, a substituted pyrrole residue, an unsubstituted pyrrole residue, a substituted imidazole residue, and an unsubstituted imidazole residue).
- an internal amino acid residue e.g., a ⁇ -alanine residue, a substituted pyrrole residue, an unsubstituted pyrrole residue, a substituted imidazole residue, and an unsubstituted imidazole residue.
- certain preferred embodiments involve conjugation of the linker to the internal amino acid residue that mediates hairpin formation, for example, ⁇ -aminobutyric acid.
- Linkers useful in the practice of this invention can be cleavable, for example, by an enzyme or chemical action (e.g., photooxidation). In this way, for example, the activity of a synthetic regulatory compound can be controlled by endogenous degradative processes.
- Suitable linkers can be identified from among candidate linkers by any of a number of suitable approaches.
- An example of one such in vitro system is as follows: a first reactive moiety on the candidate linker is used to attach the linker to a nucleic acid binding moiety known to specifically bind to a target nucleotide sequence in dsDNA in proximity to a regulatable promoter from which high level
- the first reactive moiety is used to attach the linker to a portion of the nucleic acid binding moiety in a manner that is not anticipated to substantially disrupt the nucleic acid binding moiety's ability to bind to its target nucleotide sequence. Whether such disruption occurs, or the extent of any such disruption, can be independently assessed by determining the binding affinity of the nucleic acid binding moiety for its target nucleotide sequence before and after linker attachment.
- the intermediate can be linked to a regulatory moiety via a second reactive moiety contained in the linker.
- the linker-regulatory element linkage not disrupt the biological activity of the moiety to which it is attached via the second reactive group, namely the activator or repressor.
- Whether such disruption also occurs, or the extent of any such disruption can be independently assessed by determining the biological activity of a regulatory moiety before and after linker attachment.
- a regulatory moiety-linker intermediate that retains some (at least about 10%, preferably at least 25%, and more preferably at least about 50%), substantially all (at least about 70%, preferably at least about 85%, and more preferably at least about 90-95%), or all (more than 95%, preferably more than 98%) of the regulatory moiety's activity, that linkage mechanism is preferably selected for use generating a synthetic regulatory compound according to the invention.
- the order of moiety conjugation to the linker can vary, and can depend on the particular nucleic acid binding moiety(ies) and/or regulatory moiety(ies) being employed. 4. Screens
- the synthetic regulatory compounds of the present invention comprise at least three elements: a non-natural nucleic acid binding moiety; a linker; and a regulatory moiety.
- a synthetic regulatory compound according to the invention that is used in an in vitro system need not exhibit cell permeability.
- such compound in order to be a synthetic regulatory compound within the scope of the invention, such compound must enter cells, and then exert the intended desired regulatory effect.
- each of the moieties involved in constructing synthetic regulatory compounds of the invention are preferably independently tested, or screened, for their ability to provide the desired function (targeted nucleic acid binding or regulatory function in the case of nucleic acid binding and regulatory moieties, respectively) or structure (in the case of linkers).
- a nucleic acid binding moiety will generally be tested to determine if it binds the desired target sequence with the requisite affinity and specificity prior to being incorporated into a conjugate including a linker and/or a regulatory moiety.
- a regulatory moiety is preferably determined to have the desired regulatory effect prior to its inclusion in a conjugate comprising a nucleic acid binding moiety and a linker.
- nucleic acid binding moiety is exposed to nucleic acid molecules which include the intended target sequence, preferably under conditions that at least approximate those expected to be encountered when the molecule is put to its intended use. It is then ascertained by any of a number of conventional methods whether the desired binding events take place, and if so, at what affinities, etc.
- any suitable method can be employed. Such methods include the synthesis ofa single compound by traditional methods, up through a massively parallel combinatorial approach. For example, a number of combinatorial synthetic methods are known in the art. For example, Thompson & Elman ((1996) Chem. Rev. 9.6:555) recognized at least five different general approaches for preparing combinatorial libraries on solid supports. These were: (1) synthesis of discrete compounds; (2) split synthesis (split and pool); (3) soluble library deconvolution; (4) structural determination by analytical methods; and (5) encoding strategies in which the chemical compositions of active candidate are determined by unique labels, after testing positive for biological activity.
- Synthesis in libraries in solution includes at least spatially separate synthesis, and synthesis pools. Additional descriptions of combinatorial methods are known in the art. See, e.g., Lam et al. (1997) Chem. Rev. 97:4111. These approaches can be readily adapted to prepare moieties for use in accordance with the present invention, including suitable protection schemes, as necessary. After synthesis and testing of the various moieties required to make a particular compound according to the invention, they can be assembled into a synthetic regulatory molecule. After a putative synthetic regulatory compound according to the invention has been generated, it can be tested in any number of in vitro or cell-based assays that elicit detectable signals in proportion with the activity of the compound. Preferably, such assays are conducted in a high throughput format.
- a plurality of microtiter plates allows the simultaneous testing of large numbers of candidate compounds, be they in vitro or cell-based assays.
- High throughput formats are often partially or fully automated, and allow 100, 1,000, 10,000, or more candidate compounds to be screened at one time.
- the synthetic regulatory compounds of the invention regulate transcription such assays typically will employ systems in which one or more reporter genes can be expressed under the control of a promoter that can be influenced by the compound.
- suitable reporter constructs include those that encode genes such as luciferase, green fluorescent protein, CAT, etc., although any gene, the expression of which can be readily detected, can be employed. 5.
- synthetic regulatory compounds of the invention are preferably cell permeable, i.e., they can traverse a cell's plasma membrane and thus be internalized.
- the compounds inherently have this ability.
- they can be formulated in a composition that facilitates cell entry, for example, a liposome.
- such compositions can further comprise a cell targeting element to direct the composition to a particular cell type, for example, a cell expressing a specific antigen (e.g., a disease-associated antigen) not expressed on the surface of other cells.
- Cell permeability enables a compound according to the invention to enter into a cell's cytoplasm, from which it then moves into the nucleus to exert its intended regulatory effect.
- plasma membranes present a barrier to many molecules that, if they could enter a cell, could exert a useful effect.
- a compound that effects an intracellular function or activity for example, transcription of one or more particular genes, is soluble in both the aqueous compartments of the cells and organism, and preferably also in the lipid bilayers through which it must pass in order to enter cells or organelles, for example, mitochondria and chloroplasts.
- Nuclei on the other hand, have large pores that generally do not prevent ingress ofa synthetic regulatory compound according to the invention.
- Cell permeability can be assessed in a number of ways. For example, the intracellular concentration ofa compound can be determined, and compared to the extracellular concentration ofa compound. This can also be done over time, so that the rate at which a compound is taken up can be determined.
- a synthetic regulatory compound of the invention will typically be formulated into a suitable composition.
- suitable compositions include liquid and solid, or dry, compositions.
- the particular composition selected will depend on a variety of factors, including the particular synthetic regulatory compound(s) to be formulated, the intended use (e.g., for an agricultural purpose (for instance, as a pesticide or as a molecular switch, such as to induce flowering) or a medical application (e.g., to treat or prevent a disease)), the method of delivery (for example, in an agricultural context, spraying or broadcasting, and in a medical context, injection or oral delivery), regulatory requirements, etc.
- Formulations suitable for human or non-human animal uses are among the preferred embodiments of this aspect of the invention.
- one or more synthetic regulatory compounds preferably are suspended in an aqueous carrier, for example, in an isotonic buffer solution at a suitable pH.
- compositions can be sterilized by conventional sterilization techniques, or can be sterile filtered.
- the compositions can contain pharmaceutically or veterinarily, as the case can be, acceptable auxiliary substances as required to approximate physiological conditions, such as pH buffering agents.
- Useful buffers include for example, sodium acetate/acetic acid buffers.
- the desired isotonicity can be accomplished using sodium chloride or other pharmaceutically acceptable agents such as dextrose, boric acid, s ⁇ Uium tartrate, ⁇ ' propylene glycol, polyols (such as mannitol and sorbitol), or other inorganic or organic solutes.
- Sodium chloride is preferred particularly for buffers containing sodium ions.
- the synthetic regulatory compounds of the invention can also be formulated as pharmaceutically acceptable salts and/or complexes thereof.
- Pharmaceutically acceptable salts are non-toxic salts at the concentration at which they are administered. The preparation of such salts, including addition salts, can facilitate pharmacological or other use by altering the physical-chemical characteristics of the active ingredient without preventing the synthetic regulatory compound from exerting its intended physiological effect.
- Pharmaceutically acceptable salts of compounds of this invention include salts of their conjugate acids or bases.
- Suitable counterions for conjugate acid salts include the chlorides, bromides, phosphates, sulfates, maleates, malonates, salicylates, fumarates, ascorbates, benzenesulfonates, methanesulfonates, p-toluenesulfonates, cyclohexylsulfonates, lactates, malates, citrates, acetates, tartrates, succinates, glutamates, sulfamates, quinates, and the like, in particularly those salts which are FDA acceptable.
- a conjugate acid salt can be formed by contacting a synthetic regulatory compound in the free acid or base form with a sufficient amount of the desired base or acid, for example, hydrochloric acid, sulfuric acid, phosphoric acid, sulfamic acid, acetic acid, citric acid, lactic acid, tartaric acid, malonic acid, methanesulfonic acid, ethane sulfonic acid, benzene sulfonic acid, /.-toluenesulfonic acid, cyclohexyl sulfamic acid, and quinic acid.
- a synthetic regulatory compound in the free acid or base form for example, hydrochloric acid, sulfuric acid, phosphoric acid, sulfamic acid, acetic acid, citric acid, lactic acid, tartaric acid, malonic acid, methanesulfonic acid, ethane sulfonic acid, benzene sulfonic acid, /.-toluenesulf
- Such contacting can typically involve reacting the free acid or base forms ofa synthetic regulatory compound with one or more equivalents of the appropriate base or acid in a solvent or medium in which the salt is insoluble, or in a solvent such as water that is then removed, for example, in vacuo, by freeze-drying, or by ion exchange.
- a conjugate acid or base form of a compound of this invention is considered equivalent to the free base form (or the free acid form, as the case may be) for the purposes of the claims of this invention.
- Carriers and/or excipients can also be included in compositions of the invention.
- Representative examples of carriers and excipients include calcium carbonate, calcium phosphate, various sugars such as lactose, or types of starch, cellulose derivatives, gelatin, vegetable oils, polyethylene glycols, and physiologically compatible solvents.
- Excipients such as polyhydric alcohols and carbohydrates share the same feature in their backbones, i.e., -CHOH-CHOH-.
- Useful polyhydric alcohols include such straight-chain molecules as sorbitol, mannitol, inositol, glycerol, xylitol, and polypropylene/ethylene glycol copolymer, as well as various polyethylene glycols (PEG) of various molecular weights, including molecular weights of 200, 400, 1450, 3350, 4000, 6000, and 8000.
- PEG polyethylene glycols
- Carbohydrates for example, mannose, ribose, trehalose, maltose, glycerol, inositol, glucose, galactose, arabinose, can also be included.
- liquid compositions can be thickened with a thickening agent such as methylcellulose.
- a thickening agent such as methylcellulose.
- They can be prepared in emulsified form, either water in oil or oil in water.
- suitable emulsifying agents including pharmaceutically acceptable emulsifying agents, can be employed including, for example, acacia powder, a non-ionic surfactant (such as a Tween), or an ionic surfactant (such as alkali polyether alcohol sulfates or sulfonates, e.g., a Triton).
- compositions of the invention are prepared by mixing the ingredients following generally accepted procedures.
- compositions of the invention will typically be provided in a dosage form or formulation. Any suitable dosage form can be employed, and different dosage forms can be used for different applications. Exemplary formulations within the scope of the invention include a parenteral liquid dosage form, a lyophilized or freeze-dried unit-dosage form, controlled or sustained release formulations, wherein an effective amount of the active ingredient is released over time, and modifications of these dosage forms that are useful in practicing certain aspects of the invention.
- dosage forms can be administered to a patient via a variety oi routes, including oral, nasal, buccal, sublingual, intratracheal, ocular, transdermal, and pulmonary delivery.
- Formulations that support a parenteral liquid dosage form are those in which the active ingredient(s) are stable, and typically the solvent has adequate buffering capacity to maintain the pH of the solution over the intended shelf life of the product, and can optionally include a preservative.
- the dosage form should be either an isotonic and/or an iso-osmolar solution to either facilitate stability of the active ingredient or lessen the pain on injection or both.
- Oral delivery can be accomplished in a variety of ways, for example, by liquid (including gel caps) or solid dosage forms, and certain preferred embodiments concern pharmaceutical formulations intended for oral administration. Such formulations can be prepared as solid dosage forms.
- Solid dosage forms are pills, e.g., capsules, tablets, caplets, or the like suitable for oral ingestion.
- Solid dosage forms typically contain inert ingredients (e.g., carriers and excipients, as described above) along with the active ingredient to facilitate tablet formation.
- inert ingredients e.g., carriers and excipients, as described above
- Numerous capsule manufacturing, filling, and sealing systems are well- known in the art, and can be used to make pills of any desired size.
- Preferred capsule dosage forms can be prepared from gelatin or starch. After making a capsule dosage form, if desired, it can be coated with one or more suitable materials. For example, one or more enteric coatings can be applied to prevent gastric irritation, nausea, or to prevent the active ingredient from being destroyed by acid or gastric enzymes, or to target a particular gastrointestinal region.
- Formulations that support pulmonary and/or intra-tracheal dosage forms include preserved or unpreserved liquid formulations and/or dry powder formulations, as can be used in any suitable device for delivering the composition to the lung, e.g., a metered dose inhaler, nebulizer, or dry powder inhaler.
- Dissolvable gels and/or patches are useful to facilitate buccal delivery, and can be prepared from various types of starch and/qr cellulose derivatives.
- Sublingual delivery can be supported by liquid formulations or by solid dosage forms that dissolve under the tongue. After a dosage form is prepared, it is typically packaged in " a * suitable material. For pill or tablet dosage forms, the dosage forms can be packaged individually or bottled en masse.
- the effective dosage of a synthetic regulatory compound of the invention will vary depending upon a variety of factors, including the compound itself, its intended application, etc.
- the particular dose and dosing regimen can be determined by the attending clinician, and can be further dependent upon such factors as the age, weight, and condition of the patient.
- nucleic acid binding moieties employed in the compounds of the invention provide a level of promoter- or other regulatory element selectivity that can not be afforded by natural or chimeric transcriptional factors.
- compositions comprising a synthetic regulatory compound can be used to modulate physiological processes in vivo.
- animals particularly humans, primates, and domestic animals, one can affect development by controlling the expression of particular genes, modify physiological processes, such as accumulation of fat, growth, response to stimuli, etc., and treat or prevent disease.
- domestic animals include bovine, canine, equine, feline, murine, ovine, and.porcine animals, as well as fish and birds.
- the compositions of the invention can also be used agriculturally, for example, to control plant and animal pests, to affect gene regulation in plants, particularly commercially important crops.
- the synthetic regulatory compounds of the invention can be used for the treatment or prevention of various disease states.
- the subject compositions can be used, for example, therapeutically, to inhibit or activate the expression of one or more genes, which can change the phenotype of cells, either endogenous or exogenous to the host or patient, where the phenotype is detrimental.
- synthetic regulatory compounds can be developed that contain a nucleic acid binding moiety that specifically binds to nucleic acids (e.g., double-stranded genomic DNA) in pathogens (i.e., viruses and pathogens of either eukaryotic or prokaryotic origin) that are involved in the regulation of expression of certain pathogenic genes, for example, genes required for replication or virulence.
- the nucleic acid binding moiety can be chosen to target a synthetic regulatory compound to a unique sequence of the pathogen that is not found in the genome of the pathogen's host.
- a synthetic regulatory compound to a unique sequence of the pathogen that is not found in the genome of the pathogen's host.
- the subject compositions can be used to inhibit expression of the gene by inhibiting transcription or chromatin remodeling thereof.
- a disease phenotype is caused by under-expression ofa gene
- its expression can be activated by providing a compound according to the invention.
- An important application of the invention's synthetic regulatory compounds is in the prevention and treatment of cancer. For example, expression of specific oncogenes can be inhibited or prevented. Similarly, expression of genes inappropriately up-regulated in cancer cells can also be inhibited or prevented.
- other genes whose expression is essential to the maintenance of an immortal phenotype e.g., the genes coding for the RNA and protein components of telomerase, can be down-regulated using a
- TE SHEET (RULE 26) synthetic regulatory compound.
- An alternative strategy involves the activation of genes that give rise to an apoptotic cascade or whose expression is down-regulated as part of the tumorigenic process. These approaches to regulating transcription will find application in situations such as cancers, such as sarcomas, carcinomas and leukemias, restenosis, psoriasis, lymphopoiesis, atherosclerosis, pulmonary fibrosis, primary pulmonary hypertension, neurofibromatosis, acoustic neuroma, tuberous sclerosis, keloid, fibrocystic breast, polycystic ovary and kidney, scleroderma, inflammatory diseases such as rheumatoid arthritis, ankylosing spondilitis, myelodysplasia, cirrhosis, esophageal stricture, sclerosing cholangitis, retroperitoneal fibrosis, etc.
- Inhibition or activation can be associated with one or more specific growth factors, such as the families of platelet- derived growth factors, epidermal growth factors, transforming growth factor, nerve growth factor, fibroblast growth factors, e.g., basic and acidic, keratinocyte fibroblast growth factor, tumor necrosis factors, interleukins, particularly interleukin 1, interferons, etc.
- specific growth factors such as the families of platelet- derived growth factors, epidermal growth factors, transforming growth factor, nerve growth factor, fibroblast growth factors, e.g., basic and acidic, keratinocyte fibroblast growth factor, tumor necrosis factors, interleukins, particularly interleukin 1, interferons, etc.
- transcription factors such as ras, myb, myc, sis, src, yes, fps/fes, erbA, erbB, ski, jun, crk, sea, rel, fins, abl, met, trk, mos, Rb-1, etc.
- Other conditions of interest for treatment with the subject compositions include inflammatory responses, skin graft rejection, allergic response, psychosis, sleep regulation, immune response, mucosal ulceration, withdrawal symptoms associated with termination of substance use, pathogenesis of liver injury, cardiovascular processes, neuronal processes, and, in particular, where specific T-cell receptors are associated with autoimmune diseases, such as multiple sclerosis, diabetes, lupus erythematosus, myasthenia gravis, Hashimoto's disease, cytopenia, rheumatoid arthritis, etc., the expression of the undesired T-cell receptors can be diminished, so as to inhibit the activity of the disease-associated T-cells.
- autoimmune diseases such as multiple sclerosis, diabetes, lupus erythematosus, myasthenia gravis, Hashimoto's disease, cytopenia, rheumatoid arthritis, etc.
- TNF enzymes that produce singlet oxygen, such as peroxidases and superoxide dismutase, proteases, such as elastase, INF- ⁇ , IL-2, factors that induce proliferation of mast cells, eosinophils, IgG, IgE, regulatory T cells, etc., or modulate expression of adhesion molecules in leukocytes and endothelial cells.
- NCCLS National Committee for ' Clinical Laboratory Standards
- MIC minimal inhibition concentration
- Staphylococcus spp. Streptococcus spp., Enterococcus spp., Coryne bacterium spp., Listeria spp., Bacillus spp., Micrococcus spp., Peptostreptococcus spp, Clostridium spp., Propionibacterium spp., Escherichia spp., Pseudomonas spp., Haemophilus spp., Candida spp., Cryptococcus spp., Aspergillus spp., Trichophyto spp., Paecilomyces spp., Saccharomyces spp.
- Acinetobacter spp. Alcaligenes spp., Campylobacter spp., Citrobacter spp., Enterobacter spp., Proteus spp., Salmonella spp., Shigella spp., Helicobacter. spp., Neisseria spp., Vibrio spp., Bacteroides spp., Prevotella spp., Mycoplasma spp., Mycobacteria spp., and Clamydia SPP.
- Other opportunities for use of the subject synthetic regulatory compounds include modulating the level of expression of genes coding for receptors, ligands, enzymes, changing phenotype of cells, modifying the response of cells to drugs or other stimuli, e.g., enhancing or diminishing the response, and inhibiting or activating the expression of one of two or more alleles.
- individual and multiple compounds can be employed, directed to the same dsDNA region, but different target sequences, contiguous or distal, or different DNA regions, depending upon the number of genes which one wishes to modulate the expression of.
- the subject compositions can also be used as a sole therapeutic agenfor in combination with other therapeutic agents.
- other drugs can also be used, such as antibiotics, antisera, monoclonal antibodies, cytokines, anti-inflammatory drugs, and the like.
- the subject compositions can be used for acute or chronic situations, where a particular regimen is devised for the treatment of the patient.
- hairpin polyamide 1, containing a primary amine was prepared by solid-phase synthesis. Trauger et al. (1996; Baird et al.; and U.S. Patent No. 6,090,947. Polyamide 1 and thiolane-2,5-dione were combined in N-methylpyrrolidinone (NMP) with N,N-diisopropylethylamine (DIPEA) at ambient temperature and reaction progress monitored by analytical reversed-phase HPLC (Scheme 1). Polyamide 1 was rapidly consumed and thioacid 2 isolated by ether precipitation.
- NMP N-methylpyrrolidinone
- DIPEA N,N-diisopropylethylamine
- thioester 3 was produced in a "one- pot" conversion by lowering the pH of the initial reaction mixture with pH 5.2 NaO Ac buffer and adding benzyl bromide. As monitored by analytical HPLC, the conversion from 1 to 3 was virtually quantitative.
- a typical reaction procedure was conducted as follows: to a solution of 18.6 ⁇ moles (23.5 mg) polyamide 1 in 0.600 mL NMP was added thiolane-2,5-dione (25.0 ⁇ L ofa 1.00 M solution, 25.0 ⁇ moles), followed by 9.70 ⁇ L (55.6 ⁇ mol) of DIPEA. After 10 min., conversion to thioacid 2 appeared complete.
- the reaction mixture was diluted with 900 ⁇ L 100 mM NaO Ac buffer (pH 5.2) and cooled to 4°C, necessary to prevent the formation of dialkylation products in the subsequent step.
- Benzyl bromide 55.8 ⁇ mol, 6.70 ⁇ L was added with thorough mixing and after an additional 10 min., the thioester 3 was isolated by semi-preparatory reversed-phase HPLC as a pale yellow powder in 52% overall yield (14.2 mg, 9.66 ⁇ mol).
- Thioester 3 underwent reaction under standard "native chemical ligation" conditions with peptides having an amino-terminal cysteine (such as compound 4, below) to produce compound 5. Due to the hydrophobicity of thioester 3, it was necessary to dissolve the thioester in NMP (5-10% of reaction volume) prior to addition of aqueous denaturing buffer. Compound 5 was recovered in 29% yield (after reversed-phase HPLC purification) under these reaction conditions (MALDI- TOF MS [M+H] (average mass) calculated 6935.0, observed 6934.8).
- This Example describes the combination of a weakly dimerizing Gal4 domain (residues 73-100 of Gal4) as a linker to connect the polyamide and activator moieties of Example 3 (YLLPTCIP ("XL" SEQ ID NO: 2)) in order to make synthetic regulatory compounds of the invention, as well as such compounds that instead comprise a flexible polyethylene glycol linker.
- XL is described in Lu et al. (2000) Proc. Natl. Acad. Sci. USA 97:1989-1992.
- the synthetic process described in Example 1 was used to generate each of three compounds (7-9, Figure 1 SEQ ID NOs: 1, 2 and 3) tested in the course of the experiments described below.
- compounds 7, 8, 9 had the same structure, except that the linker domain of each compound was different.
- Compound 7 did not contain a linker per se; instead, the activator peptide was linked directly the C- terminus via a Cys residue during the native ligation procedure.
- Conjugate 8 contained a PEG linker
- compound 9 contained as a linker the ' weak dimerization domain comprising residues 73-100 of Gal4.
- Each of the compounds was generated in moderate to good yield using the synthetic process of Example 2, and each was sufficiently water soluble to be purified and characterized, although to a lesser degree than the polyamide moiety alone.
- Each conjugate had a binding affinity for the target nucleotide sequence (the same site for each compound) that was at least about 10-fold less than that of the polyamide moiety alone for its cognate target sequence.
- Compound 9 bound inverted target dimer site ( Figure IB) with an approximately 4-fold greater affinity (K a 9 x lO ⁇ "1 vs. 2 x 10 7 M "1 ), indicating that a favorable interaction occurs between adjacent compounds.
- K a 9 x lO ⁇ "1 vs. 2 x 10 7 M "1 )
- Example 3 Compounds Comprising AH Activators and Gcn4 Linkers Attached Via Internal Residues
- This example describes the synthesis and testing of several synthetic regulatory compounds according to the invention that also employ polyamides as the nucleic acid binding moiety.
- the linker is attached at an internal pyrrole unit rather than at the C-terminus of the polyamide. See Figure 2.
- an acidic activator peptide PEFPGIELQELQELQALLQQ ("AH", SEQ ID NO: 5) is described.
- the AH activator increases water solubility, and also contacts different components of the transcriptional machinery as compared to the small hydrophobic XL peptide.
- dimerization domain (Gcn4 251- 281) is described (compounds 10 (SEQ ID NO: 4), 11, 13 and 14, Figure 2).
- This dimerization domain is derived from the leucine zipper region of the yeast protein Gcn4. It is relatively small (about 30 residues) and soluble.
- Gcn4 conjugates containing XL conjugates 10 and 13 also included amino acid residues 90-100 of Gal4 believed to be necessary for full function.
- Each of compounds 10-20 was prepared by joining a linker-activator peptide conjugate to a polyamide via native ligation, as described in Example 1, Scheme 1. All of the compounds were prepared in low (5%) to good (50%) yfe d, and their " respective identities confirmed by MALDI-TOF mass spectrometry.
- DNase I footprint titrations were performed to determine DNA binding affinities.
- the DNA fragments used for the titrations were 32 P-labeled restriction fragments containing an inverted dimer binding site separated by 5 or 7 base pairs, as well as monomer sites.
- Each of the compounds bound DNA with reasonable (K a >10 7 M ) affinity.
- the conjugates containing the Gcn-4 dimerization domain exhibited much improved solubility characteristics and bound DNA with greater affinity than the corresponding Gal4 conjugates.
- Compounds containing a Gcn4(251-281) linker (compounds 10, 11, 13, and 14) bound a palindromic dimer site with about 5 fold higher affinity than the corresponding monomer site.
- each of the conjugates was separately tested in an in vitro transcription assay derived from yeast nuclear extracts to assess transcription activation function.
- the template DNA used in the reactions contained three palindromic binding sites 50 base pairs upstream of the transcription start site. See Figure 3. In these assays, compounds 9 and 17-20 did not activatei'transcription, and in some cases, inhibition or levels lower than basal were observed.
- Example 4 Compounds Comprising Non-Dimerizing Linkers This Example describes the design, synthesis, and testing ofa series of compounds (21-28 see Figure 4) in which the Gcn4-derived linker portion (Gcn4(251 -28 l))of compound 11 (see Example 5) was replaced with a "scrambled" peptide in which the amino acid residues of the linker were re-ordered so as to disrupt the helix-forming propensity of the coiled-coil motif of native peptide (compound 21) believed to be participate in dimerization of Gcn4 molecules; the polyamide and activating domains were the same as those in compound 11.
- the compounds described in this Example demonstrate that a discrete dimerization domain is not essential for activator function. That compound 11, which comprises a leucine zipper region derived from Gcn4, exhibits stronger activating potential than compounds containing a flexible linker leads to the conclusion that projection from DNA is the crucial function of the linker region. Furthermore, when the capacity to project AH from DNA via coiled-coil formation was removed (compound 21), the function of the conjugate as an activator was also severely compromised. Finally, transcription activation by these compounds demonstrates that the DNA binding domain of an activator plays no fundamental role in activation other than specifying the location of DNA binding.
- EXAMPLE 5 Activation of gene expression by small molecule transcription factors
- naturally occurring eukaryotic transcriptional activators typically, at a minimum, comprise a dsDNA binding domain and a separable activation domain, and most activator proteins also contain a dimerization module between the DNA binding and activation domains.
- This example describes preferred embodiments of a class of synthetic regulatory compounds according to the invention that mimic natural transcription factors, namely compounds comprising a dsDNA binding domain comprised of a polyamide, a peptide-based activation domain derived from a designed or naturally occurring transcription activating protein, and either a peptidic or polyethylene glycol linker (diglycolic anhydride).
- such compounds can mediate high levels of DNA site-specific transcriptional activation in vitro.
- a representative example of such molecules had a molecular weight of about 4.2 kDa, and contained a sequence- specific DNA binding polyamide to serve as the DNA binding region, a non-peptide linker instead of a dimerization peptide, and an activating region (here, a designed peptide) designated "AH", that comprised the following amino acid sequence: PEFPGIELQELQELQALLQQ (SEQ ID NO: 5, Giniger et al. supra). Because synthetic polyamides can be designed to recognize any specific double-stranded nucleic acid sequence, these results demonstrate that synthetic regulatory compounds can be designed to up-regulate the expression of any specified gene.
- the compounds described in this Example incorporate non-natural or synthetic counterparts for each of the functional modules typically found in naturally occurring activator proteins.
- the protein-based DNA binding module was replaced with a hairpin polyamide composed of N-methylpyrrole (Py) and N-methylimidazole (Im) amino acids that binds in the minor groove of D ⁇ A ( Figure 4).
- the hairpin polyamide selected for the present study ImPyPyPy- ⁇ - PyPyPyPy- ⁇ -Dp (where ⁇ is ⁇ -aminobutyric acid, ⁇ is ⁇ -alanine, and Dp is dimethylaminopropylamide; polyamide 1), binds the target nucleotide sequence 5'- TGTTAT-3' with a dissociation constant (K D ) of 1J nM. Initially, a palindromic
- HEET RUL 2 ⁇ binding site containing this sequence as an inverted repeat separated by 7 bp was targeted.
- Conjugates 3, 4, and 5 Figure 5).
- Polyamide 1 (Example 1) was prepared according to established protocols as described above and then was combined with 1.2 equivalent (eq.) thiolane-2,5-dione (Kates et al.
- Synthetic peptides 3 (SEQ ID NO: 10), 4 (SEQ ID NO: 11), and 5 ( Figure 5A) were synthesized using established peptide synthesis protocols.
- Peptide 3 contained both the Gcn4 dimerization domain and the AH activation domain at the carboxy-terminus of the peptide.
- Peptide 4 contained the dimerization domain from Gcn4 without the AH peptide; and peptide 5 contained the AH domain alone.
- each of peptides 3, 4, and 5 when synthesized included an amino-terminal Cys residue to facilitate linkage to compound 2.
- polyamide 2 (1 ⁇ mol) was combined with either peptide 3, 4, or 5, (0.8-1 ⁇ mol) in 5% 1-methyl- 2-pyrrolidinone in 6 M Gn HC1, 100 mM potassium phosphate buffer (pH 7.3), and 10% (vol/vol) thiophenol was added to this solution. Reaction progress was monitored by analytical HPLC and upon completion (3-5 days), purification of the mixture by reversed-phase HPLC resulted in isolation of the desired conjugates.
- D ⁇ ase I Footprinting Titrations Quantitative D ⁇ ase I Footprinting titrations were carried out in accordance with established protocols on a 3- 32 P- labeled 271-bp pPT7 EcoRI/EvwII restriction fragment. Trauger et al. supra; and Senear et al. (1986) Biochem. 25:7344-7354.
- the template plasmid was constructed by closing a 78-bp oligomer bearing three cognate palindromic sequences into a BgHl site 30 bp upstream of the TATA box of pML 53.
- This plasmid has the AdML TATA box 30 bp upstream of a 277-bp G-less cassette.
- the "mismatch" template (containing a substitution of a T/A base pair with a G/C base pair, as compared to the "match," or target site) was constructed by cloning a 78-bp oligomer containing three palindromic "mismatch" sites into a Bglil site 30 bp upstream of the TATA box of pML 53. Gu et al. (1999) Mol. Cell. 3:97-108. For each reaction, 20 ng of plasmid (30 fmol of palindromic sites) was preincubated with conjugate for 75 min before the addition of 90 ng of yeast nuclear extract in a 25 ⁇ L reaction volume under standard conditions. Lue et al. (1991) Met. ⁇ nzymol.
- Conjugate 3 contained the eight- ring hairpin polyamide as the DNA binding module in addition to residues 251-281 of the yeast protein Gcn4 as a dimerization element and the designed peptide AH as the activating region.
- the two control compounds 4 (PA-Gcn4) and 5 (PA- AH) each lacked one or another of components critical for activator function.
- Conjugate 3 activated transcription in yeast nuclear extracts on a DNA template containing three palindromic binding sites upstream of the start site (Fig. 8B). Lue (1991); and Lue (1989). Thus, inclusion of compound 3 at 200 nM concentration in the reactions resulted in 13-fold levels of activated transcription over basal levels.
- polyamide 1 alone or polyamide coupled to the Gcn4 dimerization domain did not stimulate transcription.
- activation depended on the presence of cognate polyamide binding sites upstream of the transcription initiation site.
- compound 3 failed to significantly activate transcription (Fig. 8C).
- Example 5 describes the construction and testing of synthetic regulatory compounds that comprise non-natural nucleic acid binding moieties and various linkers in combination with the designed amphipathic AH activator peptide, a motif later found to occur in many native activation proteins.
- One of those synthetic regulatory activators, 4.2 kDa in size comprised an eight-ring DNA binding hairpin polyamide tethered through a flexible ethylene glycol-based polyether linker to the 20 residue AH activating peptide stimulated high levels of promoter-specific transcription.
- a dimerization domain was unnecessary for function of the activation moiety.
- This example further demonstrates the utility of the nucleic acid binding moiety-linker-regulatory moiety motif by showing that even smaller synthetic ligands that mimic the activities of naturally occurring regulatory proteins can be successfully assembled and have regulatory function. Small synthetic regulatory compounds are preferred in order to increase the probability of membrane permeability without appreciable loss of specific gene-regulating activity.
- synthetic regulatory compounds are provided that comprise sequence-specific polyamides attached via a linker to an activation peptide comprised of 8 or 16 residues derived from the activation domain of the potent viral activator VPl 6.
- the 16 residue activation moiety coupled to the polyamide via a linker activated transcription three fold better than the analogous AH conjugate described in Example 5. Altering the site-of-attachment of the activation moiety on the polyamide allowed reduction of the intervening linker from 36 atoms to eight without significant diminution of the activation potential. Also provided are
- conjugates 1, 5, and 10 were synthesized and transformed.
- Polyamides 1, 5, and 10 were prepared according to established solid phase synthesis protocols and subsequently transformed into conjugates 2, 3, 4, 6, 7, 8, and 11 by the previously reported methods.
- Conjugate 9 was prepared in an analogous fashion. The identity of all conjugates was verified by MALDI-TOF mass spectrometry.
- DNase I Footprinting Titrations A 363-bp 5'- 32 P-labeled PCR fragment was generated from template plasmid pAZA812 in accordance with standard protocols and isolated by nondenaturing gel electrophoresis. All DNase I footprinting reactions were carried out in a volume of 40 ⁇ L. 0.8 ng/ ⁇ L of plasmid pPT7 was used in these reactions as unlabeled carrier DNA.
- a polyamide stock "" ' solution or water (for reference lanes) was added to an assay buffer where the final concentrations were 50 mM HEPES, 100 mM KOAc, 15 mM Mg(OAc) 2 , 5 mM CaCl 2 , 6.5% glycerol, 1 mM DTT, pH 7.0 and 15 kcpm 5'-radiolabeled DNA.
- the solutions were equilibrated for 75 min. at 22°C. Cleavage was initiated by the addition of 4 ⁇ L of a DNase I stock solution and was allowed to proceed for 7 min. at 22°C.
- the reactions were stopped by adding 10 ⁇ L of a solution containing 2.25 M NaCl, 150 mM EDTA, 0.6 mg/mL glycogen, and 30 ⁇ M base pair calf thymus DNA and then ethanol-precipitated.
- the cleavage products were resuspended in 100 mM Tris-borate-EDT A/80% formamide loading buffer, denatured at 85°C for 10 min, and immediately loaded onto an 8% denaturing polyacrylamide gel (5% crosslink, 7 M urea) at 2000 V for 2 h. 15 min.
- the gels were dried under vacuum at 80°C and quantitated using storage phosphor technology.
- Template plasmid pAZA812 was constructed by cloning a 78 bp oligomer bearing three cognate palindromic sequences for conjugates 2, 3, 4, 6, 7, 8, and 9 into a Eg/II site 30 bp upstream of the TATA box of pML ⁇ 53. This plasmid has the AdML TATA box 30 bp upstream of a 277 bp G-less cassette.
- Template pAZA813 was constructed by cloning a 78 bp oligomer containing three palindromic cognate sites for compound 11 into a Eg II site 30 bp upstream of the TATA box of pML ⁇ 53 (22).
- Synthesis of Synthetic Activators The activation domain (residues 411- 490) of VPl 6 fused to a DNA binding protein yielded a synthetic regulatory compound that activated transcription with a potency comparable to strong natural activator proteins. Dissection of this and other activation domains " has identifie minimal units that activate transcription, these modules are often surreptitiously iterated in natural activating regions. In VPl 6, one such minimal moiety comprises eight amino acids. When iterated, the activation potential of the consequent peptide increased in a synergistic rather than an additive manner. This property of activating regions was adopted in order to design artificial activators of varying strengths.
- a series of polyamide-linker-(VP16 minimal module)-, compounds (where x is 1 or 2) was designed, each of which comprises: a hairpin polyamide designed to target the cognate sequence 5'-TGTTAT-3'; a flexible tether of varying length (12 or 36 atoms); and one of three different activating regions (see Figure 11).
- the three activating regions were AH and one (VPl)or two (VP2) tandem repeats of the eight amino acid sequence derived from VPl 6 (SEQ ID NOs: 16 and 17).
- linker a critical role of the linker is to facilitate projection of the activating moiety away from the DNA for productive interaction with the transcriptional machinery. Initially, this was achieved by conjugating a polyether linker to the C-terminus of hairpin polyamides. Conjugation via linkage at an internal pyrrole residue also appeared attractive, as solution studies and x-ray crystallographic data demonstrated that the N-methyl group of the pyrrole residues is directed outward from the minor groove when a polyamide binding in the 2: 1 motif forms a sequence-specific complex with dsDNA.
- HEET RULE conditions for those experiments were substantially different than those of the typical in vitro transcription assays.
- concentration of DNA preferably is at least about a 10-fold excess of polyamide relative to DNA.
- in vitro transcription assay ' s employ a relatively high concentration of plasmid DNA (0.8 ng/ ⁇ L), which can cause a decrease in the occupancy of a target nucleotide sequence by 10- to 100-fold.
- Activating potential corresponds to size and site-of-attachment of the activation moiety.
- the in vitro transcription experiments reveal that the activation strength of a synthetic regulatory compound is proportional to the size of the activating region, and that projection of the activating region away from the DNA enhances its functionality.
- compound 8 PA-VP1
- PA-VP1 which lacked a linker
- substituting VPl with VP2 increased the activation strength of the resulting compound.
- projecting the VPl or VP2 module via a longer linker further improved the activation potential the synthetic regulatory compounds.
- the synthetic regulatory compounds used in this example employed polyamide molecules as non-natural nucleic acid binding moieties.
- the compounds were tested for cell permeability against two cell lines, SKOV (a cisplatin resistant human ovarian cancer cell line) and 293T (a human renal cell line) cells.
- the compounds were also tested for their ability to activate a transiently transfected reporter gene the expression of which could be up-regulated by activation of a promoter functionally associated with one DR5 target site.
- Other cells including, without limitation, COS, Cho, Jurkat, and HeLa cells, are also suitable for use with routine modifications in the experiments reported herein.
- the reporter gene carried by the cells comprised a minimal HS V TK promoter driving the expression ofa luciferase gene.
- Promoter activity was regulated by a consensus DR5 site approximately 50 bp upstream of the TK promoter.
- the DR5 site comprises a direct repeat of two consensus hexameric sequences separated by five nucleotides, the identity of which was irrelevant.
- UTE SHEET (RULE 26) of the consensus hexameric nucleotide sequences in the DR5 site was 5'-AGGTCA- 3', which also corresponds to an estrogen receptor half-site.
- the polyamide moieties of the synthetic regulatory compounds used in this example targeted the nucleotide sequence 5'-WGGWCA-3', and were fused via a PEG linker at the C-terminal tail to either the L- or the D- form o a VP2 activating region (AMyl54 and AMyl55, respectively).
- AMyl54 and AMyl55 Prior to testing these two conjugates on reporter cells, these compounds were tested for their ability to activate transcription in standard cell-free yeast system that comprised a reporter construct having three tandem 5'-AGGTCA-3' sites located 40 bp upstream of the AdML TATA:G-less cassette.
- the in vitro transcription experiments were performed as reported above. As expected, the AMyl51 polyamide alone did not stimulate transcription over basal conditions.
- the synthetic regulatory compound AM ⁇ l54 (PA-1L-VP2) activated transcription about 12 fold
- AM V 155 (PA-1L-D-VP2) activated transcription about ten-fold in the in vitro assays.
- the observed levels of activation were consistent with the fact that only three half-sites, rather than three complete palindromes, were used in the reporter construct employed in the in vitro transcription assays.
- the cells used for the cell permeability and cell-based reporter assays were passaged twice and then grown overnight in 6-well plates to sub-confluency in CO incubators. Two micrograms of DR5-Luc DNA and 0.5 micrograms of CMV- ⁇ gal were transfected in each of the wells. One well on each plate contained no reporter construct, and one contained a strong CMV promoter driving the Luc reporter gene to check for transfection efficiency.
- SKOV cells were transfected using DOTAP (a cationic lipid transfection system), and the 293 cells were transfected by the standard calcium phosphate precipitation method.
- activating regions typically are unstructured in solution, and they are fairly hydrophobic. Such peptides are typically rapidly degraded in cells. Thus, the higher activity of the D-form in activating reporting expression in 293 cells can reflect ah ability to resist proteolysis.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Animal Behavior & Ethology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Veterinary Medicine (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Public Health (AREA)
- General Health & Medical Sciences (AREA)
- Epidemiology (AREA)
- Chemical & Material Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Diabetes (AREA)
- Endocrinology (AREA)
- Pharmacology & Pharmacy (AREA)
- Pathology (AREA)
- Rheumatology (AREA)
- Toxicology (AREA)
- Urology & Nephrology (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Peptides Or Proteins (AREA)
Abstract
This invention provides novel synthetic regulatory compounds that comprise a nucleic acid binding moiety, a linker, and a regulatory moiety, compositions comprising such compounds, methods of designing and synthesizing such compounds, methods of screening such compounds to identify those having the desired regulatory activity, and methods of using such compounds to prevent or treat disease in plants and animals, including humans. These compounds, and compositions containing them, have multiple applications, including use in human and animal medicine and in agriculture.
Description
TITLE OF THE INVENTION
SYNTHETIC REGULATORY COMPOUNDS
CROSS-REFERENCE TO RELATED APPLICATIONS This application claims the benefit of and priority to United States provisional patent application serial number 60/161,545, filed October 26, 1999. STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH
The United States government may have certain rights to this invention pursuant to National Institute of Health grant no. GM-27681. FIELD OF THE INVENTION
This invention relates to synthetic regulatory compounds comprising a double-stranded nucleic acid binding moiety, a linker, and a regulatory moiety, the synthesis and testing of such compounds, and applications therefor. BACKGROUND OF THE INVENTION The following description of the background of the invention is provided to aid in understanding the invention, but is not admitted to be or to describe prior art to the invention.
The regulation of gene expression is critical to the growth, development, proliferation, and maintenance of all living cells and organisms. In most cases, the positive or negative regulation of genes is under the control of signal transduction cascades that transmit information from the cell surface to the nucleus. Signal- transduction cascades are generally triggered by ligands which may be small molecules, soluble peptides, extracellular matrix, adhesive proteins attached to cell surfaces of neighboring or migrating cells, and even metabolic intermediates. In most cases, ligands interact with a membrane bound, or sometimes soluble intracellular, receptor, thus triggering a cascade of events that ultimately either stimulate or inhibit the activity of the mRNA-synthesizing machinery at one or more genes. Such reprogramming of gene-expression leads to an appropriate cellular response to the stimuli. Based on current understanding, almost all such signals converge and mediate their function through transcriptional activators and/or repressors, although some environmental stimuli, such as nutrient deprivation, directly affect the stability of certain components of the transcriptional machinery.
Because of the importance of gene transcription or gene expression to living cells, manipulating the process is of extreme interest. Compounds that fundamentally alter the activity of the transcriptional machinery itself, for example, by inhibiting the elongation process, would be potent transcriptional modulators, and it is gene-specific regulation that is the goal of many development 'programs. Oiie approach to gene-specific transcriptional regulation has been to develop molecules that block activator-DNA or repressor-DNA interactions and thereby regulate transcription artificially. Several approaches in this vein are being investigated. One such approach involves protein nucleic acid (PNAs) which are oligomers that contain the standard purine and pyramidine bases of an oligonucleotide but contain a simple amide-based backbone as opposed to the sugar-phosphate backbone found in nucleic acids. Nielsen (1997) Chem. Eur. J. 3:505-508. PNAs have been shown to bind with very high affinity to single-stranded DNAs and RNAs. It has also been shown that a PNA complementary to one strand to a DNA duplex will invade the double helix, pair with its complementary strand. Footer et al. (1996) Biochem. 35:10673-10679. PNA binding can abolish PNA protein interactions in the same region, and additionally PNAs have been employed as antisense agents.
Another class of such molecules is oligonucleotides that are capable of promoting "triple helix" formation. Yet another class of molecules are the so-called "polyamides" developed by Dervan and co-workers. See, e.g., Dervan et al. (1999) Curr. Opin. Chem. Biol. 3:688-693. Polyamides having high sequence specificity and association constants have been developed based on a "code" by which a given pair of substituted or unsubstituted imidazole /pyrrole pair can be selecting to bind to particular nucleotide base pairs in the minor groove of double stranded DNA. Sequence specific polyamides that adhere with proteins that bind via the major- groove of double stranded DNA have also been developed. Bremer et al. (1998) Chem. Biol. 5:119-133.
In addition to developing molecules that interfere with the association of activators and repressors with the cognate target sequences, another approach involves small molecules that modulate (positively or negatively) interactions between proteins involved in the regulation of transcription. A second approach to regulating transcription ofa desired gene may involve mediating protein-protein
interaction. To date, efforts in this area have involved cell-based genetic approaches.
For example, the so-called "two-hybrid assay" (Fields et al. (1989) Nature
340:245-246) is based on the observation that in many promoter contexts, the DNA binding and activation domains of an activator protein function more or less independently of one another Brent, R. & Ptashne, M. (1985) Cell 43:729-736;
Keegan, L. et al (1986) Science 231:691-704.
Nonetheless, these domains require functional association in proximity of the promoter. For instance, if the activation and DNA binding domains of the yeast Gal4 protein are severed and expressed in a yeast strain deleted for wild-type GAL4, no transcription of genes under the control of the GAL4 promoter occurs. However, in genes encoding two other proteins that interact with one another are fused to the
DNAs encoding the severed GAL4 domains, activator activity is reconstituted and the target gene can be transcribed. Ma, J. & Ptashne, M. (1988) Cell 55:443-446. Other similar systems, each of which requires the intracellular expression of chimeric gene constructs, are known. Vidal et al. (1996) Proc. Natl. Acad. Sci. USA
93:10321-10326; Leanna et al. (1996) Nucl. Acids Res. 24:3341-3347; Huang et al.
(1997) Proc. Natl. Acad. Sci. USA 94:13396-13401; and Hu et al. (1990) Science
250:1400-1403. Despite these approaches, however, at present there exists no class of synthetic, cell-permeable compounds that can regulate the expression ofa specific gene.
BRIEF SUMMARY OF THE INVENTION
It is the object of this invention to provide a novel class of synthetic regulatory compounds, which are preferably cell permeable, as well as compositions comprising such compounds, methods of synthesizing such compounds, methods of screening to identify such compounds, and methods of using such compounds.
Thus, in one aspect, the invention concerns synthetic regulatory compounds, each of which comprises at least one nucleic acid binding moiety, at least one regulatory moiety, and at least one linker connecting the nucleic acid binding moiety(ies) to the regulatory moiety(ies). By "synthetic" is meant any compound wherein the at least one of the particular nucleic acid binding moiety(ies) and at least
one of the particular regulatory moiety(ies) are not found in the same molecule in nature, i.e., in a wild-type animal or plant. By "non-natural" is meant that the compound (e.g., a peptide) is not naturally occurring.
A "nucleic acid binding moiety" refers to any compound that binds to a nucleic acid molecule, be it single- or double-stranded DNA or RNA (with double- stranded DNA being preferred), under desired conditions. What constitutes "desired conditions" will vary depending upon application, but in general refers to reaction conditions such as temperature, pH, solvent, ionic strength, the presence or absence of chaotropic agents, reactant concentrations, etc. to be encountered in the eventual intended application of the compound. In the context of synthetic regulatory compounds to be used as drugs, for example, preferred desired conditions are physiological conditions and, again, these will vary depending upon the particular animal, plant, and environment being considered, but in any event may be determined by one ordinarily skilled in the art. The particular conditions used (for example, the reactions conditions used to conduct in vitro screening or other testing) need not be equivalent to those actually found in a cell for example; however, such conditions should sufficiently represent or approximate those likely to be encountered in the eventual application (e.g., the conditions in a human cancer cell, when the compound is a pharmaceutical intended to treat or prevent such cancer) such that meaningful data can be generated.
Preferably, a nucleic acid binding moiety of a compound according to the invention will specifically interact with a target nucleotide sequence. A "target nucleotide sequence" refers to a specific sequence of nucleotides, and is typically represented in the 5' to 3' direction using standard single letter notation, where "A" represents adenine, "G" represents guanine, "T" represents thymine, and "C" represents cytosine, and "U" represents uracil. As used herein, a "target nucleotide sequence" within a double-stranded nucleic acid molecule preferably comprises a sequence greater than 3 but preferably less than about 20 nucleotides in length.
The target sequence, in general, is defined by the nucleotide sequence on one of the strands of the double-stranded nucleic acid. Such sequences are pre-selected or pre-determined, and are thus referred to as targets. While such sequences comprise a specific sequence of nucleotides, it will be appreciated that such
sequence can include a different nucleotide at the same position, i.e., is degenerate at that position, with respect to one or more positions in the particular sequence. Degenerate bases can be represented by any suitable nomenclature, for example, that which is described in World Intellectual Property Organization Standard ST.25 (1998), Appendix 2. Typically, when a nucleic acid binding moiety ofa synthetic regulatory compound of the invention specifically binds to its target nucleotide sequence, it does so in a manner that does not compete for binding site interaction with endogenous compounds. This can be accomplished in any suitable manner, for example, by selecting a target nucleotide sequence that is adjacent or proximal to (e.g., within about 500 bases, preferably about 200 bases, and even more preferably within about 100 or fewer bases of) the binding site for a DNA-binding protein of the transcriptional machinery.
By "regulatory element" is meant any cis element of defined nucleotide sequence that can be identified in a nucleic acid molecule and which associates with a DNA-binding protein of the transcriptional machinery or a protein involved in providing chromatin structure. Such elements include promoters and enhancers. A "promoter" is the minimum sequence necessary to initiate transcription of a target gene. An "enhancer" is a cis-acting sequence that increases the utilization of a eukaryotic promoter. Preferred cis elements to be targeted by the nucleic acid binding moieties of the invention's synthetic regulatory compounds are those that occur endogenously in association with the gene whose transcription is to be regulated.
Other embodiments include chimeric reporter constructs that comprise a promoter or other regulatory element not naturally associated with a particular gene. Transcription from any promoter can be regulated by a synthetic regulatory compound. As such, promoters from which transcription can be initiated by any RNA polymerase can be targeted. Suitable RNA polymerases include, but are not limited to, RNA polymerase I (which transcribes ribosomal RNA (rRNA) in eukaryotic cells), RNA polymerase II (which transcribes messenger RNA (mRNA) in eukaryotic cells), and RNA polymerase III (which transcribes transfer RNA (tRNA) in eukaryotic cells.
SHEET (RULE 26
The interaction between the nucleic acid binding moiety can occur at different regions in the nucleic acid molecule. For example, when dsDNA in the B form contains the target nucleotide sequence, the nucleic acid binding moiety preferably interacts with the DNA via minor and/or major groove interactions. In preferred embodiments, the nucleic acid binding moiety is a molecular " scaffold that spatially arrays a plurality of hydrogen bond donors and acceptors in a manner that allows the formation of hydrogen bonds with the corresponding hydrogen bond acceptors and donors in the target nucleotide sequence when the nucleic acid binding moiety interacts with a nucleic acid molecule (e.g., dsDNA) containing the target nucleotide sequence. In addition, or alternatively, the molecular scaffold can provide moieties that allow specific electrostatic and/or van der Waals interactions between units of the scaffold and atoms in the target nucleic acid molecule. Preferred nucleic acid binding moieties that can act as such molecular scaffolds include protein nucleic acids (PNAs) and oligonucleotides. Another preferred class of nucleic acid binding moiety useful in the practice of this invention can be represented as follows: - Qi — Z\ - Q2 - Z2 - ... - Qm - Zm -, wherein each of Q , Q , ... Qm is independently selected from a heteroaromatic moiety and (CH2)P, wherein p is an integer between 1 and 10, inclusive, and is preferably 1, 2, or 3; wherein each of Z\, Z2, ..., Zm is independently selected from the group consisting ofa covalent bond and a linking group; and m is between 1 and 20, inclusive, with 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16 being preferred. In certain preferred embodiments, at least one of Q , Q2, ... Qm is a heteroaromatic moiety, for example, a substituted or unsubstituted imidazole or pyrrole moiety. In particularly preferred embodiments, at least about 50%, 60%, 70%, 80%, or more of Qi, Q2, ... Qm are a heteroaromatic moieties, particularly substituted or unsubstituted imidazole or pyrrole moieties. In additional or alternative preferred embodiments, at least one of Z\, Z2, ..., Zm is a linking group having between 1 and 10, inclusive, and preferably 2, 3, 4, or 5, backbone atoms. In particularly preferred embodiments, each of Zls Z2, ..., Zm is a carboxamide group. Such nucleic acid binding moieties, wherein the vast majority of the hydrogen bond donors and acceptors are contained within moieties (e.g., heteroaromatic moieties
SUBSTITUTE SHEET (RULE 2
such as substituted or unsubstituted imidazoles or pyrroles) linked by carboxamide bonds, are referred to as polyamide nucleic acid binding moieties.
Polyamides can be designed to assume one of several alternative conformations upon base-specific interaction with a nucleic acid molecule, including hairpin, H-pin, slipped, overlapped, and cyclic conformations. Such conformations include intermolecular 2:1 binding motifs between dsDNA molecules comprising a corresponding target sequence, as well as intramolecular 2:1 binding motifs between dsDNA molecules comprising a corresponding target sequence.
Preferably, a nucleic acid binding moiety included in a synthetic regulatory compound of the invention has, under desired conditions, a binding specificity for its corresponding target nucleotide sequence of at least about two, and preferably at least about 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, 100 or more, as compared to a mismatch target sequence. A "mismatch" target sequence refers to a target nucleotide sequence in which one nucleotide is different at a particular position, as compared to the target nucleotide sequence.
Preferably, a nucleic acid binding moiety included in a synthetic regulatory compound of the invention also has, under desired conditions, at least about submicromolar, and preferably at least about nanomolar or picomolar binding affinity for its target nucleotide sequence. Binding affinity can be determined by any suitable technique including, but not limited to, quantitative DNase I footprint analysis. Association constants (Ka) for preferred nucleic acid bindmg moieties are at least about 10° M"1, IO7 M' IO8 M"1, IO9 M"1, 1010 M*1, IO11 M"1, or IO12 M"1.
Other embodiments of this aspect concern synthetic regulatory compounds that comprise two or more nucleic acid binding moieties associated in a manner that allows each to retain its nucleic acid binding function, for example, through the use of a flexible linker. One representative class of synthetic regulatory compounds comprising a plurality (e.g., 2, 3, 4, or more ) of nucleic acid binding moieties include those wherein two such moieties are tethered to one another via a linker, and the linker is attached to another linker (or, alternatively, is a dendrimeric linker having three or more functional sites) that is attached to a regulatory moiety.
Another such class concerns compounds wherein the nucleic acid binding moieties are each separately attached via a linker to the regulatory moiety. Yet another class
concerns compounds wherein the nucleic acid binding moieties are tethered via a linker and one of them is also linked via a separate linker to a regulatory moiety. Many other configurations of this sort are encompassed herein. These include, but are not limited to, compounds containing multiple regulatory moieties (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, or more), and compounds containing multiple regulatory moieties and a plurality (e.g., 2, 3, 4, or more) of nucleic acid binding moieties.
Synthetic regulatory compounds that comprise a plurality of nucleic acid binding moieties typically will have greater target sequence specificity than a synthetic regulatory compound containing fewer nucleic acid binding moieties. In addition, the compounds bind to both in the minor and major grooves of dsDNA, and provide groups that can be modified by other molecules, for example, by linkers to which one or more regulatory moiety can be attached. Similarly, synthetic regulatory compounds that comprise a plurality of regulatory moieties can interact with more than one component of the transcription or chromatin structure machinery.
In order to regulate, or modulate, expression of a target gene, a synthetic regulatory compound according to the invention also contains, in addition to a nucleic acid binding moiety, a regulatory moiety. As used herein, "regulate" or "modulate" refers to an ability to alter the level of expression of a particular gene above (i.e., up-regulate or activate) or below (i.e., down-regulate or repress) the basal level of expression that would occur in the particular system (for example, an in vitro transcription system or a cell) in the absence of the compound under the same conditions. A regulatory moiety that activates transcription is referred to herein as an "activation moiety" or "activator", whereas a regulatory moiety that represses transcription is referred to as a "repressor moiety" or "repressor."
In general, a regulatory moiety is any compound that can positively or negatively effect, by either a direct mechanism (i.e., by direct interaction with one or more components of the transcription complex) or and indirect mechanism (i.e., by (i) direct interaction with a repressor protein or (ii) direct interaction with a protein involved in chromatin or nucleosome structure), transcription of a target gene, other than by direct electrostatic interaction with double-stranded DNA. By "direct interaction" is meant direct, non-covalent association between two components.
EET RULE 26)
Representative embodiments of regulatory moieties include peptides, polypeptides, lipids, carbohydrates, and any combination thereof. A "peptide" is a polymer (i.e., a linear chain of two or more identical or non-identical subunits joined by covalent bonds) made up of naturally occurring or synthetic D- or L-, or D- and L-, amino acids joined by peptide bonds. Generally, peptides contain at least two amino acid residues (i.e., the molecules resulting from the formation of a peptide bond between two amino acids, or between an amino acid residue and another amino acid) but fewer than about 50 amino acid residues. A "polypeptide" is a also a polymer of amino acid residues linked by peptide bonds, but typically contains at least about 50 amino acid residues. Thus, herein "peptide" is used to refer to a regulatory moiety that is less than about 50 amino acid residues in length, and "polypeptide" refers to larger polymers of amino acid residues linked by peptide bonds. A "lipid" is a substantially water-insoluble molecule that contains as a major constituent an aliphatic hydrocarbon. Lipids include fatty acids, neutral fats, waxes, and steroids. The hydrocarbon portions of the molecule can be of any length, can be saturated or unsaturated, and can be straight- or branched-chain. "Carbohydrate" refers to any aldehyde or ketone derivative of a polyhydric alcohol, and includes starches, sugars, celluloses, and gums.
A regulator moiety can be naturally occurring or be derived from or an analog of a naturally occurring molecule. Alternatively, it can be synthetic.
Preferred regulatory moieties are peptides and small organic molecules. "Peptide", in this context, refers to an amino acid polymer comprised of between one to about fifty, inclusive, residues of amino acids (i.e., any molecule that contains an amino group and a carboxylic group linked by peptide, or carboxamide, bonds. Such peptides can be duplicative of amino acid sequences that occur naturally (e.g., an activation or repressor domain of a transcriptional activator or repressor, respectively). Typically, such peptides are comprised of the twenty L amino acids commonly found in proteins in nature. Alternatively, they can comprise one or more D enantiomers of such amino acids, or other rarely observed amino acids. In other embodiments, such peptides are comprised of amino acids not typically found in naturally occurring proteins. Peptidic regulatory moieties can be synthesized by any suitable method, including recombinant techniques or solid state or in-solution
synthetic methods. After synthesis, they can be linked to a suitable nucleic acid binding moiety. If it is unknown whether a particular peptide possesses regulatory function, it can be screened in a suitable system. Indeed, large numbers of such peptides (as can, for example, result from combinatorially generating peptides) can be screened in high throughput formats.
In certain preferred embodiments, the regulatory moieties are small organic molecules that have been identified as having regulatory function. By "small organic molecule" is meant any water soluble organic molecule having a molecular weight of less than about 10 kDa (kilo Dalton), preferably less than about 5 kDa, more preferably less than about 2.5 kDa, even more preferably less than about 1.5 kDa, and even more preferably less than about 1 kDa.
In some embodiments, a synthetic regulatory compound comprises two, three, four, or more regulatory moieties linked to the nucleic acid binding moiety (ies). In such instances, the regulatory moieties can be the same or different, and can serve to recruit or retard the same or different transcription factors.
Another component of the synthetic regulatory compounds of the invention is a linker, which can be any molecule that can be used to link at least one nucleic acid binding moiety to at least one regulatory moiety in a manner that allows the nucleic acid binding moiety(ies) to retain its(their) intended nucleic acid binding function(s) and the regulatory moiety(ies) to retain its(their) ability to influence transcription of the target gene. The linker should provide adequate spacing between and/or orientation with respect to the nucleic acid binding moiety and the regulatory moiety so as to allow each to retain its respective function.
Linkers used in the practice of this invention can be branched, although straight chain molecules are preferred. They can be amphipathic or aliphatic, with molecules in the latter class being preferred. Typically, a linker will contain from about one to about 200 "spacing" or "backbone" moieties (e.g., -CH2-, -CH=, and -C≡). The backbone moieties preferably are heteroatoms including, but not limited to, carbon, nitrogen, oxygen, sulfur, and phosphorus, with the majority of such atoms being carbon. One or more different side chain groups can also appended to the backbone. Spacing between multiple side chain unit groups can be variable or
SUBSTITUTE SHEET (RULE 26
consistent along the backbone. Representative examples of suitable linkers include polyethylene glycol, alkyl chains, and peptides.
In particularly preferred embodiments of this aspect, the synthetic regulatory compounds are cell permeable. In other words, they are able to come into contact with and be internalized by a cell. Such cells include animal (both* prokaryotic and eukaryotic) and plant cells. Preferred animal cells include vertebrate cells, particularly arachnid, avian, fish, insect, and mammalian cells. Particularly preferred are mammalian cells, for example, bovine, avian, canine, equine, feline, human, murine, ovine, porcine, and primate cells. Preferred plant cells include those of commercially important grains (e.g., alfalfa, barley, corn, rice, soy, sorghum, wheat) and ornamental plants.
Synthetic regulatory compounds of the invention can be prepared as pharmaceutical salts by any method known in the art depending upon the intended application. A related aspect of the invention concerns compositions comprising a synthetic regulatory compound according to the invention and a carrier, for example, a pharmaceutically acceptable carrier. Such compositions can be dry or liquid formulations, and can optionally include one or more excipients, stabilizers, bulking agents, etc. Dry formulations include those that have been lyophilized or freeze dried, which formulations can be reconstituted in a solvent prior to use. Liquid formulations include aqueous formulations, oils, emulsions, and suspensions.
Compositions according to the invention can be delivered to a subject by any suitable route. For example, in the context of compositions intended for agricultural use, a composition can be applied by spraying a liquid or broadcasting a solid. For administration to animals, such compositions can be injected (e.g., via subcutaneous, intramuscular, intravenous, and other parenteral routes), inhaled, eaten, or delivered topically.
Another related aspect concerns kits containing a synthetic regulatory compound or composition according to the invention. Kits typically comprise one or more synthetic regulatory compounds of the invention in a suitable composition and packaged in an appropriate storage container, for example, a vial or ampule.
Kits often further comprise external packaging, such as a box or other container, to
protect and support the storage container containing the composition. In some embodiments, the packaging also contains directions for use, package inserts, etc.
Still another related aspect of the invention concerns complexes comprising a synthetic regulatory compound of the invention complexed with a nucleic acid molecule, e.g., a DNA, particularly a dsDNA. Such complexes can be formed, for example, by exposing a composition (for example, a cell or an in vitro transcription reaction) containing a nucleic acid (e.g., dsDNA) comprising a target nucleotide sequence to a synthetic regulatory compound according to the invention.
Thus, a related aspect concerns cells containing a synthetic regulatory compound of the invention. Such cells include animal cells and plant cells.
Preferred animal cells include avian, bovine, canine, equine, feline, fish, human, murine, ovine, porcine, and primate cells. Such cells can be in vivo or in vitro (including ex vivo).
Another aspect of the invention relates to methods for regulating expression of a target gene by exposing the target gene and its associated regulatory elements to a synthetic regulatory compound according to the invention. Such methods can be carried out in vitro and in vivo.
Still another aspect of the invention concerns the use ofa synthetic regulatory compound of the invention to prevent or treat a disease. Such methods can be accomplished by delivering an effective amount of the synthetic regulatory compound to an animal or plant. An "effective amount" refers to an amount of a synthetic regulatory compound sufficient to induce or effectuate a detectable change in transcription ofa desired gene in a cell-free in vitro system or a cell, be it in an organism or in culture. What constitutes an effective amount will depend on a variety of factors that the skilled artisan will take into account in arriving at the desired delivery regimen.
Another aspect of the invention concerns methods of screening for synthetic regulatory compounds from amongst one or more test compounds. These screening methods include both in vitro and in vivo screening methods, and can include methods involving an in vitro screen followed by an in vivo screen (e.g., a cell-based screen). Such methods can be performed, for example, by exposing, under transcription conditions, a dsDNA encoding a regulatable gene to a test compound
comprising a nucleic acid binding moiety targeted to a transcription-associated regulatory element of the regulatable gene conjugated via a linker to a regulatory moiety, and determining whether the test compound regulates expression of the regulatable gene. Preferably, such methods are performed in vitro, preferably in a high throughput format, meaning that more than about 10, preferably, more than about 100, 1,000, or 10,000 compounds are screened at once. Preferably, the regulatable gene is a marker gene, such as a gene encoding a luciferase or green fluorescent protein.
The various moieties of the invention can be synthetic or natural products. Synthetic moieties can be synthesized by solution or solid phase methods. Two or more moieties can also be synthesized together. Compounds according to the invention can be in unpurified, substantially purified, and purified forms. The compounds can be present with any additional component(s) such as a solvent, reactant, or by-product that is present during compound synthesis or purification, and any additional component(s) that is present during the use or manufacture ofa compound or that is added during formulation or compounding ofa compound. Another aspect relates to methods for synthesizing the compounds of the invention. Broadly, such methods typically comprise separately obtaining each of the nucleic acid binding moiety, the linker, and the regulatory moiety to be linked together to form the synthetic regulatory compound. Two or more of these moieties are then linked by a suitable chemistry, followed by linkage to the third moiety. The particular order of addition of moieties can vary, and its selection is within the skill of the ordinary artisan. Alternatively, the linker and regulatory moiety, or linker and nucleic acid binding moiety, can be synthesized as a single unit. In other embodiments, all three elements are synthesized together. Following synthesis, the compound, or its intermediary compounds in intermediary steps, is preferably purified.
The above summary of the invention is not limiting and other features and advantages of the invention will be apparent from the following Brief Description of the Figures and Detailed Description, as well as from the appended Claims and Abstract. BRIEF DESCRIPTION OF THE FIGURES
Figure 1 illustrates a double-stranded DNA (dsDNA) molecule comprising an inverted tandem repeat of the target nucleotide sequence of an eight ring hairpin polyamide molecule (SEQ ID NO: 1). In the polyamide portion of the synthetic regulatory compound, imidazole moieties are represented by blackened circles and pyrroles are represented by open circles. Diamonds represent β-alanine. Amino acids are represented by the standard one letter code. This nomenclature is used consistently throughout the specification unless otherwise expressly indicated. Two synthetic regulatory compounds, each comprised of the same components, are illustrated as being bound to their respective target sites in the dsDNA. Linked to each polyamide through a linker is one of three activator peptides. The compounds are designated 7 (SEQ ID NO: 2), 8, and 9 (SEQ ID NO: 3). In compound 7, the linker comprises ofa Cys residue linked to an XL peptide. In compound 8, the linker is larger and contains a Cys residue at its amino-terminus, and an XL peptide at its carboxy-terminus. Compound 9 comprises a 28 amino acid residue peptide, additionally including a Cys residue at the N-terminus and a XL peptide at its C- terminus.
Figure 2 contains 3 panels, A, B, and C. Panels A and B show dsDNA comprising inverted tandem repeats of target nucleotide sequences for a specific polyamide. In panel A, the polyamide has an activating peptide (SEQ ID NOs: 4 and 5) attached to its C-terminus via a linker. In panel B, the polyamide has the linker-activating peptide conjugate attached via the C-terminal pyrrole residue. In such circumstances, the polyamide contains conventional C-terminal moieties, for example, β-Dp. Panel C depicts a single hairpin polyamide to which two linker- activating peptide domains are attached via internal pyrrole moieties and SEQ ID NOs: 6 and 7). Also described are compounds 10-20, specifically the portions of the compounds comprising the linker and activating domains.
Figure 3 is a graphic representation of the template used for in vitro transcription assays. As shown, the template contains three inverted tandem repeat target nucleotide sequences (hatched region) to be targeted by nucleic acid binding moiety ofa synthetic regulatory compound. The 3 '-terminus of the downstream most of these sequences is 50 based pairs upstream of the adenovirus major late
promoter TATA box that drives the expression of a "g-less" transcript. In each inverted tandem repeat sequence, the number of nucleotides between the target nucleotide sequences is 3, 5, or 7.
Figure 4 depicts a graphic illustration ofa double stranded nucleic acid molecule (SEQ ID NOs: 8 and 9) containing an inverted tandem repeat target nucleotide sequence for an eight ring polyamide which provides DNA sequence target specificity. Also depicted are the chemical structures of N-methylpyrrole and N-methylimidazole sub-units that may be incorporated into nucleic acid targeting moieties, e.g., polyamides, for use in conjunction with the present invention. Also shown are two representative linker molecules that may be used to join nucleic acid binding moiety and a regulatory moiety in order to generate a synthetic regulatory compound according to the invention. The figure also lists the compounds that were generated upon the ligation of the respective peptides to the polyamide.
Figure 5 depicts synthesis of synthetic activators. The synthesis of polyamide 1 was accomplished according to established protocols, (a) Treatment of compound 1 with thiolane-2,5-dione followed by benzyl bromide provided thioester compound 2 in good yield (53%). (b) Combination of thioester-containing compound 2 with each peptide (SEQ ID NOs: 10 and 11) in denaturing buffer then provided the targeted conjugates via the native ligation reaction. Figure 6 depicts substitution of the dimerization module with a flexible ethylene gly col-derived linker. The synthesis of hairpin polyamides 6 and 7 and compounds 8 and 9 was carried out as described.
Figure 7 illustrates a synthetic regulatory compound comprising a hairpin polyamide (linked head to tail by a short linker (e.g., γ-aminobutyric acid)) attached to a linker domain (LD) to which an activation domain (AD) is attached. The polyamide is associated with the minor groove of a DNA double helix via its DNA recognition domain. Also illustrated is a dsDNA comprising a target nucleotide sequence with which is associated a hairpin polyamide targeted to that sequence. The chemical structure of pyrrole moieties (open circles) are shown, as is the structure of an imidazole residue (shaded circle).
Figure 8 depicts compound 3 (PA-Gcn4-AH) bound to its cognate palindromic DNA site and activated transcription in vitro when its predetermined
DNA binding sites are present (SEQ ID NOs: 12 and 13). (A) (Upper) Storage phosphor autoradiogram ofa quantitative DNase I footprinting titration of compound 3 on the 3- P-labeled 271-bp pPT7 EcoRI/Evι/11 restriction fragment carried out according to established protocols. Pre-equilibration of compound 3 with the DNA fragment was carried out for 75 min., before initiation of the cleavage reactions. From left to right the lanes are: the A sequencing lane; DNase I digestion products in the presence of compound 3 at concentrations of 100 nM, 50 nM, 25 nM, 10 nM, 5 nM, 2.5 nM, 1 nM, 0.5 nM, and 0.25 nM, respectively; DNase I digestion products with no compound 3 present; undigested DNA. (Lower) Data for compound 3 in complex with the 19-bp palindromic site. The curve through the data points is the best-fit cooperative Langmuir binding titration isotherm (n∑) obtained from a nonlinear least-squares algorithm. (B) An in vitro transcription reaction containing PA-Gcn4-AH (compound 3) at 200 nM shows enhanced expression of a 277-nt transcript relative to basal levels whereas a reaction containing conjugate 4, lacking the activating region, does not. Inclusion of polyamide 1 (lane 2) in the reaction did not impair basal transcription (lane 1) match template (SΕQ ID NO: 14), mismatch template (SΕQ ID NO: 15). The variation in transcript position for lane 4 was found to be caused by curvature of the gel. (C) In vitro transcription reactions containing 3 (PA-Gcn4-AH) with templates bearing either the cognate palindromic binding sites (match template) or palindromic sites in which a G:C base pair has replaced a T:A base pair in each half site (mismatch template) upstream of the core promoter. The concentrations of compound 3 used were 0 (basal), 10 nM, 100 nM, and 500 nM.
Figure 9 Depicts dependence of activation level upon time and activating region. (A) (Upper) Storage phosphor autoradiogram showing in vitro transcription time course experiment with compounds 3 and 4 present at 300 nM concentration. Aliquots were processed at 10, 20, 30, 40, and 60 min. (Lower) Comparison of the amount of transcript obtained at each time point relative to basal transcription levels in which no conjugate was present (fold activation). (B) (Upper) Storage phosphor autoradiogram showing the effect of increasing concentrations of untethered 21-aa AH peptide on transcription reactions containing either compounds 4 or 3 at 300 nM concentration. Transcription reactions were performed for 30 min, in the presence
UTΕ SHEET (RULE 26)
of 0, 0.2 :M, 2 :M, and 10 :M concentrations of AH peptide. (Lower) For each reaction, the amount of transcript obtained was compared with basal transcription levels to give the respective fold activation value. These values are displayed as percent activation compared with the results from the reaction containing compound 3 (lane 5), which is defined as 100%.
Figure 10 depicts the activation levels for compounds 3, 5, 8, and 9 were determined by comparison with the amount of transcript obtained from reactions containing the relevant parent hairpin polyamides. A) The fold activation values thus obtained are displayed as percentages relative to the fold activation mediated by compound 3, defined as 100%. (B) Data from DNase I footprinting titrations with compounds 5 and 8. The curve through each data set is the best-fit Langmuir binding titration isotherm (n = 1) obtained from a nonlinear least-squares algorithm. Figure 11 describes the structure of three polyamides, 1, 5, 9. With respect to compounds 1 and 5, they are separately derivatized with either of three peptides, designated AH, VP2 (SEQ ID NO: 16), and VPl (SEQ ID NO: 17), the amino acid sequences of which are shown and the corresponding conjugate is numbered. In contrast, compound 9 shows a hairpin polyamide linked to a VP2 peptide through a linker attached to the C-terminal-most pyrrole of the polyamide.
Figure 12 shows the ability of various conjugates to activate transcription in vitro. Background levels of transcription in the absence of any polyamides are shown in lane 1. Addition of the polyamide alone does not alter the levels of transcription appreciably (lane 2). Lane 3 shows the levels of transcription elicited by conjugate 2; lane 4 shows activation elicited by conjugate 4 and lane 5 demonstrates the activity of conjugate 3. Lane 6 shows activation elicited by conjugate 9. Lane 7 represents the activation due to conjugate 6; lane 8 corresponds to conjugate 8 and lane 9 is due to conjugate 7. Reactions in lanes 1-9 were performed with on a template bearing three cognate palindromic sites (first described in Figure 4) whereas lanes 10-18 were performed in the same order as 1-9 except a template bearing non-cognate sites upstream of the promoter was used. In each of the reactions 400nM concentrations of the relevant conjugate were used.
Figure 13 shows the structures of compounds 10 and 11, as well as a dsDNA molecule (SEQ ID NOs: 18 and 19) containing a target nucleotide sequence for the
polyamide, wherein the polyamide is attached via the PEG linker (shaded hexagon) to the VP2 regulatory peptide. These conjugates recognize a different dsDNA sequence.
Figure 14 shows the modular nature of the synthetic regulator. In vitro transcription reactions show that substituting the imidazole polyamide (conjugate" 1 of Figure 11) for that described in Figure 13 (conjugate 10) alters the template recognition properties in a specific manner. Lanes 1-7 represent reactions performed with the template designed to bind polyamide 10 and its various conjugates (#11) and lanes 8-14 show reactions which were performed with template 1 described in Figure 4. Lanes 2 & 3 using template 2 and lanes 9&10 on template 1 show the inability of either polyamide (at 400nM) to influence transcription levels. Lanes 4&5 on template 2 and lanes 13 &14 on template 1 show that the corresponding polyamide conjugate elicits transcription on the appropriate template. Lanes 6&7 for conjugate 7 and lanes 11& 12 for conjugate 11 show that on the non- conjugate sites the regulatory moiety bearing conjugates do not elicit transcription above background levels. DETAILED DESCRIPTION OF THE INVENTION
The synthetic regulatory compounds of the invention represent a novel class of nucleic acid binding ligands in which a nucleic acid binding moiety is tethered through a linker to a second functional moiety, viz, a regulatory moiety (e.g., an activator), that interacts directly with elements of the endogenous transcriptional machinery or indirectly through interaction with a repressor protein or components involved in chromatin structure (e.g., nucleosomes). Cell-permeable members of this class can be targeted to designated sites in the genome, and can be used, for example, as tools to study mechanistic aspects of transcriptional regulation and to correct the ectopic gene expression that often occurs in disease.
The invention now will be discussed with reference to particular preferred embodiments, that, for convenience, will be in the context of dsDNA as the nucleic acid, but it is to be understood that the invention is not limited to such context and can be applicable to other nucleic acid, i.e., single-stranded DNA or single- or double-stranded ribonucleic acid. •Synthetic Regulatory Compounds
Synthetic regulatory compounds of this invention are now discussed in greater detail, especially with reference to preferred embodiments thereof. 1. Nucleic Acid Binding Moieties. One component of the synthetic regulatory compounds of the invention concerns nucleic acid binding moieties. In the context of this invention, a nucleic acid binding moiety is any compound or chemical that can provide site-specific recognition of a target nucleotide sequence in a nucleic acid molecule preferably a dsDNA. In preferred embodiments, such specificity is provided by an ability to recognize specific based pairs within the major groove and/or minor groove of a double-stranded nucleic acid molecule. Interactions of this sort are typically mediated by electrostatic forces, hydrogen bonding, van der Waals forces, and steric considerations. In natural systems, such binding specificity is typically mediated by peptides that are comprised within protein-based transcription factors and other proteins that bind to nucleic acids within cells and viruses. Herein, such peptides are referred to as "natural DNA binding ligands," and are not within the scope of the instant invention as it relates to embodiments that comprise but one nucleic acid binding moiety. However, in embodiments that comprise two or more nucleic acid binding moieties, one or more such natural peptides can be included in the synthetic regulatory compound, provided that at least one of such nucleic acid binding moieties is a non-natural nucleic acid binding moiety. Thus, any such peptides, whether now known or later developed, can be included in synthetic regulatory compounds that comprise a plurality of nucleic acid binding elements.
Below, various preferred embodiments of nucleic acid binding moieties useful in the practice of the present invention are provided. A. Molecular Scaffolds
In certain preferred embodiments of the invention, the nucleic acid binding moiety comprises a molecular scaffold. A molecular scaffold is any compound that spatially arises a plurality of hydrogen bond donors and acceptors in a manner that allows the formation of hydrogen bonds with the corresponding hydrogen bond acceptors and donors in a target nucleotide sequence in a nucleic acid molecule. The structures of single- and double-stranded nucleic acid molecules at high resolution, such as the three-dimensional coordinates of each individual atom within the nucleic
acid molecule are known to a resolution of less than about 2 A. Such structures can be determined by techniques known in the art, for example, x-ray crystallography or NMR spectroscopy. Accordingly, a high-resolution model of any particular nucleotide sequence within a nucleic acid molecule can be determined and used for modeling purposes. From such models, it is possible to deduce the positions of ~ " various hydrogen bond donor and acceptor elements within a nucleic acid molecule. The distance ranges over which such interactions occur are also known in the art, as are such disinteractions for other forces involved in nucleic acid molecule/nucleic acid binding ligand interactions. From such data, it is possible to design synthetic ligands that correspondingly array hydrogen bond acceptors and donors in a manner that allows formation of hydrogen bonding upon interaction of the particular ligand with a particular nucleotide sequence in such nucleic acid molecule. This invention encompasses designing a molecular scaffold directed toward a particular target nucleotide sequence. Such molecules can then be screened in vitro and/or in vivo to ascertain if the desired interaction occurs. Those molecules exhibiting the desired interaction, particularly those that do so at submacromolar and preferably subnanomolar binding affinities can be selected for further use. Alternatively, such compounds can be used as a basis for lead optimization in order to generate additional analogs that exhibit the desired interactions. Preferably, any such molecule selected for further use will also exhibit cell-permeability.
In yet other embodiments, the nucleic acid binding moiety comprises the structure h- Ql— Zl— Q2— Zz — . . . — Qm— Zm \ where each of Qi, Q2, ... , Qm, is a heteroaromatic moiety or (CH )P (where p is an integer between 1 and 3, inclusive); each of Zi, Z , ..., Zm is a covalent bond or a linking group; and m is an integer between 1 and 9 (preferably between 2 and 4), inclusive. Where Q is a heteroaromatic moiety, it is preferably selected from optionally substituted imidazole, pyrrole, pyrazole, furan, isothiazole, oxazole, isoxazole, thiazole, thiophene, furazan, 1,2,3-thiadiazole, 1,2,4-thiadiazole, 1,2,5- thiadiazole, 1,3,4-thiadiazole, 1,2,3-triazole, 1,2,4-triazole, 1,3,4-oxadiazole, 1,2,4-
oxadiazole, and thiophene moieties. Exemplary substituents include Cl, F, CH3 (e.g., as in N-methylpyrrole or N-methylimidazole), hydroxy (e.g., as in 3- hydroxypyrrole).
Linking groups Zj, Z2, ... Zm, can be between 2 and 5 (preferably 2) backbone atoms long. Exemplary linking groups include carboxamide, amidine' and ester groups, with carboxamide groups being preferred.
Any suitable synthetic method can be used to link the various elements of the compounds of this invention. In addition to those described in the examples furt er below, other suitable methods now known in the art or later developed can be used. B. PNAs
Protein nucleic acids (PNAs) are analogs of DNA in which the backbone is structurally homomorphous with a deoxyribose backbone, except that the backbone linkages are peptide bonds rather than phosphate esters. The PNA backbone comprises N-(2-aminoethyl)glycine units to which the nucleobases are attached. PNAs containing all four natural nucleobases hybridize to complementary oligonucleotides according to Watson-Crick base-pairing rules, and thus represent true DNA mimics in terms of base-pair recognition. Egholm et al. (1993) Nature 65:566-568. Since a PNA backbone is uncharged, PNA/DNA and PNA/RNA duplexes exhibit greater thermal stability, as compared to DNA/DNA, DNA/RNA, or RNA/RNA duplexes. PNAs have the additional advantage in not being recognized by nucleases or proteases. In addition, PNAs can be synthesized on an automated solid state synthesizer using standard t-Boc chemistry. The design and synthesis of PNA molecules are described, for example, in U.S. Patent Nos. 5,539,083; 5,864,010; 5,977,296; and 5,985,563. PNA-based non-natural nucleic acid binding moieties can be designed, synthesized, and incorporated into compounds according to the invention. Chemistries suitable for attaching linkers (or linkers already conjugated to one or regulatory moieties) to such PNAs are known in the art and can be used for such purposes. C. Triplex-Forming Oligonucleotides
Other preferred embodiments of the invention concern synthetic regulatory compounds wherein the nucleic acid binding moiety is an oligonucleotide capable of
base-pair specific recognition with a double-stranded nucleic acid, such that the oligonucleotide can form a triple helix. Oligonucleotide-directed triple helix formation is one of the most effective methods for accomplishing the sequence specific recognition of double helical DNA. See, e.g., U.S. Patent No. 5,847,555; Moser et al. (1987) Science 238:645; Le Doan et al. (1987) NuclTAcids Res. "" 15:7749; Maher et al. (1989) Science 245:725; Beal et al. (1991) Science 251 J360; Strobel et al. (1991) Science 254:1639; and Maher et al. (1992) Biochem. 31:70. Triple helices form as the result of hydrogen bonding between bases in a third strand of DNA and duplex base pairs in the double stranded DNA, via Hoogsteen base pairs.
Briefly, such oligonucleotides typically comprise between about 10 to about 200, preferably about 10 to about 50, nucleotides. Such oligonucleotides can be synthesized by any suitable method, for example, by conventional automated solid state techniques. Typically, nucleotide sequences targeted by triplex-forming oligonucleotides comprise purine rich tracts on one of the strands of the double- stranded, double-helical nucleic acid. The triple helix so formed contains the oligonucleotide bound in either a parallel or anti-parallel orientation with respect to the target sequence depending on the nucleotide sequences used in the oligonucleotide. A parallel orientation occurs when the oligonucleotide is a pyrimidine-rich oligonucleotide. In particular, the pyrimidine-rich oligonucleotide contains a thymine containing nucleotide (T) when the nucleotide at the complementary position in the purine-rich target sequence is an adenosine containing nucleotide (A) and a cytosine containing nucleotide (C) when the nucleotide at the complementary position in the purine-rich target sequence is a guanine containing nucleotide (G). An anti-parallel orientation occurs when a purine-rich oligonucleotide is used. In particular, anti-parallel orientation is obtained when the purine-rich oligonucleotide contains a guanine containing nucleotide (G) when the nucleotide at the complementary position in the purine-rich target sequence is a guanine containing nucleotide (G) and an adenosine containing nucleotide (A) when the nucleotide at the complementary position in the purine-rich target sequence is an adenosine containing nucleotide (A).
Synthetic triple helix-forming oligonucleotides can also bind in an anti- parallel orientation to a purine-rich target sequence. Such triple helix-forming oligonucleotides contain a G when the nucleotide in the complementary position of the purine-rich target sequence is G, T or A when the nucleotide in the complementary position in the purine-rich target sequence is an A and the nucleotide nebularine when the complementary position in the purine-rich target sequence is C. Because such moieties are capable of targeting specific nucleotide sequences in double-helical nucleic acids, they represent suitable nucleic acid moieties for use in developing candidate synthetic regulatory compounds. Preferably, such oligonucleotides (as with other nucleic acid binding moieties of the invention) will be designed to target a sequence that will allow a regulatory moiety attached thereto through a linker to be brought into proximity of the promoter of the gene, the expression of which is desired to be regulated. Using the nucleotide sequence of the region proximal to the target gene's promoter, one or more oligonucleotides can be designed that will possess the desired triplex-forming function.
The oligonucleotides used in the invention to form triple helices can be made synthetically by well-known synthetic techniques to contain a structure corresponding to the naturally occurring polyribonucleic or polydeoxyribonucleic acids. See, e.g., U.S. Patent No. 5,847,555. Alternatively, the phosphoribose backbone of such oligonucleotides can be modified such that the thus formed oligonucleotide has greater chemical and/or biological stability. Biological stability of the oligonucleotide is desirable when the oligonucleotides are used in vivo. Such modified oligonucleotides can be synthesized with a structure that is stable under physiological conditions, including enhanced resistance to nuclease degradation. Further, when used in vivo, such nucleotides preferably have a minimal length that permits targeted triple helix formation so as to facilitate the transport of the oligonucleotide across the membranes of the cytoplasm and nucleus.
These and other nucleic acid binding moieties intended for use in a synthetic regulatory compound of the invention are preferably tested against the target nucleotide sequence to identify those which bind with the highest affinity and greatest specificity. Binding affinity can be assessed by any suitable method, for
example, DNase I footprinting. Specificity can be determined by using nucleic acids containing target sequences and others containing nucleotide sequences that differ from the target sequence by one or more nucleotides. Preferably, such compounds have binding affinities (in terms of association constants) of at least about IO6 M"1, more preferably at least about IO9 M"1, and even more preferably, at least about 1010 M"1, 1011 M"1, IO12 M"1, or higher. Specifities of least about 2, preferably at least about 3, 4, or 5, even more preferably at least about 10, 50, 100, or more, for target versus single base pair "mismatch" sites are preferred.
Such molecules can then serve as the basis for the synthesis of the synthetic regulatory compounds according to the invention by attaching one or more regulatory moieties thereto through one or more linkers. The resulting compounds are then preferably tested in cell-based assays (e.g., where the cells carry a reporter construct that comprises a reporter gene under the control of a promoter region that is targeted by the nucleic acid binding moiety of the compound) to determine if the desired regulatory function is achieved by the particular synthetic regulatory compound. If so, the compound typically will represent a lead compound which can then be used to develop potential therapeutic or prophylactic drugs or other compounds that can be administered to cells to achieve the desired regulatory function. D. Polyamides
As described above, the nucleic acid binding moiety includes those that are dsDNA intercalators, dsDNA minor groove binding moieties, and dsDNA major groove binding moieties. It is to be understood that, where the nucleic acid binding moiety is referred to as a "minor groove binder" (or words to that effect), it does not mean that such moiety has binding interactions exclusively with the minor groove; the moiety also can have binding interactions with other parts of the dsDNA, for example, with adjacent base pairs by intercalation, with backbone phosphate groups, or with the major groove.
In certain preferred embodiments, the nucleic acid binding moiety is a minor groove binder, which typically (but not necessarily) has an elongate crescent shape, topologically complementary to the shape of the minor groove. The minor groove binder can be a residue of a naturally-occurring compound, such as doxorubicin,
daunomycin, anthramycin, calicheamycin, mitomycin, duocarmycin, distarnycin, and netropsin, or an analog or a derivative thereof. Alternatively, the nucleic acid binding moiety can be a residue ofa synthetic minor groove binder.
In particularly preferred embodiments, the nucleic acid binding moiety is a synthetic polyamide unit comprising N-methylpyrrole carboxamide ("Py") units and optionally one or more of N-methylimidazole carboxamide ("Im"), N-methyl-3- hydroxypyrrole carboxamide ("Hp"), glycine carboxamide, P-alanine carboxamide, γ-aminobutyric acid carboxamide, 5 -amino valeric acid carboxamide, and γ-2,4- diaminobutyric acid carboxamide units. Such synthetic polyamides are minor groove binders, binding with high binding constants, often greater than 10 M" . The design and synthesis of such polyamides are described for instance in Baird et al. (1996) J. Am. Chem. Soc. 118:6141 6146; and U.S. Applications 08/607,078 (filed Feb. 26, 1996); 09/374,702 (filed Aug. 12, 1999); 09/372,473 (filed Aug. 11, 1999); 09/372,474 (filed Aug. 11, 1999); 09/414,611 (filed Oct. 8, 1999); and 60/115,232 (filed Jan. 6, 1999, the benefit and priority of which is claimed by international application PCT/US00/00298, entitled "Compositions and Methods Relating to Cyclic Compounds that Undergo Nucleotide Base Pair Specific Interactions with Double Stranded Nucleic Acids"), the disclosures of which are incorporated herein by reference. It has been further discovered that such polyamides can bind to dsDNA with two heteroaromatic carboxamide moieties fitting side-by-side within the minor groove and that such side-by-side heteroaromatic carboxamide pairs recognize specific dsDNA base pairs, giving rise to a set of "pairing rules" correlating heteroaromatic carboxamide pairs and the DNA base pairs recognized:
Heteroaromatic Pair dsDNA Base Pair(s) Recognized
Im/Py G/C
Py Im C/G
Py/Py A/T, T/A
Hp/Py T/A
Py/Hp A T
Where it is desired to synthesize a polyamide-based nucleic acid binding moiety that binds to dsDNA with specificity for particular base pair sequences, resort can be had to the above pairing rules.
Glycyl or β-alanyl carboxamides can serve as "spacer" groups for adjusting the position of the heteroaromatic carboxamide residues in relation to the nucleotide base pairs of a polyamide-based nucleic acid binding moiety's binding site. A γ- aminobutyric acid carboxamide, 5-aminovaleric acid carboxamide, or γ-2,4- diaminobutyric acid carboxamide unit (or other moieties that produce a substantially equivalent structural effect) provides the potential for formation of a "hairpin" conformation, wherein some or all of the heteroaromatic carboxamide units from a portion of the polyamide to one side of the turn-providing moiety bind side-by-side to the heteroaromatic carboxamide units from the portion the polyamide to the other side of the turn-providing moiety. See Figure 2C for a representative diagrammatic illustration of polyamide hairpin conformation. Hairpin polyamides are capable of targeting predetermined DNA sequences with affinities and specificities comparable to DNA binding proteins in accordance with a simple set of pairing rules dictated by the side-by-side binding of the aromatic amino acids. Mrksich et al. (1992) Proc. Natl. Acad. Sci. USA 89:7586-7590; Wade et al. (1992) J. Am. Chem. Soc. 114:8784-8794; and Trauger et al. (1996) Nature 382:559-561. These synthetic DNA binding ligands are cell permeable, and one such compound was shown to specifically interfere with gene expression in mammalian cell culture. Gottesfeld et al. (1997) Nature 387:202-205; and Dickinson et al. (1998) Proc. Natl. Acad. Sci. USA 95:12890-12895.
When fewer than all of the heteroaromatic carboxamide units from one end of the polyamide associate side-by-side with heteroaromatic carboxamide units from the other side of the polyamide, the unpaired heteroaromatic carboxamide units from one polyamide can be available to form cooperative side-by-side pairings with unpaired heteroaromatic carboxamide units from another polyamide, for example another hairpin or straight-chain polyamide. Such cooperative interaction can serve to increase DNA binding specificity. Use of two turn-providing (or other non- recognition moieties), for example, at each end of the nucleic acid binding moiety or
at one end and at an internal position, allows the formation of nucleic acid binding moieties having other conformations (e.g., cyclic or "H-pin" conformations, respectively). The 2-amino group of γ-2,4-diaminobutyric acid provides, among other locations, an attachment point for tandem-linked polyamide units, as well as providing a moiety that can be used to introduce chirality into the polyamide. A Py, Hp, or Py equivalent heteroaromatic carboxamide can be replaced with a β carboxamide to form pairs such as β/β β/Py, or Py/β. These and other molecular design principles disclosed in the aforementioned references can be used in the design of preferred examples of polyamide-based nucleic acid binding moieties of the synthetic regulatory compounds of this invention.
Compounds of this invention are useful because they are strong nucleic acid binders, often as nanobinders (i.e., association constant (Ka) of IO9 M"1) or even as picobinders (Ka of 1012 M"1). It is especially noteworthy that some compounds of the invention are nanobinders while having relatively few heteroaromatic moieties (3-5), while previously described nanobinders have generally required a larger number of heteroaromatic moieties.
Additionally, compounds of this invention will have anti-fungal (e.g., yeast, filamentous fungi) and/or anti-bacterial (Gram-positive, Gram-negative, aerobic, anaerobic) properties and therefore can be used for combating (i.e., preventing and/or treating) infections by such pathogens. Other pathogens against which compounds of this invention can be used include protozoa and viruses. For human anti-infective applications, a compound of this invention can be used in combination with a pharmaceutically acceptable carrier. The composition can be dry, or it can be a solution. Treatment can be reactive, for example, to combat an existing infection, or prophylactic, for preventing infection in an organism susceptible to infection. Host organisms that can be treated include eukaryotic organisms, in particular plants and animals. The plant can be an agriculturally important crop, such as wheat, rice, corn, soybean, sorghum, and alfalfa. Animals of interest include mammals such as bovines, canine, equines, felines, ovines, porcines, and primates (including humans).
SUBSTrrϋTE SHEET (RULE 26)
While not wishing to be bound by any particular theory, it is believed that the synthetic regulatory compounds of this invention derive their biological activity by binding to double stranded nucleic acid, in particular double stranded DNA, and recruiting (in the case of activators) transcriptional machinery to the gene to be expressed or, in the case of repressors, inhibiting such recruitment
The matching ofa synthetic regulatory compound of this invention against a particular gene can be accomplished by rational design if the desired target dsDNA base pair sequence - e.g., a sequence in a gene (particularly a regulatory region thereof, e.g., a promoter, enhancer) is known. In such circumstances, a nucleic acid binding moiety that binds to the target base pair sequence with the desired degree of specificity is preferably used. The NABM can be a residue of a naturally occurring dsDNA binder with known specificity for the target sequence, or can be a synthetic dsDNA binder synthesized according to the base pair recognition rules discussed hereinabove. Alternatively, the matching can be accomplished by a suitable screening method. i. Polyamide Synthesis.
There are two basic methods for synthesizing peptides and polyamides: the chemistry is either carried out in solution (solution phase) or on a solid support (solid phase). A major disadvantage of solution phase synthesis is the poor solubility of protected intermediates in organic solvents. Additionally, solution phase synthesis requires difficult purification methods. Solid phase synthesis avoids these problems, and thus is the preferred method in synthesizing peptides and polyamides.
U.S. Patent Nos. 6,090,947 and 5,998,140 detail solid state synthetic processes for use in synthesizing polyamides useful in the practice of this invention. Briefly, to make polyamides, such methods involve providing a solid support, preferably a polystyrene resin, for the stepwise addition of amino acids. To begin, the appropriate amino acid monomer or dimer is protected at its amino (NH2) group with a Boc-group or an Fmoc-group and activated at the carboxylic acid (COOH) group by formation of an -OBt ester. The protected and activated amino acids are then sequentially added to the solid support, beginning with the carboxy terminal
amino acid. When the desired polyamide has been prepared, the amino acids are deprotected and the peptide is cleaved from the resin and purified.
See also application no. PCT/US97/12722, filed 7/21/97; application no. PCT/US00/00298, filed 01/06/00, publication no. WO 00/40605, publication date 07/13/00; application no. PCT/US98/01714, filed 01/29/98, publication no. WO~ 98/37067, publication date 08/27/98; application no. 09/372,474, filed 08/11/99; application no. PCT/US98/06997, filed 04/08/98, publication no. WO 98/49142, publication date 11/05/98; application no. 08/837,524, filed 04/21/97; application no. 09/181,306, filed 10/28/98; application no. 09/374,702, filed 08/12/99; application no. PCT/US 98/03829, filed 01/29/98, publication no. WO 98/45284, publication date 10/15/98; application no. 08/853,522, filed 05/08/97; application no. 09/372,473, filed 08/11/99; application no. PCT/US98/01006, filed 01/21/98, publication no. WO 98/37066, publication date 08/27/98; application no. PCT/US98/02684, filed 02/13/98, publication no. WO 98/37087, publication date 08/27/98; application no. 09/374,704, filed 08/12/99; application no. 8/853,525, filed 05/08/97; application no. 09/367,513, filed 02/11/98; application no. PCT/US98/02444, filed 02/11/98; application no. 09/434,290, filed 11/05/99; application no. 09/359,921, filed 07/22/99; application no. 09/360,840, filed 07/22/99; application no. 09/360,344, filed 07/22/99; application no. 09/360,345, filed 07/22/99; application no. PCT/US99/20971 , filed 09/10/99, publication no. WO 00/15242, publication date 03/23/00; application no. PCT/US99/20489, filed 09/10/99, publication no. WO 00/15773, publication date 03/23/00; application no. 09/414,611, filed 10/08/99; application no. 60/161,545, filed 10/26/99; application no. 09/479,279, filed 1/6/00; application no. 60/178,821, filed 01/28/00. ii. Exemplary protocol for DNasel footprint titration experiments
All reactions were executed in a total volume of 400 μL. A polyamide stock solution or H20 (for reference lanes) was added to an assay buffer containing 3'- P radiolabeled restriction fragment (20,000 cpm), affording final solution conditions of 10 mM Tris.HCl, 10 mM KC1, 10 mM MgCl2, 5 mM CaCl2, pH 7.0. The solutions were allowed to equilibrate for at least 12 hours at 22°C. Footprinting reactions were initiated by the addition of 10 μL of a stock solution of DNase I (at the
appropriate concentration to give -55% intact DNA) containing 1 mM dithiothreitol and allowed to proceed for 7 minutes at 22°C. The reactions were stopped by the addition of 50 μL ofa solution containing 2.25 M NaCl, 150 mM EDTA, 23 μM base pair calf thymus DNA, and 0.6 mg/ml glycogen, and ethanol precipitated. The reactions were resuspended in 1 x TBE/ 80% formamide loading buffer, denatured by heating at 85°C for 15 minutes, and placed on ice. The reaction products were separated by electrophoresis on an 8% polyacrylamide gel (5% crosslinking, 7 M urea) in 1 x TBE at 2000 V for 1.5 h. Gels were dried on a slab dryer and then exposed to a storage phosphor screen at 22°C. iii. Quantitative DNase I Footprint Titration Data Analysis.
Background-corrected volume integration of rectangles encompassing the footprint sites and a reference site at which DNase I reactivity was invariant across the titration generated values for the site intensities (Isue,) and the reference intensity (Iref). The apparent fractional occupancy (0app) of the sites were calculated using the equation:
/ site // ref θ app = 1 -
/ site0// ref °
(1)
where ISjte 0 and Iref° are the site and reference intensities, respectively, from a DNase I control lane to which no polyamide was added.
The ([L]tot, 0app) data were fit to a Langmuir binding isotherm (eq. 2, n=l) by minimizing the difference between 0app and 0flt, using the modified Hill equation:
where [Ltot]is the total polyamide concentration, Ka is the equilibrium association constant, and 0m;n and 0max, are the experimentally determined site saturation values when the site is unoccupied or saturated, respectively. The data were fit using a nonlinear least-squares fitting procedure of KaleidaGraph software (v. 3.0.1 ,
UBSτiTUTE SHEET (RULE 26)
Abelbeck Software) with Ka, 0maχ, and 0m.n, as the adjustable parameters. The goodness of fit of the binding curve to the data points is evaluated by the correlation coefficient, with R > 0.97 as the criterion for an acceptable fit. All lanes from a gel were used unless a visual inspection revealed a data point to be obviously flawed relative to neighboring points. The data were normalized using the following " equation:
. θ app - θ min θ norm = θ max - θ min
(3) 2. Regulatory Moieties
A. Overview of Transcription
The major players in the regulation of gene expression within the nucleus are: the genes and their regulatory sequences that are complexed with structural proteins (e.g., histones) in chromatin; chromatin remodeling activities that allow access to a gene and its regulatory regions; regulatory proteins that instruct the transcription machinery to express (or, as in the case of repressors, prevent the expression of) the relevant genes; and the RNA-synthesizing machinery that decodes the genes. A host of other activities play a role in this process, for instance, those that facilitate elongation of paused transcripts, or those that lead to the processing of nascent transcripts and those that play a role in release of full-length transcripts.
The primary players and the events that lead to regulation of gene expression are described below. Other components involved in gene expression, such as mRNA elongation, processing, termination, or nuclear export, can also be targeted by designing synthetic regulators based on the synthetic regulatory compound motif presented herein.
Activators: Positive regulation (stimulation) of gene expression requires factors called transcriptional activators. An economical 'recruitment' model posits that activator proteins bind to DNA and recruit the transcriptional machinery to the promoter of the gene, thereby stimulating gene expression. Most activators comprise three functional modules. Of these, specificity in targeting genes is achieved by the DNA recognition module which binds to cognate DNA sequences
near a promoter of a gene and in most cases DNA binding specificity is further enhanced by dimerization. A key functional module, the activating region, is thought to bind one or more components of the transcriptional machinery. While not wishing to be bound to a particular theory, it is believed that weak interactions between an activating region and several components of the machinery result in high avidity 'multi-dentate' binding. In addition, the typical activating region (e.g., those used here) are also believed to contact and recruit nucleosome modifying activities to promoters.
Repressors: These proteins appear to function to inhibit gene expression at several levels. Some repressors function in part by blocking the activity of activators directly, for example, by binding to an activation domain on an activating protein in order to prevent its interaction with a component of the transcriptional machinery. Another example includes MDM-2, which not only binds to the activating region of p53, but also indirectly attenuates transcriptional activity by stimulating p53 's degradation via a proteolytic pathway. More recently it has been proposed that repressors are recruited to promoters where they serve to inhibit the ability of transcriptional machinery to utilize the proximal promoter by either directly interacting with the machinery and inactivating it, or indirectly by mediating changes in chromatin structure so as to prevent the components of a transcriptional apparatus from interacting with DNA.
Transcriptional Machinery: The general components of the eukaryotic transcription apparatus have been described. Orphanides et al. (1996) Genes Dev. 10:2657-2683; and Conaway et al. (1993) Annu. Rev. Biochem. 62:161-190. Briefly, the transcriptional machinery for mRNA comprises the catalytic core RNA polymerase II (12 subunits), several general transcription factors (TFπ -A, B, D, E, F, H), mediator complex (-20 Srb and Med subunits), elongator complex, co- activator proteins and several additional polypeptides, some of which remain to be defined. Most of these proteins are conserved through evolution and occur in species from yeast to humans. Many of the components of the transcription machinery exist in large multi- subunit complexes which associate with the RNA polymerase II, and are known as the RNA polymerase II holoenzyme. The RNA-polymerase II holoenzyme can be
STiTUTE SHEET (RULE 26 i
broadly described as containing two functional parts. One part is the "catalytic core" that is required for synthesizing mRNA while the other is the mediator (Bjorkland et al. (1996) Trends Biochem. Sci. 21:335-337), a complex of approximately twenty proteins that is required for the holoenzyme to respond to activators. It is believed that the holoenzyme, along with additional factors that "do not associate tightly (such as TBP/TFIID and a class of proteins known as co- activators (Thompson et al. (1993) Cell 73:1361-1375; and Koleske et al. (1994) Nature 368:466-469), constitute the minimal transcriptional machinery recruited by activators to most promoters in vivo. Conversely, as described above, repressors function to inhibit holoenzyme activity, and in some instances they recruit co- repressor proteins.
TFIID (Burley et al. (1996) Ann. Rev. Biochem. 65:769-799), an essential component of the transcriptional machinery, is not typically found associated with the holoenzyme, and is a target of activators and some repressors as well. It is a protein complex containing about thirteen components, including TBP and TBP- associated factors (TAFs). Kim et al. (1993) Nature 365:520-527; Kim et al. (1993) Nature 365:512-520; and Dynlacht et al. (1991) Cell 66:563-576. TBP is a sequence-specific DNA-binding protein that recognizes and binds via the minor groove to a sequence known as the TATA box (consensus: S'-TATAAAA-S') that exists in the promoters of many genes. Hoopes et al. (1992) J. Biol. Chem. 267:11539-1154; and Coleman et al. (1995) J. Biol. Chem. 270:13850-13859. TFIID associates with TFIIA, which is comprised of three polypeptides. TFIIA helps TFIID bind to DNA perhaps by competing with repressors as well as displacing inhibitory domains within TAFs away from TBP. Geiger et al. (1996) Science 272:830-836; and Thompson et al. (1993) Cell 73:1361-1375. TFIIB, a holoenzyme component, also interacts with the promoter DNA and binds to TBP (Nikolov et al. (1995) Nature 377:119-128; and Burley (1996) Nature 381:112-113) and it is proposed to hold the entire complex together as a single unit.
Chromatin Remodeling Machinery: In order for a gene sequestered in chromatin to become available for transcription, the chromatin structure must be remodeled. Felsenfeld (1992) Nature 355:219-224; Kingston et al. (1996) Genes Dev. 10:905-92; and Kadonaga (1998) Cell 92:307-313. Chromatin remodeling
occurs through activator-mediated recruitment of at least two types of chromatin remodeling complexes. One of which comprise the histone acetyl transferases that contain proteins that acetylate certain lysine residues in the amino-terminal tails of histone proteins (Brownell et al. (1996) Curr. Opin. Genet. Dev. 6:176-184), thereby rendering DNA in a nucleosome more accessible to DNA-binding "transcription ~ factors. The second type of chromatin remodeling complex, Swi Snf uses energy derived from ATP hydrolysis to facilitate binding of the transcriptional machinery to a particular promoter. Burns et al. (1997) Mol. Cell. Biol. 17:4811-4819; Quinn et al. (1996) Nature 379:844-847; Kwon et al. (1994) Nature 370:477-481; and Cote et al. (1994) Science 265:65-68. Activators can recruit chromatin remodeling complexes through direct binding. The viral activator VPl 6 has been shown to bind to components of both the multi-protein histone acetyl transferase (HAT) complex (Berger et al. (1992) Cell 70:251-265; and Candau et al. (1997) EMBO J. 16:555- 565), as well as the Swi/Snf complex. In fact, TFIID, another target of VPl 6, was observed to display a weak HAT activity. Mizzen et al. ( 1996) Cell 87 : 1261 - 1270; and Wilson et al. (1996) Cell 84:235-244.
As a corollary it has been shown that certain gene-specific transcriptional repressors mediate their repressive function by recruiting histone deacetylase complexes to a target promoter. Brehm et al. (1998) Nature 391:597-601; and Magnaghi-Jaulin et al. (1998) Nature 391 :601-605. Other repressors are suggested to directly bind histones and/or other similar proteins and these interactions lead to compact chromatin structures that occlude the transcriptional machinery.
The Regulatory Process: Based on current understanding, upon receipt of a signal, an activator bound to a promoter or enhancer recruits the chromatin remodeling machinery to the adjacent promoter. It then recruits the transcriptional machinery to form a pre-initiation complex at the promoter. It appears that assembly of a pre-initiation complex can require two synchronized steps: TFIID/TBP -TATA binding in concert with the association of the holoenzyme with the complex at the promoter. Stargell et al. (1996) Trends Genet. 12:311-315. For mRNA synthesis to be initiated at a particular gene, the complex must open (melt) the double helix to expose the template strands. Once mRNA initiation occurs and after a certain length of transcript is synthesized the polymerase must move away
from the promoter to continue mRNA synthesis. Certain activators such as HSF and Tat function to stimulate this stage of transcription process, possibly by recruiting the pTEFB complex which contains a kinase (Cdk9) capable of phosphorylating the largest of the 12 subunits of the polymerase. It has been reported that promoter escape appears to involve hyperphosphorylation of the carboxy-terminal domain of the largest subunit of the RNA polymerase II. This hyperphosphorylation achieves two goals: first, it can provide the signal to detach the mediator complex from the catalytic core; and second, it can permit the association of RNA processing and elongator complexes with the rapidly elongating polymerase. The release of the mediator and TFIID during promoter escape by the polymerase would provide a mechanistic basis for a re-initiation event by another polymerase catalytic core. Svejstrup et al. (1997) Proc. Natl. Acad. Sci. USA 94:6075-6078; and Zawel et al. (1995) Genes Dev. 9:1479-1490. It has been found that mediator complexes are limiting, whereas the catalytic machinery is more abundant. Moreover, activators directly interact with both the mediator as well as TBP/TFIID, thus, they can play a major role in helping to retain the mediator and/or TFIID at the promoter. Therefore, the next transcription complex can be reassembled rapidly by only recruiting the core fragment of the RNA polymerase II holoenzyme. It is postulated that re-initiation is much more likely than initiation alone to contribute significantly to rapid stimulation of gene expression. And activators must clearly play a role in (Ho et al. (1996) Nature 382:822-826) facilitating multiple rounds of transcription re-initiation.
Repression, on the other hand, requires the opposite series of events. A repressor can first directly engage an activator and mask its activating surface thereby preventing its interactions with the transcriptional and chromatin remodeling machinery. As in the case of MDM-2, after masking the activating region the repressor can also directly interrupt the low-level activator-independent assembly of the transcriptional machinery at the exposed promoter. In the next set of events the repressor, such as Retinoblastoma gene product (Rb), can directly recruit histone deacetylases, which then strip the acetyl groups off the lysine residues on histone tails. It is now believed that deacetylated histone H3 tails are then methylated by methyl transferases, which are also recruited by repressors. The methylated histone
tails bind to chromatin compacting proteins such as HP-1. Thus, in a sequential manner the gene is silenced. Additional components that participate in stimulation as well as repression of a gene will no doubt be discovered in the future, and they shall also be amenable to manipulation by compounds within the scope of this invention.
A regulatory moiety is any molecule that can positively or negatively effect transcription of a target gene, other than by direct electrostatic interaction with double-stranded DNA. Representative embodiments of regulatory moieties include peptides, polypeptides, lipids, carbohydrates, and any combination thereof. A regulator moiety can be naturally occurring or be derived from or an analog of a naturally occurring molecule. Alternatively, it can be synthetic. Preferred regulatory moieties are peptides and small organic molecules. B. Activators An activator is any molecule that can activate transcription of a target gene. According to current understanding, an activator binds to its cognate sites in the genome and recruits the transcriptional machinery to nearby promoters; initiation of transcription then follows. Ptashne et al. (1997) Nature 386:569-577. Activators can be small molecules or peptides. For example, many protein activators are known. The peptides within such proteins that provide activation activity can be identified. Peptidomimetics of such peptides can also be generated, as can other analogs and derivatives. The key is whether the peptide or other molecule retains the desired activation function. Small molecules can be developed from rational design approaches modeled after known activators. Alternatively, or in addition, they can be identified by screening combinatorial libraries, natural product libraries, and/or libraries of already synthesized small organic molecules. Such molecules can be tested for activator function is a suitable test system. For example, one can substitute the regulatory moiety of synthetic regulatory compound already known to provide regulatory activity with a molecule of unknown activity. Such candidate molecules can be screened in an in vitro or cell-based reporter system comprising a reporter construct that carries a reporter gene the expression of which is under the control of a regulatable promoter. Candidate compounds that are found to activate transcription of the reporter gene can be retained for further study. Also, the
particular regulatory moiety can be identified as an activator. The activator can also then be tested for activity in other compounds that employ different linkers and/or nucleic acid binding moieties, and even different regulatory moieties, or multiples of the same regulatory moiety. In this way, it will be possible to develop a library of activators, the members of which can be used for various, and even multiple, applications.
C. Repressors A repressor is any molecule that can inhibit or prevent transcription ofa target gene. Repressors can be small molecules or peptides. For example, many protein repressors are known. The peptides within such proteins that provide repressor activity can be identified. Peptidomimetics of such peptides can also be generated, as can other analogs and derivatives. The key is whether the peptide or other molecule retains the desired repressor function. Repressors can be obtained in ways analogous to those used to identify activators. 3 Linkers
For purposes of this invention, a linker is any molecule that can be used to link at least one nucleic acid binding moiety to at least one regulatory moiety in a manner that allows the nucleic acid binding moiety(ies) to retain its(their) intended nucleic acid binding function(s) and the regulatory moiety(ies) to retain its(their) ability to influence transcription of the target gene. Thus, the linker should provide adequate spacing between and/or orientation with respect to the nucleic acid binding moiety and the regulatory moiety so as to allow each to retain its respective function. A linker is a polymer of backbone units e.g. methylene, propyl, and ether groups. Other backbone units include, but are not limited to, five or six membered rings. One or more backbone units can include a heteroatom including, but not limited to, sulfur, nitrogen or oxygen.
Linkers used in the practice of this invention can be branched, although straight chain molecules are preferred. They can be amphipathic or aliphatic, with molecules in the latter class being preferred. Representative examples of suitable linkers include polyethylene glycol, alkyl chains, and peptides.
A linker can contain one or more first reactive moieties for conjugation by a suitable chemistry to a nucleic acid binding moiety. Similarly, a linker can contain
one or more second reactive moieties for conjugation by a suitable chemistry to a regulatory moiety. The chemistry used to conjugate the first reactive moiety(ies) to the nucleic acid binding moietyries) can be the same or different as the chemistry used to conjugate the second reactive moiety (ies) of the linker to the regulatory moiety(ies). In certain preferred embodiments, the linker is a dendrimer with respect to second reactive moieties, in that it contains two or more such moieties and thus can be useful in linking two or more regulatory moieties having the same or different regulatory activities to a nucleic acid binding moiety.
Linkers can be conjugated to a nucleic acid binding moiety at any suitable location. Such locations include at the one or both ends of the molecule, or at an internal position. Preferably, when conjugated to the nucleic acid binding moiety, the linker is oriented such that upon interaction with the nucleic acid molecule (as mediated by the nucleic acid binding moiety) it is projected away from the nucleic acid molecule so as to avoid steric hindrance or other interaction with the nucleic acid. When the nucleic acid binding moiety is a polyamide, preferred locations for conjugating a linker thereto include the carboxy terminus, the amino terminus, and an internal amino acid residue (e.g., a β-alanine residue, a substituted pyrrole residue, an unsubstituted pyrrole residue, a substituted imidazole residue, and an unsubstituted imidazole residue). With respect to polyamides capable of assuming a hairpin conformation in the minor groove of dsDNA, certain preferred embodiments involve conjugation of the linker to the internal amino acid residue that mediates hairpin formation, for example, γ-aminobutyric acid.
Linkers useful in the practice of this invention can be cleavable, for example, by an enzyme or chemical action (e.g., photooxidation). In this way, for example, the activity of a synthetic regulatory compound can be controlled by endogenous degradative processes.
Suitable linkers can be identified from among candidate linkers by any ofa number of suitable approaches. An example of one such in vitro system is as follows: a first reactive moiety on the candidate linker is used to attach the linker to a nucleic acid binding moiety known to specifically bind to a target nucleotide sequence in dsDNA in proximity to a regulatable promoter from which high level
BSTlTUTE SHEET (RULE 26)
SU
transcription of a reporter gene can be initiated. In this example, the first reactive moiety is used to attach the linker to a portion of the nucleic acid binding moiety in a manner that is not anticipated to substantially disrupt the nucleic acid binding moiety's ability to bind to its target nucleotide sequence. Whether such disruption occurs, or the extent of any such disruption, can be independently assessed by determining the binding affinity of the nucleic acid binding moiety for its target nucleotide sequence before and after linker attachment. After obtaining a nucleic acid binding moiety-linker intermediate that retains some (at least about 10%, preferably at least 25%, and more preferably at least about 50%), substantially all (at least about 70%, preferably at least about 85%, and more preferably at least about 90-95%), or all (more than 95%, preferably more than 98%) of the nucleic acid binding moiety's binding affinity and specificity for its target nucleotide sequence, the intermediate can be linked to a regulatory moiety via a second reactive moiety contained in the linker. Again, it is desirable that the linker-regulatory element linkage not disrupt the biological activity of the moiety to which it is attached via the second reactive group, namely the activator or repressor. Whether such disruption also occurs, or the extent of any such disruption, can be independently assessed by determining the biological activity of a regulatory moiety before and after linker attachment. After obtaining a regulatory moiety-linker intermediate that retains some (at least about 10%, preferably at least 25%, and more preferably at least about 50%), substantially all (at least about 70%, preferably at least about 85%, and more preferably at least about 90-95%), or all (more than 95%, preferably more than 98%) of the regulatory moiety's activity, that linkage mechanism is preferably selected for use generating a synthetic regulatory compound according to the invention. The order of moiety conjugation to the linker can vary, and can depend on the particular nucleic acid binding moiety(ies) and/or regulatory moiety(ies) being employed. 4. Screens
The synthetic regulatory compounds of the present invention comprise at least three elements: a non-natural nucleic acid binding moiety; a linker; and a regulatory moiety. As most applications the compounds of the invention concern use in cells, it is desirable to assay compounds to ensure that they are able to enter into cells and exert the desired effect. As the foregoing suggests, a synthetic
regulatory compound according to the invention that is used in an in vitro system need not exhibit cell permeability. However, in the context of cells, in order to be a synthetic regulatory compound within the scope of the invention, such compound must enter cells, and then exert the intended desired regulatory effect. In general, each of the moieties involved in constructing synthetic regulatory compounds of the invention are preferably independently tested, or screened, for their ability to provide the desired function (targeted nucleic acid binding or regulatory function in the case of nucleic acid binding and regulatory moieties, respectively) or structure (in the case of linkers). For example, a nucleic acid binding moiety will generally be tested to determine if it binds the desired target sequence with the requisite affinity and specificity prior to being incorporated into a conjugate including a linker and/or a regulatory moiety. Similarly, a regulatory moiety is preferably determined to have the desired regulatory effect prior to its inclusion in a conjugate comprising a nucleic acid binding moiety and a linker. Screens to identify such moieties can be conducted as follows. After synthesis, whether by solid state, solution phase, or recombinant techniques, as the case may be, the particular moiety is typically tested in an in vitro format. For example, in the context of nucleic acid moieties, the nucleic acid binding moiety is exposed to nucleic acid molecules which include the intended target sequence, preferably under conditions that at least approximate those expected to be encountered when the molecule is put to its intended use. It is then ascertained by any of a number of conventional methods whether the desired binding events take place, and if so, at what affinities, etc.
In order to synthesize a moiety for testing in the first instance, any suitable method can be employed. Such methods include the synthesis ofa single compound by traditional methods, up through a massively parallel combinatorial approach. For example, a number of combinatorial synthetic methods are known in the art. For example, Thompson & Elman ((1996) Chem. Rev. 9.6:555) recognized at least five different general approaches for preparing combinatorial libraries on solid supports. These were: (1) synthesis of discrete compounds; (2) split synthesis (split and pool); (3) soluble library deconvolution; (4) structural determination by analytical methods; and (5) encoding strategies in which the chemical compositions of active candidate
are determined by unique labels, after testing positive for biological activity. Synthesis in libraries in solution includes at least spatially separate synthesis, and synthesis pools. Additional descriptions of combinatorial methods are known in the art. See, e.g., Lam et al. (1997) Chem. Rev. 97:4111. These approaches can be readily adapted to prepare moieties for use in accordance with the present invention, including suitable protection schemes, as necessary. After synthesis and testing of the various moieties required to make a particular compound according to the invention, they can be assembled into a synthetic regulatory molecule. After a putative synthetic regulatory compound according to the invention has been generated, it can be tested in any number of in vitro or cell-based assays that elicit detectable signals in proportion with the activity of the compound. Preferably, such assays are conducted in a high throughput format. For example, use of a plurality of microtiter plates allows the simultaneous testing of large numbers of candidate compounds, be they in vitro or cell-based assays. High throughput formats are often partially or fully automated, and allow 100, 1,000, 10,000, or more candidate compounds to be screened at one time. As the synthetic regulatory compounds of the invention regulate transcription, such assays typically will employ systems in which one or more reporter genes can be expressed under the control of a promoter that can be influenced by the compound. Examples of suitable reporter constructs include those that encode genes such as luciferase, green fluorescent protein, CAT, etc., although any gene, the expression of which can be readily detected, can be employed. 5. Cell Permeability
In order to be useful in cellular contexts, synthetic regulatory compounds of the invention are preferably cell permeable, i.e., they can traverse a cell's plasma membrane and thus be internalized. Preferably, the compounds inherently have this ability. Alternatively, or in addition, they can be formulated in a composition that facilitates cell entry, for example, a liposome. When so formulated, such compositions can further comprise a cell targeting element to direct the composition to a particular cell type, for example, a cell expressing a specific antigen (e.g., a disease-associated antigen) not expressed on the surface of other cells.
Cell permeability enables a compound according to the invention to enter into a cell's cytoplasm, from which it then moves into the nucleus to exert its intended regulatory effect. As is known in the art, plasma membranes present a barrier to many molecules that, if they could enter a cell, could exert a useful effect. Preferably, a compound that effects an intracellular function or activity, for example, transcription of one or more particular genes, is soluble in both the aqueous compartments of the cells and organism, and preferably also in the lipid bilayers through which it must pass in order to enter cells or organelles, for example, mitochondria and chloroplasts. Nuclei, on the other hand, have large pores that generally do not prevent ingress ofa synthetic regulatory compound according to the invention.
Cell permeability can be assessed in a number of ways. For example, the intracellular concentration ofa compound can be determined, and compared to the extracellular concentration ofa compound. This can also be done over time, so that the rate at which a compound is taken up can be determined. Compositions
To be used for an agricultural or medicinal application, a synthetic regulatory compound of the invention will typically be formulated into a suitable composition. Such compositions include liquid and solid, or dry, compositions. The particular composition selected will depend on a variety of factors, including the particular synthetic regulatory compound(s) to be formulated, the intended use (e.g., for an agricultural purpose (for instance, as a pesticide or as a molecular switch, such as to induce flowering) or a medical application (e.g., to treat or prevent a disease)), the method of delivery (for example, in an agricultural context, spraying or broadcasting, and in a medical context, injection or oral delivery), regulatory requirements, etc.
Formulations suitable for human or non-human animal uses are among the preferred embodiments of this aspect of the invention. With regard to liquid formulations, one or more synthetic regulatory compounds preferably are suspended in an aqueous carrier, for example, in an isotonic buffer solution at a suitable pH.
These compositions can be sterilized by conventional sterilization techniques, or can be sterile filtered. For human or non-human animal use, the compositions can
contain pharmaceutically or veterinarily, as the case can be, acceptable auxiliary substances as required to approximate physiological conditions, such as pH buffering agents. Useful buffers include for example, sodium acetate/acetic acid buffers. The desired isotonicity can be accomplished using sodium chloride or other pharmaceutically acceptable agents such as dextrose, boric acid, sόUium tartrate,~ ' propylene glycol, polyols (such as mannitol and sorbitol), or other inorganic or organic solutes. Sodium chloride is preferred particularly for buffers containing sodium ions. Many pharmaceutically acceptable carriers and their formulation are described in standard formulation treatises, e^ ., Remington's Pharmaceutical Sciences by E.W. Martin. See also, Wang, et al. (1988) J. Parenteral Sci. Tech., Technical Report No. 10, Supp. 42:2S.
The synthetic regulatory compounds of the invention can also be formulated as pharmaceutically acceptable salts and/or complexes thereof. Pharmaceutically acceptable salts are non-toxic salts at the concentration at which they are administered. The preparation of such salts, including addition salts, can facilitate pharmacological or other use by altering the physical-chemical characteristics of the active ingredient without preventing the synthetic regulatory compound from exerting its intended physiological effect. Pharmaceutically acceptable salts of compounds of this invention include salts of their conjugate acids or bases. Exemplary suitable counterions for conjugate acid salts include the chlorides, bromides, phosphates, sulfates, maleates, malonates, salicylates, fumarates, ascorbates, benzenesulfonates, methanesulfonates, p-toluenesulfonates, cyclohexylsulfonates, lactates, malates, citrates, acetates, tartrates, succinates, glutamates, sulfamates, quinates, and the like, in particularly those salts which are FDA acceptable. A conjugate acid salt can be formed by contacting a synthetic regulatory compound in the free acid or base form with a sufficient amount of the desired base or acid, for example, hydrochloric acid, sulfuric acid, phosphoric acid, sulfamic acid, acetic acid, citric acid, lactic acid, tartaric acid, malonic acid, methanesulfonic acid, ethane sulfonic acid, benzene sulfonic acid, /.-toluenesulfonic acid, cyclohexyl sulfamic acid, and quinic acid. Such contacting can typically involve reacting the free acid or base forms ofa synthetic regulatory compound with one or more equivalents of the appropriate base or acid in a solvent or medium in
which the salt is insoluble, or in a solvent such as water that is then removed, for example, in vacuo, by freeze-drying, or by ion exchange. A conjugate acid or base form ofa compound of this invention is considered equivalent to the free base form (or the free acid form, as the case may be) for the purposes of the claims of this invention.
Carriers and/or excipients can also be included in compositions of the invention. Representative examples of carriers and excipients include calcium carbonate, calcium phosphate, various sugars such as lactose, or types of starch, cellulose derivatives, gelatin, vegetable oils, polyethylene glycols, and physiologically compatible solvents. Excipients such as polyhydric alcohols and carbohydrates share the same feature in their backbones, i.e., -CHOH-CHOH-. Useful polyhydric alcohols include such straight-chain molecules as sorbitol, mannitol, inositol, glycerol, xylitol, and polypropylene/ethylene glycol copolymer, as well as various polyethylene glycols (PEG) of various molecular weights, including molecular weights of 200, 400, 1450, 3350, 4000, 6000, and 8000.
Carbohydrates, for example, mannose, ribose, trehalose, maltose, glycerol, inositol, glucose, galactose, arabinose, can also be included.
If desired, liquid compositions can be thickened with a thickening agent such as methylcellulose. They can be prepared in emulsified form, either water in oil or oil in water. Any of a wide variety of suitable emulsifying agents, including pharmaceutically acceptable emulsifying agents, can be employed including, for example, acacia powder, a non-ionic surfactant (such as a Tween), or an ionic surfactant (such as alkali polyether alcohol sulfates or sulfonates, e.g., a Triton). In general, compositions of the invention are prepared by mixing the ingredients following generally accepted procedures. For example, the selected components can be simply mixed in a blender or other standard device to produce a concentrated mixture that can then be adjusted to the final concentration and viscosity by the addition of water or thickening agent and possibly a buffer to control pH or an additional solute to control tonicity. Compositions of the invention will typically be provided in a dosage form or formulation. Any suitable dosage form can be employed, and different dosage forms can be used for different applications. Exemplary formulations within the
scope of the invention include a parenteral liquid dosage form, a lyophilized or freeze-dried unit-dosage form, controlled or sustained release formulations, wherein an effective amount of the active ingredient is released over time, and modifications of these dosage forms that are useful in practicing certain aspects of the invention. Such dosage forms can be administered to a patient via a variety oi routes, including oral, nasal, buccal, sublingual, intratracheal, ocular, transdermal, and pulmonary delivery.
Formulations that support a parenteral liquid dosage form (for intravenous, intramuscular, interperitoneal, peripheral, or subcutaneous injection or infusion, for example) are those in which the active ingredient(s) are stable, and typically the solvent has adequate buffering capacity to maintain the pH of the solution over the intended shelf life of the product, and can optionally include a preservative. The dosage form should be either an isotonic and/or an iso-osmolar solution to either facilitate stability of the active ingredient or lessen the pain on injection or both. Oral delivery can be accomplished in a variety of ways, for example, by liquid (including gel caps) or solid dosage forms, and certain preferred embodiments concern pharmaceutical formulations intended for oral administration. Such formulations can be prepared as solid dosage forms. Particularly preferred solid dosage forms are pills, e.g., capsules, tablets, caplets, or the like suitable for oral ingestion. Solid dosage forms typically contain inert ingredients (e.g., carriers and excipients, as described above) along with the active ingredient to facilitate tablet formation. Numerous capsule manufacturing, filling, and sealing systems are well- known in the art, and can be used to make pills of any desired size. Preferred capsule dosage forms can be prepared from gelatin or starch. After making a capsule dosage form, if desired, it can be coated with one or more suitable materials. For example, one or more enteric coatings can be applied to prevent gastric irritation, nausea, or to prevent the active ingredient from being destroyed by acid or gastric enzymes, or to target a particular gastrointestinal region.
Formulations that support pulmonary and/or intra-tracheal dosage forms include preserved or unpreserved liquid formulations and/or dry powder formulations, as can be used in any suitable device for delivering the composition to the lung, e.g., a metered dose inhaler, nebulizer, or dry powder inhaler. Dissolvable
gels and/or patches are useful to facilitate buccal delivery, and can be prepared from various types of starch and/qr cellulose derivatives. Sublingual delivery can be supported by liquid formulations or by solid dosage forms that dissolve under the tongue. After a dosage form is prepared, it is typically packaged in" a* suitable material. For pill or tablet dosage forms, the dosage forms can be packaged individually or bottled en masse.
The effective dosage of a synthetic regulatory compound of the invention will vary depending upon a variety of factors, including the compound itself, its intended application, etc. When the compound is a pharmaceutical or veterinary drug, the particular dose and dosing regimen can be determined by the attending clinician, and can be further dependent upon such factors as the age, weight, and condition of the patient.
Those skilled in the art will be able to use the preceding information to prepare appropriate formulations for delivery of compositions comprising a synthetic regulatory compound of the invention. Other necessary information is known in the art and can be utilized to prepare appropriate formulations. Applications
Misregulation of transcriptional cascades is often the cause of disease; thus, gene-specific, particularly regulatory element-specific, modulation of gene expression mediated by, for example, cell permeable synthetic regulatory compounds offer the opportunity to treat or prevent disease that arise due to ectopic gene expression. An important application of the compounds of the invention is in the study of gene expression at a mechanistic level in cell-free systems and in living cells. Nucleic acid binding moieties (e.g., polyamides) employed in the compounds of the invention provide a level of promoter- or other regulatory element selectivity that can not be afforded by natural or chimeric transcriptional factors.
In certain preferred embodiments of the invention, compositions comprising a synthetic regulatory compound can be used to modulate physiological processes in vivo. In animals, particularly humans, primates, and domestic animals, one can affect development by controlling the expression of particular genes, modify physiological processes, such as accumulation of fat, growth, response to stimuli,
etc., and treat or prevent disease. Domestic animals include bovine, canine, equine, feline, murine, ovine, and.porcine animals, as well as fish and birds. The compositions of the invention can also be used agriculturally, for example, to control plant and animal pests, to affect gene regulation in plants, particularly commercially important crops.
The synthetic regulatory compounds of the invention can be used for the treatment or prevention of various disease states. The subject compositions can used, for example, therapeutically, to inhibit or activate the expression of one or more genes, which can change the phenotype of cells, either endogenous or exogenous to the host or patient, where the phenotype is detrimental. For example, synthetic regulatory compounds can be developed that contain a nucleic acid binding moiety that specifically binds to nucleic acids (e.g., double-stranded genomic DNA) in pathogens (i.e., viruses and pathogens of either eukaryotic or prokaryotic origin) that are involved in the regulation of expression of certain pathogenic genes, for example, genes required for replication or virulence. Alternatively, the nucleic acid binding moiety can be chosen to target a synthetic regulatory compound to a unique sequence of the pathogen that is not found in the genome of the pathogen's host. Thus, by inhibiting the expression of housekeeping or other genes of bacteria or other pathogens, particularly genes specific to the pathogen, one can provide for inhibition of proliferation and/or virulence of the particular pathogen.
Similarly, where a gene may be essential to proliferation or protect a cell from apoptosis, where such cell exhibits undesired proliferation, the subject compositions can be used to inhibit expression of the gene by inhibiting transcription or chromatin remodeling thereof. Alternatively, where a disease phenotype is caused by under-expression ofa gene, its expression can be activated by providing a compound according to the invention. An important application of the invention's synthetic regulatory compounds is in the prevention and treatment of cancer. For example, expression of specific oncogenes can be inhibited or prevented. Similarly, expression of genes inappropriately up-regulated in cancer cells can also be inhibited or prevented. Also, other genes whose expression is essential to the maintenance of an immortal phenotype, e.g., the genes coding for the RNA and protein components of telomerase, can be down-regulated using a
TE SHEET (RULE 26)
synthetic regulatory compound. An alternative strategy involves the activation of genes that give rise to an apoptotic cascade or whose expression is down-regulated as part of the tumorigenic process. These approaches to regulating transcription will find application in situations such as cancers, such as sarcomas, carcinomas and leukemias, restenosis, psoriasis, lymphopoiesis, atherosclerosis, pulmonary fibrosis, primary pulmonary hypertension, neurofibromatosis, acoustic neuroma, tuberous sclerosis, keloid, fibrocystic breast, polycystic ovary and kidney, scleroderma, inflammatory diseases such as rheumatoid arthritis, ankylosing spondilitis, myelodysplasia, cirrhosis, esophageal stricture, sclerosing cholangitis, retroperitoneal fibrosis, etc. Inhibition or activation, as the case may be, can be associated with one or more specific growth factors, such as the families of platelet- derived growth factors, epidermal growth factors, transforming growth factor, nerve growth factor, fibroblast growth factors, e.g., basic and acidic, keratinocyte fibroblast growth factor, tumor necrosis factors, interleukins, particularly interleukin 1, interferons, etc. In other situations, one can wish to inhibit a specific gene that is associated with a disease state, such as mutant receptors associated with cancer, or inhibit the arachidonic cascade, expression of various oncogenes, including transcription factors, such as ras, myb, myc, sis, src, yes, fps/fes, erbA, erbB, ski, jun, crk, sea, rel, fins, abl, met, trk, mos, Rb-1, etc. Other conditions of interest for treatment with the subject compositions include inflammatory responses, skin graft rejection, allergic response, psychosis, sleep regulation, immune response, mucosal ulceration, withdrawal symptoms associated with termination of substance use, pathogenesis of liver injury, cardiovascular processes, neuronal processes, and, in particular, where specific T-cell receptors are associated with autoimmune diseases, such as multiple sclerosis, diabetes, lupus erythematosus, myasthenia gravis, Hashimoto's disease, cytopenia, rheumatoid arthritis, etc., the expression of the undesired T-cell receptors can be diminished, so as to inhibit the activity of the disease-associated T-cells. In cases of reperfusion injury or other inflammatory insult, one can provide for inhibition of enzymes associated with the production of various factors associated with the inflammatory state and/or septic shock, such as TNF, enzymes that produce singlet oxygen, such as peroxidases and superoxide dismutase, proteases, such as elastase, INF-γ, IL-2, factors that induce proliferation
of mast cells, eosinophils, IgG, IgE, regulatory T cells, etc., or modulate expression of adhesion molecules in leukocytes and endothelial cells.
Compounds of this invention can be screened for their in vitro activities against different species of bacteria and fungi. The minimal inhibition concentration (MIC) of these compounds was determined using the National Committee for' Clinical Laboratory Standards (NCCLS) broth microdilution assay in microtiter plates, as set forth in: (1) the guidelines of the National Committee for Clinical Laboratory Standards (NCCLS) Document M7-A4 (NCCLS, 1997); (2) the guidelines of the National Committee for Clinical Laboratory Standards (NCCLS) Document MI I -A4 (NCCLS, 1997); and (3) the guidelines and reference method of the National Committee for Clinical Laboratory Standards (NCCLS) Document M27-T (NCCLS, 1995). For antifungal assays, the method recommended in Murray, PR., 1995 Manual of clinical microbiology (ASM Press, Washington, DC), was employed. A variety of Gram positive and Gram-negative bacteria (aerobes and anaerobes) as well as yeasts and filamentous fungi were tested. These organisms included Staphylococcus spp., Streptococcus spp., Enterococcus spp., Coryne bacterium spp., Listeria spp., Bacillus spp., Micrococcus spp., Peptostreptococcus spp, Clostridium spp., Propionibacterium spp., Escherichia spp., Pseudomonas spp., Haemophilus spp., Candida spp., Cryptococcus spp., Aspergillus spp., Trichophyto spp., Paecilomyces spp., Saccharomyces spp. and Fusarium spp. In addition, some drug resistant microbes were also evaluated with this assay. Other pathogenic bacteria against which compounds of this invention can be effective include Acinetobacter spp., Alcaligenes spp., Campylobacter spp., Citrobacter spp., Enterobacter spp., Proteus spp., Salmonella spp., Shigella spp., Helicobacter. spp., Neisseria spp., Vibrio spp., Bacteroides spp., Prevotella spp., Mycoplasma spp., Mycobacteria spp., and Clamydia SPP.
Other opportunities for use of the subject synthetic regulatory compounds include modulating the level of expression of genes coding for receptors, ligands, enzymes, changing phenotype of cells, modifying the response of cells to drugs or other stimuli, e.g., enhancing or diminishing the response, and inhibiting or activating the expression of one of two or more alleles.
WTTrUTESHE
To accomplish the above representative treatments and therapies, individual and multiple compounds can be employed, directed to the same dsDNA region, but different target sequences, contiguous or distal, or different DNA regions, depending upon the number of genes which one wishes to modulate the expression of. The subject compositions can also be used as a sole therapeutic agenfor in combination with other therapeutic agents. Depending upon the particular indication, other drugs can also be used, such as antibiotics, antisera, monoclonal antibodies, cytokines, anti-inflammatory drugs, and the like. The subject compositions can be used for acute or chronic situations, where a particular regimen is devised for the treatment of the patient.
The following examples are provided to assist in understanding the present invention. The examples and experiments described below should not, of course, be construed as specifically limiting the invention and such variations of the invention, now known or later developed, which would be within the purview of one skilled in the art in view of the description provided herein.
EXAMPLE 1 Preparation of Thioesters for Ligating Peptides With Non-Native Substrates This example provides a one-pot procedure for the synthesis of thioesters from primary amines. Polyamides containing one or more primary amines were prepared by solid-phase synthesis using standard methods (see U.S. Patent Nos. 6,090,947 and 5,998,140) and reacted with thiolane-2,5-dione followed by alkylation with benzyl bromide to produce the target thioesters in good yield. The thioesters thus prepared were conjugated to peptides to produce synthetic regulatory compounds according to the invention using the "native chemical ligation" method. This flexible synthetic procedure provides a ready route to both natural and unnatural substrates for chemical ligation reactions.
In order to prepare large numbers of putative synthetic regulatory compounds, a streamlined synthetic process was desired. Of the many coupling techniques available to prepare peptide conjugates, the most versatile was the "native chemical ligation" procedure originally developed for the synthesis of proteins too large to be accessed by standard solid-phase synthesis approaches. Dawson et al. (1994) Science 266:776-779. In this reaction, a peptide containing a
SUBSTIT
carboxy-terminal thioester and a peptide with an amino-terminal cysteine are combined in denaturing buffer. Upon transesterification of the thioester with the cysteine thiol, an S-»N acyl shift takes place to generate a ligated product in which the two halves are now connected by an amide bond. The product recovery of this sequence is generally good, and the facility of the reaction appears sequence independent. Total syntheses of many natural and modified proteins have been reported using this method. Cotton et al. (1999) Chem. Biol. 6:R247-R256.
To adapt this powerful reaction for present purposes, it was necessary to prepare polyamides containing the requisite thioester functional group. Available methods for the preparation of suitable thioesters range from biochemical approaches (Muir et al. (1998) Proc. Natl. Acad. Sci. USA 95:6705-6710; and Welker et al. (1999) Biochem. Biophys. Res. Commun. 254:141-151) to solid-phase synthesis using modified resins. See, e.g., Camarero et al. (2000) Lett. Peptide Sci. 7:17-21; Shin et al. (1999) J. Am. Chem. Soc. 121:11684-11689; Ingenito et al. (1999) Tetrahedron Lett. 121:11369-11374; Schwabacher et al. (1993) Tetrahedron Lett. 34:1269-1270; Clippingdale et al. (2000) J. Pept. Sci. 6:225-234; Hojo et al. (1991) Bull. Chem. Soc. Jpn. 64:111-117; Yamashiro et al. (1988) Int. J. Peptide Protein Res. 31:322-334; and Yamashiro et al. (1981) Int. J. Peptide Protein Res. 18:385-392. However, all previous methods provided products with thioesters at the carboxy-terminal position, severely restricting their utility in the context needed to generate the desired range of compounds useful in the practice of this invention, as, in certain embodiments, it was desired to prepare compounds with multiple peptides attached to one polyamide. Accordingly, it was necessary to develop a synthetic approach that would allow such flexibility. Polyamides were prepared by solid-phase synthesis (Baird et al. (1996) J.
Am. Chem. Soc. 118:6141-6146; and U.S. Patent No. 6,090,947), and a functional group that is straightforward to introduce at one or more positions is a primary amine. Therefore, an electrophile was needed that would produce a thioester or thioacid when reacted with an amine. Thiolane-2,5-dione was readily available from succinic anhydride by reaction with sodium sulfide (Kates et al. (1995) J. Heterocyclic Chem. 32:971-978), and thus was tested. Scheme I
Conditions: (a) DIPEA, NMP, rt. (b) 100 mM NaOAc (pH 5.2), benzyl bromide, 0 "C (52%, 2 steps), (c) 100 mM potassium phosphate buffer (pH 7.3), 6 M GrvHCI, 5% NMP, 5% PhSH, 4 d, rt (29%).
Referring to Scheme I, to test this approach, hairpin polyamide 1, containing a primary amine, was prepared by solid-phase synthesis. Trauger et al. (1996; Baird et al.; and U.S. Patent No. 6,090,947. Polyamide 1 and thiolane-2,5-dione were combined in N-methylpyrrolidinone (NMP) with N,N-diisopropylethylamine (DIPEA) at ambient temperature and reaction progress monitored by analytical reversed-phase HPLC (Scheme 1). Polyamide 1 was rapidly consumed and thioacid 2 isolated by ether precipitation. Alternatively, thioester 3 was produced in a "one- pot" conversion by lowering the pH of the initial reaction mixture with pH 5.2 NaO Ac buffer and adding benzyl bromide. As monitored by analytical HPLC, the conversion from 1 to 3 was virtually quantitative. A typical reaction procedure was conducted as follows: to a solution of 18.6 μmoles (23.5 mg) polyamide 1 in 0.600 mL NMP was added thiolane-2,5-dione (25.0 μL ofa 1.00 M solution, 25.0 μmoles), followed by 9.70 μL (55.6 μmol) of DIPEA. After 10 min., conversion to thioacid 2 appeared complete. The reaction mixture was diluted with 900 μL 100 mM NaO Ac buffer (pH 5.2) and cooled to 4°C, necessary to prevent the formation of dialkylation products in the subsequent step. Benzyl bromide (55.8 μmol, 6.70 μL) was added with thorough mixing and after an additional 10 min., the thioester 3 was isolated by semi-preparatory reversed-phase HPLC as a pale yellow powder in
52% overall yield (14.2 mg, 9.66 μmol).
Data for thioester 3: 1H NMR (500 MHz, DMSO-d6): δ 1.58-1.68 (m, 2), 1.72-1.82 (m, 4), 2.28-2.36 (m, 4), 2.41 (t, 2, J= 6.6 Hz), 2.72 (d, 2, J= 4.9 Hz), 2.84 (t, 2, J= 6.8 Hz), 2.94-3.0 (m, 2), 3.04-3J0 (m, 4), 3J6 (tt, 2, J = 6.9, 13.9), 3.21 (m, 2), 3.80 (s, 3), 3.81 (s, 3), 3.83 (s, 3), 3.84 (s, 3), 3.84 (sTS 3.85 (br s, 5), 3.91 (d, 3, J= 2.0 Hz), 3.99 (s, 3), 4J0 (s, 3), 6.84 (d, 1, J= 1.7 Hz), 6.88 (d, 1, J= 2.0 Hz), 6.91 (d, 1, J= 2.0 Hz), 7.05 (br s, 3), 7J6-7J8 (m, 3), 7.21-7.30 (m, 7), 7.39 (s, 1), 7.86 (t, 1, J= 5.6 Hz,), 7.99-8J0 (m, 6), 9.09 (br s, 1), 9.84 (s, 1), 9.89 (s, 1), 9.90 (s, 1), 9.91 (s, 1), 9.93 (s, 1), 9.96 (s, 1), 10.46 (s, 1). MALDI-TOF MS [M+H] (monoisotopic) calculated 1470.7, observed 1470.7.
Thioester 3 underwent reaction under standard "native chemical ligation" conditions with peptides having an amino-terminal cysteine (such as compound 4, below) to produce compound 5. Due to the hydrophobicity of thioester 3, it was necessary to dissolve the thioester in NMP (5-10% of reaction volume) prior to addition of aqueous denaturing buffer. Compound 5 was recovered in 29% yield (after reversed-phase HPLC purification) under these reaction conditions (MALDI- TOF MS [M+H] (average mass) calculated 6935.0, observed 6934.8).
To gauge the generality of this method for preparing polyamides with multiple thioesters at diverse positions within the structure, compounds containing one or more internal primary amine(s) were used to provide the requisite
functional group handles. The above reaction sequence was applied to afford thioesters such as compounds 6 and 7. MALDI-TOF MS [M+H] (monoisotopic) 6: calculated 1470.7, observed 1470.7; 7: calculated 1719.7, observed 1719.8.
Compound 6 was isolated in 55% yield after HPLC purification. Incorporation of
two thioesters was as facile as one, with conversion to product nearly quantitative as determined by analytical HPLC to furnish thioester 7 in 30% yield. In all cases, subsequent conjugation via "native chemical ligation" to peptides containing amino- terminal cysteine residues proceeded readily (5-45% yields of isolated conjugate).
In summary, the one-pot reaction sequence described herein allows rapid access to thioester-modified products suitable for "native chemical ligation" reactions (Scheme 2).
SH .peptide XS s ) HΛ —"
substrate = any 1° amine-containing natural or unnatural molecule x is greater than or equal to 1
Most notably, this procedure requires only a primary amine and can be used to install multiple functional groups. This sequence works equally well for the functionalization of α-amino acids. While this example describes the use of this method to prepare polyamide-linker-regulatory compounds, it is extendable to the preparation of other chimeric target molecules. Finally, although the thioester products function well in the native chemical ligation reaction, other ligation approaches are equally applicable such as the Staudinger ligation or alkylation of thioacids. Schnolzer et al. (1992) Science 256:221-225.
Example 2 Gal4 dimerization domain
This Example describes the combination of a weakly dimerizing Gal4 domain (residues 73-100 of Gal4) as a linker to connect the polyamide and activator moieties of Example 3 (YLLPTCIP ("XL" SEQ ID NO: 2)) in order to make synthetic regulatory compounds of the invention, as well as such compounds that instead comprise a flexible polyethylene glycol linker. XL is described in Lu et al. (2000) Proc. Natl. Acad. Sci. USA 97:1989-1992. The synthetic process described in Example 1 was used to generate each of three compounds (7-9, Figure 1 SEQ ID NOs: 1, 2 and 3) tested in the course of the experiments described below.
As shown in Figure 1 A, compounds 7, 8, 9 had the same structure, except that the linker domain of each compound was different. Compound 7 did not contain a linker per se; instead, the activator peptide was linked directly the C- terminus via a Cys residue during the native ligation procedure. Conjugate 8 contained a PEG linker, and compound 9 contained as a linker the'weak dimerization domain comprising residues 73-100 of Gal4. Each of the compounds was generated in moderate to good yield using the synthetic process of Example 2, and each was sufficiently water soluble to be purified and characterized, although to a lesser degree than the polyamide moiety alone. Each conjugate had a binding affinity for the target nucleotide sequence (the same site for each compound) that was at least about 10-fold less than that of the polyamide moiety alone for its cognate target sequence. Compound 9 bound inverted target dimer site (Figure IB) with an approximately 4-fold greater affinity (Ka = 9 x lO^"1 vs. 2 x 107 M"1), indicating that a favorable interaction occurs between adjacent compounds. The use of organic solvents in DNase I footprinting experiments was necessary.
Example 3 Compounds Comprising AH Activators and Gcn4 Linkers Attached Via Internal Residues This example describes the synthesis and testing of several synthetic regulatory compounds according to the invention that also employ polyamides as the nucleic acid binding moiety. In several of these compounds, the linker is attached at an internal pyrrole unit rather than at the C-terminus of the polyamide. See Figure 2. Also, the use of an acidic activator peptide (PEFPGIELQELQELQALLQQ ("AH", SEQ ID NO: 5) is described. The AH activator increases water solubility, and also contacts different components of the transcriptional machinery as compared to the small hydrophobic XL peptide. Finally, alternative dimerization domain (Gcn4 251- 281) is described (compounds 10 (SEQ ID NO: 4), 11, 13 and 14, Figure 2). This dimerization domain is derived from the leucine zipper region of the yeast protein Gcn4. It is relatively small (about 30 residues) and soluble. Gcn4 conjugates containing XL (conjugates 10 and 13) also included amino acid residues 90-100 of Gal4 believed to be necessary for full function. Finally, a series of compounds comprising two activator peptides linked to a single non-natural DNA binding
BSTITUTE SHEET (RULE 26)
SU1
moiety, thus an intramolecular dimerization domain, were also synthesized (compounds 18-20, Figure 2).
Each of compounds 10-20 was prepared by joining a linker-activator peptide conjugate to a polyamide via native ligation, as described in Example 1, Scheme 1. All of the compounds were prepared in low (5%) to good (50%) yfe d, and their" respective identities confirmed by MALDI-TOF mass spectrometry.
DNase I footprint titrations were performed to determine DNA binding affinities. The DNA fragments used for the titrations were 32P-labeled restriction fragments containing an inverted dimer binding site separated by 5 or 7 base pairs, as well as monomer sites. Each of the compounds bound DNA with reasonable (Ka>107M ) affinity. The conjugates containing the Gcn-4 dimerization domain exhibited much improved solubility characteristics and bound DNA with greater affinity than the corresponding Gal4 conjugates. Compounds containing a Gcn4(251-281) linker (compounds 10, 11, 13, and 14) bound a palindromic dimer site with about 5 fold higher affinity than the corresponding monomer site.
Each of the conjugates was separately tested in an in vitro transcription assay derived from yeast nuclear extracts to assess transcription activation function. The template DNA used in the reactions contained three palindromic binding sites 50 base pairs upstream of the transcription start site. See Figure 3. In these assays, compounds 9 and 17-20 did not activatei'transcription, and in some cases, inhibition or levels lower than basal were observed.
Assays to which compounds 9, 15, and 16 were added exhibited measurable (2-3 fold) levels of transcription above basal levels. Compounds 10, 11, 13, and 14 activated transcription efficiently (5-10 fold) on three different templates that contained palindromic binding sites, and the size of the palindromic binding sites appeared to affect transcription levels only slightly.
Preliminary experiments with other compounds indicated that a dimerization region could be dispensed with. Compound 11 was the most active, giving consistently high levels of activation. Minor effects of binding site size were also observed, particularly for compounds 10 and 13, which each contain XL as the activator peptide. Compound 12 was a weak activator in these assays, presumably due to an interruption of the helix proposed to form at residues 86-96 by attachment
SUBSTITUTE SHEET PLE 26)
of the Gcn4(251-281) region which could significantly disrupt presentation of XL. Matching the phasing of the two helices can therefore improve the activity of such compounds (i.e., compounds wherein the linker and activator moieties are both peptides). The order of addition of the various assay components has 'some effect upon the levels of transcription observed. When conjugates were pre-incubated with nuclear extracts prior to the addition to the template DNA, activated transcription was detected at conjugate concentrations of 5-50 nM. When conjugates were pre- incubated with template DNA for 1.5 hr. (as in the DNase I footprinting experiments) before nuclear extract was added, activation was not observed at concentrations lower than 50 nM, and higher overall levels (7-15-fold) of activated transcription were obtained at higher concentrations (500 nM).
Example 4 Compounds Comprising Non-Dimerizing Linkers This Example describes the design, synthesis, and testing ofa series of compounds (21-28 see Figure 4) in which the Gcn4-derived linker portion (Gcn4(251 -28 l))of compound 11 (see Example 5) was replaced with a "scrambled" peptide in which the amino acid residues of the linker were re-ordered so as to disrupt the helix-forming propensity of the coiled-coil motif of native peptide (compound 21) believed to be participate in dimerization of Gcn4 molecules; the polyamide and activating domains were the same as those in compound 11. To further probe the relative importance of dimerization versus projection from DNA, compounds in which Gcn4(251-281) were replaced by one polyethylene glycol linker (compound 22), two polyethylene glycol linkers (compound 23), or no linker (compound 24) were constructed. Compound 22 was predicted to orient the AH activation domain away from the double helix and project approximately one-half the distance provided by the Gcn4(251-281) linker, whereas compound 23, which comprised two polyethylene linkers, was predicted to position the AH activation domain away from the DNA at approximately the same distance as the Gcn4(251- 281) linker. Control compounds (compounds 25-28; see Figure 4) were also generated.
Each of the compounds was prepared by methods described in the previous examples, and their respective identities were confirmed by MALDI-TOF mass spectrometry. In vitro transcription assays were performed using a reporter construct that separated the inverted repeat of the nucleotide sequence targeted by the polyamide (the same hairpin polyamide used in the compouridl'described in~ " Example 3) by seven nucleotides. The inverted repeat was replicated three times, with the 3 '-most repeat being spaced 50 base pairs from the TATA box of the AdML promoter, which was used to regulate expression of luciferase in a g-less expression cassette. Compound 11 was observed to activate expression of the reporter gene 25-
40-fold over basal levels at compound concentrations of 250-350 nM and transcription times of 30-40 minutes. Control compounds 25-28 did not produce detectable levels of expression of the reporter gene product under comparable conditions, as expected. Compound 21 activated transcription 1.5-2-fold over basal levels. In contrast, when AH was attached directly to the polyamide with no intervening linker (compound 24), only minimal (2-3-fold) activation was observed. On the other hand, inclusion ofa single polyethylene glycol flexible linker (compound 22) resulted in activation levels (10-fold) approximately half those generated by compound 11. Finally, on a template containing three "mismatch" palindromic binding sites, the strongest activator (compound 11) activated transcription minimally (less than 2-fold), and only at high concentration (500 nM).
The compounds described in this Example demonstrate that a discrete dimerization domain is not essential for activator function. That compound 11, which comprises a leucine zipper region derived from Gcn4, exhibits stronger activating potential than compounds containing a flexible linker leads to the conclusion that projection from DNA is the crucial function of the linker region. Furthermore, when the capacity to project AH from DNA via coiled-coil formation was removed (compound 21), the function of the conjugate as an activator was also severely compromised. Finally, transcription activation by these compounds demonstrates that the DNA binding domain of an activator plays no fundamental role in activation other than specifying the location of DNA binding.
EXAMPLE 5 Activation of gene expression by small molecule transcription factors As discussed elsewhere herein, naturally occurring eukaryotic transcriptional activators typically, at a minimum, comprise a dsDNA binding domain and a separable activation domain, and most activator proteins also contain a dimerization module between the DNA binding and activation domains. This example describes preferred embodiments of a class of synthetic regulatory compounds according to the invention that mimic natural transcription factors, namely compounds comprising a dsDNA binding domain comprised of a polyamide, a peptide-based activation domain derived from a designed or naturally occurring transcription activating protein, and either a peptidic or polyethylene glycol linker (diglycolic anhydride).
As shown in this example, such compounds can mediate high levels of DNA site-specific transcriptional activation in vitro. A representative example of such molecules had a molecular weight of about 4.2 kDa, and contained a sequence- specific DNA binding polyamide to serve as the DNA binding region, a non-peptide linker instead ofa dimerization peptide, and an activating region (here, a designed peptide) designated "AH", that comprised the following amino acid sequence: PEFPGIELQELQELQALLQQ (SEQ ID NO: 5, Giniger et al. supra). Because synthetic polyamides can be designed to recognize any specific double-stranded nucleic acid sequence, these results demonstrate that synthetic regulatory compounds can be designed to up-regulate the expression of any specified gene.
The compounds described in this Example incorporate non-natural or synthetic counterparts for each of the functional modules typically found in naturally occurring activator proteins. In the compounds, the protein-based DNA binding module was replaced with a hairpin polyamide composed of N-methylpyrrole (Py) and N-methylimidazole (Im) amino acids that binds in the minor groove of DΝA (Figure 4). The hairpin polyamide selected for the present study, ImPyPyPy-γ- PyPyPyPy-β-Dp (where γ is γ-aminobutyric acid, β is β-alanine, and Dp is dimethylaminopropylamide; polyamide 1), binds the target nucleotide sequence 5'- TGTTAT-3' with a dissociation constant (KD) of 1J nM. Initially, a palindromic
HEET RUL 2δ)
binding site containing this sequence as an inverted repeat separated by 7 bp was targeted. A peptidic dimerization element known to form a coiled-coil, residues 251-281 of the yeast protein Gcn4 (Ellenberger et al. (1992) Cell 71:1223-1237), was used to link the polyamide to the AH activation domain. Synthesis of Conjugates 3, 4, and 5 (Figure 5). Polyamide 1 (Example 1) was prepared according to established protocols as described above and then was combined with 1.2 equivalent (eq.) thiolane-2,5-dione (Kates et al. (1995)) and 3 eq 7-Pr2EtN in l-methyl-2-pyrrolidinone at a final concentration of 10 mM. After 15 min., 1.5 vol. of 100 mM NaOAc (pH 3.2) were added followed by 3 eq. of benzyl bromide. After an additional 15 min., the reaction mixture was subjected to purification by reversed-phase HPLC, and the appropriate fractions were concentrated to isolate compound 2 (Example 1) (53%) as a white powder. The matrix-assisted laser desorption/ionization time of flight (MALDI-TOF) MS analysis of compound 2 revealed the following: [M+H] calculated (monoisotopic) 1470.7, observed 1470.7.
Synthetic peptides 3 (SEQ ID NO: 10), 4 (SEQ ID NO: 11), and 5 (Figure 5A) were synthesized using established peptide synthesis protocols. Peptide 3 contained both the Gcn4 dimerization domain and the AH activation domain at the carboxy-terminus of the peptide. Peptide 4 contained the dimerization domain from Gcn4 without the AH peptide; and peptide 5 contained the AH domain alone. In addition, each of peptides 3, 4, and 5 when synthesized included an amino-terminal Cys residue to facilitate linkage to compound 2. In separate reactions, polyamide 2 (1 μmol) was combined with either peptide 3, 4, or 5, (0.8-1 μmol) in 5% 1-methyl- 2-pyrrolidinone in 6 M Gn HC1, 100 mM potassium phosphate buffer (pH 7.3), and 10% (vol/vol) thiophenol was added to this solution. Reaction progress was monitored by analytical HPLC and upon completion (3-5 days), purification of the mixture by reversed-phase HPLC resulted in isolation of the desired conjugates. Yields and characterization were as follows: compound 3 (PA-Gcn4-AH): 11%, MALDI-TOF [M+H] (average mass) calculated 7465.4, observed 7465.4; compound 4 (PA-Gcn4): 21%, MALDI-TOF: [M+H] (average mass) calculated
TlTϋTESHEE ^
5159.8, observed 5159.9; compound 5 (PA-AH): 22%, MALDI-TOF: [M+H] (average mass) calculated 3774.2, observed 3774.5.
Synthesis of Conjugates 8 and 9 (Figure 6). The ethylene gly col-derived linker was prepared as the N-t-butoxycarbonyl amino acid for use in solid-phase synthesis from 4,7J0-trioxa-l,13-tridecanediamine by monoproteclion with N-t- ' butoxycarbonyl anhydride followed by reaction with diglycolic anhydride and incorporated into polyamides 6 and 7 according to established protocols. Baird et al. (1996). Transformation into conjugates 8 (PA-1L-AH) and 9 (PA-2L-AH) was accomplished as described above. Yields and characterization: compound 8: 12%, MALDI-TOF MS [M+H] (average mass) calculated 4164.7, observed 4164.6; compound 9: 11%, MALDI-TOF MS [M+H] (average mass) calculated 4482.0, observed 4482.2.
DΝase I Footprinting Titrations. Quantitative DΝase I footprinting titrations were carried out in accordance with established protocols on a 3-32P- labeled 271-bp pPT7 EcoRI/EvwII restriction fragment. Trauger et al. supra; and Senear et al. (1986) Biochem. 25:7344-7354.
In Vitro Transcription Assays. The template plasmid was constructed by closing a 78-bp oligomer bearing three cognate palindromic sequences into a BgHl site 30 bp upstream of the TATA box of pML 53. This plasmid has the AdML TATA box 30 bp upstream of a 277-bp G-less cassette. The "mismatch" template (containing a substitution of a T/A base pair with a G/C base pair, as compared to the "match," or target site) was constructed by cloning a 78-bp oligomer containing three palindromic "mismatch" sites into a Bglil site 30 bp upstream of the TATA box of pML 53. Gu et al. (1999) Mol. Cell. 3:97-108. For each reaction, 20 ng of plasmid (30 fmol of palindromic sites) was preincubated with conjugate for 75 min before the addition of 90 ng of yeast nuclear extract in a 25 μL reaction volume under standard conditions. Lue et al. (1991) Met. Εnzymol. 194:545-55; and Lue et al. (1989) Science 246:661-664. The reactions were processed as described (Lue et al. (1991); and Lue et al. (1989)) and resolved on 8% 30:1 polyacrylamide gels containing 8 M urea. Gels were dried and exposed to photostimulatable phosphorimaging plates (Fuji). Data were visualized by using a Fuji Phosphorlmager followed by quantification using MACBAS software (Fuji).
Results and Discussion
Synthesis of Artificial Activators. The hairpin polyamides and peptides were synthesized by solid-phase protocols, and the peptides each contained an N- terminal cysteine for subsequent attachment to the polyamide. Polyamide 1 then was treated with thiolane-2,5-dione followed by benzyl bromide to produce thioester 2 functionalized for use in the native ligation procedure described by Kent and colleagues (Fig. 5). Dawson et al. (1994). Three polyamide-peptide conjugates were prepared by this sequence. Conjugate 3 (PA-Gcn4-AH) contained the eight- ring hairpin polyamide as the DNA binding module in addition to residues 251-281 of the yeast protein Gcn4 as a dimerization element and the designed peptide AH as the activating region. The two control compounds 4 (PA-Gcn4) and 5 (PA- AH) each lacked one or another of components critical for activator function.
Based on available crystal structures (Ellenberger et al. (1992); and Dawson et al. (1994)), it was expected that the target site specificity of the polyamide would target the synthetic regulatory compounds to the palindromic binding site, as shown in Fig 7. The data from a quantitative DNase I footprinting titration between conjugate 3 and DNA containing the 19-bp site were fit by a cooperative binding isotherm (KD 11 nM) (Fig. 8A) (Senear et al. supra; and Brenowitz et al. (1986) Met. Enzymol, 130:132-181), confirming this expectation. The decrease in overall binding affinity of conjugate 3 relative to the parent hairpin polyamide can be attributed to the attachment of the peptide at the C-terminal position of the polyamide, known to have a deleterious effect.
Activation of Transcription. Conjugate 3 (PA-Gcn4-AH) activated transcription in yeast nuclear extracts on a DNA template containing three palindromic binding sites upstream of the start site (Fig. 8B). Lue (1991); and Lue (1989). Thus, inclusion of compound 3 at 200 nM concentration in the reactions resulted in 13-fold levels of activated transcription over basal levels. In control experiments, polyamide 1 alone or polyamide coupled to the Gcn4 dimerization domain (polyamide 4) but lacking AH did not stimulate transcription. Furthermore, activation depended on the presence of cognate polyamide binding sites upstream of the transcription initiation site. Thus, on a template with palindromic sites
containing a single base pair mismatch at each half site, compound 3 failed to significantly activate transcription (Fig. 8C).
Time Dependence of Activation. The time course experiment (the results of which appear in Figure 9 A) revealed that the activation profile of conjugate 3 (PA-Gcn4-AH) was consistent with that previously determined for protein transcriptional activators. Wootner et al. (1990) J. Biol. Chem. 265:8979-8982. At 20 min., the level of transcription was 40-fold above the basal level. Figure 9B shows that at high concentrations of free AH peptide, the activation elicited by compound 3 was decreased by about 50%. AH peptide thus competes with the DNA-bound compound 3 for binding to the transcriptional machinery in a phenomenon referred to as squelching. Gill et al. (1988) Nature 334:721-724; and Tasset et al. (1990) Cell 62:1177-1187. This demonstrates that DNA-tethered AH recruits the transcriptional machinery to the nearby promoter.
Functional Role of Dimerization Element. The functional necessity ofa dimerization element was investigated by the evaluation of conjugates containing the activator peptide AH separated by flexible straight-chain linkers of 12 atoms (compound 5), 36 atoms (compound 8), and 55 atoms (compound 9) (Fig. 6). As shown in Figure 10A, compound 8 (PA-1L-AH) activated transcription at approximately 50% the level of compound 3 (PA-Gcn4-AH). Increasing the linker length to 55 atoms (compound 9) did not result in a further increase in activation levels; this is likely because of the flexibility of the linker moiety, which can not project AH fully away from DNA. The use ofa shorter linker (compound 5, PA- AH) provided a compound that activated transcription 25% as well as conjugate 3 (PA-Gcn4-AH), confirming that spatial separation of the activator moiety from DNA plays a role in the efficiency of activation. "'
Two observations also demonstrate that compounds 5, 8, and 9 do not dimerize. As shown in Figure 10B, data from quantitative DNase I footprinting titrations were fit by noncooperative isotherms (KD for compound 5 = 19 nM; KD for compound 8= 32 nM). Furthermore, in contrast to titrations containing compound 3 (PA-Gcn4-AH), DNase I-mediated cleavage was observed at positions between the monomeric binding sites within the palindrome.
These data demonstrate that synthetic regulatory compounds according to the invention can regulate transcription of a desired gene. These results also show that dimerization per se between two compounds is not required for activator function, nor is a natural DNA binding domain essential for activation. EXAMPLE 6
Altering size, composition, site of attachment of activating regions on the polyamide As described above, naturally occurring transcriptional activator proteins minimally comprise two functions, one for DNA binding and the other for activation. Example 5 describes the construction and testing of synthetic regulatory compounds that comprise non-natural nucleic acid binding moieties and various linkers in combination with the designed amphipathic AH activator peptide, a motif later found to occur in many native activation proteins. One of those synthetic regulatory activators, 4.2 kDa in size, comprised an eight-ring DNA binding hairpin polyamide tethered through a flexible ethylene glycol-based polyether linker to the 20 residue AH activating peptide stimulated high levels of promoter-specific transcription. One surprise from that study was that a dimerization domain was unnecessary for function of the activation moiety.
This example further demonstrates the utility of the nucleic acid binding moiety-linker-regulatory moiety motif by showing that even smaller synthetic ligands that mimic the activities of naturally occurring regulatory proteins can be successfully assembled and have regulatory function. Small synthetic regulatory compounds are preferred in order to increase the probability of membrane permeability without appreciable loss of specific gene-regulating activity. In this example, synthetic regulatory compounds are provided that comprise sequence-specific polyamides attached via a linker to an activation peptide comprised of 8 or 16 residues derived from the activation domain of the potent viral activator VPl 6. The 16 residue activation moiety coupled to the polyamide via a linker activated transcription three fold better than the analogous AH conjugate described in Example 5. Altering the site-of-attachment of the activation moiety on the polyamide allowed reduction of the intervening linker from 36 atoms to eight without significant diminution of the activation potential. Also provided are
røSTITUTE SHEET RULE 26
synthetic regulatory compounds containing different polyamides to target a different sequence without compromising activator function, further emphasizing the generality of this motif.
These synthetic regulatory compounds have tunable potency effected by the size and identity of the activating moiety as well as the site of attachment to the " ' polyamide. In particular, these results reveal a potency two to three times greater than PA-1L-AH conjugate 8 in Example 5) with concomitant decreases in molecular weight of 21%) and 12%, respectively, and that these synthetic polyamide-linker- activator regulatory compounds bind to their cognate target sequences upstream of the AdML promoter. These results also show that levels of compounds required for full promoter occupancy correspond to the levels required to elicit maximal activation in vitro, and further confirm that changing the DNA binding domain can be accomplished while retaining significant transcription-activating function. Materials and Methods Referring to Figure 11, conjugates 1, 5, and 10 were synthesized and transformed. Polyamides 1, 5, and 10 were prepared according to established solid phase synthesis protocols and subsequently transformed into conjugates 2, 3, 4, 6, 7, 8, and 11 by the previously reported methods. Conjugate 9 was prepared in an analogous fashion. The identity of all conjugates was verified by MALDI-TOF mass spectrometry.
Characterization: conjugate 2 (PA-1L-AH): MALDI-TOF [M+H] (average mass) calculated 4164.7, observed 4164.6; conjugate 3 (PA-1L-VP2): MALDI-TOF: [M-H] (average mass) calculated 3670J, observed 3670J; conjugate 4 (PA-1L- VP1) MALDI-TOF [M+H] (monoisotopic mass) calculated 2763.25, observed 2763.53; conjugate 6 (PA-AH): MALDI-TOF: [M+H] (average mass) calculated 3774.2, observed 3774.5; conjugate 7 (PA-VP2): MALDI-TOF [M+H] (monoisotopic mass) calculated 3280.4, observed 3280.5; conjugate 8 (PA- VPl): MALDI-TOF [M+H] (monoisotopic mass) calculated 2374.0, observed 2374.0; conjugate 9 (PA-(py)-VP2): MALDI-TOF [M+H] (monoisotopic mass) calculated 3280.4, observed 3280.7; conjugate 11: MALDI-TOF [M+H] (monoisotopic mass) calculated 3670.6, observed 3670.7.
DNase I Footprinting Titrations. A 363-bp 5'-32P-labeled PCR fragment was generated from template plasmid pAZA812 in accordance with standard protocols and isolated by nondenaturing gel electrophoresis. All DNase I footprinting reactions were carried out in a volume of 40 μL. 0.8 ng/μL of plasmid pPT7 was used in these reactions as unlabeled carrier DNA. A polyamide stock"" ' solution or water (for reference lanes) was added to an assay buffer where the final concentrations were 50 mM HEPES, 100 mM KOAc, 15 mM Mg(OAc)2, 5 mM CaCl2, 6.5% glycerol, 1 mM DTT, pH 7.0 and 15 kcpm 5'-radiolabeled DNA. The solutions were equilibrated for 75 min. at 22°C. Cleavage was initiated by the addition of 4 μL of a DNase I stock solution and was allowed to proceed for 7 min. at 22°C. The reactions were stopped by adding 10 μL of a solution containing 2.25 M NaCl, 150 mM EDTA, 0.6 mg/mL glycogen, and 30 μM base pair calf thymus DNA and then ethanol-precipitated. The cleavage products were resuspended in 100 mM Tris-borate-EDT A/80% formamide loading buffer, denatured at 85°C for 10 min, and immediately loaded onto an 8% denaturing polyacrylamide gel (5% crosslink, 7 M urea) at 2000 V for 2 h. 15 min. The gels were dried under vacuum at 80°C and quantitated using storage phosphor technology.
In Vitro Transcription Assays. Template plasmid pAZA812 was constructed by cloning a 78 bp oligomer bearing three cognate palindromic sequences for conjugates 2, 3, 4, 6, 7, 8, and 9 into a Eg/II site 30 bp upstream of the TATA box of pMLΔ53. This plasmid has the AdML TATA box 30 bp upstream of a 277 bp G-less cassette. Template pAZA813 was constructed by cloning a 78 bp oligomer containing three palindromic cognate sites for compound 11 into a Eg II site 30 bp upstream of the TATA box of pMLΔ53 (22). For each reaction, 20 ng of plasmid (30 femtomoles of palindromic sites) was preincubated with conjugate for 75 minutes prior to the addition of 90 ng of yeast nuclear extract in a 25 μl reaction volume under standard conditions. The reactions were processed as previously described and resolved on 8% 30:1 polyacrylamide gels containing 8 M urea (see Example 5). Gels were dried and exposed to photostimulatable phosphorimaging plates (Fuji Photo Film Co.). Data were visualized using a Fuji phosphorimager followed by quantitation using MacBAS software (Fuji Photo Film Co.).
SU Bi SΠTUTE SHEET PULE 26)
Results and Discussion
Synthesis of Synthetic Activators. The activation domain (residues 411- 490) of VPl 6 fused to a DNA binding protein yielded a synthetic regulatory compound that activated transcription with a potency comparable to strong natural activator proteins. Dissection of this and other activation domains "has identifie minimal units that activate transcription, these modules are often surreptitiously iterated in natural activating regions. In VPl 6, one such minimal moiety comprises eight amino acids. When iterated, the activation potential of the consequent peptide increased in a synergistic rather than an additive manner. This property of activating regions was adopted in order to design artificial activators of varying strengths. A series of polyamide-linker-(VP16 minimal module)-, compounds (where x is 1 or 2) was designed, each of which comprises: a hairpin polyamide designed to target the cognate sequence 5'-TGTTAT-3'; a flexible tether of varying length (12 or 36 atoms); and one of three different activating regions (see Figure 11). The three activating regions were AH and one (VPl)or two (VP2) tandem repeats of the eight amino acid sequence derived from VPl 6 (SEQ ID NOs: 16 and 17).
Also, it was determined that a critical role of the linker is to facilitate projection of the activating moiety away from the DNA for productive interaction with the transcriptional machinery. Initially, this was achieved by conjugating a polyether linker to the C-terminus of hairpin polyamides. Conjugation via linkage at an internal pyrrole residue also appeared attractive, as solution studies and x-ray crystallographic data demonstrated that the N-methyl group of the pyrrole residues is directed outward from the minor groove when a polyamide binding in the 2: 1 motif forms a sequence-specific complex with dsDNA. Compounds 1 and 5 and conjugates 2-4 and 6-9 were prepared by previously reported methods, and their identity was verified by matrix-assisted laser-desorption ionization time-of-flight (MALDI-TOF) mass spectrometry. Conjugates 2, 3, and 4 included a 36 atom polyether tether as the linker. Conjugates 6, 7, and 8 utilized a shorter 12 atom linking region. Promoter occupancy under transcriptional conditions. The dissociation constant (KD) for compound 2 binding to its cognate site (5'-TGTTAT-3') was measured as 32 nM using quantitative DNase I footprinting titrations. However, the
HEET RULE
conditions for those experiments were substantially different than those of the typical in vitro transcription assays. For example, in quantitative DNase I footprinting experiments, for accurate determination of KD for the ligand - DNA complex the concentration of DNA preferably is at least about a 10-fold excess of polyamide relative to DNA. In contrast, in vitro transcription assay's employ a relatively high concentration of plasmid DNA (0.8 ng/μL), which can cause a decrease in the occupancy of a target nucleotide sequence by 10- to 100-fold.
To investigate the binding behavior of compound 2 under such conditions, a 5'-32P-labeled 363 bp DNA fragment containing both the promoter region and 140 bp of the G-less cassette reporter was generated. To each footprinting reaction, unlabeled plasmid DNA was added to bring the total concentration of polyamide binding sites to a level equivalent to those utilized in transcription assays. As anticipated, DNase I footprinting titrations run under the conditions of the in vitro transcription assays revealed that 50% occupancy of the three dimeric binding sites occurs at a concentration of 215 nM, approximately a 7-fold increase over the measured KD (Figure 12). Moreover, compound 2 binds specifically to the three palindromic sites with no binding observed at concentrations of up to 1 μM in the G- less cassette region or at single base pair mismatch sites present elsewhere in the DNA fragment. Promoter occupancy and the level of activation. An in vitro transcription titration experiment with compound 2 demonstrated that full occupancy of the promoter is necessary for maximal activator function are presented in Table 1. Table 1
At a concentration of 100 n-M, detectable levels (>4-fold) of transcriptional activation were observed, and as the concentration was increased to provide full saturation of the dimeric sites, activation levels rose to 20-fold over basal. These data show that optimal synthetic regulatory compound concentration for in vitro " ' transcription assays can be predicted using data from DNase I footprinting titrations under such conditions. Therefore, additional DNase I footprinting titration experiments were carried out, revealing that the compounds used in this example required concentrations of 75 to 215 nM to attain 50% target site occupancy. Based upon these data, all subsequent in vitro transcription experiments were carried out at synthetic regulatory compound concentrations of 200-400 nM.
Activating potential corresponds to size and site-of-attachment of the activation moiety. The in vitro transcription experiments reveal that the activation strength of a synthetic regulatory compound is proportional to the size of the activating region, and that projection of the activating region away from the DNA enhances its functionality. When compared with compound 2 (PA-1L-AH), compound 8 (PA-VP1), which lacked a linker, is a poor activator, and that substituting VPl with VP2 increased the activation strength of the resulting compound. Consistent with the requirement for the activating moiety to access targets in the transcriptional machinery, projecting the VPl or VP2 module via a longer linker further improved the activation potential the synthetic regulatory compounds. From Figure 12, it is apparent that a VP2 peptide attached to the C-terminus of a polyamide via a 36 atom linker (PA-1L-VP2; compound 3) was the most potent of the polyamide-linker-activating moiety compounds tested in this example.
However, compound 9, where VP2 was attached to an internal pyrrole residue via an eight atom linker, activated transcription robustly despite the absence ofa long linker, demonstrating that projection of the activating region from this position is particularly effective. Transcriptional stimulation by each compound was dependent on the presence of cognate binding sites upstream of the promoter. Thus, on a template bearing mismatched binding sites, compounds 3, 7, and 9 did not stimulate transcription effectively.
Replacing the DNA binding moiety alters specificity. To demonstrate the generality of synthetic regulatory compound motif of the invention, the identity of the DNA recognition domain was changed and did not compromise function. Specifically, compound 11 targeted the sequence 5'-AGGTCA-3', and incorporated the polyether linker as the linking domain and VP2 as the activating region. This compound had proven to be the most active of all tested in the experiments reported this example (Figure 13). Control polyamide 10 and compound 11 were prepared by established protocols. As shown in Figure 14, a template bearing three binding sites 40bp upstream of the AdML G-less cassette reporter was constructed. The substitution of the nucleic acid binding moiety resulted in, as expected, a synthetic activator that specifically targeted the template bearing its cognate DNA binding sites and not the template bearing sites for the previous polyamide (Figure 14).
EXAMPLE 8 Metazoan Cell Culture Experiments This example describes experiments that can be used to assess whether a synthetic regulatory compound according to the invention is cell permeable, and cell permeability for members of this class of compounds is demonstrated for the first time.
The synthetic regulatory compounds used in this example employed polyamide molecules as non-natural nucleic acid binding moieties. The compounds were tested for cell permeability against two cell lines, SKOV (a cisplatin resistant human ovarian cancer cell line) and 293T (a human renal cell line) cells. The compounds were also tested for their ability to activate a transiently transfected reporter gene the expression of which could be up-regulated by activation of a promoter functionally associated with one DR5 target site. Other cells, including, without limitation, COS, Cho, Jurkat, and HeLa cells, are also suitable for use with routine modifications in the experiments reported herein.
The reporter gene carried by the cells comprised a minimal HS V TK promoter driving the expression ofa luciferase gene. Promoter activity was regulated by a consensus DR5 site approximately 50 bp upstream of the TK promoter. The DR5 site comprises a direct repeat of two consensus hexameric sequences separated by five nucleotides, the identity of which was irrelevant. One
UTE SHEET (RULE 26)
of the consensus hexameric nucleotide sequences in the DR5 site was 5'-AGGTCA- 3', which also corresponds to an estrogen receptor half-site.
The polyamide moieties of the synthetic regulatory compounds used in this example targeted the nucleotide sequence 5'-WGGWCA-3', and were fused via a PEG linker at the C-terminal tail to either the L- or the D- form o a VP2 activating region (AMyl54 and AMyl55, respectively). Prior to testing these two conjugates on reporter cells, these compounds were tested for their ability to activate transcription in standard cell-free yeast system that comprised a reporter construct having three tandem 5'-AGGTCA-3' sites located 40 bp upstream of the AdML TATA:G-less cassette. The in vitro transcription experiments were performed as reported above. As expected, the AMyl51 polyamide alone did not stimulate transcription over basal conditions. The synthetic regulatory compound AMγl54 (PA-1L-VP2) activated transcription about 12 fold, and AMV155 (PA-1L-D-VP2) activated transcription about ten-fold in the in vitro assays. The observed levels of activation were consistent with the fact that only three half-sites, rather than three complete palindromes, were used in the reporter construct employed in the in vitro transcription assays.
The cells used for the cell permeability and cell-based reporter assays were passaged twice and then grown overnight in 6-well plates to sub-confluency in CO incubators. Two micrograms of DR5-Luc DNA and 0.5 micrograms of CMV-βgal were transfected in each of the wells. One well on each plate contained no reporter construct, and one contained a strong CMV promoter driving the Luc reporter gene to check for transfection efficiency. SKOV cells were transfected using DOTAP (a cationic lipid transfection system), and the 293 cells were transfected by the standard calcium phosphate precipitation method.
After DNA addition, the cells were incubated for 12 hours and then washed with fresh DMEM media containing 10% charcoal-stripped Fetal Bovine Serum (to remove ligands of nuclear receptors that might activate transcription of the reporter). Cells were then allowed to recover for 12 hours in 2 mL of DMEM+15% FBS (stripped). After washing out this media, cells were supplied with 1 mL of
DMEM+10% FBS(stripped) containing AMV151, AMV154, or AMV155 at 1 μM concentrations.
Luciferase activity was measured using standard luminomitor techniques. The results of these experiments appear in Tables 2 and 3.
Table 2 Activity in SKOV Cells
Table 3 Activity in 293T Cells
It is important to note that of the two cells lines, it was clear that the 293T cells were more readily transfected, as the stronger CMN-Luc reporter construct resulted in approximately 10-fold more luciferase activity as compared the DR5-TK- Luc (compare row 2 and 3). Despite the low transfection efficiency of the calcium phosphate technique, sufficient numbers of reporter constructs were taken up. Significantly, the synthetic regulatory compounds were taken up by each of the cell types tested, and they stimulated luciferase expression. Surprisingly, compounds containing the D-enantiomer of the activator stimulated more reporter gene
expression than the activator made using amino acid forms found naturally in proteins. While not wishing to be bound to a particular theory, it is believed that activating regions typically are unstructured in solution, and they are fairly hydrophobic. Such peptides are typically rapidly degraded in cells. Thus, the higher activity of the D-form in activating reporting expression in 293 cells can reflect ah ability to resist proteolysis.
The contents of the articles, patents, and patent applications, and all other documents and electronically available information mentioned or cited herein, are hereby incorporated by reference in their entirety to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference. Applicants reserve the right to physically incorporate into this application any and all materials and information from any such articles, patents, patent applications, or other documents.
The inventions illustratively described herein can suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms "comprising", "including," containing", etc. shall be read expansively and without limitation. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the inventions embodied therein herein disclosed can be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of the inventions disclosed herein.
The inventions have been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of these inventions. This includes the generic description of each invention with a proviso or negative limitation removing any subject matter from the
genus, regardless of whether or not the excised material is specifically recited herein.
Other embodiments are within the following claims. In addition, where features or aspects of an invention are described in terms of a Markush group, those skilled in the art will recognize that the invention is also thereby " scribed in terms of any individual member or subgroup of members of the Markush group.
UBSTfTUTE SHEET (RULE 26)
Claims
1. A cell permeable synthetic regulatory compound comprising:
(a) a non-natural nucleic acid binding moiety;
(b) a regulatory moiety; and _Λ. (c) a linker connecting the non-natural nucleic acid binding element to the regulatory moiety, or a pharmaceutically acceptable salt of such synthetic regulatory compound.
2. The synthetic regulatory compound according to claim 1 wherein the non- natural nucleic acid binding moiety is a molecular that binds to double-stranded DNA via hydrogen bonds, van der Waals forces and/or electrostatic interactions.
3. The synthetic regulatory compound according to claim 1 wherein the non- natural nucleic acid binding moiety is selected from the group consisting of a polyamide, a peptide nucleic acid, and an oligonucleotide.
4. The synthetic regulatory compound according to claim 1 wherein the non- natural nucleic acid binding moiety comprises a structure
- Qι -Z, - Q2 - Z2 - ... - Ojn - Zm -
wherein each of Qi, Q , ... Qm is independently selected from a heteroaromatic moiety and (CH2)P, wherein p is an integer between 1 and 3, inclusive; wherein each of Zi, Z2, ..., Zm is independently selected from the group consisting ofa covalent bond and a linking group; and m is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, or 16.
5. A synthetic regulatory compound according to claim 4 wherein, with respect to the non-natural nucleic acid binding moiety, at least one of Ql3 Q , ... Qm is a heteroaromatic moiety.
6. A synthetic regulatory compound according to claim 5 wherein, with respect to the non-natural nucleic acid binding moiety, the heteroaromatic moiety is selected from the group consisting of substituted and unsubstituted imidazole and pyrrole moieties.
7. A synthetic regulatory compound according to claim 4 wherein, with respect to the non-natural nucleic acid binding moiety, at least one of Zl5 Z2, ... , Zm is a linking group having 2, 3, 4, or 5 backbone atoms.
8. A synthetic regulatory compound according to claim 7 wherein, with respect to the non-natural nucleic acid binding moiety, each of Zi, Z , ..., Zm is a carboxamide group.
9. A synthetic regulatory compound according to claim 5 wherein, with respect to the non-natural nucleic acid binding moiety, at least 60% of Qi, Q , ... Qm are heteroaromatic moieties independently selected from the group consisting of substituted and unsubstituted imidazole and pyrrole moieties, and wherein each of Zi, Z2, ..., Zm is a carboxamide group.
10. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety is cyclic.
11. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety is capable of forming an intermolecular 2:1 binding motif under physiological conditions in the presence of a double-stranded DNA molecule comprising a corresponding target sequence.
12. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety is capable of forming an intramolecular 2:1 binding motif under physiological conditions in the presence of a double-stranded DNA molecule comprising a corresponding target sequence.
13. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety has, under physiological conditions, a binding specificity for its corresponding target sequence in a double-stranded DNA of at least about two as compared to a mismatch target sequence.
14. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety has, under physiological conditions, a binding specificity for its corresponding target sequence in a double-stranded DNA of at least about ten as compared a mismatch target sequence.
15. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety has at least about submicromolar binding affinity for its corresponding target sequence in a double-stranded DNA under physiological conditions.
16. A synthetic regulatory compound according to claim 9 wherein the non- natural nucleic acid binding moiety has at least about subnanomolar binding affinity for its corresponding target sequence in a double-stranded DNA under physiological conditions.
17. A synthetic regulatory compound according to claim 1 wherein the regulatory moiety is selected from the group consisting of a small organic molecule, a lipid, a peptide, a carbohydrate, a nucleic acid and a peptide nucleic acid.
18. A synthetic regulatory compound according to claim 1 wherein the regulatory moiety that decreases expression of a target gene.
19. A synthetic regulatory compound according to claim 1 wherein the regulatory moiety that increases expression ofa target gene.
20. A synthetic regulatory compound according to claim 1 wherein the linker is covalently linked to each of the nucleic acid binding moiety and the regulatory moiety.
21. A synthetic regulatory compound according to claim 20 wherein the linker comprises from 1 to about 200 spacing moieties.
22. A synthetic regulatory compound according to claim 21 wherein at least one of the spacing moieties of the linker is -(CH2)-.
23. A synthetic regulatory compound according to claim 21 wherein the linker is attached to a terminal moiety of the non-natural nucleic acid binding moiety.
24. A synthetic regulatory compound according to claim 21 wherein the linker is attached to an internal moiety of the non-natural nucleic acid binding moiety.
25. A synthetic regulatory compound according to claim 24 wherein the internal moiety of the non-natural nucleic acid binding moiety is selected -rρm the group consisting of a γ-aminobutyric acid, β-alanine, a substituted pyrrole, an unsubstituted pyrrole, a substituted imidazole, and an unsubstituted imidazole.
26. A composition comprising a synthetic regulatory compound according to claim 1 and a carrier.
27. A composition according to claim 26 wherein the carrier is a pharmaceutically acceptable carrier.
28. A composition according to claim 26 that is a liquid composition.
29. A composition according to claim 26 that is a dry composition.
30. A cell containing a synthetic regulatory compound according to claim 1.
31. A cell according to claim 30 selected from the group consisting of an animal cell and a plant cell.
32. A cell according to claim 30 selected from the group of cells consisting of bovine, canine, equine, feline, murine, ovine, porcine, and primate cells.
33. A cell according to claim 29 that is a human cell.
34. A cell according to claim 29 that is in vitro.
35. A cell according to claim 29 that is in vivo.
36. A complex comprising a synthetic regulatory compound according to claim 1 complexed with a double-stranded DNA.
37. A method of forming a complex according to claim 36, comprising exposing a composition containing a double-stranded DNA comprising a target sequence to the synthetic regulatory compound.
38. A method according to claim 37 wherein the compositiorus-a cell.
39. A method of regulating transcription of a regulatable gene, comprising exposing a double-stranded DNA encoding the regulatable gene to a synthetic regulatory compound according to claim 1 capable of regulating transcription thereof under transcription conditions.
40. A method according to claim 39 performed in vitro.
41. A method according to claim 39 performed in vivo.
42. A method of screening for a synthetic regulatory compound according to claim 1, comprising exposing, under transcription conditions, a double-stranded DNA encoding a regulatable gene to a plurality of test compounds, each of which comprises: (a) a non-natural nucleic acid binding moiety targeted to a transcription- associated regulatory element of the regulatable gene;
(b) an activation element; and
(c) a linker connecting the non-natural nucleic acid binding element to the activation element, and determining whether any of the test compounds regulates expression of the regulatable gene.
43. A method according to claim 42 performed in vitro.
44. A method according to claim 42 performed in vivo.
45. A method according to claim 42 wherein the regulatable gene is a marker gene.
46. A synthetic regulatory compound comprising:
SUBSTITUTE SHEET (RULE 26; (a) a first nucleic acid binding moiety and a second nucleic acid binding moiety, wherein at least one of the first or second nucleic acid binding moieties is a non-natural nucleic acid binding moiety;
(b) a regulatory moiety; and (c) at least one linker connecting the first or the secoήfnucleic acid " binding moiety to the regulatory moiety, or a pharmaceutically acceptable salt of such synthetic regulatory compound.
47. A synthetic regulatory compound comprising:
(a) a non-natural nucleic acid binding moiety; (b) a plurality of regulatory moieties; and
(c) a linker connecting the non-natural nucleic acid binding moiety to the regulatory moieties, or a pharmaceutically acceptable salt of such synthetic regulatory compound.
48. A synthetic regulatory compound comprising: (a) a non-natural nucleic acid binding moiety other than a hairpin polyamide;
(b) a regulatory moiety; and
(c) a linker connecting the non-natural nucleic acid binding moiety to the regulatory moiety, or a pharmaceutically acceptable salt of such synthetic regulatory compound.
49. A synthetic regulatory compound comprising:
(a) a non-natural nucleic acid binding moiety;
(b) a regulatory moiety other than one that solely recruits a mediator complex; and (c) a linker connecting the non-natural nucleic acid binding moiety to the regulatory moiety, or a pharmaceutically acceptable salt of such synthetic regulatory compound.
50. A synthetic regulatory compound comprising: (a) a non-natural nucleic acid binding moiety; (b) a regulatory moiety other than a small molecule; and
(c) a linker connecting the non-natural nucleic acid binding moiety to the regulatory moiety, or a pharmaceutically acceptable salt of such synthetic regulatory compound.
51. The synthetic regulatory compound according to claim 1 wherein the regulatory moiety targets chromatin remodeling activities.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/US2000/029617 WO2002034295A1 (en) | 2000-10-27 | 2000-10-27 | Synthetic regulatory compounds |
| AU2001213481A AU2001213481A1 (en) | 2000-10-27 | 2000-10-27 | Synthetic regulatory compounds |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/US2000/029617 WO2002034295A1 (en) | 2000-10-27 | 2000-10-27 | Synthetic regulatory compounds |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2002034295A1 true WO2002034295A1 (en) | 2002-05-02 |
Family
ID=21741930
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2000/029617 WO2002034295A1 (en) | 2000-10-27 | 2000-10-27 | Synthetic regulatory compounds |
Country Status (2)
| Country | Link |
|---|---|
| AU (1) | AU2001213481A1 (en) |
| WO (1) | WO2002034295A1 (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002000262A3 (en) * | 2000-06-26 | 2002-10-31 | Univ Geneve | Modulation of chromosome function by chromatin remodeling agents |
| WO2002031166A3 (en) * | 2000-10-13 | 2003-09-04 | Crosslink Genetics Corp | Artificial transcriptional factors and methods of use |
| EP2199281A1 (en) * | 2008-12-19 | 2010-06-23 | DSM IP Assets B.V. | Thioic acids as building block for polythioesters |
| US8017755B2 (en) | 2003-05-23 | 2011-09-13 | President And Fellows Of Harvard College | RNA-based transcriptional regulators |
| US10517877B2 (en) | 2016-03-30 | 2019-12-31 | Wisconsin Alumni Research Foundation | Compounds and methods for modulating frataxin expression |
| US11759533B2 (en) | 2017-03-29 | 2023-09-19 | Wisconsin Alumni Research Foundation | Methods and compositions for modulating gene expression |
-
2000
- 2000-10-27 WO PCT/US2000/029617 patent/WO2002034295A1/en active Application Filing
- 2000-10-27 AU AU2001213481A patent/AU2001213481A1/en not_active Abandoned
Non-Patent Citations (5)
| Title |
|---|
| GOTTESFELD, JOEL M. ET AL: "Regulation of gene expression by small molecules", NATURE (LONDON) (1997), 387(6629), 202-205, XP002182603 * |
| LU, XIANGYANG ET AL: "An artificial transcriptional activating region with unusual properties", PROC. NATL. ACAD. SCI. U. S. A. (2000), 97(5), 1988-1992, XP002182592 * |
| MAPP, A. K. ET AL: "Preparation of thioesters for the ligation of peptides with non-native substrates", TETRAHEDRON LETT. (2000), 41(49), 9451-9454, XP002182590 * |
| MAPP, ANNA K. ET AL: "Activation of gene expression by small molecule transcription factors", PROC. NATL. ACAD. SCI. U. S. A. (2000), 97(8), 3930-3935, - 11 April 2000 (2000-04-11), XP002182589 * |
| OAKLEY, MARTHA G. ET AL: "Evidence that a minor groove-binding peptide and a major groove-binding protein can simultaneously occupy a common site on DNA", BIOCHEMISTRY (1992), 31(45), 10969-75, XP002182591 * |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2002000262A3 (en) * | 2000-06-26 | 2002-10-31 | Univ Geneve | Modulation of chromosome function by chromatin remodeling agents |
| WO2002031166A3 (en) * | 2000-10-13 | 2003-09-04 | Crosslink Genetics Corp | Artificial transcriptional factors and methods of use |
| US8017755B2 (en) | 2003-05-23 | 2011-09-13 | President And Fellows Of Harvard College | RNA-based transcriptional regulators |
| EP2199281A1 (en) * | 2008-12-19 | 2010-06-23 | DSM IP Assets B.V. | Thioic acids as building block for polythioesters |
| WO2010070132A1 (en) * | 2008-12-19 | 2010-06-24 | Dsm Ip Assets B.V. | Thioic acids as building block for polythioesters |
| CN102256938A (en) * | 2008-12-19 | 2011-11-23 | 帝斯曼知识产权资产管理有限公司 | Thioacids as structural units of polythioesters |
| US10517877B2 (en) | 2016-03-30 | 2019-12-31 | Wisconsin Alumni Research Foundation | Compounds and methods for modulating frataxin expression |
| US11759533B2 (en) | 2017-03-29 | 2023-09-19 | Wisconsin Alumni Research Foundation | Methods and compositions for modulating gene expression |
Also Published As
| Publication number | Publication date |
|---|---|
| AU2001213481A1 (en) | 2002-05-06 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU747998B2 (en) | Complex formation between dsDNA and oligomer of heterocycles | |
| US6143901A (en) | Complex formation between dsDNA and pyrrole imidazole polyamides | |
| JP6997177B2 (en) | SCN9A antisense oligonucleotide | |
| US20140038281A1 (en) | System for cargo delivery into the cells | |
| CA2288806A1 (en) | Complex formation between dsdna and oligomer of heterocycles | |
| EP1599474A2 (en) | Alternative heterocycles for dna recognition | |
| JP2020525462A (en) | Compounds containing staple or stitch peptides for improved drug delivery | |
| Wu et al. | Programmable pyrrole-imidazole polyamides: A potent tool for DNA targeting | |
| DE4331011A1 (en) | C-branched oligomers that bind nucleic acids for therapy and diagnostics | |
| US6559125B1 (en) | Polyamide-alkylator conjugates and related products and method | |
| US6673940B1 (en) | Compositions and methods relating to cyclic compounds that undergo nucleotide base pair-specific interactions with double-stranded nucleic acids | |
| WO2002034295A1 (en) | Synthetic regulatory compounds | |
| JP2020072716A (en) | Novel cell-penetrating compositions and methods using the same | |
| KR20240082344A (en) | Compounds and methods for skipping exon 44 in Duchenne muscular dystrophy | |
| US9403869B2 (en) | α5β1 integrin binding RGD-lipopeptides with gene transfer activities | |
| EP1144414B1 (en) | Compositions and methods relating to cyclic compounds that undergo nucleotide base pair-specific interactions with double-stranded nucleic acids | |
| US7935530B2 (en) | Polyamides with tail structures capable of binding DNA | |
| KR100780405B1 (en) | NR-specific binding peptide search method using alpha helical peptide | |
| Mapp et al. | Towards a Minimal Motif for Artificial Transcriptional Activators | |
| WO2023032726A1 (en) | Cyclic peptide, salt of cyclic peptide, pharmaceutical composition, and biological tissue calcification inhibitor | |
| US20030207400A1 (en) | Vectors for dna delivery | |
| JP5422832B2 (en) | Inactivated gene reactivation peptide | |
| CN119060138A (en) | Nectin-4 targeted cyclic peptide and its preparation method and application | |
| Phillips | Biological Activity of a Py-Im Polyamide Androgen Receptor Antagonist | |
| JP2003289871A (en) | Compound comprising peptide nucleic acid and oligopeptide, method for producing the same and application of the same |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| 122 | Ep: pct application non-entry in european phase | ||
| NENP | Non-entry into the national phase |
Ref country code: JP |







