WO2018118585A1 - Antiviral compositions - Google Patents
Antiviral compositions Download PDFInfo
- Publication number
- WO2018118585A1 WO2018118585A1 PCT/US2017/066108 US2017066108W WO2018118585A1 WO 2018118585 A1 WO2018118585 A1 WO 2018118585A1 US 2017066108 W US2017066108 W US 2017066108W WO 2018118585 A1 WO2018118585 A1 WO 2018118585A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- composition
- target
- dna
- binding domain
- cas endonuclease
- Prior art date
Links
- 239000000203 mixture Substances 0.000 title claims abstract description 104
- 230000000840 anti-viral effect Effects 0.000 title description 13
- 230000003612 virological effect Effects 0.000 claims abstract description 105
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 90
- 230000008685 targeting Effects 0.000 claims abstract description 83
- 108010042407 Endonucleases Proteins 0.000 claims abstract description 77
- 230000004568 DNA-binding Effects 0.000 claims abstract description 71
- 230000027455 binding Effects 0.000 claims abstract description 61
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 46
- 230000000295 complement effect Effects 0.000 claims abstract description 41
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 33
- 108020004414 DNA Proteins 0.000 claims abstract description 31
- 102000004533 Endonucleases Human genes 0.000 claims abstract description 17
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 claims description 76
- 108091034117 Oligonucleotide Proteins 0.000 claims description 38
- 241000701044 Human gammaherpesvirus 4 Species 0.000 claims description 24
- 208000036142 Viral infection Diseases 0.000 claims description 20
- 230000009385 viral infection Effects 0.000 claims description 20
- 235000018102 proteins Nutrition 0.000 claims description 19
- 241000701806 Human papillomavirus Species 0.000 claims description 17
- 241000700584 Simplexvirus Species 0.000 claims description 14
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 claims description 14
- 239000011701 zinc Substances 0.000 claims description 14
- 229910052725 zinc Inorganic materials 0.000 claims description 14
- 241000701022 Cytomegalovirus Species 0.000 claims description 13
- 241001502974 Human gammaherpesvirus 8 Species 0.000 claims description 12
- 108010017070 Zinc Finger Nucleases Proteins 0.000 claims description 12
- 150000001413 amino acids Chemical class 0.000 claims description 11
- 241000701085 Human alphaherpesvirus 3 Species 0.000 claims description 9
- 235000001014 amino acid Nutrition 0.000 claims description 9
- 241000700721 Hepatitis B virus Species 0.000 claims description 8
- 241000701041 Human betaherpesvirus 7 Species 0.000 claims description 8
- 241000701027 Human herpesvirus 6 Species 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 8
- 125000003729 nucleotide group Chemical group 0.000 claims description 8
- OAKJQQAXSVQMHS-UHFFFAOYSA-N Hydrazine Chemical compound NN OAKJQQAXSVQMHS-UHFFFAOYSA-N 0.000 claims description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 6
- 241000973051 Paraburkholderia rhizoxinica Species 0.000 claims description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 5
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 4
- 239000004472 Lysine Substances 0.000 claims description 4
- 241000579048 Merkel cell polyomavirus Species 0.000 claims description 4
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 4
- 239000004473 Threonine Substances 0.000 claims description 4
- 235000003704 aspartic acid Nutrition 0.000 claims description 4
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 4
- 235000013922 glutamic acid Nutrition 0.000 claims description 4
- 239000004220 glutamic acid Substances 0.000 claims description 4
- 235000018977 lysine Nutrition 0.000 claims description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 3
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 3
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 3
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 claims description 3
- 235000018417 cysteine Nutrition 0.000 claims description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 3
- 150000002463 imidates Chemical class 0.000 claims description 3
- 239000000126 substance Substances 0.000 claims description 3
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims description 2
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 claims description 2
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 claims description 2
- 101710163270 Nuclease Proteins 0.000 abstract description 98
- 108091033409 CRISPR Proteins 0.000 abstract description 22
- 238000000034 method Methods 0.000 abstract description 16
- 238000010354 CRISPR gene editing Methods 0.000 abstract description 8
- 230000009438 off-target cleavage Effects 0.000 abstract description 8
- 230000007246 mechanism Effects 0.000 abstract description 2
- 102100031780 Endonuclease Human genes 0.000 description 60
- 241000700605 Viruses Species 0.000 description 34
- 102000039446 nucleic acids Human genes 0.000 description 25
- 108020004707 nucleic acids Proteins 0.000 description 25
- 150000007523 nucleic acids Chemical class 0.000 description 25
- 239000002105 nanoparticle Substances 0.000 description 24
- 102000053602 DNA Human genes 0.000 description 21
- 108020004682 Single-Stranded DNA Proteins 0.000 description 20
- 108010081734 Ribonucleoproteins Proteins 0.000 description 17
- 102000004389 Ribonucleoproteins Human genes 0.000 description 17
- 239000002202 Polyethylene glycol Substances 0.000 description 16
- 229920001223 polyethylene glycol Polymers 0.000 description 16
- 210000004027 cell Anatomy 0.000 description 15
- 241000282414 Homo sapiens Species 0.000 description 14
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 14
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 10
- 108020004999 messenger RNA Proteins 0.000 description 10
- 229940024606 amino acid Drugs 0.000 description 8
- 238000003776 cleavage reaction Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 125000006850 spacer group Chemical group 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 101150059079 EBNA1 gene Proteins 0.000 description 6
- 238000012650 click reaction Methods 0.000 description 6
- 150000002632 lipids Chemical class 0.000 description 6
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 5
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 5
- 108091028113 Trans-activating crRNA Proteins 0.000 description 5
- 208000015181 infectious disease Diseases 0.000 description 5
- 102000004196 processed proteins & peptides Human genes 0.000 description 5
- 108090000765 processed proteins & peptides Proteins 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 239000004471 Glycine Substances 0.000 description 4
- 238000006736 Huisgen cycloaddition reaction Methods 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 239000007795 chemical reaction product Substances 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 210000000981 epithelium Anatomy 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 230000000699 topical effect Effects 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 239000004971 Cross linker Substances 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 208000032420 Latent Infection Diseases 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 238000010459 TALEN Methods 0.000 description 3
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 3
- 108020005202 Viral DNA Proteins 0.000 description 3
- 125000002015 acyclic group Chemical group 0.000 description 3
- 150000001345 alkine derivatives Chemical class 0.000 description 3
- 150000001412 amines Chemical class 0.000 description 3
- 230000003466 anti-cipated effect Effects 0.000 description 3
- 150000001540 azides Chemical class 0.000 description 3
- 125000002091 cationic group Chemical group 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 208000006454 hepatitis Diseases 0.000 description 3
- 231100000283 hepatitis Toxicity 0.000 description 3
- 229910052739 hydrogen Inorganic materials 0.000 description 3
- 239000001257 hydrogen Substances 0.000 description 3
- 239000002502 liposome Substances 0.000 description 3
- 230000009437 off-target effect Effects 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 230000009897 systematic effect Effects 0.000 description 3
- 125000003396 thiol group Chemical group [H]S* 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 208000037581 Persistent Infection Diseases 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 108091027544 Subgenomic mRNA Proteins 0.000 description 2
- -1 amide hydrogen Chemical class 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 208000037771 disease arising from reactivation of latent virus Diseases 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 125000001183 hydrocarbyl group Chemical group 0.000 description 2
- 230000014726 immortalization of host cell Effects 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 230000009871 nonspecific binding Effects 0.000 description 2
- 230000002085 persistent effect Effects 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- XQUPVDVFXZDTLT-UHFFFAOYSA-N 1-[4-[[4-(2,5-dioxopyrrol-1-yl)phenyl]methyl]phenyl]pyrrole-2,5-dione Chemical compound O=C1C=CC(=O)N1C(C=C1)=CC=C1CC1=CC=C(N2C(C=CC2=O)=O)C=C1 XQUPVDVFXZDTLT-UHFFFAOYSA-N 0.000 description 1
- QWENRTYMTSOGBR-UHFFFAOYSA-N 1H-1,2,3-Triazole Chemical compound C=1C=NNN=1 QWENRTYMTSOGBR-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- RJUHZPRQRQLCFL-IMJSIDKUSA-N Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O RJUHZPRQRQLCFL-IMJSIDKUSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- MQLZLIYPFDIDMZ-HAFWLYHUSA-N Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O MQLZLIYPFDIDMZ-HAFWLYHUSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241001302512 Banna virus Species 0.000 description 1
- 241001453380 Burkholderia Species 0.000 description 1
- 108091079001 CRISPR RNA Proteins 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 201000006082 Chickenpox Diseases 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 241000702669 Coltivirus Species 0.000 description 1
- 241000709687 Coxsackievirus Species 0.000 description 1
- 241000150230 Crimean-Congo hemorrhagic fever orthonairovirus Species 0.000 description 1
- VMQMZMRVKUZKQL-UHFFFAOYSA-N Cu+ Chemical compound [Cu+] VMQMZMRVKUZKQL-UHFFFAOYSA-N 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 241000725619 Dengue virus Species 0.000 description 1
- 241001115402 Ebolavirus Species 0.000 description 1
- 241000709661 Enterovirus Species 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- 241000190708 Guanarito mammarenavirus Species 0.000 description 1
- 241000893570 Hendra henipavirus Species 0.000 description 1
- 241000711549 Hepacivirus C Species 0.000 description 1
- 241000724675 Hepatitis E virus Species 0.000 description 1
- 208000037262 Hepatitis delta Diseases 0.000 description 1
- 241000724709 Hepatitis delta virus Species 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 208000007514 Herpes zoster Diseases 0.000 description 1
- MDCTVRUPVLZSPG-BQBZGAKWSA-N His-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 MDCTVRUPVLZSPG-BQBZGAKWSA-N 0.000 description 1
- 238000005873 Huisgen reaction Methods 0.000 description 1
- 241001479210 Human astrovirus Species 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 241000046923 Human bocavirus Species 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 241000829111 Human polyomavirus 1 Species 0.000 description 1
- 241000701460 JC polyomavirus Species 0.000 description 1
- 241000712890 Junin mammarenavirus Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000712902 Lassa mammarenavirus Species 0.000 description 1
- 241000712898 Machupo mammarenavirus Species 0.000 description 1
- 241001115401 Marburgvirus Species 0.000 description 1
- 241000712079 Measles morbillivirus Species 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241000526636 Nipah henipavirus Species 0.000 description 1
- 241000714209 Norwalk virus Species 0.000 description 1
- 241000702259 Orbivirus Species 0.000 description 1
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 206010037742 Rabies Diseases 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- 241000725643 Respiratory syncytial virus Species 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 241000192617 Sabia mammarenavirus Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 206010046980 Varicella Diseases 0.000 description 1
- 241000700647 Variola virus Species 0.000 description 1
- 241000710886 West Nile virus Species 0.000 description 1
- 241000710772 Yellow fever virus Species 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 239000000611 antibody drug conjugate Substances 0.000 description 1
- 229940049595 antibody-drug conjugate Drugs 0.000 description 1
- 229940121357 antivirals Drugs 0.000 description 1
- 239000003125 aqueous solvent Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 238000010462 azide-alkyne Huisgen cycloaddition reaction Methods 0.000 description 1
- 238000010461 azide-alkyne cycloaddition reaction Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000000562 conjugate Substances 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000002101 lytic effect Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 108091005763 multidomain proteins Proteins 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 230000012223 nuclear import Effects 0.000 description 1
- 238000005580 one pot reaction Methods 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 229920003192 poly(bis maleimide) Polymers 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000707 stereoselective effect Effects 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 125000001273 sulfonato group Chemical group [O-]S(*)(=O)=O 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 210000000605 viral structure Anatomy 0.000 description 1
- 229940051021 yellow-fever virus Drugs 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/54—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being an organic compound
- A61K47/549—Sugars, nucleosides, nucleotides or nucleic acids
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/16—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- A61K38/43—Enzymes; Proenzymes; Derivatives thereof
- A61K38/46—Hydrolases (3)
- A61K38/465—Hydrolases (3) acting on ester bonds (3.1), e.g. lipases, ribonucleases
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K47/00—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient
- A61K47/50—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates
- A61K47/51—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent
- A61K47/62—Medicinal preparations characterised by the non-active ingredients used, e.g. carriers or inert additives; Targeting or modifying agents chemically bound to the active ingredient the non-active ingredient being chemically bound to the active ingredient, e.g. polymer-drug conjugates the non-active ingredient being a modifying agent the modifying agent being a protein, peptide or polyamino acid
- A61K47/64—Drug-peptide, drug-protein or drug-polyamino acid conjugates, i.e. the modifying agent being a peptide, protein or polyamino acid which is covalently bonded or complexed to a therapeutically active agent
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
- C12N15/1131—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing against viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases [RNase]; Deoxyribonucleases [DNase]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K9/00—Medicinal preparations characterised by special physical form
- A61K9/10—Dispersions; Emulsions
- A61K9/127—Synthetic bilayered vehicles, e.g. liposomes or liposomes with cholesterol as the only non-phosphatidyl surfactant
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPR]
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Definitions
- the disclosure relates to antiviral therapeutics.
- Viral infections cause problems ranging from social embarrassment and discomfort to severe pain or death. Rabies, chicken pox, the flu, shingles, hepatitis, and cancer are examples of painful or fatal conditions that may arise as a consequence of viral infection. Some of the most common targets of viral infection include the respiratory system, the gastrointestinal tract, the liver, the nervous system, and skin— all systems that are important to a healthy and productive life. Thus viral infections pose significant problems to human health and welfare.
- viruses have the ability to go into a latent stage or a persistent stage of infection, in which the virus does not replicate itself as it does in the active stage.
- the viral genome is maintained in the host cell as an episome, such as a closed circular DNA molecule that replicates independently of the host chromosomes.
- latent or persistent infections are difficult to treat at least because the virus is not exhibiting the proteins that are targeted by some antiviral therapeutics.
- the only viral targets available may be the DNA genome.
- the viral episome is typically resident in human host cells which are also home to the approximately 3.2 billion base pair human genome. Any attempt to digest the viral episome that also does widespread damage to the human genome may be of limited clinical value.
- the invention includes a programmable nuclease such as a CRISPR-associated (Cas) endonuclease linked to a DNA-binding domain.
- the DNA-binding domain recognizes and binds to one site on a target viral genome, while the programmable nuclease recognizes and binds to another.
- the Cas endonuclease associated guide RNA of the invention comprises a shorter protospacer sequence that has minimal binding affinity for the target DNA.
- the DNA-binding domain is also designed to have minimal binding affinity for the target DNA. Accordingly, neither the guide RNA or the DNA-binding domain alone is sufficient to bind the target DNA sequence. Only when both components have bound to their respective targets does the nuclease have enough DNA binding affinity to cleave the viral genome.
- a DNA-binding domain such as a transcription-activator like effector (TALE) binding domain
- the nuclease is a Cas endonuclease and is complexed with a guide RNA (gRNA) that has a recognition sequence complementary to the target in the viral genome.
- gRNA guide RNA
- the gRNA and the TALE DNA-binding domain target different sites on a viral genome for cleavage.
- the two targeting mechanisms individually may have reduced binding affinity and specificity compared to a larger TALE binding domain or a gRNA as would naturally be associated with Cas endonuclease but in combination, the requirement for a Cas target proximal to a TALE target increases specificity of binding and reduces off-target cleavage.
- a Cas endonuclease complexed with a gRNA with a recognition sequence of minimal length and linked to a minimal TALE domain provides an antiviral therapeutic with good target specificity.
- a composition that include the nuclease linked to the TALE domain, with both being designed to bind to targets determined to be proximal to each other in a viral genome, may be delivered to a site of a viral infection.
- the TALE-linked nuclease will differentiate the target viral genome from the host genome due to the increased specificity.
- the increased specificity provided by the combined minimal TALE DNA-binding domain and short gRNA with minimal targeting sequence is particularly suited to targeting viral genetic material in infected host cells, and particularly for targeting the genomes of latent or persistent viruses in infected human cells.
- DNA-binding domains may be a TALE domain or other binding domains and can be similarly designed to have require components to have bound to their respective targets before the nuclease has enough DNA binding affinity to cleave the viral genome.
- the DNA-binding domain may be a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN).
- Standard zinc finger DNA-binding domains may contain between three and six individual zinc finger repeats and recognize between 9 and 18 basepairs and zinc finger DNA- binding domains of the invention may be shortened as described above to reduce binding affinity.
- the DNA-binding domain may be a second, catalytically inactive Cas
- the DNA-binding domain may be from a Burkholderia rhizoxinica protein (Bat protein). Burkholderia rhizoxinica is an endosymbiotic bacterium that contains TALE-like proteins. Bat protein binding domains function similarly to TALE binding domains and may be similarly programmed to recognize target sequences. Bat protein binding domains generally require less sequence identity than TALE binding domains in order to bind.
- Bat protein Burkholderia rhizoxinica protein
- Bat protein binding domains function similarly to TALE binding domains and may be similarly programmed to recognize target sequences. Bat protein binding domains generally require less sequence identity than TALE binding domains in order to bind.
- compositions of the invention are particularly useful where the Cas endonuclease target plus protospacer adjacent motif (PAM) is found in both the viral genome and the host genome. Or stated differently, where the viral genome does not offer any sequence adjacent to a PAM that cannot also be found in the host genome.
- PAM protospacer adjacent motif
- the Cas recognition sequence can be shorter than would otherwise be permissible, which prevents the Cas endonuclease from binding to the host genome.
- the Cas endonuclease is linked to a TALE domain that is designed to recognize a nearby target found only in the viral genome. Thus, using a TALE domain allows an antiviral therapeutic
- composition to require an additional targeting sequence that may be spaced away from the PAM at a point where the viral and host genomes diverge.
- the gRNA may include a minimal recognition sequence, e.g., between about 8 to about 16 base pairs in length while the TALE binding domain may include 7 repeat variable diresidues (RVDs). Allowing these targeting portions to be shorter than they otherwise might be means that each targeting portion, taken alone, would have poor binding affinity and specificity. As result, in the presence of only one of the two targets, the nuclease may have insufficient target binding to cleave the target. However, in the presence of both targets, binding affinity and specificity is increased to a degree that the TALE-linked nuclease can be designed to specifically cleave the genome of a virus without any off-target cleavage.
- RVDs repeat variable diresidues
- binding specificity is increased by delivering a Cas endonuclease along with short oligonucleotides, such as single- stranded DNA (ssNDA) oligonucleotides, that are substantially complementary to a portion of the targeting sequence of a guide RNA of the Cas endonuclease.
- ssNDA single- stranded DNA
- the oligonucleotides hybridize to the guide RNA targeting sequence unless and until the true target, which is a better match to the targeting sequence and therefore higher affinity than the oligonucleotides, is present. In the presence of the viral genome, the true target within the viral genome displaces the oligonucleotides from the guide RNA.
- the oligonucleotides prevent the guide RNA from mediating any off-target binding (e.g., of the Cas endonuclease to a similar target within a human genome). That is, ssDNA oligos may be used to compete with the target viral DNA sequence such that only perfect or near perfect matches between the gRNA and the target viral nucleic acid will result in the target viral nucleic acid displacing the ssDNA, allowing the gRNA to bind and the endonuclease to cleave the bound nucleic acid. Accordingly, off-target cleavage of genetic material is avoided.
- the gRNA contains a targeting region at least substantially complementary to a portion of a target viral genome.
- the ssDNA oligos are shorter than the complementary portion of the targeting region and may comprise between about 10 and about 15 nucleotides, with a melting temperature of 40°C to 50°C.
- Off-target cleavage reducing compositions and methods of the invention are particularly suited to anti- viral treatments where viral genetic material must be specifically targeted within a living host cell without cleaving the hosts own genetic material.
- Preferred viral targets include a hepatitis virus such as a hepatitis B virus (HBV), an Epstein-Barr virus, a Kaposi's sarcoma-associated herpesvirus (KSHV), a herpes-simplex virus (HSV), a cytomegalovirus (CMV), human papilloma virus (HPV), or Merkel cell polyomavirus.
- HBV hepatitis B virus
- KSHV Kaposi's sarcoma-associated herpesvirus
- HSV herpes-simplex virus
- CMV cytomegalovirus
- HPV human papilloma virus
- Merkel cell polyomavirus Merkel cell polyomavirus
- the Cas endonuclease— or a messenger RNA (mRNA) encoding the Cas endonuclease— is preferably delivered via a nanoparticle such as a lipid nanoparticle that includes cationic lipids to encourage tissue and cellular penetration.
- Cas endonuclease and gRNA may be delivered as a plasmid encoding the same, however, delivery of the active, ribonucleoprotein (RNP) form of the Cas endonuclease, or the mRNA encoding the Cas endonuclease, avoids the requirement for nuclear import and transcription.
- Topical delivery of RNP or mRNA to tissue such vaginal or anal tissue is used for preferred targets such as HPV to treat warts, lesions, or even cancers such as cervical cancer.
- compositions for treating a viral infection include a programmable nuclease linked to a DNA-binding domain, in which the programmable nuclease hybridizes to a first target in a viral genome and the DNA-binding domain binds to a second target in the viral genome.
- the DNA-binding domain may be a transcription activator-like effector (TALE) DNA-binding domain having about 7 repeat variable diresidues (RVDs) corresponding to the second target in the viral genome.
- TALE transcription activator-like effector
- the DNA-binding domain may be a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN).
- ZFN zinc finger nuclease
- the DNA-binding domain can be a second, catalytically inactive Cas endonuclease and gRNA complex.
- the DNA-binding domain may be from an endosymbiotic bacterium Burkholderia rhizoxinica protein (Bat protein).
- the programmable nuclease may be a Cas endonuclease and guide RNA having a targeting region substantially complementary to the first target in a viral genome. Upon binding to both the first target and the second target, the nuclease cleaves the viral genome.
- the targeting region of the guide RNA can include an about 8-16 (preferably about 15-16) base pair sequence complementary to the first target in the viral genome.
- Any suitable virus may be targeted including, for example, Herpes Simplex virus (HSV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Kaposi's sarcoma-associated herpesvirus (KSHV), Cytomegalovirus (CMV), Epstein Barr virus (EBV), Varicella zoster virus (VZV), human papillomavirus (HPV), and hepatitis b virus (HBV).
- HSV Herpes Simplex virus
- HHV6 human Herpes virus 6
- HHV7 human Herpes virus 7
- KSHV Kaposi's sarcoma-associated herpesvirus
- CMV Cytomegalovirus
- EBV Epstein Barr virus
- VZV Varicella zoster virus
- HPV human papillomavirus
- HBV hepatitis b virus
- compositions of the invention include a Cas endonuclease with a guide RNA that has a targeting sequence complementary to a target in a viral genome, and the composition further includes short oligonucleotides at least partially complementary to the targeting sequence of the guide RNA.
- the short oligonucleotides may be provided as a plurality of ssDNA molecules that are co-delivered with the Cas endonuclease.
- the oligonucleotides hybridize to the guide RNA unless and until the true target in the viral genome is present.
- the composition may be used to cleave viral targets even where somewhat similar sequences may be present in the host human genome.
- the ssDNA may be provided as a plurality of ssDNA molecules that are co-delivered with the Cas endonuclease.
- the oligonucleotides hybridize to the guide RNA unless and until the true target in the viral genome is present.
- the composition may be used to
- the targeting sequence of the guide RNA may be at least partially complementary to a sequence in a host genome and the oligonucleotides will prevent off-target activity.
- the ssDNA oligonucleotides may be used in conjunction with embodiments in which a TALE domain, specific to a second target in a viral genome, is linked to the Cas endonuclease.
- the Cas endonuclease may be covalently or non-covalently linked to the TALE DNA-binding domain.
- the Cas endonuclease may be linked to the TALE DNA-binding domain through a linker.
- the linker can be of a length chosen to correspond to a distance between the first target and the second target in the viral genome.
- the linker may be protein and may include one or more of proline, glycine, threonine, serine, or combinations thereof (e.g., glycine -rich for flexibility, or proline-rich for rigidity).
- a non-protein linker may be included.
- a non-protein linker may include, for example, a disulfide bond; a thioether bond; an amine bond; a hydrazine linkage; an amide bond; an imidoester; maleimide; PEG; and BM(PEG)n with 1 ⁇ n ⁇ 9.
- the linker may be attached to the Cas endonuclease at an amino acid selected from the group consisting lysine, cysteine, aspartic acid, and glutamic acid.
- the invention provides a composition for treating a viral infection.
- the composition includes a Cas endonuclease, or mRNA encoding the Cas endonuclease, and a guide RNA (gRNA) comprising a targeting region complementary to a target in a viral genome.
- the composition also includes at least one short oligonucleotide at least partially complementary to the targeting region.
- the targeting region has a first binding affinity for the oligonucleotide that is lower than a perfect match. However, the first binding affinity may be greater than a binding affinity between the targeting region and a mismatched target sequence.
- the mismatched target sequence can be at least 75% complementary to the targeting region.
- the mismatched target sequence may be at least 80% complementary to the targeting region.
- the mismatched target sequence may be at least 90% complementary to the targeting region. In preferred embodiments, the mismatched target sequence may be at least 95% complementary to the targeting region.
- the targeting region can include about 20 nucleotides.
- the ssDNA oligonucleotide may comprise between about 10 and about 15 nucleotides.
- FIG. 1 shows a composition for treating a viral infection comprising a Cas endonuclease linked to a TALE binding domain.
- FIG. 2 shows a composition for treating a viral infection, the composition including a Cas endonuclease-gRNA complex with a ssDNA oligo complementary to the targeting sequence of the gRNA.
- FIG. 3 shows a targeting region of a Cas endonuclease-gRNA complex that has preferentially bound a viral nucleic acid over the ssDNA oligo.
- FIG. 4 shows a targeting region of a Cas endonuclease-gRNA complex and a mismatched viral nucleic acid where the ssDNA oligo has preferentially bound the targeting region over the mismatched viral nucleic acid.
- FIG. 5 shows a Cas endonuclease linked to a TALE binding domain in a nanoparticle.
- FIG. 6 shows a Cas endonuclease and a ssDNA oligonucleotide in a nanoparticle.
- FIG. 7 shows a programmable nuclease in a nanoparticle.
- FIG. 8 is a map of the Epstein Barr genome to illustrate target categories.
- FIG. 9 shows a composition that includes a messenger RNA.
- FIG. 10 shows an antiviral composition with a DNA vector encoding a nuclease.
- FIG. 11 shows a method of preparing an antiviral composition.
- FIG. 12 diagrams a method for treating a viral infection.
- FIG. 1 shows a composition 101 for treating a viral infection.
- the composition 101 includes a programmable nuclease 107 linked to a TALE domain 157, in which the
- the programmable nuclease 107 binds to a first target in a viral genome 175 and the TALE domain 157 is designed to bind to a second target in the viral genome, and wherein the first target and the second target are not both found in a human genome.
- the programmable nuclease 107 is a Cas endonuclease and is complexed with a guide RNA 121 that includes a targeting region 127 that is substantially (e.g., at least 60%) complementary to the first target in viral nucleic acid 175.
- a linker 161 joins the programmable nuclease 107 to the TALE domain 157.
- the TALE domain 157 includes series of repeat variable diresidue (RVD) designed to bind the second target in the viral genome 175.
- RVD repeat variable diresidue
- the targeting region 127 of the guide RNA 121 hybridizes to the first target in the viral genome 175 and the TALE domain 157 binds to the second target.
- Both the TALE binding domain and the targeting region may be short enough in length that neither alone provide sufficient binding affinity to allow the programmable nuclease to efficiently cleave the viral nucleic acid. Instead, both the targeting region and the linked TALE binding domain binding to their recognition sites along the viral nucleic acid provide strong enough recognition for the programmable nuclease to function by cleaving the viral nucleic acid. By using two separate binding sites and binding modalities, off-target cleavage is avoided.
- composition 101 is designed to require specific first and second targets a distance apart from each other on the viral genome, the composition may be particularly useful for targeting the genome of one organism within the cells of an infected host, and thus for digest genomes of latent viruses within the cells of the infected human host.
- compositions and methods of the invention are particularly suited to applications where a PAM and its adjacent sequence are found in both the viral genome and the human genome. That PAM and its adjacent sequence in the viral genome may still be targeted specifically using compositions and methods of the disclosure by having the DNA-binding domain linked to the Cas endonuclease be designed to bind to a sequence found only in the viral genome. By requiring binding of a second, separate target sequence that may be spaced away from the first target sequence in order to allow cleavage, compositions of the invention may provide the ability to use a conserved first target sequence between the host and the target virus. The second target sequence of the DNA-binding domain may be selected to be removed from the conserved section of sequence.
- the DNA-binding domain may be linked to a programmable nuclease, such as a Cas endonuclease at, for example, a side chain of an amino acid of the Cas endonuclease, wherein the side chain may present an amine, a carboxyl, a sulfhydryl, or a carbonyl.
- a programmable nuclease such as a Cas endonuclease at, for example, a side chain of an amino acid of the Cas endonuclease, wherein the side chain may present an amine, a carboxyl, a sulfhydryl, or a carbonyl.
- the DNA- binding domain can be attached to the side chain through a linker 161, which may include one or more of a disulfide bond, a thioether, an amine bond, a hydrazine linkage, an amide bond, an imidoester, a peptide bond, maleimide; polyethylene glycol (PEG), BM(PEG)n with 1 ⁇ n ⁇ 9, biotin, or other molecule or agent.
- the linker 161 may be configured in length to space the DNA-binding domain apart from the gRNA's targeting region by a distance corresponding to the anticipated distance between the target sequences of the DNA-binding domain and the targeting region in the viral genome.
- compositions and methods of the invention recognize that target DNA may not be encountered by the programmable nuclease in a perfect line and that, because of this, sequences that are separated by a distance within the target genome may, in fact, be quite close in physical space. Accordingly, the anticipated distance may be based upon the nucleotide length separating the two target sequences or spatial relationships based on known conformation of the viral genetic material.
- compositions and methods of the invention may include a targeting region 127 that is shorter in length than 20-bp.
- the substantially complementary targeting region may be, for example, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19 bp in length.
- the targeting region may include a 10 to 16 base sequence substantially complimentary to a target sequence in a viral genome.
- the DNA-binding domains may be derived from TALE DNA-binding domains, zinc finger DNA-binding domains from a zinc finger nuclease (ZFN), a second, catalytically inactive Cas endonuclease/gRNA complex, or a DNA-binding domain from a Bat protein for example.
- ZFN zinc finger nuclease
- a second, catalytically inactive Cas endonuclease/gRNA complex or a DNA-binding domain from a Bat protein for example.
- TALENs Transcription activator-like effector nucleases
- TALENs are nucleases that include a DNA-binding domain and a cleavage domain.
- TALENs contain a Fokl nuclease domain and a DNA-binding domain known as a transcription activator-like effector (TALE).
- TALE transcription activator-like effector
- the TALE is composed of tandem arrays of amino acid repeats, each of which recognizes a single base-pair in the major groove of target viral DNA.
- the nucleotide specificity of a domain comes from repeat variable diresidues (RVD), the two amino acids at positions 12 and 13 where Asn- Asn, Asn-Ile, His-Asp and Asn-Gly recognize guanine, adenine, cytosine and thymine, respectively. That pattern allows one to design a TALE domain to be a sequence-specific DNA- binding domain.
- Compositions and methods of the invention may include a TALE DNA-binding domain comprising, for example, 3, 4, 5, 6, 7, 8, 9, or 10 RVD programmed to bind a target sequence in a viral genome. In preferred embodiments, the TALE DNA-binding domain comprises about 7 RVD.
- the DNA-binding domain may be a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN).
- ZFNs Zinc-finger nucleases
- Zinc finger DNA-binding domains can be engineered to target specific desired DNA sequences.
- Standard zinc finger DNA- binding domains may contain between three and six individual zinc finger repeats and recognize between 9 and 18 basepairs and zinc finger DNA-binding domains of the invention may be shortened to reduce binding affinity in order to require binding of both the zinc finger binding domain and the coupled gRNA/Cas endonuclease complex before the Cas endonuclease can cleave the target viral genome.
- the DNA-binding domain may be a second, catalytically inactive Cas endonuclease/gRNA complex of the same or different species as the first cleavage-active Cas endonuclease/gRNA complex. See Guilinger, et al., 2014, Fusion of catalytically inactive Cas9 to Fokl nuclease improves the specificity of genome modification, Nature Biotechnology 32, 577-582, incorporated herein by reference.
- the Cas endonuclease/gRNA complex linked DNA- binding domain may be from a Burkholderia rhizoxinica protein (Bat protein).
- Burkholderia rhizoxinica is an endosymbiotic bacterium that contains TALE-like proteins.
- Bat protein binding domains function similarly to TALE binding domains and may be similarly programmed to recognize target sequences.
- Bat repeat domains have been shown to mediate sequence- specific DNA binding similarly to TALEs while requiring less than 40% sequence identity. See de Lange, et al., 2014, Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain, Nucleic Acids Res. 42(11): 7436-7449, incorporated herein by reference.
- the core repeats of each Bat protein are highly polymorphic, allowing for the creation of Bat derivatives with programmable specificity. Id.
- compositions that includes a Cas endonuclease linked to a TALE or other DNA-binding domain will bind to a viral genome with good specificity and can be designed to not exhibit any off-target binding or cleavage.
- compositions of the invention have good specificity and may be used to digest a viral genome within a human cell. Additionally, compositions of the invention may include other features that even further increase the target specificity.
- FIG. 2 shows a composition 201 of the invention including a programmable nuclease 207 complexed with a guide RNA 221 comprising a targeting region 227 that is substantially (e.g., at least 60%) complementary to a target in viral nucleic acid.
- the composition 201 includes a short oligonucleotide 225 such as a single-stranded DNA (ssDNA) oligonucleotide that is co-delivered with the programmable nuclease 207.
- the oligonucleotide 225 is complementary to at least a portion of the targeting region 227 of the guide RNA 221.
- the oligonucleotide 225 may be shorter than the targeting region 227 or may simply contain a sequence complementary to the targeting region 227 that is shorter than the viral genome-complementary portion of the targeting region 227.
- the oligonucleotide 225 hybridizes to the targeting sequence 227 of the guide RNA 221 with an affinity greater than a mis-matched target would hybridize to the guide RNA. Thus the oligonucleotide 225 prevents off-target binding and activity. However, the oligonucleotide 225 binds to the guide RNA 221 less well than would the true target. Accordingly, a perfectly matched target sequences in the viral genome will displace the oligonucleotide 225 and allow for cleavage by the programmable nuclease 207.
- FIG. 3 illustrates a strong match between a target sequence on the viral genome 175 and the targeting region 227.
- the targeting region 227 has preferentially bound to the viral genome 175 and displaced the oligonucleotide 225.
- the programmable nuclease 107 will accordingly cleave the viral genome 175, disrupting the virus.
- FIG. 4 illustrates a mismatch between the viral genome 175 and the targeting region 227. Accordingly, the ssDNA oligo 225 remains preferentially bound to the targeting region 227 preventing the mismatch binding of the targeting region 227 to the viral genome 175 or potential mismatched binding sites in the host genome. Off-target cleavage by the programmable nuclease 207 is thereby prevented. Reducing off-target cleavage is of particular importance where a sequence in the host genome is similar to the target sequence in the viral genome in order to prevent unwanted disruption of the host genetic material.
- compositions of the invention use a programmable nuclease to digest viral nucleic acid and include features that prevent off-target activity.
- Compositions and methods of the disclosure may have particular benefit for the treatment of viruses that contain target sequences that are similar to the host's genetic sequence. Where the target in the viral genome is similar to a region within the host genome, the inclusion of a TALE domain (while using a shorter guide RNA recognition sequence), the use of ssDNA oligonucleotides, or both, can prevent the nuclease from binding to and cleaving the host genome.
- compositions and methods of the invention include, Programmable nucleases may be used in various embodiments to target viral genetic material within host cells.
- compositions include, as the programmable nuclease, an RNA-guided nuclease (e.g., Cas9) and at least one gRNA targeting the genome of the virus.
- Suitable targets in viral genomes include, but are not limited to, a portion of a genome or gene of adenovirus, herpes simplex virus, varicella-zoster virus, Epstein- Barr virus, human cytomegalovirus, human herpesvirus type 8, human papillomavirus, BK virus, JC virus, smallpox, hepatitis B virus, human bocavirus, parvovirus, B 19, human astrovirus, Norwalk virus, coxsackievirus, hepatitis A virus, poliovirus, rhinovirus, sever acute respiratory syndrome virus, hepatitis C virus, yellow fever virus, dengue virus, west nile virus, rubella virus, hepatitis E virus, human immunodeficiency virus, influenza virus, guanarito virus, Junin virus, Lassa virus, machupo virus, sabia virus, Crimean-Congo hemorrhagic fever virus, Ebola virus, Marburg virus, measles virus, m
- a hepatitis virus a hepatitis B virus (HBV), an Epstein-Barr virus, a Kaposi's sarcoma-associated herpesvirus (KSHV), a herpes-simplex virus (HSV), a cytomegalovirus (CMV), human papilloma virus (HPV), and Merkel cell polyomavirus.
- HBV hepatitis B virus
- KSHV Kaposi's sarcoma-associated herpesvirus
- HSV herpes-simplex virus
- CMV cytomegalovirus
- HPV human papilloma virus
- Merkel cell polyomavirus a hepatitis virus
- Methods and compositions of the disclosure use a programmable nuclease to digest nucleic acid of the virus, thereby rendering the virus incapable of replication or infection of the host patient.
- the composition may be delivered as a programmable nuclease such as Cas endonuclease or a nucleic acid encoding the programmable nuclease.
- the programmable nuclease may be complexed with a guide RNA (gRNA) to form a ribonucleoprotein.
- gRNA guide RNA
- the programmable nuclease may be delivered as mRNA that encodes a programmable nuclease, co-delivered with a gRNA.
- the programmable nuclease may be delivered in the form of a vector encoding for the gRNA and the programmable nuclease.
- the programmable nuclease may be any suitable programmable nuclease.
- programmable nuclease is a molecule that can be designed to, or "programmed” to, cleave a nuclease in a sequence- specific manner.
- Programmable nucleases include CRISPR-associated (Cas) nucleases, such as Cas9, Cpfl, C2cl, C2c3, and C2c2.
- methods and compositions of the disclosure use a Cas endonuclease.
- Cas endonucleases were first found as part of bacterial immune systems. The host bacteria capture small DNA fragments (-20 bp) from invading viruses and insert those sequences (termed targeting regions) into their own genome to form a CRISPR. Those CRISPR regions are transcribed as pre-CRISPR RNA(pre-crRNA) and processed to give rise to target- specific crRNA. Invariable target-independent trans-activating crRNA (tracrRNA) is also transcribed from the locus and contributes to the processing of pre-crRNA. The crRNA and tracrRNA have been shown to be combinable into a single guide RNA.
- guide RNA or gRNA refers to either format.
- Guide RNA and a Cas endonuclease form an active ribonucleoprotein (RNP) complex that cleaves the target nucleic acid.
- RNP active ribonucleoprotein
- a sgRNA forms the RNP with Cas endonuclease protein, and the RNP finds the target by hybridization of a targeting region to the intended target. The RNP will cleave when the target is found next to a sequence known as protospacer adjacent motif (PAM).
- PAM protospacer adjacent motif
- Cas endonucleases are generally programmed to target a specific viral nucleic acid by providing a gRNA that includes a ⁇ 20-bp targeting region that is substantially complementary to a target in viral nucleic acid.
- the length of the targeting region is reduced to about 15-16-bp to lower binding efficiency.
- the targetable sequences include, among others, 5 ' -X 15NGG-3 ' or 5 ' -X 15NAG-3 ' ; where X 15 is
- PAMs substantially complementary to the targeting region in the gRNA and NGG and NAG are PAMs. It will be appreciated that recognition sequences with lengths other than 15-16 bp and PAMs other than NGG and NAG are known and are included within the scope of the invention.
- CRISPR systems with single-subunit effectors are known as Class 2. These are then subdivided even further into type II (e.g., Cas9) and type V (e.g., Cpfl).
- Cas endonucleases include Cas9, Cpfl, C2cl, C2c3, and C2c2, and modified versions of Cas9, Cpfl, C2cl, C2c3, and C2c2, such as nuclease with an amino acid sequence that is different, but at least about 85% similar to, an amino acid sequence of wild-type Cas9, Cpfl, C2cl, C2c3, or C2c2, or a Cas9, Cpfl, C2cl, C2c3, or C2c2 protein with a linked to an accessory element such as another polypeptide or protein domain (e.g., within a recombinant fusion protein or linked via an amino acid side-chain) or other molecule or agent.
- C2cl (Class 2, candidate 1) is a type V-
- C2cl examples have been indicated to be functional in E. coli.
- tracrRNAs short RNAs that help separate the CRISPR array into individual spacers, or crRNAs
- the tracrRNA may be fused to the crRNA to make a single short guide, or sgRNA.
- C2cl targets DNA with a 5' PAM sequence TTN.
- C2c3 (Class 2, candidate 3) is a type V-C Cas endonuclease that clusters with C2cl and Cpf 1 within type V. C2c2 was found in metagenomic sequences, and the species is not known.
- C2c2 (Class 2, candidate 2) is a type VI Cas endonuclease. C2c2 has been indicated to make mature crRNAs in E. coli. See Shmakov, 2015, Discovery and functional characterization of diverse class 2 CRISPR-Cas systems, Mol Cell 60(3):385-397, incorporated by reference.
- a TALE DNA-binding domain is linked to a programmable nuclease through a linker.
- a linker may be chosen for its properties. For example, for a polypeptide linker (e.g., within a recombinant fusion protein) to be flexible it may be provided with a plurality of glycine resides (e.g., > 30% or > 50%). For a more rigid polypeptide linker, it may be desirable to include a plurality of proline residues.
- the linker may be biodegradable.
- the linker is cleavable.
- the linker may include an enzyme cleavage region.
- an enzyme cleave region can be the target of a protease.
- the TALE DNA-binding domain is non-covalently bound to the programmable nuclease.
- the programmable nuclease or the DNA-binding domain may be biotinylated and the DNA-binding domain may thus be non-covalently bound to the programmable nuclease through a biotin/streptavidin linkage.
- a composition for treating a viral infection may include a programmable nuclease covalently linked to a TALE DNA-binding domain through a protein linker.
- Some recombinant fusion proteins are composed of two or more functional domains joined by linker peptides.
- the linker serves to connect the proteins, and also provide many other functions, such as maintaining cooperative inter-domain interactions or preserving biological activity.
- the natural length of linkers in multi-domain proteins is about 6 to 10 residues on average.
- Preferred residues for linkers include threonine (Thr), serine (Ser), proline (Pro), glycine (Gly), aspartic acid (Asp), lysine (Lys), glutamine (Gin), asparagine (Asn), and alanine (Ala), arginine (Arg), phenylalanine (Phe), and glutamic acid.
- residues are polar (charged or uncharged).
- Proline may be included to give the linker rigidity. It is thought that the lack of an amide hydrogen, as well as the cyclic side chain, limit proline's ability to participate in promiscuous hydrogen bonding and restrict its flexibility.
- the small, polar amino acids, such as Thr, Ser, and Gly are thought to be favorable for providing good flexibility due to their small sizes, and also help maintain stability of the linker structure in the aqueous solvent through formation of hydrogen bonds with water.
- the linker may include a plurality of glycine residues. In some embodiments, the linker comprises a plurality of threonine and serine residues.
- a linker may provide functionality such as flexibility, rigidity (e.g., even a mixture of both flexibility and rigidity at different points along it), solubility, cleavage targets, binding targets, others, or combinations thereof.
- the linker is included to provide a spacer arm.
- the spacer arm is the chemical chain between two groups.
- the length of a spacer arm (e.g., in angstroms) determines how flexible a conjugate will be. Longer spacer arms have greater flexibility, reduced steric hindrance, and offer more sites for potential nonspecific binding. Spacer arms can range from zero length to > 100 angstroms.
- the molecular composition of a spacer arm can affect solubility and nonspecific binding.
- linkers have spacer arms that contain hydrocarbon chains or polyethylene glycol (PEG) chains. Hydrocarbon chains are not water soluble and typically require an organic solvent such as DMSO or DMF for suspension. Those crosslinkers are suited for penetrating the cell membrane and performing intercellular crosslinking because they are hydrophobic and uncharged. If a charged sulfonate group is added to the termini of such crosslinkers, a water soluble analogue is formed.
- PEG polyethylene glycol
- Certain exemplary categories of cross-linkers use bismaleimide-activated PEG
- BM(PEG)n bis(succinimidyl) PEG
- BS(PEG)n bis(succinimidyl) PEG
- BM(PEG)n cross-links sulfhydryls and BS(PEG)n cross-links amines, although variations will be understood by one of skill in the art.
- a TALE DNA-binding domain may be attached to a programmable nuclease at an amino acid with a side chain comprising an amine, a carboxyl, a sulfhydryl, or a carbonyl.
- the agent or the linker may be attached to the programmable nuclease at an amino acid in the nuclease such as lysine, cysteine, aspartic acid, or glutamic acid.
- a programmable nuclease is linked to a TALE DNA -binding domain through a click reaction product such as one more five-membered rings or acyclic derivatives thereof.
- Click chemistry includes a class of biocompatible reactions intended primarily to join substrates of choice with specific biomolecules. Click chemistry provides methods joining small modular units. In general, click reactions usually join a biomolecule and a TALE DNA-binding domain. Typical click reactions occur in one pot, are not disturbed by water, make unremarkable byproducts, and are driving quickly and irreversibly to high yield of a single click reaction product, with high reaction specificity (in some cases, with both regio- and stereo-specificity). Click reaction products are physiologically stable with only non-toxic byproducts.
- the Azide-Alkyne Huisgen Cycloaddition is a 1,3-dipolar cycloaddition between an azide and a terminal or internal alkyne to give a 1,2,3-triazole.
- the 1,3-dipolar cycloaddition is a chemical reaction between a 1,3-dipole and a dipolarophile to form a five-membered ring.
- Linking a programmable nuclease to an agent via click chemistry can create a linker that includes, as the click reaction product, one or more five-membered rings or acyclic derivatives thereof.
- This 1,3-dipolar cycloaddition is an important route to the regio- and stereoselective synthesis of five-membered heterocycles and their ring-opened acyclic derivatives.
- the 1,3-dipolar cycloaddition between organic azides and terminal alkynes may proceed by a copper(I) -catalyzed version of the Huisgen reaction, CuAAC (for Copper-catalyzed Azide-Alkyne Cycloaddition), which proceeds readily in mild conditions that can approximate physiological conditions.
- Click chemistry may be bioorthogonal: azides and alkynes are typically not found in biomolecules discussed herein and can be selectively reacted.
- FIG. 5 shows a composition 501 for treating a viral infection.
- the composition 501 includes a programmable nuclease 507 designed to bind to and cleave viral nucleic acid.
- the programmable nuclease 507 is linked by a linker 561 to a TALE domain 557.
- the composition preferably includes a nanoparticle 571 encapsulating at least the programmable nuclease 507 and the TALE domain 557.
- the programmable nuclease 507 linked to the TALE domain 557 is delivered using the nanoparticle 571.
- the nanoparticle 571 may be any suitable nanoparticle including, for example, a liposome.
- the nanoparticle 571 is a lipid nanoparticle and preferably includes cationic lipids, which may be found to be particularly well suited for topical delivery of compositions of the disclosure to tissues harboring a latent or persistent infection.
- FIG. 6 shows a composition 601 for treating a viral infection.
- the composition 601 includes a programmable nuclease 607 programmed to cleave viral nucleic acid or an RNA encoding the programmable nuclease and a short oligonucleotide 625, such as an ssDNA oligo.
- the composition preferably includes a nanoparticle 671 encapsulating at least the programmable nuclease 607, or the RNA encoding the programmable nuclease, and the oligonucleotide 625.
- a feature of the depicted composition 601 is that the nuclease 607 is complexed with a guide RNA 621.
- the guide RNA 621 has a targeting sequence that is complementary to a target sequence within a viral genome, and the oligonucleotide 625 is complementary, over a shorter extent than for the true target, to the targeting sequence of the guide RNA 621.
- the targeting sequence includes about a 16 to 20 base pair stretch that is complementary to a target in a viral genome (preferably about 20), and the oligonucleotide includes a stretch that is a few base pairs shorter (e.g., about 10 to 15) that is complementary to the targeting sequence.
- the short oligonucleotide 625 may be bound to the guide RNA 621 within the nanoparticle 671 at the time of delivery.
- any suitable nanoparticle 671 may be used in the composition 601.
- the nanoparticle is a lipid nanoparticle.
- FIG. 7 shows a composition 701 for treating a viral infection, in which the composition 701 includes a programmable nuclease 707 linked to a TALE domain 757 packaged within a lipid nanoparticle 767, e.g., a liposome.
- the TALE domain 757 binds to a first target in a viral genome while the programmable nuclease 707 binds to a second target in the viral genome.
- the programmable nuclease 707 is a Cas endonuclease and is complexed with a guide RNA 721 as an active
- ribonucleoprotein within the lipid nanoparticle 767.
- Preferred viral targets include those with episomal DNA genomes during persistence or latency such as human papillomavirus (HPV), Herpes Simplex virus (HSV), Epstein Barr virus (EBV), Kaposi's sarcoma-associated herpesvirus (KSHV), hepatitis b virus (HBV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Cytomegalovirus (CMV), or Varicella zoster virus (VZV).
- the first target, recognized by the TALE domain 757, and the second target, recognized by the guide RNA 1121 are both in the genome of a virus. This means that the first and second targets are selected from within the genome of the virus and that the TALE domain 757 and the guide RNA 721 are designed to bind to those targets.
- the targets are selected by scanning the genomes of the virus for suitable targets. Once suitable first and second targets are identified, the human genome may be scanned for the presence of both of those targets. Once first and second targets are selected that appear in the viral genome and preferably do not also appear in the human genome, the TALE domain and guide RNA may be designed accordingly. It may be preferable to select targets within the viral genome based on where within the genome those targets lie. For example, it may be preferable to select a target that will cause the nuclease to cleave a protein-coding gene and either leave it cleaved or introduce a frame shift.
- Targets may be selected by reading a reference genome (e.g., a GenBank file) and selection may include referencing annotations to identify genes by category or function. For example, it may be preferable to target cleavage within a gene that is annotated as being functional during latency.
- a reference genome e.g., a GenBank file
- FIG. 8 is a map of the Epstein Barr genome and is used to illustrate how the guide sequence may be designed.
- the map shown in FIG. 12 shows certain features in the EBV genome that may be targeted with a programmable nuclease.
- the marks "#", and "+” are used to indicate features that are related to viral structure, transformation, and latency, respectively.
- Guide RNAs that target the EBV genome are used in compositions according to certain embodiments. Within a genome of interest, such as EBV, selected regions, or genes are targeted. For example, six regions can be targeted with seven guide RNA designs for different genome editing purposes.
- EBNA1 is the only nuclear Epstein-Barr virus (EBV) protein expressed in both latent and lytic modes of infection.
- EBNA1 While EBNA1 is known to play several important roles in latent infection, EBNA1 is crucial for many EBV functions including gene regulation and latent genome replication. Therefore, guide RNAs sgEBV4 and sgEBV5 were selected to target both ends of the EBNA1 coding region in order to excise this whole region of the genome. These "structural" targets enable systematic digestion of the EBV genome into smaller pieces. EBNA3C and LMPl are essential for host cell transformation, and guide RNAs sgEBV3 and sgEBW were designed to target the 5' exons of these two proteins respectively.
- EBNA1 is crucial for many EBV functions including gene regulation and latent genome replication.
- Guide RNA sgEBV4 and sgEBV5 are targeted to both ends of the EBNA1 coding region in order to excise that region of the genome.
- Guide RNAs sgEBVl, 2 and 6 fall in repeat regions, so that the success rate of at least one CRISPR cut is multiplied.
- Those "structural" targets enable systematic digestion of the EBV genome into smaller pieces.
- EBNA3C and LMPl are essential for host cell transformation, and guide RNAs sgEBV3 and sgEBW are designed to target the 5' exons of these two proteins respectively.
- a Cas endonuclease will digest the genome of the virus.
- Suitable targets include viruses such as Herpes Simplex virus (HSV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Kaposi's sarcoma-associated herpesvirus (KSHV), Cytomegalovirus (CMV), Epstein Barr virus (EBV), Varicella zoster virus (VZV), human papillomavirus (HPV), or hepatitis b virus (HBV).
- HSV Herpes Simplex virus
- HHV6 human Herpes virus 6
- HHV7 human Herpes virus 7
- KSHV Kaposi's sarcoma-associated herpesvirus
- CMV Cytomegalovirus
- EBV Epstein Barr virus
- VZV Varicella zoster virus
- HPV human papillomavirus
- HBV hepatitis b virus
- FIG. 9 illustrates a preferred embodiment that includes a composition 901 for treating a viral infection.
- the composition 901 includes a messenger RNA 937 (mRNA) encoding a programmable nuclease linked to a TALE domain— here, as a transcript of a recombinant gene that will be translated in the infected cells to provide a recombinant fusion protein that includes a Cas endonuclease linked to a TALE domain.
- the composition 901 also includes a guide RNA 921 with a targeting sequence substantially complementary to a first target within a viral genome.
- the composition 901 further includes a nanoparticle 971 encapsulating at least the mRNA 937 encoding the programmable nuclease linked to the TALE domain.
- FIG. 10 diagrams a composition 1000 for treating a viral infection that includes a nucleic acid vector 1001 (e.g., a plasmid) encoding a programmable nuclease for delivery to viral- infected cells.
- the nucleic acid vector 1001 is a plasmid that includes a gene 1027, preferably under control of a promoter 1039.
- the gene 1027 is preferably a recombinant gene encoding a TALE domain linked to a programmable endonuclease.
- the plasmid may also include a viral origin of replication 1035 to support maintenance of the plasmid preferentially in viral-infected cells.
- the plasmid may also include a guide RNA segment 1055, which includes portions that correspond to targets in genetic material of a virus.
- the guide RNA segment 1055 is transcribed, the product is one gRNA with a portion substantially
- vector 1001 is a plasmid and the gene 1027 codes for a Cas endonuclease (e.g., Cas9 or a modified version of Cas9 that is at least a 95% match to Cas9) linked to a TALE domain.
- the guide RNA segment 1055 preferably includes a 15-16 nucleotide segment that is at least a 70% match to a segment in a genome of a virus adjacent to a protospacer adjacent motif (PAM) (e.g., NGG); and the viral origin of replication 1035 is an origin of replication from the genome of a virus.
- PAM protospacer adjacent motif
- the virus may be selected from Human papillomavirus (HPV), Hepatitis B virus, Cytomegalovirus, herpes simplex virus, Epstein Barr virus, for example. These certain embodiments may be preferred where the nucleic acid vector 901 is part of an antiviral therapeutic composition to be delivered to infected cells.
- HPV Human papillomavirus
- Hepatitis B virus Hepatitis B virus
- Cytomegalovirus Cytomegalovirus
- herpes simplex virus herpes simplex virus
- Epstein Barr virus for example.
- the programmable nuclease segment 1027 may preferably code for an RNA-guided nuclease such as Cas9, a modified Cas9 linked by a linker to a TALE DNA-binding domain, Cpfl, or a modified Cpfl .
- the guide RNA segment 1055 and the viral origin of replication 1035 are omitted. Any suitable promoter 1039 (e.g., U6 promoter) may be included. These embodiments may be preferred where the gene 1027 is to be expressed e.g., in culture (for example, in E. coli, yeast, or a Lactobacillus) to produce a nuclease for use in an antiviral therapeutic composition.
- the programmable nuclease segment 1027 codes for an RNA-guided nuclease
- the expressed protein is preferably complexed with a gRNA to form into an active ribonucleoprotein (RNP).
- FIG. 11 shows preparation of an antiviral composition.
- a programmable nuclease 1107 linked to a TALE domain 1157 and guide RNA 1121 are obtained. Those elements are formed in an RNP 1167.
- the guide RNA 1121 includes a targeting region substantially complementary to a target site within a viral genome and the TALE domain 1157 binds a second target site within a viral genome.
- the RNP 1167 is packaged with nanoparticles 1137 to form a composition 1101.
- the composition 1101 may further include any suitable carrier fluid, cream, or gel e.g., for topical delivery.
- the composition 1101 is delivered topically to a site in tissue in a patient.
- the nanoparticles penetrate tissue, preferably to the basal epithelium or mucosal epithelium where, for example, the virus is HPV.
- the nanoparticles 1137 deliver the TALE-linked RNP 1167 molecules to infected cells 1179, where the TALE-linked RNP 1167 then cleaves viral DNA 1175.
- FIG. 12 diagrams a method 1201 for treating a viral infection.
- the method 1201 includes providing 1205 a composition that includes a programmable nuclease programmed to cleave viral nucleic acid.
- the programmable nuclease may include a Cas endonuclease linked to a TALE DNA-binding domain.
- the programmable nuclease may be provided with a ssDNA oligo partially complementary to a targeting region sequence of a gRNA complexed with the programmable nuclease.
- the programmable nuclease or the RNA encoding the programmable nuclease are encapsulated in a nanoparticle such as a lipid nanoparticle that includes cationic lipids.
- the composition is delivered 1209 to cells infected by a virus. In the most preferred embodiments, the composition is delivered topically to avoid systemic distribution.
- the programmable nuclease is preferably a Cas endonuclease programmed to cleave genetic material of the virus. The composition is then used to cleave 1213 the genetic material of the virus.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Zoology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Epidemiology (AREA)
- General Engineering & Computer Science (AREA)
- Veterinary Medicine (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Virology (AREA)
- Immunology (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Compositions include programmable nucleases with targeting mechanisms to prevent off-target cleavage of genetic material. Compositions may include a programmable nuclease such as a CRISPR-associated (Cas) endonuclease complexed with a short guide RNA sequence and linked to a DNA-binding domain from, for example, a TALE, ZFN, or Bat protein such that the guide RNA and the DNA-binding domain target separate target sequences in a target viral genome and binding of both is required to the Cas endonuclease to cleave the bound genetic material. Compositions and methods may also include using a single-strand DNA oligo complementary to a targeting region sequence on a guide RNA-Cas endonuclease complex to competitively block mismatched binding of the targeting region sequence.
Description
ANTIVIRAL COMPOSITIONS
Cross -Reference to Related Applications
This application claims the benefit of priority of U.S. Provisional Application No.
62/438,074, filed December 22, 2016, the contents of which are incorporated by reference.
Technical Field
The disclosure relates to antiviral therapeutics.
Background
Viral infections cause problems ranging from social embarrassment and discomfort to severe pain or death. Rabies, chicken pox, the flu, shingles, hepatitis, and cancer are examples of painful or fatal conditions that may arise as a consequence of viral infection. Some of the most common targets of viral infection include the respiratory system, the gastrointestinal tract, the liver, the nervous system, and skin— all systems that are important to a healthy and productive life. Thus viral infections pose significant problems to human health and welfare.
Some viruses have the ability to go into a latent stage or a persistent stage of infection, in which the virus does not replicate itself as it does in the active stage. For some viruses, in the latent phase, the viral genome is maintained in the host cell as an episome, such as a closed circular DNA molecule that replicates independently of the host chromosomes.
Unfortunately, latent or persistent infections are difficult to treat at least because the virus is not exhibiting the proteins that are targeted by some antiviral therapeutics. During latency, the only viral targets available may be the DNA genome. While it may be possible to treat the infection by attempting to disrupt the viral genome— e.g., by digesting the genome with nucleases, there are potential difficulties associated with off-target activity. The viral episome is typically resident in human host cells which are also home to the approximately 3.2 billion base pair human genome. Any attempt to digest the viral episome that also does widespread damage to the human genome may be of limited clinical value.
Summary
The invention includes a programmable nuclease such as a CRISPR-associated (Cas) endonuclease linked to a DNA-binding domain. The DNA-binding domain recognizes and binds to one site on a target viral genome, while the programmable nuclease recognizes and binds to another. The Cas endonuclease associated guide RNA of the invention comprises a shorter protospacer sequence that has minimal binding affinity for the target DNA. The DNA-binding domain is also designed to have minimal binding affinity for the target DNA. Accordingly, neither the guide RNA or the DNA-binding domain alone is sufficient to bind the target DNA sequence. Only when both components have bound to their respective targets does the nuclease have enough DNA binding affinity to cleave the viral genome. The addition of a DNA-binding domain, such as a transcription-activator like effector (TALE) binding domain, to a
programmable nuclease increases the target specificity of the composition. In preferred embodiments, the nuclease is a Cas endonuclease and is complexed with a guide RNA (gRNA) that has a recognition sequence complementary to the target in the viral genome. The gRNA and the TALE DNA-binding domain target different sites on a viral genome for cleavage. The two targeting mechanisms individually may have reduced binding affinity and specificity compared to a larger TALE binding domain or a gRNA as would naturally be associated with Cas endonuclease but in combination, the requirement for a Cas target proximal to a TALE target increases specificity of binding and reduces off-target cleavage.
Thus a Cas endonuclease complexed with a gRNA with a recognition sequence of minimal length and linked to a minimal TALE domain provides an antiviral therapeutic with good target specificity. A composition that include the nuclease linked to the TALE domain, with both being designed to bind to targets determined to be proximal to each other in a viral genome, may be delivered to a site of a viral infection. The TALE-linked nuclease will differentiate the target viral genome from the host genome due to the increased specificity. Thus, the increased specificity provided by the combined minimal TALE DNA-binding domain and short gRNA with minimal targeting sequence is particularly suited to targeting viral genetic material in infected host cells, and particularly for targeting the genomes of latent or persistent viruses in infected human cells.
DNA-binding domains may be a TALE domain or other binding domains and can be similarly designed to have require components to have bound to their respective targets before the nuclease has enough DNA binding affinity to cleave the viral genome.
The DNA-binding domain may be a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN). Standard zinc finger DNA-binding domains may contain between three and six individual zinc finger repeats and recognize between 9 and 18 basepairs and zinc finger DNA- binding domains of the invention may be shortened as described above to reduce binding affinity.
The DNA-binding domain may be a second, catalytically inactive Cas
endonuclease/gRNA complex of the same or different species as the first cleavage-active Cas endonuclease/gRNA complex.
The DNA-binding domain may be from a Burkholderia rhizoxinica protein (Bat protein). Burkholderia rhizoxinica is an endosymbiotic bacterium that contains TALE-like proteins. Bat protein binding domains function similarly to TALE binding domains and may be similarly programmed to recognize target sequences. Bat protein binding domains generally require less sequence identity than TALE binding domains in order to bind.
The length of the linker joining the TALE or other binding domain to the Cas
endonuclease may be selected based on the distance within the viral genome between the TALE target and the gRNA target. By targeting two specific and separate sequences separated by an anticipated distance, mismatch binding and off-target cleaving is reduced and specificity increased over single- sequence targeting. Compositions of the invention are particularly useful where the Cas endonuclease target plus protospacer adjacent motif (PAM) is found in both the viral genome and the host genome. Or stated differently, where the viral genome does not offer any sequence adjacent to a PAM that cannot also be found in the host genome. Using a composition of the invention, the Cas recognition sequence can be shorter than would otherwise be permissible, which prevents the Cas endonuclease from binding to the host genome. The Cas endonuclease is linked to a TALE domain that is designed to recognize a nearby target found only in the viral genome. Thus, using a TALE domain allows an antiviral therapeutic
composition to require an additional targeting sequence that may be spaced away from the PAM at a point where the viral and host genomes diverge.
In preferred embodiments, where the programmable nuclease is a Cas endonuclease, the gRNA may include a minimal recognition sequence, e.g., between about 8 to about 16 base pairs in length while the TALE binding domain may include 7 repeat variable diresidues (RVDs). Allowing these targeting portions to be shorter than they otherwise might be means that each
targeting portion, taken alone, would have poor binding affinity and specificity. As result, in the presence of only one of the two targets, the nuclease may have insufficient target binding to cleave the target. However, in the presence of both targets, binding affinity and specificity is increased to a degree that the TALE-linked nuclease can be designed to specifically cleave the genome of a virus without any off-target cleavage.
Additionally or alternatively, in certain embodiments, binding specificity is increased by delivering a Cas endonuclease along with short oligonucleotides, such as single- stranded DNA (ssNDA) oligonucleotides, that are substantially complementary to a portion of the targeting sequence of a guide RNA of the Cas endonuclease. The oligonucleotides hybridize to the guide RNA targeting sequence unless and until the true target, which is a better match to the targeting sequence and therefore higher affinity than the oligonucleotides, is present. In the presence of the viral genome, the true target within the viral genome displaces the oligonucleotides from the guide RNA. In the absence of the target viral genome, the oligonucleotides prevent the guide RNA from mediating any off-target binding (e.g., of the Cas endonuclease to a similar target within a human genome). That is, ssDNA oligos may be used to compete with the target viral DNA sequence such that only perfect or near perfect matches between the gRNA and the target viral nucleic acid will result in the target viral nucleic acid displacing the ssDNA, allowing the gRNA to bind and the endonuclease to cleave the bound nucleic acid. Accordingly, off-target cleavage of genetic material is avoided. The gRNA contains a targeting region at least substantially complementary to a portion of a target viral genome. The ssDNA oligos are shorter than the complementary portion of the targeting region and may comprise between about 10 and about 15 nucleotides, with a melting temperature of 40°C to 50°C.
Off-target cleavage reducing compositions and methods of the invention are particularly suited to anti- viral treatments where viral genetic material must be specifically targeted within a living host cell without cleaving the hosts own genetic material.
Preferred viral targets include a hepatitis virus such as a hepatitis B virus (HBV), an Epstein-Barr virus, a Kaposi's sarcoma-associated herpesvirus (KSHV), a herpes-simplex virus (HSV), a cytomegalovirus (CMV), human papilloma virus (HPV), or Merkel cell polyomavirus. Compositions of the preferred embodiments are formulated for topical delivery, i.e., so that the potential for systematic side effects may be reduced.
For delivery to tissue such as basal epithelium or mucosal epithelium, the Cas endonuclease— or a messenger RNA (mRNA) encoding the Cas endonuclease— is preferably delivered via a nanoparticle such as a lipid nanoparticle that includes cationic lipids to encourage tissue and cellular penetration. Cas endonuclease and gRNA may be delivered as a plasmid encoding the same, however, delivery of the active, ribonucleoprotein (RNP) form of the Cas endonuclease, or the mRNA encoding the Cas endonuclease, avoids the requirement for nuclear import and transcription. Topical delivery of RNP or mRNA to tissue such vaginal or anal tissue is used for preferred targets such as HPV to treat warts, lesions, or even cancers such as cervical cancer.
Aspects of the disclosure include a composition for treating a viral infection. The composition includes a programmable nuclease linked to a DNA-binding domain, in which the programmable nuclease hybridizes to a first target in a viral genome and the DNA-binding domain binds to a second target in the viral genome. The DNA-binding domain may be a transcription activator-like effector (TALE) DNA-binding domain having about 7 repeat variable diresidues (RVDs) corresponding to the second target in the viral genome. In certain
embodiments, the DNA-binding domain may be a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN). The DNA-binding domain can be a second, catalytically inactive Cas endonuclease and gRNA complex. In various embodiments, the DNA-binding domain may be from an endosymbiotic bacterium Burkholderia rhizoxinica protein (Bat protein).
The programmable nuclease may be a Cas endonuclease and guide RNA having a targeting region substantially complementary to the first target in a viral genome. Upon binding to both the first target and the second target, the nuclease cleaves the viral genome. The targeting region of the guide RNA can include an about 8-16 (preferably about 15-16) base pair sequence complementary to the first target in the viral genome. Any suitable virus may be targeted including, for example, Herpes Simplex virus (HSV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Kaposi's sarcoma-associated herpesvirus (KSHV), Cytomegalovirus (CMV), Epstein Barr virus (EBV), Varicella zoster virus (VZV), human papillomavirus (HPV), and hepatitis b virus (HBV).
In certain embodiments, compositions of the invention include a Cas endonuclease with a guide RNA that has a targeting sequence complementary to a target in a viral genome, and the composition further includes short oligonucleotides at least partially complementary to the
targeting sequence of the guide RNA. The short oligonucleotides may be provided as a plurality of ssDNA molecules that are co-delivered with the Cas endonuclease. The oligonucleotides hybridize to the guide RNA unless and until the true target in the viral genome is present. Using the ssDNA oligonucleotides, the composition may be used to cleave viral targets even where somewhat similar sequences may be present in the host human genome. The ssDNA
oligonucleotides will prevent the Cas endonuclease from binding to the somewhat similar sequences in the host. Thus, the targeting sequence of the guide RNA may be at least partially complementary to a sequence in a host genome and the oligonucleotides will prevent off-target activity. The ssDNA oligonucleotides may be used in conjunction with embodiments in which a TALE domain, specific to a second target in a viral genome, is linked to the Cas endonuclease. The Cas endonuclease may be covalently or non-covalently linked to the TALE DNA-binding domain.
In certain embodiments, the Cas endonuclease may be linked to the TALE DNA-binding domain through a linker. The linker can be of a length chosen to correspond to a distance between the first target and the second target in the viral genome. The linker may be protein and may include one or more of proline, glycine, threonine, serine, or combinations thereof (e.g., glycine -rich for flexibility, or proline-rich for rigidity). A non-protein linker may be included. A non-protein linker may include, for example, a disulfide bond; a thioether bond; an amine bond; a hydrazine linkage; an amide bond; an imidoester; maleimide; PEG; and BM(PEG)n with 1 < n <9. The linker may be attached to the Cas endonuclease at an amino acid selected from the group consisting lysine, cysteine, aspartic acid, and glutamic acid.
In certain aspects, the invention provides a composition for treating a viral infection. The composition includes a Cas endonuclease, or mRNA encoding the Cas endonuclease, and a guide RNA (gRNA) comprising a targeting region complementary to a target in a viral genome. The composition also includes at least one short oligonucleotide at least partially complementary to the targeting region. The targeting region has a first binding affinity for the oligonucleotide that is lower than a perfect match. However, the first binding affinity may be greater than a binding affinity between the targeting region and a mismatched target sequence. The mismatched target sequence can be at least 75% complementary to the targeting region. The mismatched target sequence may be at least 80% complementary to the targeting region. In certain embodiments, the mismatched target sequence may be at least 90% complementary to the targeting region. In
preferred embodiments, the mismatched target sequence may be at least 95% complementary to the targeting region. The targeting region can include about 20 nucleotides. The ssDNA oligonucleotide may comprise between about 10 and about 15 nucleotides.
Brief Description of the Drawings
FIG. 1 shows a composition for treating a viral infection comprising a Cas endonuclease linked to a TALE binding domain.
FIG. 2 shows a composition for treating a viral infection, the composition including a Cas endonuclease-gRNA complex with a ssDNA oligo complementary to the targeting sequence of the gRNA.
FIG. 3 shows a targeting region of a Cas endonuclease-gRNA complex that has preferentially bound a viral nucleic acid over the ssDNA oligo.
FIG. 4 shows a targeting region of a Cas endonuclease-gRNA complex and a mismatched viral nucleic acid where the ssDNA oligo has preferentially bound the targeting region over the mismatched viral nucleic acid.
FIG. 5 shows a Cas endonuclease linked to a TALE binding domain in a nanoparticle.
FIG. 6 shows a Cas endonuclease and a ssDNA oligonucleotide in a nanoparticle.
FIG. 7 shows a programmable nuclease in a nanoparticle.
FIG. 8 is a map of the Epstein Barr genome to illustrate target categories.
FIG. 9 shows a composition that includes a messenger RNA.
FIG. 10 shows an antiviral composition with a DNA vector encoding a nuclease.
FIG. 11 shows a method of preparing an antiviral composition.
FIG. 12 diagrams a method for treating a viral infection.
Detailed Description
FIG. 1 shows a composition 101 for treating a viral infection. The composition 101 includes a programmable nuclease 107 linked to a TALE domain 157, in which the
programmable nuclease 107 binds to a first target in a viral genome 175 and the TALE domain 157 is designed to bind to a second target in the viral genome, and wherein the first target and the second target are not both found in a human genome. In the depicted embodiment, the programmable nuclease 107 is a Cas endonuclease and is complexed with a guide RNA 121 that
includes a targeting region 127 that is substantially (e.g., at least 60%) complementary to the first target in viral nucleic acid 175. A linker 161 joins the programmable nuclease 107 to the TALE domain 157. The TALE domain 157 includes series of repeat variable diresidue (RVD) designed to bind the second target in the viral genome 175.
The targeting region 127 of the guide RNA 121 hybridizes to the first target in the viral genome 175 and the TALE domain 157 binds to the second target. Both the TALE binding domain and the targeting region may be short enough in length that neither alone provide sufficient binding affinity to allow the programmable nuclease to efficiently cleave the viral nucleic acid. Instead, both the targeting region and the linked TALE binding domain binding to their recognition sites along the viral nucleic acid provide strong enough recognition for the programmable nuclease to function by cleaving the viral nucleic acid. By using two separate binding sites and binding modalities, off-target cleavage is avoided. Because the composition 101 is designed to require specific first and second targets a distance apart from each other on the viral genome, the composition may be particularly useful for targeting the genome of one organism within the cells of an infected host, and thus for digest genomes of latent viruses within the cells of the infected human host.
Compositions and methods of the invention are particularly suited to applications where a PAM and its adjacent sequence are found in both the viral genome and the human genome. That PAM and its adjacent sequence in the viral genome may still be targeted specifically using compositions and methods of the disclosure by having the DNA-binding domain linked to the Cas endonuclease be designed to bind to a sequence found only in the viral genome. By requiring binding of a second, separate target sequence that may be spaced away from the first target sequence in order to allow cleavage, compositions of the invention may provide the ability to use a conserved first target sequence between the host and the target virus. The second target sequence of the DNA-binding domain may be selected to be removed from the conserved section of sequence.
The DNA-binding domain may be linked to a programmable nuclease, such as a Cas endonuclease at, for example, a side chain of an amino acid of the Cas endonuclease, wherein the side chain may present an amine, a carboxyl, a sulfhydryl, or a carbonyl. Optionally, the DNA- binding domain can be attached to the side chain through a linker 161, which may include one or more of a disulfide bond, a thioether, an amine bond, a hydrazine linkage, an amide bond, an
imidoester, a peptide bond, maleimide; polyethylene glycol (PEG), BM(PEG)n with 1 < n <9, biotin, or other molecule or agent. The linker 161 may be configured in length to space the DNA-binding domain apart from the gRNA's targeting region by a distance corresponding to the anticipated distance between the target sequences of the DNA-binding domain and the targeting region in the viral genome. Compositions and methods of the invention recognize that target DNA may not be encountered by the programmable nuclease in a perfect line and that, because of this, sequences that are separated by a distance within the target genome may, in fact, be quite close in physical space. Accordingly, the anticipated distance may be based upon the nucleotide length separating the two target sequences or spatial relationships based on known conformation of the viral genetic material.
Certain Cas endonuclease are thought to function naturally by forming a complex with a guide RNA that includes a ~ 20-bp targeting region that is substantially complementary to a target in viral nucleic acid. Compositions and methods of the invention may include a targeting region 127 that is shorter in length than 20-bp. In various embodiments, the substantially complementary targeting region may be, for example, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or 19 bp in length. In preferred embodiments, the targeting region may include a 10 to 16 base sequence substantially complimentary to a target sequence in a viral genome.
The DNA-binding domains may be derived from TALE DNA-binding domains, zinc finger DNA-binding domains from a zinc finger nuclease (ZFN), a second, catalytically inactive Cas endonuclease/gRNA complex, or a DNA-binding domain from a Bat protein for example.
Transcription activator-like effector nucleases (TALENs) are nucleases that include a DNA-binding domain and a cleavage domain. Specifically, TALENs contain a Fokl nuclease domain and a DNA-binding domain known as a transcription activator-like effector (TALE). The TALE is composed of tandem arrays of amino acid repeats, each of which recognizes a single base-pair in the major groove of target viral DNA. The nucleotide specificity of a domain comes from repeat variable diresidues (RVD), the two amino acids at positions 12 and 13 where Asn- Asn, Asn-Ile, His-Asp and Asn-Gly recognize guanine, adenine, cytosine and thymine, respectively. That pattern allows one to design a TALE domain to be a sequence-specific DNA- binding domain. Compositions and methods of the invention may include a TALE DNA-binding domain comprising, for example, 3, 4, 5, 6, 7, 8, 9, or 10 RVD programmed to bind a target
sequence in a viral genome. In preferred embodiments, the TALE DNA-binding domain comprises about 7 RVD.
The DNA-binding domain may be a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN). Zinc-finger nucleases (ZFNs) are artificial restriction enzymes generated by fusing a zinc finger DNA-binding domain to a DNA-cleavage domain. Zinc finger DNA-binding domains can be engineered to target specific desired DNA sequences. Standard zinc finger DNA- binding domains may contain between three and six individual zinc finger repeats and recognize between 9 and 18 basepairs and zinc finger DNA-binding domains of the invention may be shortened to reduce binding affinity in order to require binding of both the zinc finger binding domain and the coupled gRNA/Cas endonuclease complex before the Cas endonuclease can cleave the target viral genome.
In certain embodiments, the DNA-binding domain may be a second, catalytically inactive Cas endonuclease/gRNA complex of the same or different species as the first cleavage-active Cas endonuclease/gRNA complex. See Guilinger, et al., 2014, Fusion of catalytically inactive Cas9 to Fokl nuclease improves the specificity of genome modification, Nature Biotechnology 32, 577-582, incorporated herein by reference.
According to various embodiments, the Cas endonuclease/gRNA complex linked DNA- binding domain may be from a Burkholderia rhizoxinica protein (Bat protein). Burkholderia rhizoxinica is an endosymbiotic bacterium that contains TALE-like proteins. Bat protein binding domains function similarly to TALE binding domains and may be similarly programmed to recognize target sequences. Bat repeat domains have been shown to mediate sequence- specific DNA binding similarly to TALEs while requiring less than 40% sequence identity. See de Lange, et al., 2014, Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain, Nucleic Acids Res. 42(11): 7436-7449, incorporated herein by reference. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic, allowing for the creation of Bat derivatives with programmable specificity. Id.
A composition that includes a Cas endonuclease linked to a TALE or other DNA-binding domain will bind to a viral genome with good specificity and can be designed to not exhibit any off-target binding or cleavage. Thus compositions of the invention have good specificity and may be used to digest a viral genome within a human cell. Additionally, compositions of the invention may include other features that even further increase the target specificity.
FIG. 2 shows a composition 201 of the invention including a programmable nuclease 207 complexed with a guide RNA 221 comprising a targeting region 227 that is substantially (e.g., at least 60%) complementary to a target in viral nucleic acid. The composition 201 includes a short oligonucleotide 225 such as a single-stranded DNA (ssDNA) oligonucleotide that is co-delivered with the programmable nuclease 207. The oligonucleotide 225 is complementary to at least a portion of the targeting region 227 of the guide RNA 221. The oligonucleotide 225 may be shorter than the targeting region 227 or may simply contain a sequence complementary to the targeting region 227 that is shorter than the viral genome-complementary portion of the targeting region 227. The oligonucleotide 225 hybridizes to the targeting sequence 227 of the guide RNA 221 with an affinity greater than a mis-matched target would hybridize to the guide RNA. Thus the oligonucleotide 225 prevents off-target binding and activity. However, the oligonucleotide 225 binds to the guide RNA 221 less well than would the true target. Accordingly, a perfectly matched target sequences in the viral genome will displace the oligonucleotide 225 and allow for cleavage by the programmable nuclease 207.
FIG. 3 illustrates a strong match between a target sequence on the viral genome 175 and the targeting region 227. The targeting region 227 has preferentially bound to the viral genome 175 and displaced the oligonucleotide 225. The programmable nuclease 107 will accordingly cleave the viral genome 175, disrupting the virus.
FIG. 4 illustrates a mismatch between the viral genome 175 and the targeting region 227. Accordingly, the ssDNA oligo 225 remains preferentially bound to the targeting region 227 preventing the mismatch binding of the targeting region 227 to the viral genome 175 or potential mismatched binding sites in the host genome. Off-target cleavage by the programmable nuclease 207 is thereby prevented. Reducing off-target cleavage is of particular importance where a sequence in the host genome is similar to the target sequence in the viral genome in order to prevent unwanted disruption of the host genetic material.
Compositions of the invention use a programmable nuclease to digest viral nucleic acid and include features that prevent off-target activity. Compositions and methods of the disclosure may have particular benefit for the treatment of viruses that contain target sequences that are similar to the host's genetic sequence. Where the target in the viral genome is similar to a region within the host genome, the inclusion of a TALE domain (while using a shorter guide RNA
recognition sequence), the use of ssDNA oligonucleotides, or both, can prevent the nuclease from binding to and cleaving the host genome.
Any suitable virus may be targeted using compositions and methods of the invention. Suitable viruses include, Programmable nucleases may be used in various embodiments to target viral genetic material within host cells. In certain embodiments, compositions include, as the programmable nuclease, an RNA-guided nuclease (e.g., Cas9) and at least one gRNA targeting the genome of the virus. Suitable targets in viral genomes include, but are not limited to, a portion of a genome or gene of adenovirus, herpes simplex virus, varicella-zoster virus, Epstein- Barr virus, human cytomegalovirus, human herpesvirus type 8, human papillomavirus, BK virus, JC virus, smallpox, hepatitis B virus, human bocavirus, parvovirus, B 19, human astrovirus, Norwalk virus, coxsackievirus, hepatitis A virus, poliovirus, rhinovirus, sever acute respiratory syndrome virus, hepatitis C virus, yellow fever virus, dengue virus, west nile virus, rubella virus, hepatitis E virus, human immunodeficiency virus, influenza virus, guanarito virus, Junin virus, Lassa virus, machupo virus, sabia virus, Crimean-Congo hemorrhagic fever virus, Ebola virus, Marburg virus, measles virus, mumps virus, parainfluenza virus, respiratory syncytial virus, human metapnemovirus, Hendra virus, Nipah virus, rabies virus, hepatitis D virus, rotavirus, orbivirus, Coltivirus, or Banna virus. In preferred embodiments, compositions of the invention are provided as antiviral therapeutics that include a modified programmable nuclease
programmed to treat an infection by a hepatitis virus, a hepatitis B virus (HBV), an Epstein-Barr virus, a Kaposi's sarcoma-associated herpesvirus (KSHV), a herpes-simplex virus (HSV), a cytomegalovirus (CMV), human papilloma virus (HPV), and Merkel cell polyomavirus.
Methods and compositions of the disclosure use a programmable nuclease to digest nucleic acid of the virus, thereby rendering the virus incapable of replication or infection of the host patient.
The composition may be delivered as a programmable nuclease such as Cas endonuclease or a nucleic acid encoding the programmable nuclease. The programmable nuclease may be complexed with a guide RNA (gRNA) to form a ribonucleoprotein. The programmable nuclease may be delivered as mRNA that encodes a programmable nuclease, co-delivered with a gRNA. The programmable nuclease may be delivered in the form of a vector encoding for the gRNA and the programmable nuclease.
The programmable nuclease may be any suitable programmable nuclease. A
programmable nuclease is a molecule that can be designed to, or "programmed" to, cleave a
nuclease in a sequence- specific manner. Programmable nucleases include CRISPR-associated (Cas) nucleases, such as Cas9, Cpfl, C2cl, C2c3, and C2c2.
In preferred embodiments, methods and compositions of the disclosure use a Cas endonuclease. Cas endonucleases were first found as part of bacterial immune systems. The host bacteria capture small DNA fragments (-20 bp) from invading viruses and insert those sequences (termed targeting regions) into their own genome to form a CRISPR. Those CRISPR regions are transcribed as pre-CRISPR RNA(pre-crRNA) and processed to give rise to target- specific crRNA. Invariable target-independent trans-activating crRNA (tracrRNA) is also transcribed from the locus and contributes to the processing of pre-crRNA. The crRNA and tracrRNA have been shown to be combinable into a single guide RNA. As used herein, "guide RNA" or gRNA refers to either format. Guide RNA and a Cas endonuclease form an active ribonucleoprotein (RNP) complex that cleaves the target nucleic acid. A sgRNA forms the RNP with Cas endonuclease protein, and the RNP finds the target by hybridization of a targeting region to the intended target. The RNP will cleave when the target is found next to a sequence known as protospacer adjacent motif (PAM).
Cas endonucleases are generally programmed to target a specific viral nucleic acid by providing a gRNA that includes a ~ 20-bp targeting region that is substantially complementary to a target in viral nucleic acid. In preferred embodiments of the invention, the length of the targeting region is reduced to about 15-16-bp to lower binding efficiency. The targetable sequences include, among others, 5'-X 15NGG-3 ' or 5'-X 15NAG-3 '; where X 15 is
substantially complementary to the targeting region in the gRNA and NGG and NAG are PAMs. It will be appreciated that recognition sequences with lengths other than 15-16 bp and PAMs other than NGG and NAG are known and are included within the scope of the invention.
CRISPR systems with single-subunit effectors are known as Class 2. These are then subdivided even further into type II (e.g., Cas9) and type V (e.g., Cpfl). Cas endonucleases include Cas9, Cpfl, C2cl, C2c3, and C2c2, and modified versions of Cas9, Cpfl, C2cl, C2c3, and C2c2, such as nuclease with an amino acid sequence that is different, but at least about 85% similar to, an amino acid sequence of wild-type Cas9, Cpfl, C2cl, C2c3, or C2c2, or a Cas9, Cpfl, C2cl, C2c3, or C2c2 protein with a linked to an accessory element such as another polypeptide or protein domain (e.g., within a recombinant fusion protein or linked via an amino acid side-chain) or other molecule or agent.
C2cl (Class 2, candidate 1) is a type V-B Cas endonuclease that has been found.
Examples of C2cl have been indicated to be functional in E. coli. tracrRNAs (short RNAs that help separate the CRISPR array into individual spacers, or crRNAs) were required. As is the case for Cas9, with C2cl, the tracrRNA may be fused to the crRNA to make a single short guide, or sgRNA. C2cl targets DNA with a 5' PAM sequence TTN.
C2c3 (Class 2, candidate 3) is a type V-C Cas endonuclease that clusters with C2cl and Cpf 1 within type V. C2c2 was found in metagenomic sequences, and the species is not known.
C2c2 (Class 2, candidate 2) is a type VI Cas endonuclease. C2c2 has been indicated to make mature crRNAs in E. coli. See Shmakov, 2015, Discovery and functional characterization of diverse class 2 CRISPR-Cas systems, Mol Cell 60(3):385-397, incorporated by reference.
In embodiments of the invention, a TALE DNA-binding domain is linked to a programmable nuclease through a linker. A linker may be chosen for its properties. For example, for a polypeptide linker (e.g., within a recombinant fusion protein) to be flexible it may be provided with a plurality of glycine resides (e.g., > 30% or > 50%). For a more rigid polypeptide linker, it may be desirable to include a plurality of proline residues. The linker may be biodegradable.
In some embodiments, the linker is cleavable. For example, the linker may include an enzyme cleavage region. Where a polypeptide linker is used, an enzyme cleave region can be the target of a protease.
In certain embodiments, the TALE DNA-binding domain is non-covalently bound to the programmable nuclease. For example, either the programmable nuclease or the DNA-binding domain may be biotinylated and the DNA-binding domain may thus be non-covalently bound to the programmable nuclease through a biotin/streptavidin linkage.
A composition for treating a viral infection may include a programmable nuclease covalently linked to a TALE DNA-binding domain through a protein linker. Some recombinant fusion proteins are composed of two or more functional domains joined by linker peptides. The linker serves to connect the proteins, and also provide many other functions, such as maintaining cooperative inter-domain interactions or preserving biological activity. The natural length of linkers in multi-domain proteins is about 6 to 10 residues on average. Preferred residues for linkers include threonine (Thr), serine (Ser), proline (Pro), glycine (Gly), aspartic acid (Asp),
lysine (Lys), glutamine (Gin), asparagine (Asn), and alanine (Ala), arginine (Arg), phenylalanine (Phe), and glutamic acid. I.e., preferably residues are polar (charged or uncharged).
Proline may be included to give the linker rigidity. It is thought that the lack of an amide hydrogen, as well as the cyclic side chain, limit proline's ability to participate in promiscuous hydrogen bonding and restrict its flexibility. The small, polar amino acids, such as Thr, Ser, and Gly are thought to be favorable for providing good flexibility due to their small sizes, and also help maintain stability of the linker structure in the aqueous solvent through formation of hydrogen bonds with water. For flexibility, the linker may include a plurality of glycine residues. In some embodiments, the linker comprises a plurality of threonine and serine residues.
A linker may provide functionality such as flexibility, rigidity (e.g., even a mixture of both flexibility and rigidity at different points along it), solubility, cleavage targets, binding targets, others, or combinations thereof. In some embodiments, the linker is included to provide a spacer arm. The spacer arm is the chemical chain between two groups. The length of a spacer arm (e.g., in angstroms) determines how flexible a conjugate will be. Longer spacer arms have greater flexibility, reduced steric hindrance, and offer more sites for potential nonspecific binding. Spacer arms can range from zero length to > 100 angstroms. The molecular composition of a spacer arm can affect solubility and nonspecific binding. Some linkers have spacer arms that contain hydrocarbon chains or polyethylene glycol (PEG) chains. Hydrocarbon chains are not water soluble and typically require an organic solvent such as DMSO or DMF for suspension. Those crosslinkers are suited for penetrating the cell membrane and performing intercellular crosslinking because they are hydrophobic and uncharged. If a charged sulfonate group is added to the termini of such crosslinkers, a water soluble analogue is formed.
Certain exemplary categories of cross-linkers use bismaleimide-activated PEG
(BM(PEG)n) or bis(succinimidyl) PEG (BS(PEG)n). Canonically, BM(PEG)n cross-links sulfhydryls and BS(PEG)n cross-links amines, although variations will be understood by one of skill in the art. It may be preferable for most technological and therapeutic applications to use BS(PEG)n or BM(PEG)n with 1 < n <9, although related PEG-based chemistries will be understood by one of skill in the art and are included in the invention.
Optionally using a suitable linker, a TALE DNA-binding domain may be attached to a programmable nuclease at an amino acid with a side chain comprising an amine, a carboxyl, a sulfhydryl, or a carbonyl. For example, the agent or the linker may be attached to the
programmable nuclease at an amino acid in the nuclease such as lysine, cysteine, aspartic acid, or glutamic acid.
In some embodiments, a programmable nuclease is linked to a TALE DNA -binding domain through a click reaction product such as one more five-membered rings or acyclic derivatives thereof. Click chemistry includes a class of biocompatible reactions intended primarily to join substrates of choice with specific biomolecules. Click chemistry provides methods joining small modular units. In general, click reactions usually join a biomolecule and a TALE DNA-binding domain. Typical click reactions occur in one pot, are not disturbed by water, make unremarkable byproducts, and are driving quickly and irreversibly to high yield of a single click reaction product, with high reaction specificity (in some cases, with both regio- and stereo-specificity). Click reaction products are physiologically stable with only non-toxic byproducts. In one example, the Azide-Alkyne Huisgen Cycloaddition is a 1,3-dipolar cycloaddition between an azide and a terminal or internal alkyne to give a 1,2,3-triazole. The 1,3-dipolar cycloaddition is a chemical reaction between a 1,3-dipole and a dipolarophile to form a five-membered ring. Linking a programmable nuclease to an agent via click chemistry can create a linker that includes, as the click reaction product, one or more five-membered rings or acyclic derivatives thereof. This 1,3-dipolar cycloaddition is an important route to the regio- and stereoselective synthesis of five-membered heterocycles and their ring-opened acyclic derivatives.
The 1,3-dipolar cycloaddition between organic azides and terminal alkynes, e.g., for bioconjugation, may proceed by a copper(I) -catalyzed version of the Huisgen reaction, CuAAC (for Copper-catalyzed Azide-Alkyne Cycloaddition), which proceeds readily in mild conditions that can approximate physiological conditions. Click chemistry may be bioorthogonal: azides and alkynes are typically not found in biomolecules discussed herein and can be selectively reacted. For discussion see Hein et al., 2009, Click chemistry, a powerful tool for pharmaceutical sciences, Pharm Res 25(10):2216-2230 and McCombs & Owen, 2015, Antibody drug conjugates: design and selection of linker, payload and conjugation chemistry, AAPS J
17(2):339-51, both incorporated by reference.
FIG. 5 shows a composition 501 for treating a viral infection. The composition 501 includes a programmable nuclease 507 designed to bind to and cleave viral nucleic acid. The programmable nuclease 507 is linked by a linker 561 to a TALE domain 557. The composition
preferably includes a nanoparticle 571 encapsulating at least the programmable nuclease 507 and the TALE domain 557. The programmable nuclease 507 linked to the TALE domain 557 is delivered using the nanoparticle 571. The nanoparticle 571 may be any suitable nanoparticle including, for example, a liposome. In preferred embodiments, the nanoparticle 571 is a lipid nanoparticle and preferably includes cationic lipids, which may be found to be particularly well suited for topical delivery of compositions of the disclosure to tissues harboring a latent or persistent infection.
FIG. 6 shows a composition 601 for treating a viral infection. The composition 601 includes a programmable nuclease 607 programmed to cleave viral nucleic acid or an RNA encoding the programmable nuclease and a short oligonucleotide 625, such as an ssDNA oligo. The composition preferably includes a nanoparticle 671 encapsulating at least the programmable nuclease 607, or the RNA encoding the programmable nuclease, and the oligonucleotide 625. A feature of the depicted composition 601 is that the nuclease 607 is complexed with a guide RNA 621. The guide RNA 621 has a targeting sequence that is complementary to a target sequence within a viral genome, and the oligonucleotide 625 is complementary, over a shorter extent than for the true target, to the targeting sequence of the guide RNA 621. I.e., in some embodiments, the targeting sequence includes about a 16 to 20 base pair stretch that is complementary to a target in a viral genome (preferably about 20), and the oligonucleotide includes a stretch that is a few base pairs shorter (e.g., about 10 to 15) that is complementary to the targeting sequence. In certain embodiments, the short oligonucleotide 625 may be bound to the guide RNA 621 within the nanoparticle 671 at the time of delivery.
Any suitable nanoparticle 671 may be used in the composition 601. In certain embodiment, the nanoparticle is a lipid nanoparticle.
FIG. 7 shows a composition 701 for treating a viral infection, in which the composition 701 includes a programmable nuclease 707 linked to a TALE domain 757 packaged within a lipid nanoparticle 767, e.g., a liposome. Once delivered to cells in vivo in a patient, the TALE domain 757 binds to a first target in a viral genome while the programmable nuclease 707 binds to a second target in the viral genome. In preferred embodiments, the programmable nuclease 707 is a Cas endonuclease and is complexed with a guide RNA 721 as an active
ribonucleoprotein (RNP) within the lipid nanoparticle 767. Preferred viral targets include those with episomal DNA genomes during persistence or latency such as human papillomavirus
(HPV), Herpes Simplex virus (HSV), Epstein Barr virus (EBV), Kaposi's sarcoma-associated herpesvirus (KSHV), hepatitis b virus (HBV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Cytomegalovirus (CMV), or Varicella zoster virus (VZV). The first target, recognized by the TALE domain 757, and the second target, recognized by the guide RNA 1121, are both in the genome of a virus. This means that the first and second targets are selected from within the genome of the virus and that the TALE domain 757 and the guide RNA 721 are designed to bind to those targets.
In preferred embodiments, the targets are selected by scanning the genomes of the virus for suitable targets. Once suitable first and second targets are identified, the human genome may be scanned for the presence of both of those targets. Once first and second targets are selected that appear in the viral genome and preferably do not also appear in the human genome, the TALE domain and guide RNA may be designed accordingly. It may be preferable to select targets within the viral genome based on where within the genome those targets lie. For example, it may be preferable to select a target that will cause the nuclease to cleave a protein-coding gene and either leave it cleaved or introduce a frame shift. Targets may be selected by reading a reference genome (e.g., a GenBank file) and selection may include referencing annotations to identify genes by category or function. For example, it may be preferable to target cleavage within a gene that is annotated as being functional during latency.
FIG. 8 is a map of the Epstein Barr genome and is used to illustrate how the guide sequence may be designed. The map shown in FIG. 12 shows certain features in the EBV genome that may be targeted with a programmable nuclease. The marks "#", and "+" are used to indicate features that are related to viral structure, transformation, and latency, respectively. Guide RNAs that target the EBV genome are used in compositions according to certain embodiments. Within a genome of interest, such as EBV, selected regions, or genes are targeted. For example, six regions can be targeted with seven guide RNA designs for different genome editing purposes. In relation to EBV, EBNA1 is the only nuclear Epstein-Barr virus (EBV) protein expressed in both latent and lytic modes of infection. While EBNA1 is known to play several important roles in latent infection, EBNA1 is crucial for many EBV functions including gene regulation and latent genome replication. Therefore, guide RNAs sgEBV4 and sgEBV5 were selected to target both ends of the EBNA1 coding region in order to excise this whole region of the genome. These "structural" targets enable systematic digestion of the EBV
genome into smaller pieces. EBNA3C and LMPl are essential for host cell transformation, and guide RNAs sgEBV3 and sgEBW were designed to target the 5' exons of these two proteins respectively.
To design guide RNA targeting the EBV genome, the EBV reference genome is referred to. EBNA1 is crucial for many EBV functions including gene regulation and latent genome replication. Guide RNA sgEBV4 and sgEBV5 are targeted to both ends of the EBNA1 coding region in order to excise that region of the genome. Guide RNAs sgEBVl, 2 and 6 fall in repeat regions, so that the success rate of at least one CRISPR cut is multiplied. Those "structural" targets enable systematic digestion of the EBV genome into smaller pieces. EBNA3C and LMPl are essential for host cell transformation, and guide RNAs sgEBV3 and sgEBW are designed to target the 5' exons of these two proteins respectively. Using one or more guide RNA designed to target a selected location within a viral genome, a Cas endonuclease will digest the genome of the virus. Suitable targets include viruses such as Herpes Simplex virus (HSV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Kaposi's sarcoma-associated herpesvirus (KSHV), Cytomegalovirus (CMV), Epstein Barr virus (EBV), Varicella zoster virus (VZV), human papillomavirus (HPV), or hepatitis b virus (HBV).
FIG. 9 illustrates a preferred embodiment that includes a composition 901 for treating a viral infection. The composition 901 includes a messenger RNA 937 (mRNA) encoding a programmable nuclease linked to a TALE domain— here, as a transcript of a recombinant gene that will be translated in the infected cells to provide a recombinant fusion protein that includes a Cas endonuclease linked to a TALE domain. The composition 901 also includes a guide RNA 921 with a targeting sequence substantially complementary to a first target within a viral genome. The composition 901 further includes a nanoparticle 971 encapsulating at least the mRNA 937 encoding the programmable nuclease linked to the TALE domain.
FIG. 10 diagrams a composition 1000 for treating a viral infection that includes a nucleic acid vector 1001 (e.g., a plasmid) encoding a programmable nuclease for delivery to viral- infected cells. In the depicted embodiment, the nucleic acid vector 1001 is a plasmid that includes a gene 1027, preferably under control of a promoter 1039. The gene 1027 is preferably a recombinant gene encoding a TALE domain linked to a programmable endonuclease. The plasmid may also include a viral origin of replication 1035 to support maintenance of the plasmid preferentially in viral-infected cells.
Where the programmable nuclease gene 1027 codes for Cas endonuclease linked to a TALE DNA-binding domain, the plasmid may also include a guide RNA segment 1055, which includes portions that correspond to targets in genetic material of a virus. When the guide RNA segment 1055 is transcribed, the product is one gRNA with a portion substantially
complementary to a target in viral genetic material, preferably with no match in a human genome.
In certain embodiments, vector 1001 is a plasmid and the gene 1027 codes for a Cas endonuclease (e.g., Cas9 or a modified version of Cas9 that is at least a 95% match to Cas9) linked to a TALE domain. The guide RNA segment 1055 preferably includes a 15-16 nucleotide segment that is at least a 70% match to a segment in a genome of a virus adjacent to a protospacer adjacent motif (PAM) (e.g., NGG); and the viral origin of replication 1035 is an origin of replication from the genome of a virus. The virus may be selected from Human papillomavirus (HPV), Hepatitis B virus, Cytomegalovirus, herpes simplex virus, Epstein Barr virus, for example. These certain embodiments may be preferred where the nucleic acid vector 901 is part of an antiviral therapeutic composition to be delivered to infected cells.
The programmable nuclease segment 1027 may preferably code for an RNA-guided nuclease such as Cas9, a modified Cas9 linked by a linker to a TALE DNA-binding domain, Cpfl, or a modified Cpfl . In some embodiments, the guide RNA segment 1055 and the viral origin of replication 1035 are omitted. Any suitable promoter 1039 (e.g., U6 promoter) may be included. These embodiments may be preferred where the gene 1027 is to be expressed e.g., in culture (for example, in E. coli, yeast, or a Lactobacillus) to produce a nuclease for use in an antiviral therapeutic composition. Where the programmable nuclease segment 1027 codes for an RNA-guided nuclease, the expressed protein is preferably complexed with a gRNA to form into an active ribonucleoprotein (RNP).
FIG. 11 shows preparation of an antiviral composition. A programmable nuclease 1107 linked to a TALE domain 1157 and guide RNA 1121 are obtained. Those elements are formed in an RNP 1167. In a preferred embodiment, the guide RNA 1121 includes a targeting region substantially complementary to a target site within a viral genome and the TALE domain 1157 binds a second target site within a viral genome. The RNP 1167 is packaged with nanoparticles 1137 to form a composition 1101. The composition 1101 may further include any suitable carrier fluid, cream, or gel e.g., for topical delivery. The composition 1101 is delivered topically to a site
in tissue in a patient. The nanoparticles (e.g., liposomes) penetrate tissue, preferably to the basal epithelium or mucosal epithelium where, for example, the virus is HPV. The nanoparticles 1137 deliver the TALE-linked RNP 1167 molecules to infected cells 1179, where the TALE-linked RNP 1167 then cleaves viral DNA 1175.
FIG. 12 diagrams a method 1201 for treating a viral infection. The method 1201 includes providing 1205 a composition that includes a programmable nuclease programmed to cleave viral nucleic acid. The programmable nuclease may include a Cas endonuclease linked to a TALE DNA-binding domain. The programmable nuclease may be provided with a ssDNA oligo partially complementary to a targeting region sequence of a gRNA complexed with the programmable nuclease. Preferably, the programmable nuclease or the RNA encoding the programmable nuclease are encapsulated in a nanoparticle such as a lipid nanoparticle that includes cationic lipids. The composition is delivered 1209 to cells infected by a virus. In the most preferred embodiments, the composition is delivered topically to avoid systemic distribution. The programmable nuclease is preferably a Cas endonuclease programmed to cleave genetic material of the virus. The composition is then used to cleave 1213 the genetic material of the virus.
Claims
1. A composition for treating a viral infection, the composition comprising:
a Cas endonuclease complexed with a guide RNA (gRNA) comprising a targeting region at least substantially complementary to a first target in a viral genome; and
a DNA-binding domain linked to the Cas endonuclease, the DNA-binding domain programmed to bind a second target in the viral genome,
wherein binding of both the first target and the second target is required for the Cas endonuclease to cleave the viral genome.
2. The composition of claim 1, wherein the DNA-binding domain is a transcription activator- like effector (TALE) DNA-binding domain comprising about 7 repeat variable diresidues (RVDs) corresponding to the second target in the viral genome.
3. The composition of claim 1, wherein the DNA-binding domain is a zinc finger DNA-binding domain of a zinc finger nuclease (ZFN).
4. The composition of claim 1, wherein the DNA-binding domain is a second, catalytically inactive Cas endonuclease and gRNA complex.
5. The composition of claim 1, wherein the DNA-binding domain is from an endosymbiotic bacterium Burkholderia rhizoxinica protein (Bat protein).
6. The composition of claim 1, wherein the targeting region comprises an about 15-16 base pair sequence complementary to the first target in the viral genome.
7. The composition of claim 1, wherein the viral genome is selected from the group consisting of Herpes Simplex virus (HSV), human Herpes virus 6 (HHV6), human Herpes virus 7 (HHV7), Kaposi's sarcoma-associated herpesvirus (KSHV), Cytomegalovirus (CMV), Epstein Barr virus (EBV), Varicella zoster virus (VZV), human papillomavirus (HPV), Merkel cell polyomavirus (MCV), and hepatitis b virus (HBV).
8. The composition of claim 1, further comprising a single-strand DNA oligonucleotide at least partially complementary to the targeting region.
9. The composition of claim 1, wherein the targeting region is at least partially complementary to a sequence in a host genome.
10. The composition of claim 1, wherein the TALE DNA-binding domain is non-covalently bound to the Cas endonuclease.
11. The composition of claim 1, wherein the Cas endonuclease is covalently linked to the TALE DNA-binding domain .
12. The composition of claim 1, wherein the Cas endonuclease is linked to the TALE DNA- binding domain through a linker.
13. The composition of claim 12, wherein the linker is of a length approximately equivalent to a conformational distance between the first target and the second target in the viral genome.
14. The composition of claim 12, wherein the linker comprises protein.
15. The composition of claim 14, wherein the linker comprises at least one selected from the group consisting of: a plurality of proline residues; a plurality of glycine residues; and a plurality of threonine and serine residues.
16. The composition of claim 13, wherein the linker comprises a non-protein chemical linker.
17. The composition of claim 16, wherein the non-protein chemical linker comprises one selected from the group consisting of: a disulfide bond; a thioether bond; an amine bond; a hydrazine linkage; an amide bond; an imidoester; maleimide; PEG; and BM(PEG)n with 1 < n <9.
18. The composition of claim 12, wherein the linker is attached to the Cas endonuclease at an amino acid selected from the group consisting lysine, cysteine, aspartic acid, and glutamic acid.
19. A composition for treating a viral infection, the composition comprising:
a Cas endonuclease complexed with a guide RNA (gRNA) comprising a targeting region at least substantially complementary to a first target in a viral genome; and
a single-strand DNA (ssDNA) oligonucleotide at least partially complementary to the targeting region,
wherein the targeting region has a first binding affinity for the ssDNA oligonucleotide that is less than a binding affinity between the targeting region and a target sequence that is 100% complementary to the targeting region, and
wherein the first binding affinity is greater than a binding affinity between the targeting region and a mismatched target sequence.
20. The composition of claim 19, wherein the mismatched target sequence is at least 95% complementary to the targeting region.
21. The composition of claim 19, wherein the targeting region comprises about 20 nucleotides.
22. The composition of claim 21, wherein the ssDNA oligonucleotide comprises between about 10 and about 15 nucleotides.
23. The composition of claim 19, further comprising a transcription activator-like effector (TALE) DNA-binding domain linked to the Cas endonuclease, the TALE DNA-binding domain programmed to bind a second target in the viral genome,
wherein binding of both the first target and the second target is required for the Cas endonuclease to cleave the viral genome.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201662438074P | 2016-12-22 | 2016-12-22 | |
US62/438,074 | 2016-12-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018118585A1 true WO2018118585A1 (en) | 2018-06-28 |
Family
ID=62627037
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2017/066108 WO2018118585A1 (en) | 2016-12-22 | 2017-12-13 | Antiviral compositions |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018118585A1 (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150067900A1 (en) * | 2012-03-15 | 2015-03-05 | Cellectis, S.A. | New repeat variable diresidues for targeting nucleotides |
WO2016007604A1 (en) * | 2014-07-09 | 2016-01-14 | Gen9, Inc. | Compositions and methods for site-directed dna nicking and cleaving |
WO2016057961A1 (en) * | 2014-10-10 | 2016-04-14 | Editas Medicine, Inc. | Compositions and methods for promoting homology directed repair |
GB2537000A (en) * | 2012-05-25 | 2016-10-05 | Univ California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US20160346360A1 (en) * | 2015-05-29 | 2016-12-01 | Agenovir Corporation | Compositions and methods for cell targeted hpv treatment |
US20160369268A1 (en) * | 2013-07-01 | 2016-12-22 | The Board Of Regents Of The University Of Texas System | Transcription activator-like effector (tale) libraries and methods of synthesis and use |
-
2017
- 2017-12-13 WO PCT/US2017/066108 patent/WO2018118585A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150067900A1 (en) * | 2012-03-15 | 2015-03-05 | Cellectis, S.A. | New repeat variable diresidues for targeting nucleotides |
GB2537000A (en) * | 2012-05-25 | 2016-10-05 | Univ California | Methods and compositions for RNA-directed target DNA modification and for RNA-directed modulation of transcription |
US20160369268A1 (en) * | 2013-07-01 | 2016-12-22 | The Board Of Regents Of The University Of Texas System | Transcription activator-like effector (tale) libraries and methods of synthesis and use |
WO2016007604A1 (en) * | 2014-07-09 | 2016-01-14 | Gen9, Inc. | Compositions and methods for site-directed dna nicking and cleaving |
WO2016057961A1 (en) * | 2014-10-10 | 2016-04-14 | Editas Medicine, Inc. | Compositions and methods for promoting homology directed repair |
US20160346360A1 (en) * | 2015-05-29 | 2016-12-01 | Agenovir Corporation | Compositions and methods for cell targeted hpv treatment |
Non-Patent Citations (4)
Title |
---|
DELANGE, O ET AL.: "Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain", NUCLEIC ACID RESEARCH, vol. 42, no. 11, 1 July 2014 (2014-07-01), pages 7436 - 7449 * |
GOURLAY, EM ET AL.: "Strategies for human genome modification using engineered nucleases and transcription factors", DOCTORAL THESIS, December 2014 (2014-12-01), Retrieved from the Internet <URL:http://theses.gla.ac.uk/5875> [retrieved on 20180314] * |
KENNEDY, EM ET AL.: "Bacterial CRISPR/Cas DNA endonucleases: A revolutionary technology that could dramatically impact viral research and treatment", VIROLOGY, vol. 479, no. 480, May 2015 (2015-05-01), pages 213 - 220, XP029216538, DOI: doi:10.1016/j.virol.2015.02.024 * |
NAISER, T ET AL.: "Impact of point-mutations on the hybridization affinity of surface-bound , DNA/DNA and RNA/DNA oligonucleotide-duplexes: Comparison of single base mismatches and base bulges", BMC BIOTECHNOLOGY, vol. 8, no. 48, 13 May 2008 (2008-05-13), pages 1 - 23 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020233745B2 (en) | Delivery system for functional nucleases | |
JP6840796B2 (en) | Composition for linking DNA binding domain and cleavage domain | |
AU2021201257B2 (en) | Cas9 variants and uses thereof | |
US20230167426A1 (en) | Anti-crispr compounds and methods of use | |
US11920169B2 (en) | Compositions for linking DNA-binding domains and cleavage domains | |
US20170246260A1 (en) | Modified antiviral nuclease | |
AU2019326408A1 (en) | Engineered target specific base editors | |
JP2022526695A (en) | Inhibition of unintentional mutations in gene editing | |
JP2016534116A (en) | Engineered transcription activator-like effector (TALE) domains and uses thereof | |
JP2020515258A (en) | Antiviral therapeutic agent | |
EP3589330A1 (en) | Methods and systems for modifying dna | |
US20240254464A1 (en) | Cleavage-inactive cas12f1, cleavage-inactive cas12f1-based fusion protein, crispr gene-editing system comprising same, and preparation method and use thereof | |
CN117925585A (en) | Adenosine deaminase, base editor fusion protein, base editor system and use thereof | |
JP2024539652A (en) | Compositions and methods for genome editing of neonatal FC receptors | |
JP2002533088A (en) | Nonpathogenic transfection vector | |
CN116438302A (en) | Systems and methods for translocating cargo nucleotide sequences | |
WO2018118585A1 (en) | Antiviral compositions | |
WO2018118567A1 (en) | Delivery of antiviral therapies | |
JP2022551931A (en) | Modified endonucleases and related methods | |
WO2024229342A1 (en) | Piggybac transposase engineering | |
KR20240141024A (en) | Gene editing system using E1347A DddAtox mutant-TnpB fusion protein | |
US20040137626A1 (en) | AAV ITR-mediated modulation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17885365 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205N DATED 06/08/2019) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17885365 Country of ref document: EP Kind code of ref document: A1 |