WO2022256414A1 - Rna recognition complex and uses thereof - Google Patents
Rna recognition complex and uses thereof Download PDFInfo
- Publication number
- WO2022256414A1 WO2022256414A1 PCT/US2022/031780 US2022031780W WO2022256414A1 WO 2022256414 A1 WO2022256414 A1 WO 2022256414A1 US 2022031780 W US2022031780 W US 2022031780W WO 2022256414 A1 WO2022256414 A1 WO 2022256414A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- rna
- protein
- targeting
- recognition complex
- targeting agent
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 283
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 203
- 238000000034 method Methods 0.000 claims abstract description 108
- 230000014509 gene expression Effects 0.000 claims abstract description 91
- 241000711573 Coronaviridae Species 0.000 claims abstract description 47
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 45
- 108091033409 CRISPR Proteins 0.000 claims abstract description 38
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 12
- 239000003795 chemical substances by application Substances 0.000 claims description 70
- 239000012636 effector Substances 0.000 claims description 40
- 108091005774 SARS-CoV-2 proteins Proteins 0.000 claims description 33
- 101000992423 Severe acute respiratory syndrome coronavirus 2 Putative ORF9c protein Proteins 0.000 claims description 33
- 108020005004 Guide RNA Proteins 0.000 claims description 23
- 230000000295 complement effect Effects 0.000 claims description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 13
- 101000596375 Severe acute respiratory syndrome coronavirus 2 ORF7b protein Proteins 0.000 claims description 13
- 101001086079 Severe acute respiratory syndrome coronavirus 2 Putative ORF3b protein Proteins 0.000 claims description 12
- 238000001114 immunoprecipitation Methods 0.000 claims description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 11
- 230000008685 targeting Effects 0.000 claims description 10
- 238000004132 cross linking Methods 0.000 claims description 9
- 201000010099 disease Diseases 0.000 claims description 8
- 230000002829 reductive effect Effects 0.000 claims description 8
- 238000010195 expression analysis Methods 0.000 claims description 3
- 238000011222 transcriptome analysis Methods 0.000 claims description 3
- 229920002477 rna polymer Polymers 0.000 description 161
- 235000018102 proteins Nutrition 0.000 description 157
- 210000004027 cell Anatomy 0.000 description 106
- 101800001554 RNA-directed RNA polymerase Proteins 0.000 description 90
- 101800001768 Exoribonuclease Proteins 0.000 description 66
- 239000013598 vector Substances 0.000 description 61
- 101800000935 Non-structural protein 12 Proteins 0.000 description 59
- 101800004575 RNA-directed RNA polymerase nsp12 Proteins 0.000 description 59
- 108020004999 messenger RNA Proteins 0.000 description 49
- 150000007523 nucleic acids Chemical class 0.000 description 39
- 239000000203 mixture Substances 0.000 description 37
- 241001678559 COVID-19 virus Species 0.000 description 29
- 101800000482 Non-structural protein 9 Proteins 0.000 description 29
- 102000039446 nucleic acids Human genes 0.000 description 28
- 108020004707 nucleic acids Proteins 0.000 description 28
- 230000003993 interaction Effects 0.000 description 27
- 230000027455 binding Effects 0.000 description 25
- 108090000765 processed proteins & peptides Proteins 0.000 description 25
- 239000000523 sample Substances 0.000 description 25
- 230000014616 translation Effects 0.000 description 24
- 230000003612 virological effect Effects 0.000 description 24
- 238000013519 translation Methods 0.000 description 23
- 241000700605 Viruses Species 0.000 description 21
- 239000002502 liposome Substances 0.000 description 21
- 102000040430 polynucleotide Human genes 0.000 description 21
- 108091033319 polynucleotide Proteins 0.000 description 21
- 239000002157 polynucleotide Substances 0.000 description 21
- 102000004196 processed proteins & peptides Human genes 0.000 description 20
- 239000013603 viral vector Substances 0.000 description 20
- 108020004414 DNA Proteins 0.000 description 18
- 102000053602 DNA Human genes 0.000 description 18
- 229920001184 polypeptide Polymers 0.000 description 18
- 230000004570 RNA-binding Effects 0.000 description 17
- 230000000694 effects Effects 0.000 description 17
- 108020003589 5' Untranslated Regions Proteins 0.000 description 16
- 108091026890 Coding region Proteins 0.000 description 16
- 238000009472 formulation Methods 0.000 description 16
- 230000008569 process Effects 0.000 description 16
- -1 threose nucleic acids Chemical class 0.000 description 16
- 101800000509 Non-structural protein 8 Proteins 0.000 description 15
- 101710144111 Non-structural protein 3 Proteins 0.000 description 14
- 102100021798 SH2 domain-containing protein 3C Human genes 0.000 description 14
- 108010067390 Viral Proteins Proteins 0.000 description 13
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 12
- 125000005647 linker group Chemical group 0.000 description 12
- 239000008194 pharmaceutical composition Substances 0.000 description 12
- 101710144127 Non-structural protein 1 Proteins 0.000 description 11
- 102100031776 SH2 domain-containing protein 3A Human genes 0.000 description 11
- 125000003729 nucleotide group Chemical group 0.000 description 11
- 230000006798 recombination Effects 0.000 description 11
- 238000005215 recombination Methods 0.000 description 11
- 239000002773 nucleotide Substances 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 9
- 241000725303 Human immunodeficiency virus Species 0.000 description 9
- 101710144128 Non-structural protein 2 Proteins 0.000 description 9
- 102100022648 Reticulon-2 Human genes 0.000 description 9
- 101800000578 Uridylate-specific endoribonuclease Proteins 0.000 description 9
- 239000012472 biological sample Substances 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 9
- 230000003827 upregulation Effects 0.000 description 9
- 150000002632 lipids Chemical class 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- 101800001704 Guanine-N7 methyltransferase Proteins 0.000 description 7
- 101800001862 Proofreading exoribonuclease Proteins 0.000 description 7
- 101800002929 Proofreading exoribonuclease nsp14 Proteins 0.000 description 7
- 101710172711 Structural protein Proteins 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 108020004463 18S ribosomal RNA Proteins 0.000 description 6
- 101710159080 Aconitate hydratase A Proteins 0.000 description 6
- 101710159078 Aconitate hydratase B Proteins 0.000 description 6
- 102100023949 Cytochrome c oxidase subunit NDUFA4 Human genes 0.000 description 6
- 102000004127 Cytokines Human genes 0.000 description 6
- 108090000695 Cytokines Proteins 0.000 description 6
- 102100038284 Cytospin-B Human genes 0.000 description 6
- 101000672024 Homo sapiens UDP-glucose:glycoprotein glucosyltransferase 1 Proteins 0.000 description 6
- 101710144118 Non-structural protein 6 Proteins 0.000 description 6
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 6
- 101710105008 RNA-binding protein Proteins 0.000 description 6
- 238000011529 RT qPCR Methods 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 102100040363 UDP-glucose:glycoprotein glucosyltransferase 1 Human genes 0.000 description 6
- 229940024606 amino acid Drugs 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 230000033228 biological regulation Effects 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 230000001086 cytosolic effect Effects 0.000 description 6
- 108020001507 fusion proteins Proteins 0.000 description 6
- 102000037865 fusion proteins Human genes 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000013507 mapping Methods 0.000 description 6
- 230000002438 mitochondrial effect Effects 0.000 description 6
- 238000003762 quantitative reverse transcription PCR Methods 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 239000004055 small Interfering RNA Substances 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- 108020005345 3' Untranslated Regions Proteins 0.000 description 5
- 241000282552 Chlorocebus aethiops Species 0.000 description 5
- 101001111225 Homo sapiens Cytochrome c oxidase subunit NDUFA4 Proteins 0.000 description 5
- 101000665882 Homo sapiens Retinol-binding protein 4 Proteins 0.000 description 5
- 108060001084 Luciferase Proteins 0.000 description 5
- 239000005089 Luciferase Substances 0.000 description 5
- 230000004988 N-glycosylation Effects 0.000 description 5
- 101710144121 Non-structural protein 5 Proteins 0.000 description 5
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 5
- 201000003176 Severe Acute Respiratory Syndrome Diseases 0.000 description 5
- 108020004566 Transfer RNA Proteins 0.000 description 5
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 5
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 5
- 108020000999 Viral RNA Proteins 0.000 description 5
- 150000001413 amino acids Chemical class 0.000 description 5
- 239000007859 condensation product Substances 0.000 description 5
- 235000014113 dietary fatty acids Nutrition 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 210000002919 epithelial cell Anatomy 0.000 description 5
- 150000002148 esters Chemical class 0.000 description 5
- 239000000194 fatty acid Substances 0.000 description 5
- 229930195729 fatty acid Natural products 0.000 description 5
- 150000004665 fatty acids Chemical class 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000001476 gene delivery Methods 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 208000015181 infectious disease Diseases 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 239000003755 preservative agent Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 108020004418 ribosomal RNA Proteins 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 241000713704 Bovine immunodeficiency virus Species 0.000 description 4
- 241000713756 Caprine arthritis encephalitis virus Species 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- 241000713730 Equine infectious anemia virus Species 0.000 description 4
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 4
- 241000713800 Feline immunodeficiency virus Species 0.000 description 4
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 4
- 101710125418 Major capsid protein Proteins 0.000 description 4
- 101710141454 Nucleoprotein Proteins 0.000 description 4
- 241000283966 Pholidota <mammal> Species 0.000 description 4
- 239000002202 Polyethylene glycol Substances 0.000 description 4
- 101800001255 Putative 2'-O-methyl transferase Proteins 0.000 description 4
- 108020005067 RNA Splice Sites Proteins 0.000 description 4
- 108091081021 Sense strand Proteins 0.000 description 4
- 241000713311 Simian immunodeficiency virus Species 0.000 description 4
- 239000013543 active substance Substances 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 239000002552 dosage form Substances 0.000 description 4
- 230000003828 downregulation Effects 0.000 description 4
- 239000000839 emulsion Substances 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 230000006698 induction Effects 0.000 description 4
- 238000001990 intravenous administration Methods 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 239000000546 pharmaceutical excipient Substances 0.000 description 4
- 229920001223 polyethylene glycol Polymers 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000012096 transfection reagent Substances 0.000 description 4
- 239000003981 vehicle Substances 0.000 description 4
- 230000009385 viral infection Effects 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 3
- ZORQXIQZAOLNGE-UHFFFAOYSA-N 1,1-difluorocyclohexane Chemical compound FC1(F)CCCCC1 ZORQXIQZAOLNGE-UHFFFAOYSA-N 0.000 description 3
- 239000013607 AAV vector Substances 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 241000008904 Betacoronavirus Species 0.000 description 3
- 208000025721 COVID-19 Diseases 0.000 description 3
- 238000010453 CRISPR/Cas method Methods 0.000 description 3
- 241000494545 Cordyline virus 2 Species 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- 102100034583 Dolichyl-diphosphooligosaccharide-protein glycosyltransferase subunit 1 Human genes 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 108090000331 Firefly luciferases Proteins 0.000 description 3
- 101800003471 Helicase Proteins 0.000 description 3
- 101001118493 Homo sapiens Nuclear pore glycoprotein p62 Proteins 0.000 description 3
- 238000001276 Kolmogorov–Smirnov test Methods 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- 241000254158 Lampyridae Species 0.000 description 3
- 108700011259 MicroRNAs Proteins 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 101800000510 Non-structural protein 7 Proteins 0.000 description 3
- 108090000163 Nuclear pore complex proteins Proteins 0.000 description 3
- 102000003789 Nuclear pore complex proteins Human genes 0.000 description 3
- 102100024057 Nuclear pore glycoprotein p62 Human genes 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 3
- 208000037847 SARS-CoV-2-infection Diseases 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- 108091023045 Untranslated Region Proteins 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 125000003275 alpha amino acid group Chemical group 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 239000007900 aqueous suspension Substances 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 239000003086 colorant Substances 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 239000003995 emulsifying agent Substances 0.000 description 3
- LYCAIKOWRPUZTN-UHFFFAOYSA-N ethylene glycol Natural products OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 3
- 239000000796 flavoring agent Substances 0.000 description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 3
- 210000005260 human cell Anatomy 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 239000004005 microsphere Substances 0.000 description 3
- 210000003470 mitochondria Anatomy 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 239000002105 nanoparticle Substances 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 230000001124 posttranscriptional effect Effects 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 108010074916 ribophorin Proteins 0.000 description 3
- 210000004708 ribosome subunit Anatomy 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 235000002639 sodium chloride Nutrition 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 239000003765 sweetening agent Substances 0.000 description 3
- 230000009897 systematic effect Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 230000032258 transport Effects 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- 230000029812 viral genome replication Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 239000000080 wetting agent Substances 0.000 description 3
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 3
- PUPZLCDOIYMWBV-UHFFFAOYSA-N (+/-)-1,3-Butanediol Chemical compound CC(O)CCO PUPZLCDOIYMWBV-UHFFFAOYSA-N 0.000 description 2
- 101800001779 2'-O-methyltransferase Proteins 0.000 description 2
- 101800003073 2'-O-methyltransferase nsp16 Proteins 0.000 description 2
- 102100026926 60S ribosomal protein L4 Human genes 0.000 description 2
- 102100035841 60S ribosomal protein L7 Human genes 0.000 description 2
- 244000215068 Acacia senegal Species 0.000 description 2
- 235000006491 Acacia senegal Nutrition 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 2
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 2
- 102100034613 Annexin A2 Human genes 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- 241000416162 Astragalus gummifer Species 0.000 description 2
- 208000023275 Autoimmune disease Diseases 0.000 description 2
- 102100032985 CCR4-NOT transcription complex subunit 7 Human genes 0.000 description 2
- 108091079001 CRISPR RNA Proteins 0.000 description 2
- 241000282556 Cercocebus atys Species 0.000 description 2
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 2
- 108010020195 FLAG peptide Proteins 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 229920000084 Gum arabic Polymers 0.000 description 2
- 101800000355 Helicase nsp10 Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 101000691203 Homo sapiens 60S ribosomal protein L4 Proteins 0.000 description 2
- 101000853617 Homo sapiens 60S ribosomal protein L7 Proteins 0.000 description 2
- 101000924474 Homo sapiens Annexin A2 Proteins 0.000 description 2
- 101000942580 Homo sapiens CCR4-NOT transcription complex subunit 7 Proteins 0.000 description 2
- 101000996563 Homo sapiens Nuclear pore complex protein Nup214 Proteins 0.000 description 2
- 101001007901 Homo sapiens Nuclear pore complex protein Nup88 Proteins 0.000 description 2
- 101000644174 Homo sapiens Uridine phosphorylase 1 Proteins 0.000 description 2
- 102000015696 Interleukins Human genes 0.000 description 2
- 108010063738 Interleukins Proteins 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- 101710085938 Matrix protein Proteins 0.000 description 2
- 101710127721 Membrane protein Proteins 0.000 description 2
- 101800000933 Non-structural protein 10 Proteins 0.000 description 2
- 102100033819 Nuclear pore complex protein Nup214 Human genes 0.000 description 2
- 102100027586 Nuclear pore complex protein Nup88 Human genes 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 108091007412 Piwi-interacting RNA Proteins 0.000 description 2
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 108010052090 Renilla Luciferases Proteins 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 101000596353 Severe acute respiratory syndrome coronavirus 2 ORF7a protein Proteins 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 102000039471 Small Nuclear RNA Human genes 0.000 description 2
- 108020003224 Small Nucleolar RNA Proteins 0.000 description 2
- 102000042773 Small Nucleolar RNA Human genes 0.000 description 2
- 108091027967 Small hairpin RNA Proteins 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 229920001615 Tragacanth Polymers 0.000 description 2
- 108091028113 Trans-activating crRNA Proteins 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 102100020892 Uridine phosphorylase 1 Human genes 0.000 description 2
- 101800001927 Uridylate-specific endoribonuclease nsp15 Proteins 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 241000713325 Visna/maedi virus Species 0.000 description 2
- 235000010489 acacia gum Nutrition 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 230000008436 biogenesis Effects 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 239000012876 carrier material Substances 0.000 description 2
- 125000002091 cationic group Chemical group 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 239000002270 dispersing agent Substances 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 230000002124 endocrine Effects 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 235000013355 food flavoring agent Nutrition 0.000 description 2
- 235000003599 food sweetener Nutrition 0.000 description 2
- 238000007306 functionalization reaction Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 208000019622 heart disease Diseases 0.000 description 2
- 230000006658 host protein synthesis Effects 0.000 description 2
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 2
- 238000010166 immunofluorescence Methods 0.000 description 2
- 230000002458 infectious effect Effects 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 208000028774 intestinal disease Diseases 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 208000019423 liver disease Diseases 0.000 description 2
- 210000005265 lung cell Anatomy 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 108010026228 mRNA guanylyltransferase Proteins 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 210000004779 membrane envelope Anatomy 0.000 description 2
- 239000000693 micelle Substances 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 230000004770 neurodegeneration Effects 0.000 description 2
- 230000000926 neurological effect Effects 0.000 description 2
- 230000004134 neutrophil mediated immunity Effects 0.000 description 2
- 231100000252 nontoxic Toxicity 0.000 description 2
- 230000003000 nontoxic effect Effects 0.000 description 2
- 239000000346 nonvolatile oil Substances 0.000 description 2
- 210000004492 nuclear pore Anatomy 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 238000013081 phylogenetic analysis Methods 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 2
- 239000000244 polyoxyethylene sorbitan monooleate Substances 0.000 description 2
- 229920001451 polypropylene glycol Polymers 0.000 description 2
- 229920001282 polysaccharide Polymers 0.000 description 2
- 239000005017 polysaccharide Substances 0.000 description 2
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 2
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- QELSKZZBTMNZEB-UHFFFAOYSA-N propylparaben Chemical compound CCCOC(=O)C1=CC=C(O)C=C1 QELSKZZBTMNZEB-UHFFFAOYSA-N 0.000 description 2
- 230000013587 protein N-linked glycosylation Effects 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108091029842 small nuclear ribonucleic acid Proteins 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 235000011069 sorbitan monooleate Nutrition 0.000 description 2
- 239000001593 sorbitan monooleate Substances 0.000 description 2
- 229940035049 sorbitan monooleate Drugs 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000000087 stabilizing effect Effects 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000000375 suspending agent Substances 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- KIUKXJAPPMFGSW-DNGZLQJQSA-N (2S,3S,4S,5R,6R)-6-[(2S,3R,4R,5S,6R)-3-Acetamido-2-[(2S,3S,4R,5R,6R)-6-[(2R,3R,4R,5S,6R)-3-acetamido-2,5-dihydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-2-carboxy-4,5-dihydroxyoxan-3-yl]oxy-5-hydroxy-6-(hydroxymethyl)oxan-4-yl]oxy-3,4,5-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@H](O3)C(O)=O)O)[C@H](O)[C@@H](CO)O2)NC(C)=O)[C@@H](C(O)=O)O1 KIUKXJAPPMFGSW-DNGZLQJQSA-N 0.000 description 1
- LNAZSHAWQACDHT-XIYTZBAFSA-N (2r,3r,4s,5r,6s)-4,5-dimethoxy-2-(methoxymethyl)-3-[(2s,3r,4s,5r,6r)-3,4,5-trimethoxy-6-(methoxymethyl)oxan-2-yl]oxy-6-[(2r,3r,4s,5r,6r)-4,5,6-trimethoxy-2-(methoxymethyl)oxan-3-yl]oxyoxane Chemical compound CO[C@@H]1[C@@H](OC)[C@H](OC)[C@@H](COC)O[C@H]1O[C@H]1[C@H](OC)[C@@H](OC)[C@H](O[C@H]2[C@@H]([C@@H](OC)[C@H](OC)O[C@@H]2COC)OC)O[C@@H]1COC LNAZSHAWQACDHT-XIYTZBAFSA-N 0.000 description 1
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 description 1
- DNIAPMSPPWPWGF-GSVOUGTGSA-N (R)-(-)-Propylene glycol Chemical compound C[C@@H](O)CO DNIAPMSPPWPWGF-GSVOUGTGSA-N 0.000 description 1
- JLPULHDHAOZNQI-ZTIMHPMXSA-N 1-hexadecanoyl-2-(9Z,12Z-octadecadienoyl)-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCC\C=C/C\C=C/CCCCC JLPULHDHAOZNQI-ZTIMHPMXSA-N 0.000 description 1
- IXPNQXFRVYWDDI-UHFFFAOYSA-N 1-methyl-2,4-dioxo-1,3-diazinane-5-carboximidamide Chemical compound CN1CC(C(N)=N)C(=O)NC1=O IXPNQXFRVYWDDI-UHFFFAOYSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- 101150072531 10 gene Proteins 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 description 1
- CYDQOEWLBCCFJZ-UHFFFAOYSA-N 4-(4-fluorophenyl)oxane-4-carboxylic acid Chemical compound C=1C=C(F)C=CC=1C1(C(=O)O)CCOCC1 CYDQOEWLBCCFJZ-UHFFFAOYSA-N 0.000 description 1
- 102100033714 40S ribosomal protein S6 Human genes 0.000 description 1
- 102100037663 40S ribosomal protein S8 Human genes 0.000 description 1
- 102100021206 60S ribosomal protein L19 Human genes 0.000 description 1
- 102100035322 60S ribosomal protein L24 Human genes 0.000 description 1
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 description 1
- 230000002407 ATP formation Effects 0.000 description 1
- 230000005607 ATP synthesis coupled electron transport Effects 0.000 description 1
- 108020005176 AU Rich Elements Proteins 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241001655883 Adeno-associated virus - 1 Species 0.000 description 1
- 241000702423 Adeno-associated virus - 2 Species 0.000 description 1
- 241000202702 Adeno-associated virus - 3 Species 0.000 description 1
- 241000580270 Adeno-associated virus - 4 Species 0.000 description 1
- 241001634120 Adeno-associated virus - 5 Species 0.000 description 1
- 241000972680 Adeno-associated virus - 6 Species 0.000 description 1
- 241001164823 Adeno-associated virus - 7 Species 0.000 description 1
- 241000649045 Adeno-associated virus 10 Species 0.000 description 1
- 241000649046 Adeno-associated virus 11 Species 0.000 description 1
- 108091023037 Aptamer Proteins 0.000 description 1
- 241000239223 Arachnida Species 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 108010011485 Aspartame Proteins 0.000 description 1
- 241000910635 Bat betacoronavirus Species 0.000 description 1
- 102100026031 Beta-glucuronidase Human genes 0.000 description 1
- 102100036150 C-X-C motif chemokine 5 Human genes 0.000 description 1
- 108010040467 CRISPR-Associated Proteins Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 102000004657 Calcium-Calmodulin-Dependent Protein Kinase Type 2 Human genes 0.000 description 1
- 108010003721 Calcium-Calmodulin-Dependent Protein Kinase Type 2 Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 102100031673 Corneodesmosin Human genes 0.000 description 1
- 101710139375 Corneodesmosin Proteins 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 101710153216 Cytochrome c oxidase subunit NDUFA4 Proteins 0.000 description 1
- 108090000365 Cytochrome-c oxidases Proteins 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 102100036912 Desmin Human genes 0.000 description 1
- 108010044052 Desmin Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 108010089072 Dolichyl-diphosphooligosaccharide-protein glycotransferase Proteins 0.000 description 1
- 108700040192 Drosophila pum Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 101710204837 Envelope small membrane protein Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 102100031562 Excitatory amino acid transporter 2 Human genes 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 102100039289 Glial fibrillary acidic protein Human genes 0.000 description 1
- 101710193519 Glial fibrillary acidic protein Proteins 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 229930186217 Glycolipid Natural products 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 101800002870 Helicase nsp13 Proteins 0.000 description 1
- 241000711549 Hepacivirus C Species 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 208000005176 Hepatitis C Diseases 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- 102100022823 Histone RNA hairpin-binding protein Human genes 0.000 description 1
- 101000656896 Homo sapiens 40S ribosomal protein S6 Proteins 0.000 description 1
- 101001097439 Homo sapiens 40S ribosomal protein S8 Proteins 0.000 description 1
- 101001105789 Homo sapiens 60S ribosomal protein L19 Proteins 0.000 description 1
- 101000660926 Homo sapiens 60S ribosomal protein L24 Proteins 0.000 description 1
- 101000933465 Homo sapiens Beta-glucuronidase Proteins 0.000 description 1
- 101000947186 Homo sapiens C-X-C motif chemokine 5 Proteins 0.000 description 1
- 101000866287 Homo sapiens Excitatory amino acid transporter 2 Proteins 0.000 description 1
- 101000825762 Homo sapiens Histone RNA hairpin-binding protein Proteins 0.000 description 1
- 101001111338 Homo sapiens Neurofilament heavy polypeptide Proteins 0.000 description 1
- 101000979333 Homo sapiens Neurofilament light polypeptide Proteins 0.000 description 1
- 101001121642 Homo sapiens Nucleoporin p54 Proteins 0.000 description 1
- 101001121636 Homo sapiens Nucleoporin p58/p45 Proteins 0.000 description 1
- 101000711369 Homo sapiens Probable ribosome biogenesis protein RLP24 Proteins 0.000 description 1
- 101000665452 Homo sapiens RNA binding protein fox-1 homolog 2 Proteins 0.000 description 1
- 101000893689 Homo sapiens Ras GTPase-activating protein-binding protein 1 Proteins 0.000 description 1
- 101000762128 Homo sapiens Tumor suppressor candidate 3 Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000019223 Interleukin-1 receptor Human genes 0.000 description 1
- 108050006617 Interleukin-1 receptor Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101710145006 Lysis protein Proteins 0.000 description 1
- 108091007767 MALAT1 Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- 238000000585 Mann–Whitney U test Methods 0.000 description 1
- 235000014435 Mentha Nutrition 0.000 description 1
- 241001072983 Mentha Species 0.000 description 1
- 101100136101 Mesocricetus auratus PENK gene Proteins 0.000 description 1
- 102100036837 Metabotropic glutamate receptor 2 Human genes 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 1
- HSHXDCVZWHOWCS-UHFFFAOYSA-N N'-hexadecylthiophene-2-carbohydrazide Chemical compound CCCCCCCCCCCCCCCCNNC(=O)c1cccs1 HSHXDCVZWHOWCS-UHFFFAOYSA-N 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 102100024007 Neurofilament heavy polypeptide Human genes 0.000 description 1
- 102100023057 Neurofilament light polypeptide Human genes 0.000 description 1
- 101100473185 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) rpn-1 gene Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 101710138767 Non-structural glycoprotein 4 Proteins 0.000 description 1
- 101800000934 Non-structural protein 13 Proteins 0.000 description 1
- 108010066154 Nuclear Export Signals Proteins 0.000 description 1
- 102000043141 Nuclear RNA Human genes 0.000 description 1
- 108020003217 Nuclear RNA Proteins 0.000 description 1
- 102100025453 Nucleoporin p54 Human genes 0.000 description 1
- 102100025794 Nucleoporin p58/p45 Human genes 0.000 description 1
- 101710087110 ORF6 protein Proteins 0.000 description 1
- 239000005642 Oleic acid Substances 0.000 description 1
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 239000005662 Paraffin oil Substances 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 206010035664 Pneumonia Diseases 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102000017742 Pumilio homology domains Human genes 0.000 description 1
- 108050005947 Pumilio homology domains Proteins 0.000 description 1
- 238000010357 RNA editing Methods 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 230000021839 RNA stabilization Effects 0.000 description 1
- 102100040854 Ras GTPase-activating protein-binding protein 1 Human genes 0.000 description 1
- 241000242739 Renilla Species 0.000 description 1
- 206010038802 Reticuloendothelial system stimulated Diseases 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 108060007030 Ribulose-phosphate 3-epimerase Proteins 0.000 description 1
- 108091006197 SARS-CoV-2 Nucleocapsid Protein Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 102100031056 Serine protease 57 Human genes 0.000 description 1
- 101710197596 Serine protease 57 Proteins 0.000 description 1
- 101500023576 Severe acute respiratory syndrome coronavirus 2 Non-structural protein 8 Proteins 0.000 description 1
- 101000779242 Severe acute respiratory syndrome coronavirus 2 ORF3a protein Proteins 0.000 description 1
- 108091061750 Signal recognition particle RNA Proteins 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 1
- PRXRUNOAOLTIEF-ADSICKODSA-N Sorbitan trioleate Chemical class CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCC\C=C/CCCCCCCC)[C@H]1OC[C@H](O)[C@H]1OC(=O)CCCCCCC\C=C/CCCCCCCC PRXRUNOAOLTIEF-ADSICKODSA-N 0.000 description 1
- 101710198474 Spike protein Proteins 0.000 description 1
- 241000295644 Staphylococcaceae Species 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 108050009621 Synapsin Proteins 0.000 description 1
- 102000001435 Synapsin Human genes 0.000 description 1
- 108091046869 Telomeric non-coding RNA Proteins 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 102100024248 Tumor suppressor candidate 3 Human genes 0.000 description 1
- 241000008908 Tylonycteris bat coronavirus HKU4 Species 0.000 description 1
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 1
- 241000726445 Viroids Species 0.000 description 1
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- WERKSKAQRVDLDW-ANOHMWSOSA-N [(2s,3r,4r,5r)-2,3,4,5,6-pentahydroxyhexyl] (z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO WERKSKAQRVDLDW-ANOHMWSOSA-N 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000002947 alkylene group Chemical group 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000005735 apoptotic response Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 239000000605 aspartame Substances 0.000 description 1
- IAOZJIPTCAWIRG-QWRGUYRKSA-N aspartame Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)OC)CC1=CC=CC=C1 IAOZJIPTCAWIRG-QWRGUYRKSA-N 0.000 description 1
- 235000010357 aspartame Nutrition 0.000 description 1
- 229960003438 aspartame Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 229920000249 biocompatible polymer Polymers 0.000 description 1
- 230000002715 bioenergetic effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 239000004067 bulking agent Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 230000011994 cellular protein metabolic process Effects 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000001886 ciliary effect Effects 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000002856 computational phylogenetic analysis Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 235000008504 concentrate Nutrition 0.000 description 1
- 238000013270 controlled release Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000000412 dendrimer Substances 0.000 description 1
- 229920000736 dendritic polymer Polymers 0.000 description 1
- 210000005045 desmin Anatomy 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- 238000012172 direct RNA sequencing Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003974 emollient agent Substances 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 101150014310 fem-3 gene Proteins 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 210000005046 glial fibrillary acidic protein Anatomy 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 235000004554 glutamine Nutrition 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- FBPFZTCFMRRESA-UHFFFAOYSA-N hexane-1,2,3,4,5,6-hexol Chemical compound OCC(O)C(O)C(O)C(O)CO FBPFZTCFMRRESA-UHFFFAOYSA-N 0.000 description 1
- 229920002674 hyaluronan Polymers 0.000 description 1
- 229960003160 hyaluronic acid Drugs 0.000 description 1
- 229920001477 hydrophilic polymer Polymers 0.000 description 1
- 235000010979 hydroxypropyl methyl cellulose Nutrition 0.000 description 1
- 239000001866 hydroxypropyl methyl cellulose Substances 0.000 description 1
- 229920003088 hydroxypropyl methyl cellulose Polymers 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000034184 interaction with host Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000008206 lipophilic material Substances 0.000 description 1
- 239000002479 lipoplex Substances 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 231100000053 low toxicity Toxicity 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 1
- 201000005296 lung carcinoma Diseases 0.000 description 1
- 239000012931 lyophilized formulation Substances 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 230000012976 mRNA stabilization Effects 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000028018 membrane docking Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108010038421 metabotropic glutamate receptor 2 Proteins 0.000 description 1
- 239000002923 metal particle Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 230000005787 mitochondrial ATP synthesis coupled electron transport Effects 0.000 description 1
- 230000026326 mitochondrial transport Effects 0.000 description 1
- 238000002887 multiple sequence alignment Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229920005615 natural polymer Polymers 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 238000001668 nucleic acid synthesis Methods 0.000 description 1
- 230000006849 nucleocytoplasmic transport Effects 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- GYCKQBWUSACYIF-UHFFFAOYSA-N o-hydroxybenzoic acid ethyl ester Natural products CCOC(=O)C1=CC=CC=C1O GYCKQBWUSACYIF-UHFFFAOYSA-N 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000002220 organoid Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000002023 papillomaviral effect Effects 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 239000002304 perfume Substances 0.000 description 1
- 230000003094 perturbing effect Effects 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000018127 platelet degranulation Effects 0.000 description 1
- 229920002627 poly(phosphazenes) Polymers 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920000058 polyacrylate Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002721 polycyanoacrylate Polymers 0.000 description 1
- 229920000575 polymersome Polymers 0.000 description 1
- 229920002635 polyurethane Polymers 0.000 description 1
- 239000004814 polyurethane Substances 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 239000001103 potassium chloride Substances 0.000 description 1
- 235000011164 potassium chloride Nutrition 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 230000027756 respiratory electron transport chain Effects 0.000 description 1
- 230000031070 response to heat Effects 0.000 description 1
- 230000003938 response to stress Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 235000019204 saccharin Nutrition 0.000 description 1
- 229940081974 saccharin Drugs 0.000 description 1
- 239000000901 saccharin and its Na,K and Ca salt Substances 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 235000010413 sodium alginate Nutrition 0.000 description 1
- 239000000661 sodium alginate Substances 0.000 description 1
- 229940005550 sodium alginate Drugs 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000001540 sodium lactate Substances 0.000 description 1
- 229940005581 sodium lactate Drugs 0.000 description 1
- 235000011088 sodium lactate Nutrition 0.000 description 1
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 229940083466 soybean lecithin Drugs 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000005531 stress granule assembly Effects 0.000 description 1
- 238000000547 structure data Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 229960004793 sucrose Drugs 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 238000001521 two-tailed test Methods 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
- 230000007502 viral entry Effects 0.000 description 1
- 230000010464 virion assembly Effects 0.000 description 1
- 230000006394 virus-host interaction Effects 0.000 description 1
- 239000012130 whole-cell lysate Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/20011—Coronaviridae
- C12N2770/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
Definitions
- SARS-CoV-2 nucleocapsid protein interactome comprises many host RNA processing machinery proteins and stress granule proteins, suggesting a potential role in interfering with host RNA processing and driving stress granule formation.
- a majority of the viral proteins were found to associate with host RNA binding proteins (RBPs), suggesting a possibility that SARS-CoV-2 proteins interact with the host transcriptome to a greater degree than previously anticipated.
- RBPs host RNA binding proteins
- a comprehensive interrogation of S ARS- CoV-2 viral protein-host RNA interactions and how the virus hijacks host cellular machinery for its replication while it suppresses host gene expression is still lacking.
- RNA recognition complexes comprising: (a) an RNA-targeting agent; and (b) a coronavirus-derived protein.
- the RNA recognition complex further comprises a linker.
- the RNA-targeting agent comprises CRISPR/Cas9 components. In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Casl3d protein. In some embodiments, the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13d protein.
- the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
- PPR pentatricopeptide repeat
- the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to an individual gene of a cell.
- sgRNA single guide RNA
- the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
- the coronavirus-derived protein comprises a SARS-CoV-2 protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, a NSP3, aNSP6, aNSP12, a NSP14, a ORF3b, a ORF7b, or a ORF9c protein.
- Also provided herein are methods of upregulating gene expression of a target RNA comprising: delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
- Also provided herein are methods of modulating gene expression of a target RNA comprising: delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and modulates gene expression of the target RNA in the cell.
- the method further comprises profiling the gene expression of the target RNA in the cell, wherein the gene expression is upregulated.
- the coronavirus-derived protein comprises a SARS-CoV-2 protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, a NSP3, aNSP6, aNSP12, a NSP14, a ORF3b, a ORF7b, or a ORF9c protein. In some embodiments, the method further comprises profiling the gene expression of the target RNA in the cell, wherein the gene expression is downregulated. In some embodiments, the coronavirus-derived protein comprises aNSP9 protein.
- the profiling comprises transcriptome analysis or gene expression analysis. In some embodiments, the profiling comprises enhanced cross-linking immunoprecipitation (eCLIP).
- eCLIP enhanced cross-linking immunoprecipitation
- the RNA-targeting agent comprises CRISPR/Cas9 components. In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Casl3d protein. In some embodiments, the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13d protein.
- the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
- PPR pentatricopeptide repeat
- the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to the target RNA in the cell.
- sgRNA single guide RNA
- the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
- the RNA-targeting agent comprises CRISPR/Cas9 components. In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Casl3d protein. In some embodiments, the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13d protein.
- the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein. In some embodiments, the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to the target RNA in the cell. In some embodiments, the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
- the coronavirus-derived protein comprises a SARS-CoV-2 protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, a NSP3, aNSP6, aNSP12, a NSP14, a ORF3b, a ORF7b, or a ORF9c protein.
- the RNA-targeting agent comprises a sequence which is complementary to a target RNA sequence. In some embodiments, the RNA-targeting agent complementary sequence is at least 98% complementary to a target RNA sequence. In some embodiments, the RNA-targeting agent complementary sequence is at least 95% complementary to a target RNA sequence
- FIG. la shows a schematic showing eCLIP performed on SARS-CoV-2 proteins in virus infected Vero E6 cells.
- Proteins in infected cells are UV crosslinked to bound transcripts, which are immunoprecipitated (IP) with antibodies that recognize NSP8, NSP12 and N proteins.
- IP immunoprecipitated
- Protein-RNA IP product and Input lysate are resolved by SDS-PAGE and membrane transferred, followed by band excision at the estimated protein size to 75kDa above in both IP and Input lanes.
- Excised bands are subsequently purified, and library barcoded for Illumina sequencing. Sequenced reads are mapped to the hg!9 human genome (GCF_000001405.13).
- FIG. lc is a stacked bar plot showing TPM of reads mapped to the Vero E6 genome or SARS- CoV-2 genome in each of NSP12, NSP8 and N eCLIP.
- FIG. Id is a Venn diagram showing number of African Green Monkey (host) genes targeted by NSP8 and NSP12.
- FIG. le shows eCLIP read density mapped to the SARS-CoV-2 genome on both the positive (top) and negative (bottom) sense strand.
- FIG. If shows predicted secondary structure of the sequence from the NSP12 peak mapped to the C-terminal of NSP3.
- FIG. lg shows RNA-seq read density plot from SARS-CoV-2 infected A549-ACE2 cells mapping sequenced reads to the positive (top, blue) and negative (bottom, light blue) sense strand of SARS-CoV-2 genome.
- FIG. lh shows phylogenetic tree analysis of complete genomes of representative betacoronavirus from NCBI reference sequences and bat and pangolin coronavirus sequences from GISAID.
- FIG. li shows predicted recombination events of SARS-CoV-2 from phylogenetic analysis, with line plot indicating significance (-log 10(P -value)) of predicted recombination breakpoints across the SARS-CoV-2 genome.
- FIG. 2a shows a schematic showing SARS-CoV-2 proteins individually tagged and expressed in human lung epithelial cells BEAS-2B to assay with eCLIP.
- ENCODE eCLIP data for example human RNA-binding proteins (hRBPs) are included for comparison.
- FIG.2c is a V enn diagram showing the number of coding genes expressed at TPM>1.0 in V ero E6 and BEAS-2B cells as targeted by NSP12 with significant peaks (p ⁇ 0.001, >4-fold enrichment).
- FIG. 2d shows Circos plot mapping SARS-CoV-2 proteins to top five enriched Gene Ontology terms of host transcripts.
- FIG. 2e shows example sequence logos generated from all IDR peak reads for each SARS- CoV-2 eCLIP, with p-value indicated above each logo.
- FIG. 2f shows example genome browser tracks for NSP3, NSP12, N and NSP2 mapping to DYNCH1, TUSC3, CXCL5 andNAPlL4 respectively.
- FIG. 3a shows stacked bar plot showing fraction of reproducible peaks (by IDR14) mapping to different regions of coding genes.
- FIG. 3b shows example metagene profiles for NSP3, NSP12 and N. Mean of read density for each replicate data is shown as a solid line, with shaded regions indicating the 95% confidence interval.
- FIG. 3c shows a schematic showing the Renilla-MS2 and Firefly dual luciferase reporter constructs, where individual SARS-CoV-2 proteins fused to MCP are recruited to the Renillia- MS2 mRNA.
- FIGs. 3d and 3e show bar plots showing luciferase reporter activity ratios (FIG. 3d) and reporter RT-qPCR ratios (FIG.3e) for the indicated coexpressed SARS-CoV-2 protein, known human regulators of RNA stability (CNOT7, BOLL) and negative control (FLAG peptide).
- FIG. 3f shows bar plot showing the fold change of luciferase activity ratio and RT-qPCR 629 ratio.
- FIGs. 3g and 3h show line plots show the fold enrichment of eCLIP read coverage at each position on rRNAs for NSP1 (FIG. 3g, blue) and ORF9c (FIG. 3h, blue), and the mean of 446 other RBPs deposited in the ENCODE consortium (grey; https://www.encodeproject.org/, accession code ENCSR456FVU) on 18S and 28S rRNAs (lightly shaded areas indicate 10- 90% confidence intervals).
- FIGs. 3i and 3j show quantitative flow cytometry reporter assay for targeted translation activation using RCas9-fused ORF9c.
- FIG. 4a shows cumulative distribution plot (CDF) showing distribution of proteomics data from Bojkova et al2 of log2(fold change) of host genes in SARS-CoV-2 infected vs. uninfected cells, for genes whose RNAs are not interacting with SARS-CoV-2 proteins, all eCLIP target genes (peak p ⁇ 10-3, >8-fold enrichment), genes targeted by NSP12 (peak p ⁇ 10-3, >8-fold enrichment), and genes targeted by NSP12 with highly significant peaks (peak p ⁇ 10-7, >8-fold enrichment).
- P645 values are from KS test of the equality of log2(fold change) of each subset of eCLIP target genes to the untargeted genes.
- FIG. 4b shows top 10 Gene Ontology terms of NSP12 target genes.
- FIG. 4c shows a map of NSP12 target genes (blue boxes connected by red edges to yellow box at center), clustered by top GO terms. Grey edges are human protein-protein interaction data from Mentha. Dark blue frames indicate genes used in subsequent validation.
- FIG. 4d shows box plot showing quartiles of log2(fold change) protein levels of NSP12 target genes from proteomics data grouped by the GO term classification. Mann-Whitney U-test p- values indicated above each box compares the log2(fold change) of each subset of NSP12 target genes to all NSP12 target genes (red). Diamonds represent outliers, dots represent individual proteins.
- FIG. 4e shows a schematic illustrating the hypothesis of NSP12 interacting with host mRNAs to upregulate the expression of target genes in mitochondrial and N-linked glycosylation processes.
- FIG. 4f shows genome browser tracks of NSP12 eCLIP enriched RNA mapped to UGGT1, NDUFA4 and RPN 1.
- FIG. 4g shows western blots showing expression levels of UGGT1, NDUFA4 and RPN1, with b actin as loading control, from GFP or NSP12 transfected HEK293T cells.
- FIG. 4h shows immunofluorescence images (40X) of SARS-CoV-2 infected A549-ACE2 cells stained for SARS-CoV-2 NSP8 (red), endogenous genes (green), DNA content (blue).
- FIG. 4i shows a bar plot showing mean relative fluorescence intensities of cells from FIG. 4h, dots represent segmented individual cells.
- FIG. 5a shows a schematic illustrating NSP9 interacting with nuclear pore complex proteins NUP62, NUP214, NUP58, NUP88 and NUP541.
- FIG. 5b shows a schematic showing the hypothesis of NSP9 inhibiting mRNA nucleocytoplasmic transport.
- FIG. 5c shows genome browser tracks of NSP9 eCLIP target RNA mapped to IL-la, IL-Ib, ANXA2 and UPP1.
- FIG. 5e shows a bar plot showing mean concentration of IL-la in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by cytokines indicated on the x-axis.
- FIG. 5f shows a bar plot showing mean concentration of IL-la in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by different levels of TNFa.
- FIG. 6 shows a schematic illustrating the complex host-viral relationship.
- Flat-ended arrows indicate downregulation, pointed arrows indicate upregulation.
- Blue arrows are newly proposed interactions.
- This disclosure describes RNA recognition complexes and methods of modulating gene expression of a target RNA by delivering the RNA recognition complex into a cell.
- biological sample can refer to a sample generally including cells and/or other biological material.
- a biological sample can be obtained from non-mammalian organisms (e.g., a plants, an insect, an arachnid, a nematode), a fungi, an amphibian, or a fish (e.g., zebrafish).
- a biological sample can be obtained from a prokaryote such as a bacterium, e.g., Escherichia coli, Staphylococci or Mycoplasma pneumoniae, an archaea; a virus such as Hepatitis C virus or human immunodeficiency virus; or a viroid.
- a biological sample can be obtained from a eukaryote, such as a patient derived organoid (PDO) or patient derived xenograft (PDX).
- Biological samples can be derived from a homogeneous culture or population of organisms or alternatively from a collection of several different organisms, for example, in a community or ecosystem.
- the biological sample can include any number of macromolecules, for example, cellular macromolecules and organelles (e.g., mitochondria and nuclei).
- the biological sample can be a nucleic acid sample and/or protein sample.
- the biological sample can be a carbohydrate sample or a lipid sample.
- the biological sample can be obtained as a tissue sample, such as a tissue section, biopsy, a core biopsy, needle aspirate, or fine needle aspirate.
- the sample can be a fluid sample, such as a blood sample, urine sample, or saliva sample.
- the sample can be a skin sample, a colon sample, a cheek swab, a histology sample, a histopathology sample, a plasma or serum sample, a tumor sample, living cells, cultured cells, a clinical sample such as, for example, whole blood or blood-derived products, blood cells, or cultured tissues or cells, including cell suspensions.
- a “cell” can refer to either a prokaryotic or eukaryotic cell, optionally obtained from a subject or a commercially available source.
- delivering can refer to the introduction of an exogenous polynucleotide into a host cell, irrespective of the method used for the introduction.
- Such methods include a variety of well-known techniques such as vector-mediated gene transfer (e.g., viral infection/transfection, or various other protein-based or lipid-based gene delivery complexes) as well as techniques facilitating the delivery of “naked” polynucleotides (e.g., electroporation, “gene gun” delivery and various other techniques used for the introduction of polynucleotides).
- the introduced polynucleotide may be stably or transiently maintained in the host cell.
- Stable maintenance typically requires that the introduced polynucleotide either contains an origin of replication compatible with the host cell or integrates into a replicon of the host cell such as an extrachromosomal replicon (e.g., a plasmid) or a nuclear or mitochondrial chromosome.
- an extrachromosomal replicon e.g., a plasmid
- a nuclear or mitochondrial chromosome e.g., a nuclear or mitochondrial chromosome.
- a polynucleotide can be inserted into a host cell by a gene delivery molecule.
- gene delivery molecules can include, but are not limited to, liposomes, micelles biocompatible polymers, including natural polymers and synthetic polymers; lipoproteins; polypeptides; polysaccharides; lipopolysaccharides; artificial viral envelopes; metal particles; and bacteria, or viruses, such as baculovirus, adenovirus and retrovirus, bacteriophage, cosmid, plasmid, fungal vectors and other recombination vehicles typically used in the art which have been described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple protein expression.
- encode refers to a polynucleotide which is said to “encode” a polypeptide if, in its native state or when manipulated by methods well known to those skilled in the art, can be transcribed and/or translated to produce the mRNA for the polypeptide and/or a fragment thereof.
- the antisense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom.
- exogenous refers to any material introduced from or originating from outside a cell, a tissue or an organism that is not produced by or does not originate from the same cell, tissue, or organism in which it is being introduced.
- expression refers to the process by which polynucleotides are transcribed into mRNA and/or the process by which the transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins.
- expression may include splicing of the mRNA in a eukaryotic cell.
- the expression level of a gene may be determined by measuring the amount of mRNA or protein in a cell or tissue sample; further, the expression level of multiple genes can be determined to establish an expression profile for a particular sample.
- nucleic acid is used to include any compound and/or substance that comprise a polymer of nucleotides.
- a polymer of nucleotides are referred to as polynucleotides.
- Exemplary nucleic acids or polynucleotides can include, but are not limited to, ribonucleic acids (RNAs), deoxyribonucleic acids (DNAs), threose nucleic acids (TNAs), glycol nucleic acids (GNAs), peptide nucleic acids (PNAs), locked nucleic acids (LNAs, including LNA having a b-D-ribo configuration, a-LNA having an ⁇ -L-ribo configuration (a diastereomer of LNA), 2’-amino-LNA having a 2’-amino functionalization, and 2’ -amino- ⁇ -LNA having a 2’ -amino functionalization) or hybrids thereof.
- RNAs ribonucleic
- Naturally- occurring nucleic acids generally have a deoxyribose sugar (e.g., found in deoxyribonucleic acid (DNA)) or a ribose sugar (e.g., found in ribonucleic acid (RNA)).
- a deoxyribose sugar e.g., found in deoxyribonucleic acid (DNA)
- RNA ribonucleic acid
- a nucleic acid can contain nucleotides having any of a variety of analogs of these sugar moieties that are known in the art.
- a deoxyribonucleic acid (DNA) can have one or more bases selected from the group consisting of adenine (A), thymine (T), cytosine (C), or guanine (G), and a ribonucleic acid (RNA) can have one or more bases selected from the group consisting of uracil (U), adenine (A), cytosine (C), or guanine (G).
- nucleic acid refers to a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), or a combination thereof, in either a single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar binding properties as the reference nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses complementary sequences as well as the sequence explicitly indicated. In some embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is DNA. In some embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is RNA.
- Modifications can be introduced into a nucleotide sequence by standard techniques known in the art, such as site-directed mutagenesis and polymerase chain reaction (PCR)- mediated mutagenesis.
- Conservative amino acid substitutions are ones in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art.
- amino acids with basic side chains e.g., arginine, lysine and histidine
- acidic side chains e.g., aspartic acid and glutamic acid
- uncharged polar side chains e.g., asparagine, cysteine, glutamine, glycine, serine, threonine, tyrosine, and tryptophan
- nonpolar side chains e.g., alanine, isoleucine, leucine, methionine, phenylalanine, proline, and valine
- beta-branched side chains e.g., isoleucine, threonine, and valine
- aromatic side chains e.g., histidine, phenylalanine, tryptophan, and tyrosine
- aromatic side chains e.g., histidine, phenylalanine, tryptophan, and tyrosine
- aromatic side chains e.g., histidine,
- nucleotide sequence encoding a protein includes all nucleotide sequences that are degenerate versions of each other and thus encode the same amino acid sequence.
- the term “plurality” can refer to a state of having a plural (e.g., more than one) number of different types of things (e.g., a cell, a genomic sequence, a subject, a system, or a protein).
- a plurality of nucleic acid sequences can be more than one nucleic acid sequence wherein each nucleic acid sequence is different from each other.
- “plurality” can refer to a state of having a plural number of the same thing (e.g., a cell, a genomic sequence, a subject, a system, or a protein).
- a plurality of nucleic acid sequences are identical to each other.
- a plurality of cells are cellular clones (e.g., identical cells).
- the term “subject” is intended to include any mammal.
- the subject is cat, a dog, a goat, a human, a non-human primate, a rodent (e.g., a mouse or a rat), a pig, or a sheep.
- rodent e.g., a mouse or a rat
- a pig e.g., a sheep.
- transduced refers to a process by which exogenous nucleic acid is introduced or transferred into a cell.
- a “transduced,” “transfected,” or “transformed” mammalian cell is one that has been transduced, transfected or transformed with exogenous nucleic acid (e.g., a gene delivery vector) that includes an exogenous nucleic acid encoding RNA-binding zinc finger domain.
- treating means a reduction in the number, frequency, severity, or duration of one or more (e.g., two, three, four, five, or six) symptoms of a disease or disorder in a subject (e.g., any of the subjects described herein), and/or results in a decrease in the development and/or worsening of one or more symptoms of a disease or disorder in a subject.
- RNA recognition complex can refer to a system that can recognize specific mRNA transcripts and modulate protein expression.
- an RNA recognition complex comprises an RNA-targeting agent and a coronavirus-derived protein.
- the RNA-targeting agent can be fused or tethered to the coronavirus- derived protein.
- RNA-targeting agent can refer to an agent that can target and bind to a specific sequence in DNA or RNA.
- an RNA-targeting agent comprises CRISPR/Cas9 components.
- CRISPR refers to a technique of sequence specific genetic manipulation relying on the clustered regularly interspaced short palindromic repeats pathway, which unlike RNA interference regulates gene expression at a transcriptional level.
- the RNA-targeting agent comprises a PUF protein.
- the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
- the RNA-targeting agent comprises a protein that has an RNA binding domain.
- coronavirus-derived protein can refer to a SARS-CoV-2 protein, and/or any variant thereof.
- the coronavirus-derived protein includes a NSP1, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
- the coronavirus-derived protein includes aNSP9 protein.
- the RNA recognition complex further comprises a nuclear export signal and a coronavirus translation activation protein.
- an RNA recognition complex modulates protein expression in a temporal manner.
- the RNA recognition complex can activate protein expression.
- the RNA recognition complex can upregulate protein expression.
- the RNA recognition complex can downregulate protein expression.
- an RNA-targeting agent is an RNA-guided target RNA-binding fusion protein.
- RNA-guided target RNA-binding fusion proteins comprise at least one RNA- binding polypeptide which corresponds to a gRNA which guides the RNA-binding polypeptide to target RNA.
- RNA-guided target RNA-binding fusion proteins include without limitation, RNA-binding polypeptides which are CRISPR/Cas-based RNA-binding polypeptides or portions thereof.
- the RNA-targeting agent comprises an RNA-targeting Cas effector.
- a “Cas effector” or “CRISPR-associated protein” can refer to an enzyme or protein that uses CRISPR sequences as a guide to recognize and cleave specific nucleic acid strands that are complementary to the CRISPR sequence.
- An RNA-targeting Cas effector can associate with a CRISPR RNA sequence to bind to, and alter DNA or RNA target sequences.
- an RNA-targeting Cas effector can be a Cas9 endonuclease that makes a double-stranded break in a target DNA sequence.
- an RNA- targeting Cas effector can be a Cas 12a nuclease that also makes a double-stranded break in a target DNA sequence.
- an RNA-targeting Cas effector can be a Cas 13 nuclease which targets RNA.
- the RNA-targeting Cas effector comprises a Cas9 protein, a Casl3b protein, or a Casl3d protein.
- the RNA- targeting Cas effector comprises a nuclease dead Cas9 (dCas9) protein.
- the RNA-targeting Cas effector comprises a Cas 13b protein.
- the RNA- targeting Cas effector comprises a Cas 13d protein.
- the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to an individual gene of a cell.
- sgRNA single guide RNA
- the term “single guide RNA” or “sgRNA” is a specific type of gRNA that combines tracrRNA (transactivating RNA), which binds to Cas9 to activate the complex to create the necessary strand breaks, and crRNA (CRISPR RNA), comprising complimentary nucleotides to the tracrRNA, into a single RNA construct. Exemplary methods of employing the CRISPR technique are described in WO 2017/091630, which is incorporated by reference in its entirety.
- the single guide RNA can recognize a target RNA, for example, by hybridizing to the target RNA.
- the single guide RNA comprises a sequence that is complementary to the target RNA.
- the sgRNA can include one or more modified nucleotides.
- the sgRNA has a length that is about 10 nt (e.g., about 20 nt, about 30 nt, about 40 nt, about 50 nt, about 60 nt, about 70 nt, about 80 nt, about 90 nt, about 100 nt, about 120 nt, about 140 nt, about 160 nt, about 180 nt, about 200 nt, about 300 nt, about 400 nt, about 500 nt, about 600 nt, about 700 nt, about 800 nt, about 900 nt, about 1000 nt, or about 2000 nt).
- the sgRNA can include a sequence from SEQ ID NOs: 1-7 (Table 1).
- a single guide RNA can recognize a variety of RNA targets.
- a target RNA can be messenger RNA (mRNA), ribosomal RNA (rRNA), signal recognition particle RNA (SRP RNA), transfer RNA (tRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), antisense RNA (aRNA), long noncoding RNA (IncRNA), microRNA (miRNA), piwi-interacting RNA (piRNA), small interfering RNA (siRNA), short hairpin RNA (shRNA), retrotransposon RNA, viral genome RNA, or viral noncoding RNA.
- mRNA messenger RNA
- rRNA ribosomal RNA
- SRP RNA signal recognition particle RNA
- tRNA transfer RNA
- tRNA transfer RNA
- snRNA small nuclear RNA
- snoRNA small nucleolar RNA
- aRNA antisense RNA
- IncRNA microRNA
- piRNA pi
- a target RNA can be an RNA involved in pathogenesis of conditions such as cancers, neurodegeneration, cutaneous conditions, endocrine conditions, intestinal diseases, infectious conditions, neurological conditions, liver diseases, heart disorders, or autoimmune diseases.
- a target RNA can be a therapeutic target for conditions such as cancers, neurodegeneration, cutaneous conditions, endocrine conditions, intestinal diseases, infectious conditions, neurological conditions, liver diseases, heart disorders, or autoimmune diseases.
- the sgRNA can be driven by a promoter.
- the promoter can be a U6 polymerase III promoter.
- a RNA-targeting agent is not an RNA-guided target RNA- binding fusion protein and as such comprises at least one RNA-binding polypeptide which is capable of binding a target RNA without a corresponding gRNA sequence.
- Such non-guided RNA-binding polypeptides include, without limitation, at least one RNA-binding protein or RNA-binding portion thereof which is a PUF (Pumilio and FBF homology family). This type of RNA-binding polypeptide can be used in place of a gRNA-guided RNA binding protein such as CRISPR/Cas.
- the unique RNA recognition mode of PUF proteins (named for Drosophila Pumilio and C.
- the PUF domain of human Pumiliol also known in the art, binds tightly to cognate RNA sequences and its specificity can be modified. It contains eight PUF repeats that recognize eight consecutive RNA bases with each repeat recognizing a single base. Since two amino acid side chains in each repeat recognize the Watson-Crick edge of the corresponding base and determine the specificity of that repeat, a PUF domain can be designed to specifically bind most 8-nt RNA. Wang et al.. Nai Methods. 2009; 6(11): 825-830. See WO2012/068627, which is incorporated by reference herein in its entirety, for additional disclosure regarding PUF proteins.
- the fusion protein comprises at least one RNA-binding protein or RNA-binding portion thereof which is a PUMBY (Pumilio-based assembly) protein.
- RNA-binding protein PumHD Pano homology domain, a member of the PUF family
- Pumby for Pumilio-based assembly
- these modules can be concatenated in chains of varying composition and length, to bind desired target RNAs.
- the RNA-targeting agent comprises a Pumilio and FBF (PUF) protein. In some embodiments, the RNA-targeting agent comprises a Pumilio-based assembly (PUMBY) protein.
- PPF Pumilio and FBF
- PUMBY Pumilio-based assembly
- RNA- binding proteins or RNA-binding portions thereof is a PPR protein (proteins with pentatricopeptide repeat (PPR) motifs derived from plants).
- PPR proteins are nuclear-encoded and exclusively controlled at the RNA level organelles (chloroplasts and mitochondria), cutting, translation, splicing, RNA editing, genes specifically acting on RNA stability.
- PPR proteins are typically a motif of 35 amino acids and have a structure in which a PPR motif is about 10 contiguous amino acids. The combination of PPR motifs can be used for sequence- selective binding to RNA.
- PPR proteins are often comprised of PPR motifs of about 10 repeat domains.
- PPR domains or RNA-binding domains may be configured to be catalytically inactive. See WO 2013/058404, which is incorporated herein by reference in its entirety for additional disclosure regarding PPR proteins.
- Coronaviruses contain a positive-sense, single-stranded RNA genome, and the viral genome consists of more than 29,000 bases and encodes 29 proteins.
- SARS-CoV-2 has four structural proteins: the E and M proteins, which form the viral envelope; the N protein, which binds to the virus’s RNA genome; and the S protein, which binds to human receptors.
- coronavirus-derived protein can refer to a protein that is encoded from the coronavirus viral genome.
- the coronavirus-derived protein can be anon- structural protein (NSP).
- the non-structural protein can comprise a NSP1, a NSP2, a NSP3, a NSP4, a NSP5, a NSP6, a NSP7, a NSP8, a NSP9, a NSP10, a NSP12, a NSP13, a NSP14, a NSP15, or a NSP16 protein.
- the coronavirus-derived protein can be an accessory protein.
- the accessory protein can comprise a ORF3a, a ORF6, a ORF7a, a ORF7b, a ORF8, or a ORFIO protein.
- the coronavirus-derived protein can be a structural protein.
- the structural protein can comprise a spike (S) protein, a nucleocapsid (N) protein, a membrane (M) protein, or an envelope (E) protein.
- the coronavirus-derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
- the coronavirus-derived protein comprises aNSP9 protein.
- the RNA recognition complex disclosed herein comprises a linker between the RNA-targeting agent and the coronavirus-derived protein.
- the linkers or linker motifs can be any flexible peptides that connect two protein domains or motifs without interfering with their functions.
- the linker is a peptide linker.
- the peptide linker comprises one or more repeats of the tri-peptide GGS. In other embodiments, the linker is a non-peptide linker.
- the non-peptide linker comprises polyethylene glycol (PEG), polypropylene glycol (PPG), co-poly (ethylene/propylene) glycol, polyoxyethylene (POE), polyurethane, polyphosphazene, polysaccharides, dextran, polyvinyl alcohol, polyvinylpyrrolidones, polyvinyl ethyl ether, polyacryl amide, polyacrylate, polycyanoacrylates, lipid polymers, chitins, hyaluronic acid, heparin, or an alkyl linker.
- PEG polyethylene glycol
- PPG polypropylene glycol
- POE polyoxyethylene
- polyurethane polyphosphazene
- polysaccharides dextran
- polyvinyl alcohol polyvinylpyrrolidones
- polyvinyl ethyl ether polyacryl amide
- polyacrylate polycyanoacrylates
- lipid polymers chitins, hyaluronic
- nucleic acid sequences encoding the RNA recognition complexes disclosed herein for use in gene transfer and expression techniques described herein. It should be understood, although not always explicitly stated that the sequences provided herein can be used to provide the expression product as well as substantially identical sequences that produce a protein that has the same biological properties. These “biologically equivalent” or “biologically active” or “equivalent” polypeptides are encoded by equivalent polynucleotides as described herein.
- polypeptides may possess at least 60%, or alternatively, at least 65%, or alternatively, at least 70%, or alternatively, at least 75%, or alternatively, at least 80%, or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95% or alternatively at least 98%, identical primary amino acid sequence to the reference polypeptide when compared using sequence identity methods run under default conditions.
- Specific polypeptide sequences are provided as examples of particular embodiments. Modifications to the sequences to amino acids can include alternate amino acids that have similar charge.
- an equivalent polynucleotide is one that hybridizes under stringent conditions to the reference polynucleotide or its complement or in reference to a polypeptide, a polypeptide encoded by a polynucleotide that hybridizes to the reference encoding polynucleotide under stringent conditions or its complementary strand.
- an equivalent polypeptide or protein is one that is expressed from an equivalent polynucleotide.
- the nucleic acid sequences e.g., polynucleotide sequences
- the nucleic acid sequences e.g., polynucleotide sequences
- Codon optimization refers to the fact that different cells differ in their usage of particular codons.
- This codon bias corresponds to a bias in the relative abundance of particular tRNAs in the cell type.
- By altering the codons in the sequence to match with the relative abundance of corresponding tRNAs it is possible to increase expression. It is also possible to decrease expression by deliberately choosing codons for which the corresponding tRNAs are known to be rare in a particular cell type. Codon usage tables are known in the art for mammalian cells, as well as for a variety of other organisms. Based on the genetic code, nucleic acid sequences coding for, e.g., a Cas protein, can be generated.
- such a sequence is optimized for expression in a host or target cell, such as a host cell used to express the Cas protein or a cell in which the disclosed methods are practiced (such as in a mammalian cell, e.g., a human cell).
- Codon preferences and codon usage tables for a particular species can be used to engineer isolated nucleic acid molecules encoding a Cas protein (such as one encoding a protein having at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type protein) that takes advantage of the codon usage preferences of that particular species.
- an isolated nucleic acid molecule encoding at least one Cas protein (which can be part of a vector) includes at least one Cas protein coding sequence that is codon optimized for expression in a eukaryotic cell, or at least one Cas protein coding sequence codon optimized for expression in a human cell.
- a codon optimized Cas coding sequence has at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type or originating sequence.
- a eukaryotic cell codon optimized nucleic acid sequence encodes a Cas protein having at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type or originating protein.
- a vector comprises a guide RNA of the disclosure. In some embodiments, the vector comprises at least one guide RNA of the disclosure. In some embodiments, the vector comprises one or more guide RNA(s) of the disclosure. In some embodiments, the vector comprises two or more guide RNAs of the disclosure. In some embodiments, the vector further comprises a nucleic acid corresponding to an RNA recognition complex of the disclosure. In some embodiments, the RNA recognition complex comprises a RNA targeting agent and a coronavirus-derived protein.
- a first vector comprises a guide RNA of the disclosure and a second vector comprises a RNA recognition complex of the disclosure.
- the first vector comprises at least one guide RNA of the disclosure.
- the first vector comprises one or more guide RNA(s) of the disclosure.
- the first vector comprises two or more guide RNA(s) of the disclosure.
- the RNA recognition complex comprises a RNA targeting agent and a coronavirus-derived protein.
- the first vector and the second vector are identical. In some embodiments, the first vector and the second vector are not identical.
- a vector of the disclosure is a viral vector.
- the viral vector includes a sequence isolated or derived from a retrovirus.
- the viral vector includes a sequence isolated or derived from a lentivirus.
- the viral vector includes a sequence isolated or derived from an adenovirus.
- the viral vector includes a sequence isolated or derived from an adeno-associated virus (AAV).
- AAV adeno-associated virus
- the viral vector is replication incompetent.
- the viral vector is isolated or recombinant.
- the viral vector is self complementary.
- the viral vector includes a sequence isolated or derived from an adeno-associated virus (AAV).
- AAV adeno-associated virus
- the viral vector includes an inverted terminal repeat sequence or a capsid sequence that is isolated or derived from an AAV of serotype AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV 8, AAV9, AAV10, AAV11, AAV 12, AAV.rh32/33, AAV.rh43, AAV.rh64Rl, and any combinations or equivalents thereof.
- the viral vector is replication incompetent.
- the viral vector is isolated or recombinant (rAAV).
- the viral vector is self-complementary (scAAV).
- the AAV vector has low toxicity.
- the AAV vector does not incorporate into the host genome, thereby having a low probability of causing insertional mutagenesis.
- the AAV vector can encode a range of total polynucleotides from 4.5 kb to 4.75 kb.
- a vector of the disclosure is a non-viral vector.
- the vector comprises or consists of a nanoparticle, a micelle, a liposome or lipoplex, a polymersome, a polyplex or a dendrimer.
- the vector is an expression vector or recombinant expression system.
- the term “recombinant expression system” refers to a genetic construct for the expression of certain genetic material formed by recombination.
- an expression vector, viral vector or non-viral vector provided herein includes without limitation, an expression control element.
- An “expression control element” as used herein refers to any sequence that regulates the expression of a coding sequence, such as a gene.
- Exemplary expression control elements include but are not limited to promoters, enhancers, microRNAs, post-transcriptional regulatory elements, polyadenylation signal sequences, and introns. Expression control elements may be constitutive, inducible, repressible, or tissue-specific, for example.
- a “promoter” is a control sequence that is a region of a polynucleotide sequence at which initiation and rate of transcription are controlled.
- Non-limiting exemplary promoters include CMV, CBA, CAG, Cbh, EF-la, PGK, UBC, GUSB, UCOE, hAAT, TBG, Desmin, MCK, C5-12, NSE, Synapsin, PDGF, MecP2, CaMKII, mGluR2, NFL, NFH, hb2, PPE, ENK, EAAT2, GFAP, MBP, and U6 promoters.
- An “enhancer” is a region of DNA that can be bound by activating proteins to increase the likelihood or frequency of transcription.
- Non-limiting exemplary enhancers and posttranscriptional regulatory elements include the CMV enhancer and WPRE.
- the vector is a viral vector.
- the vector is an adenoviral vector, an adeno-associated viral (AAV) vector, or a lentiviral vector.
- the vector is a retroviral vector, an adenoviral/retroviral chimera vector, a herpes simplex viral I or II vector, a parvoviral vector, a reticuloendotheliosis viral vector, a polioviral vector, a papillomaviral vector, a vaccinia viral vector, or any hybrid or chimeric vector incorporating favorable aspects of two or more viral vectors.
- the vector further comprises one or more expression control elements operably linked to the polynucleotide. In some embodiments, the vector further comprises one or more selectable markers.
- the lentiviral vector is an integrase-competent lentiviral vector (ICLV). In some embodiments, the lentiviral vector can refer to the transgene plasmid vector as well as the transgene plasmid vector in conjunction with related plasmids (e.g., a packaging plasmid, a rev expressing plasmid, an envelope plasmid) as well as a lentiviral-based particle capable of introducing exogenous nucleic acid into a cell through a viral or viral-like entry mechanism.
- related plasmids e.g., a packaging plasmid, a rev expressing plasmid, an envelope plasmid
- Lentiviral vectors are well-known in the art (see, e.g., Trono D. (2002) Lentiviral vectors, New York: Spring-Verlag Berlin Heidelberg and Durand et al. (2011) Viruses 3(2): 132-159 doi: 10.3390/v3020132).
- exemplary lentiviral vectors that may be used in any of the herein described compositions, systems, methods, and kits can include a human immunodeficiency virus (HIV) 1 vector, a modified human immunodeficiency virus (HIV) 1 vector, a human immunodeficiency virus (HIV) 2 vector, a modified human immunodeficiency virus (HIV) 2 vector, a sooty mangabey simian immunodeficiency virus (SIVsM) vector, a modified sooty mangabey simian immunodeficiency virus (SIVsM) vector, a African green monkey simian immunodeficiency virus (SIVAGm) vector, a modified African green monkey simian immunodeficiency virus (SIVAGm) vector, an equine infectious anemia virus (EIAV) vector, a modified equine infectious anemia virus (EIAV) vector, a feline immunodeficiency virus (FIV) vector, a
- compositions and formulations including vectors delivering an RNA recognition complex including an RNA-targeting agent and a coronavirus-derived protein.
- the compositions are formulated with a pharmaceutically acceptable carrier.
- the pharmaceutical compositions and formulations can be administered parenterally, topically, orally or by local administration, such as by aerosol or transdermally.
- the pharmaceutical compositions can be formulated in any way and can be administered in a variety of unit dosage forms depending upon the condition or disease and the degree of illness, the general medical condition of each patient, the resulting preferred method of administration and the like. Details on techniques for formulation and administration of pharmaceuticals are well described in the scientific and patent literature, see, e.g., Remington: The Science and Practice of Pharmacy, 21st ed., 2005.
- the RNA recognition complex can be administered alone or as a component of a pharmaceutical formulation (composition).
- the compounds may be formulated for administration, in any convenient way for use in human or veterinary medicine.
- the compositions may conveniently be presented in unit dosage form and may be prepared by any methods well known in the art of pharmacy.
- the amount of active ingredient which can be combined with a carrier material to produce a single dosage form can vary depending upon the host being treated, the particular mode of administration.
- the amount of active ingredient which can be combined with a carrier material to produce a single dosage form will generally be that amount of the compound which produces a therapeutic effect.
- compositions described herein can be prepared according to any method known to the art for the manufacture of pharmaceuticals. Such compositions can contain, for example, preserving agents.
- a composition can be admixtured with nontoxic pharmaceutically acceptable excipients which are suitable for manufacture.
- Compositions may comprise one or more diluents, emulsifiers, preservatives, buffers, excipients, etc. and may be provided in such forms as liquids, powders, emulsions, lyophilized powders, controlled release formulations, on patches, in implants, etc.
- wetting agents such as sodium lauryl sulfate and magnesium stearate, as well as coloring agents, release agents, coating agents, sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be present in the compositions.
- Aqueous suspensions can contain an active agent (e.g., nucleic acid sequences of the invention) in admixture with excipients suitable for the manufacture of aqueous suspensions, e.g., for aqueous intradermal injections.
- an active agent e.g., nucleic acid sequences of the invention
- Such excipients include a suspending agent, such as sodium carboxymethylcellulose, methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia, and dispersing or wetting agents such as a naturally occurring phosphatide (e.g., lecithin), a condensation product of an alkylene oxide with a fatty acid (e.g., polyoxyethylene stearate), a condensation product of ethylene oxide with a long chain aliphatic alcohol (e.g., heptadecaethylene oxycetanol), a condensation product of ethylene oxide with a partial ester derived from a fatty acid and a hexitol (e.g., polyoxyethylene sorbitol mono-oleate), or a condensation product of ethylene oxide with a partial ester derived from fatty acid and a hexitol anhydride (e.g., polyoxyethylene sorbitan mono
- the aqueous suspension can also contain one or more preservatives such as ethyl or n-propyl p-hydroxybenzoate, one or more coloring agents, one or more flavoring agents and one or more sweetening agents, such as sucrose, aspartame or saccharin.
- preservatives such as ethyl or n-propyl p-hydroxybenzoate
- coloring agents such as a coloring agent
- flavoring agents such as aqueous suspension
- sweetening agents such as sucrose, aspartame or saccharin.
- Formulations can be adjusted for osmolarity.
- oil-based pharmaceuticals are used for administration of nucleic acid sequences as described herein.
- an injectable oil vehicle see Minto (1997) J. Pharmacol. Exp. Ther. 281:93-102.
- compositions can also be in the form of oil-in-water emulsions.
- the oily phase can be a vegetable oil or a mineral oil, described above, or a mixture of these.
- Suitable emulsifying agents include naturally-occurring gums, such as gum acacia and gum tragacanth, naturally occurring phosphatides, such as soybean lecithin, esters or partial esters derived from fatty acids and hexitol anhydrides, such as sorbitan mono-oleate, and condensation products of these partial esters with ethylene oxide, such as polyoxyethylene sorbitan mono-oleate.
- the emulsion can also contain sweetening agents and flavoring agents, as in the formulation of syrups and elixirs.
- Such formulations can also contain a demulcent, a preservative, or a coloring agent.
- these injectable oil-in- water emulsions of the invention comprise a paraffin oil, a sorbitan monooleate, an ethoxylated sorbitan monooleate and/or an ethoxylated sorbitan trioleate.
- the pharmaceutical compositions can also be delivered as microspheres for slow release in the body.
- microspheres can be administered via intradermal injection of drug which slowly release subcutaneously; see Rao (1995) J. Biomater Sci. Polym. Ed. 7:623-645; as biodegradable and injectable gel formulations, see, e.g., Gao (1995) Pharm. Res. 12:857-863 (1995); or, as microspheres for oral administration, see, e.g., Eyles (1997) J. Pharm. Pharmacol. 49:669-674.
- the pharmaceutical compositions can be parenterally administered, such as by intravenous (IV) administration or administration into a body cavity or lumen of an organ.
- IV intravenous
- These formulations can comprise a solution of active agent dissolved in a pharmaceutically acceptable carrier.
- Acceptable vehicles and solvents that can be employed are water and Ringer's solution, an isotonic sodium chloride.
- sterile fixed oils can be employed as a solvent or suspending medium.
- any bland fixed oil can be employed including synthetic mono- or diglycerides.
- fatty acids such as oleic acid can likewise be used in the preparation of injectables. These solutions are sterile and generally free of undesirable matter.
- These formulations may be sterilized by conventional, well known sterilization techniques.
- the formulations may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate and the like.
- concentration of active agent in these formulations can vary widely, and will be selected primarily based on fluid volumes, viscosities, body weight, and the like, in accordance with the particular mode of administration selected and the patient's needs.
- the formulation can be a sterile injectable preparation, such as a sterile injectable aqueous or oleaginous suspension. This suspension can be formulated using those suitable dispersing or wetting agents and suspending agents.
- the sterile injectable preparation can also be a suspension in a nontoxic parenterally- acceptable diluent or solvent, such as a solution of 1,3-butanediol.
- the administration can be by bolus or continuous infusion (e.g., substantially uninterrupted introduction into a blood vessel for a specified period of time).
- the pharmaceutical compounds and formulations can be lyophilized.
- Stable lyophilized formulations comprising an inhibitory nucleic acid can be made by lyophilizing a solution comprising a pharmaceutical of the invention and a bulking agent, e.g., mannitol, trehalose, raffmose, and sucrose or mixtures thereof.
- a process for preparing a stable lyophilized formulation can include lyophilizing a solution about 2.5 mg/mL protein, about 15 mg/mL sucrose, about 19 mg/mL NaCl, and a sodium citrate buffer having a pH greater than 5.5 but less than 6.5. See, e.g., U.S. 20040028670.
- compositions and formulations can be delivered by the use of liposomes.
- liposomes particularly where the liposome surface carries ligands specific for target cells, or are otherwise preferentially directed to a specific organ, one can focus the delivery of the active agent into target cells in vivo. See, e.g., U.S. PatentNos. 6,063,400; 6,007,839; Al-Muhammed (1996) J. Microencapsul. 13:293-306; Chonn (1995) Curr. Opin. Biotechnol. 6:698-708; Ostro (1989) Am. J. Hosp. Pharm. 46:1576-1587.
- liposome means a vesicle composed of amphiphilic lipids arranged in a bilayer or bilayers. Liposomes are unilamellar or multilamellar vesicles that have a membrane formed from a lipophilic material and an aqueous interior that contains the composition to be delivered. Cationic liposomes are positively charged liposomes that are believed to interact with negatively charged DNA molecules to form a stable complex. Liposomes that are pH-sensitive or negatively-charged are believed to entrap DNA rather than complex with it. Both cationic and noncationic liposomes have been used to deliver DNA to cells.
- Liposomes can also include “sterically stabilized” liposomes, i.e., liposomes comprising one or more specialized lipids. When incorporated into liposomes, these specialized lipids result in liposomes with enhanced circulation lifetimes relative to liposomes lacking such specialized lipids.
- sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome comprises one or more glycolipids or is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety.
- PEG polyethylene glycol
- compositions disclosed herein can be administered for prophylactic and/or therapeutic treatments.
- compositions are administered to a subject who is infected or at risk of infection with SARS-CoV2, in an amount sufficient to cure, alleviate or partially arrest the clinical manifestations of the disorder or its complications; this can be called a therapeutically effective amount.
- pharmaceutical compositions of the invention are administered in an amount sufficient to decrease the number of lung cells infected with SARS-CoV2.
- inhibitory nucleic acids used to practice the methods described herein can be isolated from a variety of sources, genetically engineered, amplified, and/or expressed/ generated recombinantly.
- Recombinant nucleic acid sequences can be individually isolated or cloned and tested for a desired activity. Any recombinant expression system can be used, including e.g. in vitro, bacterial, fungal, mammalian, yeast, insect, or plant cell expression systems. Modulating gene expression of a target RNA
- a method of upregulating gene expression of a target RNA can include delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus -derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
- a method of modulating gene expression of a target RNA can include delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus -derived protein, and wherein the RNA recognition complex binds to the target RNA and modulates gene expression of the target RNA in the cell.
- the RNA recognition complex is present in a delivery system.
- the delivery system comprises a delivery vehicle selected from the group consisting of an adeno-associated virus, a nanoparticle, and a liposome.
- the RNA recognition complex can be introduced into any cell, e.g., a mammalian cell.
- a mammalian cell include: a human cell, a rodent cell (e.g., a rat cell or a mouse cell), a rabbit cell, a dog cell, a cat cell, a porcine cell, or a non-human primate cell.
- the RNA recognition complex can be delivered into the cytoplasm of a cell.
- the RNA recognition complex can be delivered into the cell by chemical transfection, non-chemical transfection, particle- based transfection, or viral transfection.
- the RNA recognition complex can be delivered with a transfection reagent.
- the transfection reagent can be lipofectamine.
- the transfection reagent can be FuGENE transfection reagent.
- the method further includes profiling the gene expression of the target RNA in the cell, wherein the gene expression is upregulated.
- a target RNA through an RNA-targeting agent’s association with a coronavirus-derived protein, drives upregulation of the target RNA within a cell.
- the coronavirus- derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
- the method further includes profiling the gene expression of the target RNA in the cell, wherein the gene expression is downregulated.
- a target RNA through an RNA-targeting agent’s association with a coronavirus-derived protein, drives downregulation of the target RNA within a cell.
- the coronavirus-derived protein comprises aNSP9 protein.
- profiling can refer to the measurement of activity (e.g., expression) of one or more genes, to create a global picture of cellular function.
- profiling includes sequencing of a nucleic acid (e.g., DNA or RNA), wherein the gene expression profile includes information of active translation at a point in time.
- the profiling comprises transcriptome analysis or gene expression analysis.
- the profiling comprises enhanced cross-linking immunoprecipitation (eCLIP).
- eCLIP enhanced cross-linking immunoprecipitation
- eCLIP can be modified and used to profile RNAs bound by specific ribosomal subunit proteins.
- enhanced crosslinking and immunoprecipitation eCLIP recovers protein-coding mRNAs (with a particular enrichment for coding sequence regions).
- immunoprecipitation is the technique of precipitating a protein antigen out of solution using an antibody that specifically bind to that particular protein.
- the solution containing the protein antigen is in the form of a crude lysate of an animal tissue.
- Immunoprecipitation can be used to isolate and concentrate a particular protein from a sample containing many different proteins. Also, this technique requires that the antibody by coupled to a solid substrate (e.g., immunoprecipitation beads) while performing the procedure.
- a solid substrate e.g., immunoprecipitation beads
- CLIP crosslinking and immunoprecipitation
- eCLIP Enhanced crosslinking and immunoprecipitation
- eCLIP is a method to profile RNAs bound by an RNA binding protein of interest.
- a method of treating a disease of reduced gene expression in a subject in need thereof can include administering a RNA recognition complex to the subject, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus- derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
- Example 1 - eCLIP elucidates SARS-CoV-2 protein-RNA interactions in virus infected cells
- RNA interactome of SARS-CoV-2 proteins was performed on SARS-CoV-2 infected African Green Monkey kidney (Vero E6) cells (Fig. la). Cells were infected at a multiplicity of infection (MOI) of 0.01 for 48 hours before UV irradiation of cells that covalently crosslink interacting proteins to RNAs. This was followed by immunoprecipitation of the NSP8, NSP12 (also known as the RNA dependent RNA polymerase) and N (nucleocapsid) proteins using protein-specific antibodies to isolate the bound RNA.
- MOI multiplicity of infection
- RNA-bound proteins were resolved via SDS-PAGE and transferred to nitrocellulose membranes such that only the region spanning the expected protein size and 75 kDa larger were excised and purified in subsequent steps.
- the same size region of a non- immunoprecipitated input whole cell lysate was included as size-matched input to identify enriched sequences.
- RNA was converted to libraries and sequenced to an average depth of ⁇ 25 million reads, and mapped to the SARS-CoV-2 viral genome and African Green Monkey genome to determine SARS-CoV-2 protein RNA interactions.
- Targeted transcripts were determined by having one or more peaks that meet the stringent IDR (irreproducible discovery rate) threshold of overlapping peaks between two replicates for every protein, and satisfy statistical cutoffs of p ⁇ 0.001, and more than 8-fold enrichment in the immunoprecipitated sample (IP) over the size-matched input sample.
- IDR immunoprecipitated discovery rate
- the eCLIP results provide the first viral RNA genome map of interactions with NSP8, NSP12 and N proteins.
- NSP12 enrichment was seen on the negative strand at all transcription-regulatory sequences (TRSs) of the viral genome, implying that it may play a role in the transcription of subgenomic RNAs, which results in the expression of accessory protein products.
- TRSs transcription-regulatory sequences
- the eCLIP read density showed a sharp drop in reads at position 7481 on both strands, which may correspond to reverse transcription termination during eCLIP library preparation at a UV crosslinking site (Fig. If).
- the sequence at position 7470-7510 forms a stable hairpin from RNA secondary structure prediction.
- the high read density in the hairpin region suggests a potential stalling of NSP12 polymerase elongation, which may result in aborted transcripts.
- RNA-seq of A549-ACE2 cells infected with SARS-CoV-2 was performed and a steep decrease in transcript read density at the site of NSP12 eCLIP peak was observed (Fig. lg).
- Aborted transcripts were also confirmed in a direct RNA-sequencing study using the Oxford Nanoporel8. Furthermore, some of these aborted transcripts join up with the downstream sequences, forming deletion products.
- Polymerase stalling may play a role in generating genetic diversity of viruses via recombination, which has been shown to contribute to the evolution of SARS-CoV-2.
- a multiple sequence alignment and phylogenetic analysis of the reference sequences of the complete genomes of betacoronaviruses from NCBI and the complete genomes of bat and pangolin coronaviruses from GISAID was performed (Fig. lh).
- the multiple sequence analysis shows the peak region sequence to be highly conserved among the analyzed betacoronavirus sequences.
- the hairpin structure also appears conserved among bat and pangolin sequences.
- Recombination breakpoints are predicted from this sequence alignment, using a pairwise scanning approach that identifies regions with greater similarity among phylogenetically distant sequences. The prediction found a likely breakpoint -250 nt downstream of the peak in region 7450-7550. This breakpoint was predicted to be a recombination event between SARS-CoV-2 and the Tylonycteris bat coronavirus HKU4 (Fig. li). While there are several other breakpoints that did not coincide with NSP12 eCLIP peaks, the presence of the NSP12 eCLIP peak in the 7470 - 7510 region proximal to a potential recombination site suggests a possible contribution to recombination in ancestral sequences of SARS-CoV-2.
- RNA secondary structure appears conserved in the region containing the 7470 - 7510 peak among the closely related pangolin and bat betacoronaviruses, suggesting a potential function associated with NSP12 binding to this region that may be important for virus replication.
- eCLIP was performed on the 29 proteins encoded in the SARS-CoV-2 genome and one mutant (Fig. 2a). Due to the lack of antibodies specific for most of the viral proteins, the individual proteins were overexpressed in a lung epithelial cell line BEAS-2B, which is an immortalized primary bronchial cell line representative of normal lung physiology. Each protein was either fused with a 2xStrep tag and expressed stably via lentiviral transduction or fused with a 3xFLAG tag and expressed transiently via transfection. Following UV crosslinking, the tagged proteins were immunoprecipitated using anti-FLAG or anti-Strep antibodies.
- SARS-CoV-2 proteins interacted with RNA represented by 4,821 coding genes, which is about a third of the transcriptome of BEAS-2B cells.
- Nucleocapsid and non-structural proteins NSP2, NSP3, NSP5, NSP9 and NSP12 were found to target the greatest number of unique genes at 1339, 1647, 1199, 902, 863, and 865, respectively (Fig. 2b).
- the large number of genes targeted by the viral proteins is consistent with the non-structural proteins from the replicase (ORFlab) having a high affinity for its own RNA, though their potential for widespread interaction with host RNA has not been shown previously.
- target genes (400/518) in the NSP12 eCLIP in virus infected Vero E6 cells are represented in the eCLIP assay from exogenous expression in the BEAS-2B cells (Fig. 2c). Only transcripts that are expressed at a TPM of >1.0 in both cell lines are used in this comparison, and target genes are considered if bound by one or more peaks that satisfy statistical cutoffs of - loglO(p-value) > 3, and more than 4-fold enrichment over size-matched input. This suggests that NSP12 bound genes are similar in the context of the virus infected cells and in the context of NSP12 expressed in isolation.
- GO gene ontology
- Fig. 2d Distinct processes related to viral replication and host response are targeted by the viral proteins as shown by gene ontology (GO) analysis (Fig. 2d). Many of the enriched GO terms are related to nucleic acid and protein synthesis, modification and transport, which is consistent with the primary objective of the virus hijacking host resources for its own biosynthesis and replication.
- a few stress response processes are enriched, including response to heat, as targeted by ORF7b.
- Immune response processes are also enriched, including neutrophil mediated immunity targeted by NSP12 and platelet degranulation targeted by ORF9c. This supports the choice of lung epithelial cells as a model system that expresses the relevant cytokines for recruiting immune cells.
- ciliary basal body plasma membrane docking genes are enriched, which may be related to ciliated lung cells as the site of viral entry. While the enriched GO terms are highly relevant to viral and host response processes, further analysis of binding patterns is needed to determine if there are any functional implications of viral proteins interacting with these genes.
- sequence logos were generated from 6-mers of the bound RNA reads. While some of the proteins display strong sequence preferences (Fig. 2d) other proteins appear to bind more non-specifically. Some motifs resemble enrichments observed for human RBPs, where M, ORF7a and NSP10 appear to favor G-rich or GU rich motifs, and NSP5 has a motif (GNAUG). Other motifs may result from regional binding preferences (Fig.
- NSP2 and NSP9 have a strong preference for UC-rich polypyrimidine motifs (p values of 10-96 and 10-41 respectively), which may be a result of their binding to polypyrimidine tracts in intronic regions, whereas N has an AU-rich motif likely because it preferentially binds to 3' UTR which contain AU-rich elements.
- NSP12 primarily binds in the 5' UTR, and a weakly enriched GUCCCG motif that resembles terminal oligopyrimidine (TOP) motifs hints at a possible role in translation regulation.
- TOP terminal oligopyrimidine
- SARS-CoV-2 protein-host RNA interactions demonstrates that a majority of SARS-CoV-2 viral proteins are RNA binding proteins that target a third of the human transcriptome. The analysis implies that these viral proteins may be involved in perturbing many essential cellular processes of the host.
- SARS-CoV- 2 protein specific antibodies enabled confirming the large number of interactions between viral proteins NSP12 and NSP8 and host RNAs in the context of the intact and live virus. As eCLIP in virus infected cells are limited by IP -grade antibodies, focus was placed on the data obtained from the exogenous expression of individual proteins in BEAS-2B cells for systematic analysis of potential functional implications.
- Example 3 Select SARS-CoV-2 proteins upregulate protein expression of target transcripts
- SARS-CoV-2 proteins are enriched at distinct regions of target mRNAs, which imply different regulatory functions because of the protein-RNA interaction. Aggregating the analysis of all targeted peaks for each SARS-CoV-2 protein identifies RNA regions that are preferentially bound (Fig. 3a).
- NSP12, ORF3b, ORF7b and ORF9c show the highest proportion of peaks in the 5' UTR
- NSP2, NSP3,NSP6 andNSP14 show the highest proportion of peaks in the coding region (CDS)
- NSP5, NSP7 andNSP9 display ahigh proportion of peaks in intronic regions
- N and NSP15 show the largest proportion of peaks in the 3' UTR.
- Afiner-grained metagene analysis of read density across all target mRNA transcripts was also performed, where each of the 5' UTR, CDS and 3' UTR regions in an mRNA are scaled to standardized lengths (Fig. 3b).
- NSP2 has a similar number and proportion of peaks in the CDS as NSP3, it mainly targets the region spanning the 5' UTR and coding start.
- NSP3 reads, along with that of NSP6 and NSP14, coat the entire CDS, with a slight bias towards the start of the coding sequence.
- the individual proteins were fused with an MS2 phage coat protein (MCP), which localizes the tagged protein to MS2 aptamer hairpins inserted in the 3' UTR of Renilla luciferase.
- MCP MS2 phage coat protein
- a firefly luciferase without MS2 hairpins is included as a control for non-specific effects of the viral protein.
- Plasmids encoding the MCP-tagged proteins and reporter constructs are co-transfected into HEK293T cells. Changes in Renilla luciferase activity normalized to firefly luciferase activity measures up- or downregulation of protein expression via either translation or mRNA stability because of positioning the MCP tagged protein in the vicinity of th eRenilla mRNA. The luciferase readout does not by itself distinguish between translational or mRNA stabilizing effects.
- NSP1 was found to bind to very few host mRNAs and its peaks are not mapped to the 5' UTR and CDS, the results for NSP1 are consistent with its ability to enhance the transcription and translation of its own mRNA via interacting with the 5' UTR of the genomic viral mRNA.
- NSP5 NSP16 and N display slight (but not significant) down-regulation effects (0.73-fold to 0.58-fold) compared to the FLAG peptide control, but to a lesser extent than that of the known translation repressor CNOT7 (0.16-fold).
- NSP7 and NSP9 appear to have no effect on the targeted expression of the Renilla reporter.
- RT-qPCR was performed to measure the ratio of Renilla-MS2 to Firefly mRNAs.
- the Renilla-MS2/Firefly mRNA ratio is significantly increased (p ⁇ 0.05) compared to wildtype, albeit to different extents for different proteins (Fig. 3e).
- ORF9c shows the greatest enhancing effect (3.5-fold) in the dual luciferase assay, but its effect on the reporter RNAs is middling (1.5-fold).
- ORF9c displays the greatest extent of upregulation at the protein level compared to RNA (2.3-fold) (Fig. 3f), followed by NSP2 and ORF3b (1.6 and 1.7 fold respectively).
- the rest of the proteins range from 1.1-fold (NSP6) to 1.5-fold (NSP14), compared to 1.0-fold of BOLL, suggesting that upregulation likely occurs at both the RNA and protein level.
- eCLIP reads were mapped to the 18S and 28S ribosomal subunits to determine if there are any specific interactions with the ribosome.
- Fold enrichment was determined directly from comparing read coverage in IP to size-matched input. It was found that enrichment peaks (>5-fold) of NSP1 reads are mostly mapped to the mRNA entry channel of 40S ribosome corresponding to helix 16 (peak2) and 18 (peak 3) of 18S rRNA, which is consistent with several cryo-EM structure data showing that NSP1 blocks the mRNA entry channel to inhibit host translation (Fig. 3g).
- NSP1 hepatitis C viral internal ribosome entry site
- ORF9c shows enrichment at both 28S and 18S rRNA.
- One of the major enriched regions of ORF9c on 28S rRNA is above the surface of 60S ribosome. This region consists of two ORF9c binding peaks (28S peak 1 and 2) that correspond to two helices, which are connected by their interactions with RPL4 and interact with RPL27a and RPL7 respectively.
- RPL4 has been shown to interact with RPL7 and further protrude into the core of 60S ribosome and associate with the peptide exit tunnel.
- ORF9c binding to the ribosome is at the intersubunit interface which comprises a helix H63/ES27 (28S peak 3) of 28S rRNA, and two helices, helix 10 (18S peak 2) and 44 (18S peak 5), of 18S rRNA. These helices interact with RPL19, RPL24, RPS6, and RPS8, and have been shown to contribute to establishing eukaryote-specific intersubunit bridges.
- the interactions of ORF9c at the above two regions suggest that ORF9c may play a role in joining two ribosomal subunits to optimize ribosome function.
- ORF9c binding region is around the mRNA entry channel of 18S rRNA corresponding to helix 16 (18S peak 3), and two nearby helices, helix 1 (18S peak 1), and helix 26/26a (18S (peak 4)). Due to the relatively small size of ORF9c, its binding at helix 16 suggests it may play a role in regulating translation initiation by altering the position of helix 16.
- the metagene density plot for ORF9c shows binding mainly in the 5' UTR of target mRNAs. By stabilizing the ribosomal complex, ORF9c may enhance translation efficiency of its target mRNAs at the start of translation.
- ORF9c may be involved in optimizing ribosome structure and regulating translation initiation.
- ORF9c was fused to RNA-targeting Cas9 (RCas9) and its effect on mRNA translation of a reporter substrate was assessed. It was previously shown that regional binding preferences were not captured by the MS2-tethering assay, as human RBPs that bind to all three regions were found to regulate the expression of the targeted reporter, which was brought into proximity. Using 7 guide RNAs that tiled across the mRNA encoding yellow fluorescent protein (YFP) (Table 1), it was found that RCas9- ORF9c fusions upregulated the expression of YFP mRNA when targeted to its 5' UTR.
- YFP yellow fluorescent protein
- Example 4 NSP12 upregulates genes in mitochondria and N-linked glycosylation processes
- SARS-CoV-2 proteins that bind to the 5' UTR and CDS of its target genes upregulate gene expression.
- eCLIP target genes were mapped to existing proteomics datasets from SARS-CoV-2 infected cells and it was found that of the differentially expressed proteins (p ⁇ 0.05, 24 hours post infection), proteins that are eCLIP targets with IDR reproducible peaks are expressed at higher levels than the non-targeted genes (p ⁇ 10 12 by Kolmogorov-Smimoff (KS) test) (Fig.4a).
- NSP12 targeted genes also appear to be less downregulated (p ⁇ 1() 4 .
- KS test due to SARS-Cov2 infection, with genes bound by more significant peaks showing a greater difference (p ⁇ 10 5 ) (Fig. 4a).
- transcriptomics data from SARS-CoV-2 infected cells.
- eCLIP target genes show decreased RNA abundance (p ⁇ 10 8 ), with NSP12 targeted genes appearing even more downregulated (p ⁇ 10 27 ). This may need to be understood in the complex context of regulation and counter regulation in viral-host relationships. There may be certain processes that are downregulated due to global transcription shutdown, but post transcriptional upregulation as exerted by NSP12 may upregulate specific genes to the advantage of the virus.
- the GO processes enriched by the genes targeted by NSP12 include those related to neutrophil mediated immunity, mitochondrial processes (transport, translation elongation, ATP synthesis coupled electron transport), protein N-linked glycosylation and other cellular protein metabolic process (Fig. 4c, d).
- NSP12 targeted mitochondrial transport genes are the most significantly upregulated (p ⁇ 0.03, KS test) compared to non- eCLIP target genes (Fig. 4e).
- genes from the top GO terms that are targeted by NSP12 in the 5TJTR region were selected, which are representative of the metagene profile for NSP12 (Fig.3b, Fig. 3f).
- Ribophorin I is part of an N- oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains
- UDP- Glucose Glycoprotein Glucosyltransferase 1 is a soluble protein of the endoplasmic reticulum (ER) that selectively reglucosylates unfolded glycoproteins.
- NDUFA4 is part of the enzyme cytochrome-c oxidase (or complex IV) and is important for its activity and biogenesis.
- NSP12 was exogenously introduced by transiently transfecting HEK293T cells, and by comparing to a control where a GFP plasmid was transfected, it was found by Western blotting that UGGT1, RPNl and NDUFA4 are expressed at higher levels (Fig. 4g).
- NSP12 N-linked glycosylation related genes
- UGGT1 and RPNl mitochondrial cytochrome c oxidase subunit NDUFA4.
- N-linked glycosylation of host ACE2 receptor and virus Spike protein are important for their interactions and virus entry, the results suggest that the SARS-CoV-2 infection could activate the N-linked glycosylation pathway to facilitate the viral-host interaction and virus entry through NSP12.
- Upregulation of NDUFA4 by NSP12 may also imply a role in modulating mitochondrial bioenergetics during virus infection, as viral biogenesis depends on energy and metabolic resources provided by the host.
- Example 5 - NSP9 associates with the nuclear pore to block mRNA export
- NSP9 interacts with several nuclear pore complex proteins, including NUP62, NUP214, NUP88, NUP54 and 396 NUP581 (Fig. 5a). It was confirmed that NUP62 indeed co-immunoprecipitated with NSP9, which led to the hypothesis that NSP9 may interfere with mRNA export by associating with the nuclear pore (Fig. 5b). To determine if NSP9 inhibits mRNA export activity, the mRNA levels of NSP9 target genes in cytoplasmic and nuclear fractions were assayed.
- NSP9 expressing BEAS-2B cells and the parental or wild type BEAS-2B cells were fractionated into nuclear and cytoplasmic fractions, followed by RNA extraction and RT-qPCR of target genes.
- NSP9 target genes were observed to have significant peaks near the 3' splice site, which may suggest interference of splicing-coupled export (Fig. 5c). It was found that target genes IL-la, ANXA2 and UPP1 had lower cytosolic to total mRNA ratios in NSP9- expressing versus parental cells, whereas the cytosolic mRNA levels of non-targeted control genes MALAT1 and UBC were not significantly lowered (Fig. 5d).
- Interleukin la is an important inflammatory cytokine constitutively produced in epithelial cells and plays a central role in regulating immune responses, including being a master cytokine in acute lung inflammation induced by silica micro- and nanoparticles.
- Interleukin 1b binds to the same IL1 receptor as IL-la, and its mRNA is bound by NSP9 even though it does not pass the IDR threshold.
- NSP9 inhibiting the nucleocytoplasmic export of the mRNA of IL- la has any impact on the production of this cytokine
- an ELISA was performed on the growth media of BEAS-2B wild type and NSP9 expressing cells 48 hours after induction by several common cytokines.
- Interferon a, b and g resulted in lowered IL-la levels in NSP9 cells compared to wild type, though tumor necrosis factor alpha (TNFa) resulted in the greatest reduction ( ⁇ 30%) (Fig. 5e).
- TNFa tumor necrosis factor alpha
- NSP9 association with the nuclear pore complex proteins aligns with the observation of decreased cytoplasmic abundance of NSP9 target mRNAs, suggesting that NSP9 interaction may directly inhibit nuclear export.
- NSP9 reduced the production of its target gene IL- la, which suggests that the export inhibition mechanism may be a strategy that SARS-CoV-2 employs to dampen inflammatory host response.
- Example 6 - SARS-CoV-2 protein-host RNA interactions identify potential therapeutic targets
- the host-viral interactions underlying SARS-CoV-2 infection is broadly understood in terms of the virus hijacking the host cell by globally shutting down the expression of host genes that are irrelevant or hostile to its replication, while the host attempts to fight off the virus by mounting apoptotic and inflammatory responses.
- viral proteins interact with host RNAs to activate a subset of host genes for its own survival through targeted translation activation or mRNA stabilization
- NSP12 specifically upregulates genes in the processes of protein N- linked glycosylation and mitochondrial ATP synthesis and transport. While it has been shown that NSP1 is a global repressor of host cell transcription and translation, it was also proposed that NSP9 contributes another layer to dampening host gene expression by inhibiting mRNA export. Understanding specifically upregulated processes and genes will enable the development of new antiviral strategies.
Abstract
Provided are RNA recognition complexes that include an RNA-targeting agent; and a coronavirus-derived protein. In some embodiments, the RNA recognition complex further includes a linker. In some embodiments, the RNA-targeting agent includes CRISPR/Cas9 components (e.g., a Cas9 protein, a Cas 13b protein, or a Cas 13d protein). Also provided herein are methods of upregulating gene expression of a target RNA that include delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
Description
RNA RECOGNITION COMPLEX AND USES THEREOF
CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims priority to U.S. Provisional Patent Application No. 63/195,980, filed on June 2, 2021. The disclosure of the prior application is considered part of the disclosure of this application, and is incorporated herein by reference in its entirety.
SEQUENCE LISTING
This application contains a Sequence Listing that has been submitted electronically as an ASCII text file named 156700352W01_ST25. The ASCII text file, created on June 1, 2022, is 1.46 kilobytes in size. The material in the ASCII text file is hereby incorporated by reference in its entirety.
BACKGROUND
Recent transcriptome-wide and proteome-wide studies in viral protein-host protein interactions, viral protein and RNA interactions with host proteins, and viral RNA-host RNA interactions contribute to the understanding of host-virus interactions that are important to the SARS-CoV-2 virus life cycle and host response. However, the understanding of the RNA interactome of viral proteins remains limited.
It has been shown that the SARS-CoV-2 nucleocapsid protein interactome comprises many host RNA processing machinery proteins and stress granule proteins, suggesting a potential role in interfering with host RNA processing and driving stress granule formation. A majority of the viral proteins were found to associate with host RNA binding proteins (RBPs), suggesting a possibility that SARS-CoV-2 proteins interact with the host transcriptome to a greater degree than previously anticipated. However, a comprehensive interrogation of S ARS- CoV-2 viral protein-host RNA interactions and how the virus hijacks host cellular machinery for its replication while it suppresses host gene expression is still lacking. SUMMARY
The present disclosure is based, at least in part, on RNA recognition complexes and methods of modulating gene expression of a target RNA using the RNA recognition complexes.
Provided herein are RNA recognition complexes comprising: (a) an RNA-targeting agent; and (b) a coronavirus-derived protein. In some embodiments, the RNA recognition complex further comprises a linker.
In some embodiments, the RNA-targeting agent comprises CRISPR/Cas9 components. In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Casl3d protein. In some embodiments, the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13d protein.
In some embodiments, the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
In some embodiments, the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to an individual gene of a cell. In some embodiments, the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
In some embodiments, the coronavirus-derived protein comprises a SARS-CoV-2 protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, a NSP3, aNSP6, aNSP12, a NSP14, a ORF3b, a ORF7b, or a ORF9c protein.
Also provided herein are methods of upregulating gene expression of a target RNA comprising: delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
Also provided herein are methods of modulating gene expression of a target RNA comprising: delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and modulates gene expression of the target RNA in the cell. In some embodiments, the method further comprises profiling the gene expression of the target RNA in the cell, wherein the gene expression is upregulated.
In some embodiments, the coronavirus-derived protein comprises a SARS-CoV-2 protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, a NSP3, aNSP6, aNSP12, a NSP14, a ORF3b, a ORF7b, or a ORF9c protein.
In some embodiments, the method further comprises profiling the gene expression of the target RNA in the cell, wherein the gene expression is downregulated. In some embodiments, the coronavirus-derived protein comprises aNSP9 protein.
In some embodiments, the profiling comprises transcriptome analysis or gene expression analysis. In some embodiments, the profiling comprises enhanced cross-linking immunoprecipitation (eCLIP).
In some embodiments, the RNA-targeting agent comprises CRISPR/Cas9 components. In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Casl3d protein. In some embodiments, the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13d protein.
In some embodiments, the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
In some embodiments, the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to the target RNA in the cell. In some embodiments, the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
Also provided herein are methods of treating a disease associated with reduced gene expression in a subject in need thereof, the method comprising: administering a RNA recognition complex to the subject, wherein the RNA recognition complex comprises a RNA- targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell, thereby treating the disease associated with reduced gene expression.
In some embodiments, the RNA-targeting agent comprises CRISPR/Cas9 components. In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Casl3d protein. In some embodiments, the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13d protein.
In some embodiments, the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
In some embodiments, the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to the target RNA in the cell. In some embodiments, the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
In some embodiments, the coronavirus-derived protein comprises a SARS-CoV-2 protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, a NSP3, aNSP6, aNSP12, a NSP14, a ORF3b, a ORF7b, or a ORF9c protein.
In some embodiments, the RNA-targeting agent comprises a sequence which is complementary to a target RNA sequence. In some embodiments, the RNA-targeting agent complementary sequence is at least 98% complementary to a target RNA sequence. In some embodiments, the RNA-targeting agent complementary sequence is at least 95% complementary to a target RNA sequence
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Methods and materials are described herein for use in the present invention; other, suitable methods and materials known in the art can also be used. The materials, methods, and examples are illustrative only and not intended to be limiting. All publications, patent applications, patents, sequences, database entries, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control.
Other features and advantages of the invention will be apparent from the following detailed description and figures, and from the claims.
BRIEF DESCRIPTION OF DRAWINGS
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
FIG. la shows a schematic showing eCLIP performed on SARS-CoV-2 proteins in virus infected Vero E6 cells. Proteins in infected cells are UV crosslinked to bound transcripts, which are immunoprecipitated (IP) with antibodies that recognize NSP8, NSP12 and N proteins. Protein-RNA IP product and Input lysate are resolved by SDS-PAGE and membrane transferred, followed by band excision at the estimated protein size to 75kDa above in both IP and Input lanes. Excised bands are subsequently purified, and library barcoded for Illumina sequencing. Sequenced reads are mapped to the hg!9 human genome (GCF_000001405.13).
FIG. lb is a bar plot showing number of all genes, number of all peaks number of coding genes and number of peaks mapping to coding genes from n = 2 biologically independent replicates of NSP12, NSP8 and N eCLIP of SARS-CoV-2 infected cells.
FIG. lc is a stacked bar plot showing TPM of reads mapped to the Vero E6 genome or SARS- CoV-2 genome in each of NSP12, NSP8 and N eCLIP.
FIG. Id is a Venn diagram showing number of African Green Monkey (host) genes targeted by NSP8 and NSP12.
FIG. le shows eCLIP read density mapped to the SARS-CoV-2 genome on both the positive (top) and negative (bottom) sense strand.
FIG. If shows predicted secondary structure of the sequence from the NSP12 peak mapped to the C-terminal of NSP3.
FIG. lg shows RNA-seq read density plot from SARS-CoV-2 infected A549-ACE2 cells mapping sequenced reads to the positive (top, blue) and negative (bottom, light blue) sense strand of SARS-CoV-2 genome.
FIG. lh shows phylogenetic tree analysis of complete genomes of representative betacoronavirus from NCBI reference sequences and bat and pangolin coronavirus sequences from GISAID.
FIG. li shows predicted recombination events of SARS-CoV-2 from phylogenetic analysis, with line plot indicating significance (-log 10(P -value)) of predicted recombination breakpoints across the SARS-CoV-2 genome.
FIG. 2a shows a schematic showing SARS-CoV-2 proteins individually tagged and expressed in human lung epithelial cells BEAS-2B to assay with eCLIP.
FIG. 2b is a bar plot indicating number of all genes, number of all peaks, number of coding genes and number of coding peaks found to interact with each protein from n = 2 biologically independent experiments. In addition to SARS-CoV-2 proteins, ENCODE eCLIP data for example human RNA-binding proteins (hRBPs) are included for comparison.
FIG.2c is a V enn diagram showing the number of coding genes expressed at TPM>1.0 in V ero E6 and BEAS-2B cells as targeted by NSP12 with significant peaks (p<0.001, >4-fold enrichment).
FIG. 2d shows Circos plot mapping SARS-CoV-2 proteins to top five enriched Gene Ontology terms of host transcripts.
FIG. 2e shows example sequence logos generated from all IDR peak reads for each SARS- CoV-2 eCLIP, with p-value indicated above each logo.
FIG. 2f shows example genome browser tracks for NSP3, NSP12, N and NSP2 mapping to DYNCH1, TUSC3, CXCL5 andNAPlL4 respectively.
FIG. 3a shows stacked bar plot showing fraction of reproducible peaks (by IDR14) mapping to different regions of coding genes. 3ss, 3' splice site; 3utr, 3' untranslated region (UTR), 5ss, 5' splice site; 5utr, 5' UTR; CDS, coding sequence.
FIG. 3b shows example metagene profiles for NSP3, NSP12 and N. Mean of read density for each replicate data is shown as a solid line, with shaded regions indicating the 95% confidence interval.
FIG. 3c shows a schematic showing the Renilla-MS2 and Firefly dual luciferase reporter constructs, where individual SARS-CoV-2 proteins fused to MCP are recruited to the Renillia- MS2 mRNA.
FIGs. 3d and 3e show bar plots showing luciferase reporter activity ratios (FIG. 3d) and reporter RT-qPCR ratios (FIG.3e) for the indicated coexpressed SARS-CoV-2 protein, known human regulators of RNA stability (CNOT7, BOLL) and negative control (FLAG peptide). FIG. 3f shows bar plot showing the fold change of luciferase activity ratio and RT-qPCR 629 ratio.
FIGs. 3g and 3h show line plots show the fold enrichment of eCLIP read coverage at each position on rRNAs for NSP1 (FIG. 3g, blue) and ORF9c (FIG. 3h, blue), and the mean of 446 other RBPs deposited in the ENCODE consortium (grey; https://www.encodeproject.org/, accession code ENCSR456FVU) on 18S and 28S rRNAs (lightly shaded areas indicate 10- 90% confidence intervals).
FIGs. 3i and 3j show quantitative flow cytometry reporter assay for targeted translation activation using RCas9-fused ORF9c.
FIG. 4a shows cumulative distribution plot (CDF) showing distribution of proteomics data from Bojkova et al2 of log2(fold change) of host genes in SARS-CoV-2 infected vs. uninfected cells, for genes whose RNAs are not interacting with SARS-CoV-2 proteins, all eCLIP target genes (peak p<10-3, >8-fold enrichment), genes targeted by NSP12 (peak p<10-3, >8-fold enrichment), and genes targeted by NSP12 with highly significant peaks (peak p<10-7, >8-fold enrichment). P645 values are from KS test of the equality of log2(fold change) of each subset of eCLIP target genes to the untargeted genes.
FIG. 4b shows top 10 Gene Ontology terms of NSP12 target genes.
FIG. 4c shows a map of NSP12 target genes (blue boxes connected by red edges to yellow box at center), clustered by top GO terms. Grey edges are human protein-protein interaction data from Mentha. Dark blue frames indicate genes used in subsequent validation.
FIG. 4d shows box plot showing quartiles of log2(fold change) protein levels of NSP12 target genes from proteomics data grouped by the GO term classification. Mann-Whitney U-test p- values indicated above each box compares the log2(fold change) of each subset of NSP12 target genes to all NSP12 target genes (red). Diamonds represent outliers, dots represent individual proteins.
FIG. 4e shows a schematic illustrating the hypothesis of NSP12 interacting with host mRNAs to upregulate the expression of target genes in mitochondrial and N-linked glycosylation processes.
FIG. 4f shows genome browser tracks of NSP12 eCLIP enriched RNA mapped to UGGT1, NDUFA4 and RPN 1.
FIG. 4g shows western blots showing expression levels of UGGT1, NDUFA4 and RPN1, with b actin as loading control, from GFP or NSP12 transfected HEK293T cells. FIG. 4h shows immunofluorescence images (40X) of SARS-CoV-2 infected A549-ACE2 cells stained for SARS-CoV-2 NSP8 (red), endogenous genes (green), DNA content (blue).
FIG. 4i shows a bar plot showing mean relative fluorescence intensities of cells from FIG. 4h, dots represent segmented individual cells.
FIG. 5a shows a schematic illustrating NSP9 interacting with nuclear pore complex proteins NUP62, NUP214, NUP58, NUP88 and NUP541.
FIG. 5b shows a schematic showing the hypothesis of NSP9 inhibiting mRNA nucleocytoplasmic transport.
FIG. 5c shows genome browser tracks of NSP9 eCLIP target RNA mapped to IL-la, IL-Ib, ANXA2 and UPP1.
FIG.5d shows a bar plot showing ratios of cytosolic to total fraction of mRNA levels measured by RT-qPCR, in wild type (WT) BEAS-2B cells, and BEAS-2B cells transduced to express NSP9 (*p<0.05, **p<0.0005, two-tailed multiple /-test with 672 pooled variance, n = 2 biologically independent replicates).
FIG. 5e shows a bar plot showing mean concentration of IL-la in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by cytokines indicated on the x-axis. FIG. 5f shows a bar plot showing mean concentration of IL-la in culture media from WT and NSP9 expressing BEAS-2B cells, 48h after induction by different levels of TNFa.
FIG. 5g shows a bar plot showing mean concentration of IL-Ib in culture media from WT and NSP9 expressing BEAS-2B cells, 48 h after induction by 0 or 100 ng/ml TNFa (mean ± s.e.m, n = 3 biologically independent replicates, *p<0.05, **p<0.005, two-tailed test).
FIG. 6 shows a schematic illustrating the complex host-viral relationship. Flat-ended arrows indicate downregulation, pointed arrows indicate upregulation. Blue arrows are newly proposed interactions.
DETAILED DESCRIPTION
This disclosure describes RNA recognition complexes and methods of modulating gene expression of a target RNA by delivering the RNA recognition complex into a cell.
Various non-limiting aspects of these methods are described herein, and can be used in any combination without limitation. Additional aspects of various components of methods for modulating gene expression are known in the art.
It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise.
As used herein, the terms “about” and “approximately,” when used to modify an amount specified in a numeric value or range, indicate that the numeric value as well as reasonable deviations from the value known to the skilled person in the art, for example ± 20%, ± 10%, or ± 5%, are within the intended meaning of the recited value.
As used herein, “biological sample” can refer to a sample generally including cells and/or other biological material. A biological sample can be obtained from non-mammalian organisms (e.g., a plants, an insect, an arachnid, a nematode), a fungi, an amphibian, or a fish (e.g., zebrafish). A biological sample can be obtained from a prokaryote such as a bacterium, e.g., Escherichia coli, Staphylococci or Mycoplasma pneumoniae, an archaea; a virus such as Hepatitis C virus or human immunodeficiency virus; or a viroid. A biological sample can be obtained from a eukaryote, such as a patient derived organoid (PDO) or patient derived xenograft (PDX). Biological samples can be derived from a homogeneous culture or population of organisms or alternatively from a collection of several different organisms, for example, in a community or ecosystem.
The biological sample can include any number of macromolecules, for example, cellular macromolecules and organelles (e.g., mitochondria and nuclei). The biological sample can be a nucleic acid sample and/or protein sample. The biological sample can be a
carbohydrate sample or a lipid sample. The biological sample can be obtained as a tissue sample, such as a tissue section, biopsy, a core biopsy, needle aspirate, or fine needle aspirate. The sample can be a fluid sample, such as a blood sample, urine sample, or saliva sample. The sample can be a skin sample, a colon sample, a cheek swab, a histology sample, a histopathology sample, a plasma or serum sample, a tumor sample, living cells, cultured cells, a clinical sample such as, for example, whole blood or blood-derived products, blood cells, or cultured tissues or cells, including cell suspensions.
As used herein, a “cell” can refer to either a prokaryotic or eukaryotic cell, optionally obtained from a subject or a commercially available source.
As used herein, “delivering”, “gene delivery”, “gene transfer”, “transducing” can refer to the introduction of an exogenous polynucleotide into a host cell, irrespective of the method used for the introduction. Such methods include a variety of well-known techniques such as vector-mediated gene transfer (e.g., viral infection/transfection, or various other protein-based or lipid-based gene delivery complexes) as well as techniques facilitating the delivery of “naked” polynucleotides (e.g., electroporation, “gene gun” delivery and various other techniques used for the introduction of polynucleotides). The introduced polynucleotide may be stably or transiently maintained in the host cell. Stable maintenance typically requires that the introduced polynucleotide either contains an origin of replication compatible with the host cell or integrates into a replicon of the host cell such as an extrachromosomal replicon (e.g., a plasmid) or a nuclear or mitochondrial chromosome.
In some embodiments, a polynucleotide can be inserted into a host cell by a gene delivery molecule. Examples of gene delivery molecules can include, but are not limited to, liposomes, micelles biocompatible polymers, including natural polymers and synthetic polymers; lipoproteins; polypeptides; polysaccharides; lipopolysaccharides; artificial viral envelopes; metal particles; and bacteria, or viruses, such as baculovirus, adenovirus and retrovirus, bacteriophage, cosmid, plasmid, fungal vectors and other recombination vehicles typically used in the art which have been described for expression in a variety of eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple protein expression.
As used herein, the term “encode” as it is applied to nucleic acid sequences refers to a polynucleotide which is said to “encode” a polypeptide if, in its native state or when manipulated by methods well known to those skilled in the art, can be transcribed and/or translated to produce the mRNA for the polypeptide and/or a fragment thereof. The antisense
strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom.
As used herein, the term “exogenous” refers to any material introduced from or originating from outside a cell, a tissue or an organism that is not produced by or does not originate from the same cell, tissue, or organism in which it is being introduced.
As used herein, the term “expression” refers to the process by which polynucleotides are transcribed into mRNA and/or the process by which the transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. In some embodiments, if the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell. The expression level of a gene may be determined by measuring the amount of mRNA or protein in a cell or tissue sample; further, the expression level of multiple genes can be determined to establish an expression profile for a particular sample.
As used herein, “nucleic acid” is used to include any compound and/or substance that comprise a polymer of nucleotides. In some embodiments, a polymer of nucleotides are referred to as polynucleotides. Exemplary nucleic acids or polynucleotides can include, but are not limited to, ribonucleic acids (RNAs), deoxyribonucleic acids (DNAs), threose nucleic acids (TNAs), glycol nucleic acids (GNAs), peptide nucleic acids (PNAs), locked nucleic acids (LNAs, including LNA having a b-D-ribo configuration, a-LNA having an □-L-ribo configuration (a diastereomer of LNA), 2’-amino-LNA having a 2’-amino functionalization, and 2’ -amino- □ -LNA having a 2’ -amino functionalization) or hybrids thereof. Naturally- occurring nucleic acids generally have a deoxyribose sugar (e.g., found in deoxyribonucleic acid (DNA)) or a ribose sugar (e.g., found in ribonucleic acid (RNA)).
A nucleic acid can contain nucleotides having any of a variety of analogs of these sugar moieties that are known in the art. A deoxyribonucleic acid (DNA) can have one or more bases selected from the group consisting of adenine (A), thymine (T), cytosine (C), or guanine (G), and a ribonucleic acid (RNA) can have one or more bases selected from the group consisting of uracil (U), adenine (A), cytosine (C), or guanine (G).
In some embodiments, the term “nucleic acid” refers to a deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), or a combination thereof, in either a single- or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar binding properties as the reference nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses complementary sequences as well as the sequence explicitly indicated. In some
embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is DNA. In some embodiments of any of the isolated nucleic acids described herein, the isolated nucleic acid is RNA.
Modifications can be introduced into a nucleotide sequence by standard techniques known in the art, such as site-directed mutagenesis and polymerase chain reaction (PCR)- mediated mutagenesis. Conservative amino acid substitutions are ones in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., arginine, lysine and histidine), acidic side chains (e.g., aspartic acid and glutamic acid), uncharged polar side chains (e.g., asparagine, cysteine, glutamine, glycine, serine, threonine, tyrosine, and tryptophan), nonpolar side chains (e.g., alanine, isoleucine, leucine, methionine, phenylalanine, proline, and valine), beta-branched side chains (e.g., isoleucine, threonine, and valine), and aromatic side chains (e.g., histidine, phenylalanine, tryptophan, and tyrosine), and aromatic side chains (e.g., histidine, phenylalanine, tryptophan, and tyrosine).
Unless otherwise specified, a “nucleotide sequence encoding a protein” includes all nucleotide sequences that are degenerate versions of each other and thus encode the same amino acid sequence.
As used herein, the term “plurality” can refer to a state of having a plural (e.g., more than one) number of different types of things (e.g., a cell, a genomic sequence, a subject, a system, or a protein). In some embodiments, a plurality of nucleic acid sequences can be more than one nucleic acid sequence wherein each nucleic acid sequence is different from each other. In other embodiments, “plurality” can refer to a state of having a plural number of the same thing (e.g., a cell, a genomic sequence, a subject, a system, or a protein). In some embodiments, a plurality of nucleic acid sequences are identical to each other. In some embodiments, a plurality of cells are cellular clones (e.g., identical cells).
As used herein, the term “subject” is intended to include any mammal. In some embodiments, the subject is cat, a dog, a goat, a human, a non-human primate, a rodent (e.g., a mouse or a rat), a pig, or a sheep.
As used herein, the term “transduced”, “transfected”, or “transformed” refers to a process by which exogenous nucleic acid is introduced or transferred into a cell. A “transduced,” “transfected,” or “transformed” mammalian cell is one that has been transduced,
transfected or transformed with exogenous nucleic acid (e.g., a gene delivery vector) that includes an exogenous nucleic acid encoding RNA-binding zinc finger domain.
As used herein, the term “treating” means a reduction in the number, frequency, severity, or duration of one or more (e.g., two, three, four, five, or six) symptoms of a disease or disorder in a subject (e.g., any of the subjects described herein), and/or results in a decrease in the development and/or worsening of one or more symptoms of a disease or disorder in a subject.
RNA Recognition Complex
As used herein, “RNA recognition complex” can refer to a system that can recognize specific mRNA transcripts and modulate protein expression. In some embodiments, an RNA recognition complex comprises an RNA-targeting agent and a coronavirus-derived protein. In some embodiments, the RNA-targeting agent can be fused or tethered to the coronavirus- derived protein.
As used herein, “RNA-targeting agent” can refer to an agent that can target and bind to a specific sequence in DNA or RNA. In some embodiments, an RNA-targeting agent comprises CRISPR/Cas9 components. As used herein, the term “CRISPR” refers to a technique of sequence specific genetic manipulation relying on the clustered regularly interspaced short palindromic repeats pathway, which unlike RNA interference regulates gene expression at a transcriptional level. In some embodiments, the RNA-targeting agent comprises a PUF protein. In some embodiments, the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein. In some embodiments, the RNA-targeting agent comprises a protein that has an RNA binding domain.
As used here, in, “coronavirus-derived protein” can refer to a SARS-CoV-2 protein, and/or any variant thereof. In some embodiments, the coronavirus-derived protein includes a NSP1, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein. In some embodiments, the coronavirus-derived protein includes aNSP9 protein.
In some embodiments, the RNA recognition complex further comprises a nuclear export signal and a coronavirus translation activation protein.
In some embodiments, an RNA recognition complex modulates protein expression in a temporal manner. In some embodiments, the RNA recognition complex can activate protein expression. In some embodiments, the RNA recognition complex can upregulate protein expression. In some embodiments, the RNA recognition complex can downregulate protein expression.
RNA-Targeting Agents
CRISPR/Cas Systems
In some embodiments, an RNA-targeting agent is an RNA-guided target RNA-binding fusion protein. RNA-guided target RNA-binding fusion proteins comprise at least one RNA- binding polypeptide which corresponds to a gRNA which guides the RNA-binding polypeptide to target RNA. RNA-guided target RNA-binding fusion proteins include without limitation, RNA-binding polypeptides which are CRISPR/Cas-based RNA-binding polypeptides or portions thereof.
In some embodiments, the RNA-targeting agent comprises an RNA-targeting Cas effector. As used herein, a “Cas effector” or “CRISPR-associated protein” can refer to an enzyme or protein that uses CRISPR sequences as a guide to recognize and cleave specific nucleic acid strands that are complementary to the CRISPR sequence. An RNA-targeting Cas effector can associate with a CRISPR RNA sequence to bind to, and alter DNA or RNA target sequences. In some embodiments, an RNA-targeting Cas effector can be a Cas9 endonuclease that makes a double-stranded break in a target DNA sequence. In some embodiments, an RNA- targeting Cas effector can be a Cas 12a nuclease that also makes a double-stranded break in a target DNA sequence. In some embodiments, an RNA-targeting Cas effector can be a Cas 13 nuclease which targets RNA. In some embodiments, the RNA-targeting Cas effector comprises a Cas9 protein, a Casl3b protein, or a Casl3d protein. In some embodiments, the RNA- targeting Cas effector comprises a nuclease dead Cas9 (dCas9) protein. In some embodiments, the RNA-targeting Cas effector comprises a Cas 13b protein. In some embodiments, the RNA- targeting Cas effector comprises a Cas 13d protein.
In some embodiments, the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to an individual gene of a cell. The term “single guide RNA” or “sgRNA” is a specific type of gRNA that combines tracrRNA (transactivating RNA), which binds to Cas9 to activate the complex to create the necessary strand breaks, and crRNA (CRISPR RNA), comprising complimentary nucleotides to the tracrRNA, into a single RNA construct. Exemplary methods of employing the CRISPR technique are described in WO 2017/091630, which is incorporated by reference in its entirety.
In some embodiments, the single guide RNA can recognize a target RNA, for example, by hybridizing to the target RNA. In some embodiments, the single guide RNA comprises a sequence that is complementary to the target RNA. In some embodiments, the sgRNA can
include one or more modified nucleotides. In some embodiments, the sgRNA has a length that is about 10 nt (e.g., about 20 nt, about 30 nt, about 40 nt, about 50 nt, about 60 nt, about 70 nt, about 80 nt, about 90 nt, about 100 nt, about 120 nt, about 140 nt, about 160 nt, about 180 nt, about 200 nt, about 300 nt, about 400 nt, about 500 nt, about 600 nt, about 700 nt, about 800 nt, about 900 nt, about 1000 nt, or about 2000 nt). In some embodiments, the sgRNA can include a sequence from SEQ ID NOs: 1-7 (Table 1).
[Table 1]
In some embodiments, a single guide RNA can recognize a variety of RNA targets. For example, a target RNA can be messenger RNA (mRNA), ribosomal RNA (rRNA), signal recognition particle RNA (SRP RNA), transfer RNA (tRNA), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), antisense RNA (aRNA), long noncoding RNA (IncRNA), microRNA (miRNA), piwi-interacting RNA (piRNA), small interfering RNA (siRNA), short hairpin RNA (shRNA), retrotransposon RNA, viral genome RNA, or viral noncoding RNA. In some embodiments, a target RNA can be an RNA involved in pathogenesis of conditions such as cancers, neurodegeneration, cutaneous conditions, endocrine conditions, intestinal diseases, infectious conditions, neurological conditions, liver diseases, heart disorders, or autoimmune diseases. In some embodiments, a target RNA can be a therapeutic target for conditions such as cancers, neurodegeneration, cutaneous conditions, endocrine conditions, intestinal diseases, infectious conditions, neurological conditions, liver diseases, heart disorders, or autoimmune diseases. In some embodiments, the sgRNA can be driven by a promoter. In some embodiments, the promoter can be a U6 polymerase III promoter.
PUF Proteins
In some embodiments, a RNA-targeting agent is not an RNA-guided target RNA- binding fusion protein and as such comprises at least one RNA-binding polypeptide which is capable of binding a target RNA without a corresponding gRNA sequence. Such non-guided RNA-binding polypeptides include, without limitation, at least one RNA-binding protein or RNA-binding portion thereof which is a PUF (Pumilio and FBF homology family). This type of RNA-binding polypeptide can be used in place of a gRNA-guided RNA binding protein such as CRISPR/Cas. The unique RNA recognition mode of PUF proteins (named for Drosophila Pumilio and C. elegans fem-3 binding factor) that are involved in mediating mRNA stability and translation are well known in the art. The PUF domain of human Pumiliol, also known in the art, binds tightly to cognate RNA sequences and its specificity can be modified. It contains eight PUF repeats that recognize eight consecutive RNA bases with each repeat recognizing a single base. Since two amino acid side chains in each repeat recognize the Watson-Crick edge of the corresponding base and determine the specificity of that repeat, a PUF domain can be designed to specifically bind most 8-nt RNA. Wang et al.. Nai Methods. 2009; 6(11): 825-830. See WO2012/068627, which is incorporated by reference herein in its entirety, for additional disclosure regarding PUF proteins.
In some embodiments of the non-guided RNA-binding fusion proteins of the disclosure, the fusion protein comprises at least one RNA-binding protein or RNA-binding portion thereof which is a PUMBY (Pumilio-based assembly) protein. RNA-binding protein PumHD (Pumilio homology domain, a member of the PUF family), which has been widely used in native and modified form for targeting RNA, has been engineered to yield a set of four canonical protein modules, each of which targets one RNA base. These modules (i.e., Pumby, for Pumilio-based assembly) can be concatenated in chains of varying composition and length, to bind desired target RNAs. The specificity of such Pumby-RNA interactions is high, with undetectable binding of a Pumby chain to RNA sequences that bear three or more mismatches from the target sequence. Katarzyna et al., PNAS, 2016; 113(19): E2579-E2588. See also US 2016/0238593, which is incorporated by reference herein in its entirety, for additional disclosure regarding PUMBY proteins.
In some embodiments of the compositions of the disclosure, the RNA-targeting agent comprises a Pumilio and FBF (PUF) protein. In some embodiments, the RNA-targeting agent comprises a Pumilio-based assembly (PUMBY) protein.
PPR Proteins
In some embodiments of the compositions of the disclosure, at least one of the RNA- binding proteins or RNA-binding portions thereof is a PPR protein (proteins with pentatricopeptide repeat (PPR) motifs derived from plants). PPR proteins are nuclear-encoded and exclusively controlled at the RNA level organelles (chloroplasts and mitochondria), cutting, translation, splicing, RNA editing, genes specifically acting on RNA stability. PPR proteins are typically a motif of 35 amino acids and have a structure in which a PPR motif is about 10 contiguous amino acids. The combination of PPR motifs can be used for sequence- selective binding to RNA. PPR proteins are often comprised of PPR motifs of about 10 repeat domains. PPR domains or RNA-binding domains may be configured to be catalytically inactive. See WO 2013/058404, which is incorporated herein by reference in its entirety for additional disclosure regarding PPR proteins.
Coronavims-Derived Protein
Coronaviruses contain a positive-sense, single-stranded RNA genome, and the viral genome consists of more than 29,000 bases and encodes 29 proteins. SARS-CoV-2 has four structural proteins: the E and M proteins, which form the viral envelope; the N protein, which binds to the virus’s RNA genome; and the S protein, which binds to human receptors. As used herein, “coronavirus-derived protein” can refer to a protein that is encoded from the coronavirus viral genome. In some embodiments, the coronavirus-derived protein can be anon- structural protein (NSP). In some embodiments, the non-structural protein can comprise a NSP1, a NSP2, a NSP3, a NSP4, a NSP5, a NSP6, a NSP7, a NSP8, a NSP9, a NSP10, a NSP12, a NSP13, a NSP14, a NSP15, or a NSP16 protein. In some embodiments, the coronavirus-derived protein can be an accessory protein. In some embodiments, the accessory protein can comprise a ORF3a, a ORF6, a ORF7a, a ORF7b, a ORF8, or a ORFIO protein. In some embodiments, the coronavirus-derived protein can be a structural protein. In some embodiments, the structural protein can comprise a spike (S) protein, a nucleocapsid (N) protein, a membrane (M) protein, or an envelope (E) protein. In some embodiments, the coronavirus-derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein. In some embodiments, the coronavirus-derived protein comprises aNSP9 protein.
Linker
In some embodiments, the RNA recognition complex disclosed herein comprises a linker between the RNA-targeting agent and the coronavirus-derived protein. In some embodiments, the linkers or linker motifs can be any flexible peptides that connect two protein domains or motifs without interfering with their functions. In some embodiments, the linker is a peptide linker. In some embodiments, the peptide linker comprises one or more repeats of the tri-peptide GGS. In other embodiments, the linker is a non-peptide linker. In some embodiments, the non-peptide linker comprises polyethylene glycol (PEG), polypropylene glycol (PPG), co-poly (ethylene/propylene) glycol, polyoxyethylene (POE), polyurethane, polyphosphazene, polysaccharides, dextran, polyvinyl alcohol, polyvinylpyrrolidones, polyvinyl ethyl ether, polyacryl amide, polyacrylate, polycyanoacrylates, lipid polymers, chitins, hyaluronic acid, heparin, or an alkyl linker. See WO2017/192434, WO2019/089817, and WO2019/241483, each of which are herein incorporated in its entirety, for more disclosure regarding using linkers. Nucleic Acids
Provided herein are the nucleic acid sequences encoding the RNA recognition complexes disclosed herein for use in gene transfer and expression techniques described herein. It should be understood, although not always explicitly stated that the sequences provided herein can be used to provide the expression product as well as substantially identical sequences that produce a protein that has the same biological properties. These “biologically equivalent” or “biologically active” or “equivalent” polypeptides are encoded by equivalent polynucleotides as described herein. They may possess at least 60%, or alternatively, at least 65%, or alternatively, at least 70%, or alternatively, at least 75%, or alternatively, at least 80%, or alternatively at least 85%, or alternatively at least 90%, or alternatively at least 95% or alternatively at least 98%, identical primary amino acid sequence to the reference polypeptide when compared using sequence identity methods run under default conditions. Specific polypeptide sequences are provided as examples of particular embodiments. Modifications to the sequences to amino acids can include alternate amino acids that have similar charge. Additionally, an equivalent polynucleotide is one that hybridizes under stringent conditions to the reference polynucleotide or its complement or in reference to a polypeptide, a polypeptide encoded by a polynucleotide that hybridizes to the reference encoding polynucleotide under stringent conditions or its complementary strand. Alternatively, an equivalent polypeptide or protein is one that is expressed from an equivalent polynucleotide.
The nucleic acid sequences (e.g., polynucleotide sequences) disclosed herein may be codon-optimized which is a technique well known in the art. Codon optimization refers to the fact that different cells differ in their usage of particular codons. This codon bias corresponds to a bias in the relative abundance of particular tRNAs in the cell type. By altering the codons in the sequence to match with the relative abundance of corresponding tRNAs, it is possible to increase expression. It is also possible to decrease expression by deliberately choosing codons for which the corresponding tRNAs are known to be rare in a particular cell type. Codon usage tables are known in the art for mammalian cells, as well as for a variety of other organisms. Based on the genetic code, nucleic acid sequences coding for, e.g., a Cas protein, can be generated. In some embodiments, such a sequence is optimized for expression in a host or target cell, such as a host cell used to express the Cas protein or a cell in which the disclosed methods are practiced (such as in a mammalian cell, e.g., a human cell). Codon preferences and codon usage tables for a particular species can be used to engineer isolated nucleic acid molecules encoding a Cas protein (such as one encoding a protein having at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type protein) that takes advantage of the codon usage preferences of that particular species. In some embodiments, an isolated nucleic acid molecule encoding at least one Cas protein (which can be part of a vector) includes at least one Cas protein coding sequence that is codon optimized for expression in a eukaryotic cell, or at least one Cas protein coding sequence codon optimized for expression in a human cell. In one embodiment, such a codon optimized Cas coding sequence has at least 80%, at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type or originating sequence. In another embodiment, a eukaryotic cell codon optimized nucleic acid sequence encodes a Cas protein having at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to its corresponding wild-type or originating protein.
Vectors
In some embodiments of the compositions and methods of the disclosure, a vector comprises a guide RNA of the disclosure. In some embodiments, the vector comprises at least one guide RNA of the disclosure. In some embodiments, the vector comprises one or more guide RNA(s) of the disclosure. In some embodiments, the vector comprises two or more guide RNAs of the disclosure. In some embodiments, the vector further comprises a nucleic acid
corresponding to an RNA recognition complex of the disclosure. In some embodiments, the RNA recognition complex comprises a RNA targeting agent and a coronavirus-derived protein.
In some embodiments of the compositions and methods of the disclosure, a first vector comprises a guide RNA of the disclosure and a second vector comprises a RNA recognition complex of the disclosure. In some embodiments, the first vector comprises at least one guide RNA of the disclosure. In some embodiments, the first vector comprises one or more guide RNA(s) of the disclosure. In some embodiments, the first vector comprises two or more guide RNA(s) of the disclosure. In some embodiments, the RNA recognition complex comprises a RNA targeting agent and a coronavirus-derived protein. In some embodiments, the first vector and the second vector are identical. In some embodiments, the first vector and the second vector are not identical.
In some embodiments of the compositions and methods of the disclosure, a vector of the disclosure is a viral vector. In some embodiments, the viral vector includes a sequence isolated or derived from a retrovirus. In some embodiments, the viral vector includes a sequence isolated or derived from a lentivirus. In some embodiments, the viral vector includes a sequence isolated or derived from an adenovirus. In some embodiments, the viral vector includes a sequence isolated or derived from an adeno-associated virus (AAV). In some embodiments, the viral vector is replication incompetent. In some embodiments, the viral vector is isolated or recombinant. In some embodiments, the viral vector is self complementary.
In some embodiments of the compositions and methods of the disclosure, the viral vector includes a sequence isolated or derived from an adeno-associated virus (AAV). In some embodiments, the viral vector includes an inverted terminal repeat sequence or a capsid sequence that is isolated or derived from an AAV of serotype AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV 8, AAV9, AAV10, AAV11, AAV 12, AAV.rh32/33, AAV.rh43, AAV.rh64Rl, and any combinations or equivalents thereof. In some embodiments, the viral vector is replication incompetent. In some embodiments, the viral vector is isolated or recombinant (rAAV). In some embodiments, the viral vector is self-complementary (scAAV). In some embodiments, the AAV vector has low toxicity. In some embodiments, the AAV vector does not incorporate into the host genome, thereby having a low probability of causing insertional mutagenesis. In some embodiments, the AAV vector can encode a range of total polynucleotides from 4.5 kb to 4.75 kb.
In some embodiments of the compositions and methods of the disclosure, a vector of the disclosure is a non-viral vector. In some embodiments, the vector comprises or consists of a nanoparticle, a micelle, a liposome or lipoplex, a polymersome, a polyplex or a dendrimer. In some embodiments, the vector is an expression vector or recombinant expression system. As used herein, the term “recombinant expression system” refers to a genetic construct for the expression of certain genetic material formed by recombination.
In some embodiments of the compositions and methods of the disclosure, an expression vector, viral vector or non-viral vector provided herein, includes without limitation, an expression control element. An “expression control element” as used herein refers to any sequence that regulates the expression of a coding sequence, such as a gene. Exemplary expression control elements include but are not limited to promoters, enhancers, microRNAs, post-transcriptional regulatory elements, polyadenylation signal sequences, and introns. Expression control elements may be constitutive, inducible, repressible, or tissue-specific, for example. A “promoter” is a control sequence that is a region of a polynucleotide sequence at which initiation and rate of transcription are controlled. It may contain genetic elements at which regulatory proteins and molecules may bind such as RNA polymerase and other transcription factors. In some embodiments, expression control by a promoter is tissue-specific. Non-limiting exemplary promoters include CMV, CBA, CAG, Cbh, EF-la, PGK, UBC, GUSB, UCOE, hAAT, TBG, Desmin, MCK, C5-12, NSE, Synapsin, PDGF, MecP2, CaMKII, mGluR2, NFL, NFH, hb2, PPE, ENK, EAAT2, GFAP, MBP, and U6 promoters. An “enhancer” is a region of DNA that can be bound by activating proteins to increase the likelihood or frequency of transcription. Non-limiting exemplary enhancers and posttranscriptional regulatory elements include the CMV enhancer and WPRE.
In some embodiments, the vector is a viral vector. In some embodiments, the vector is an adenoviral vector, an adeno-associated viral (AAV) vector, or a lentiviral vector. In some embodiments, the vector is a retroviral vector, an adenoviral/retroviral chimera vector, a herpes simplex viral I or II vector, a parvoviral vector, a reticuloendotheliosis viral vector, a polioviral vector, a papillomaviral vector, a vaccinia viral vector, or any hybrid or chimeric vector incorporating favorable aspects of two or more viral vectors. In some embodiments, the vector further comprises one or more expression control elements operably linked to the polynucleotide. In some embodiments, the vector further comprises one or more selectable markers. In some embodiments, the lentiviral vector is an integrase-competent lentiviral vector (ICLV). In some embodiments, the lentiviral vector can refer to the transgene plasmid vector
as well as the transgene plasmid vector in conjunction with related plasmids (e.g., a packaging plasmid, a rev expressing plasmid, an envelope plasmid) as well as a lentiviral-based particle capable of introducing exogenous nucleic acid into a cell through a viral or viral-like entry mechanism. Lentiviral vectors are well-known in the art (see, e.g., Trono D. (2002) Lentiviral vectors, New York: Spring-Verlag Berlin Heidelberg and Durand et al. (2011) Viruses 3(2): 132-159 doi: 10.3390/v3020132). In some embodiments, exemplary lentiviral vectors that may be used in any of the herein described compositions, systems, methods, and kits can include a human immunodeficiency virus (HIV) 1 vector, a modified human immunodeficiency virus (HIV) 1 vector, a human immunodeficiency virus (HIV) 2 vector, a modified human immunodeficiency virus (HIV) 2 vector, a sooty mangabey simian immunodeficiency virus (SIVsM) vector, a modified sooty mangabey simian immunodeficiency virus (SIVsM) vector, a African green monkey simian immunodeficiency virus (SIVAGm) vector, a modified African green monkey simian immunodeficiency virus (SIVAGm) vector, an equine infectious anemia virus (EIAV) vector, a modified equine infectious anemia virus (EIAV) vector, a feline immunodeficiency virus (FIV) vector, a modified feline immunodeficiency virus (FIV) vector, a Visna/maedi virus (VNV/VMV) vector, a modified Visna/maedi virus (VNV/VMV) vector, a caprine arthritis-encephalitis virus (CAEV) vector, a modified caprine arthritis-encephalitis virus (CAEV) vector, a bovine immunodeficiency virus (BIV), or a modified bovine immunodeficiency virus (BIV).
Pharmaceutical Compositions
The methods described herein can include the administration of pharmaceutical compositions and formulations including vectors delivering an RNA recognition complex including an RNA-targeting agent and a coronavirus-derived protein.
In some embodiments, the compositions are formulated with a pharmaceutically acceptable carrier. The pharmaceutical compositions and formulations can be administered parenterally, topically, orally or by local administration, such as by aerosol or transdermally. The pharmaceutical compositions can be formulated in any way and can be administered in a variety of unit dosage forms depending upon the condition or disease and the degree of illness, the general medical condition of each patient, the resulting preferred method of administration and the like. Details on techniques for formulation and administration of pharmaceuticals are well described in the scientific and patent literature, see, e.g., Remington: The Science and Practice of Pharmacy, 21st ed., 2005.
The RNA recognition complex can be administered alone or as a component of a pharmaceutical formulation (composition). The compounds may be formulated for administration, in any convenient way for use in human or veterinary medicine. The compositions may conveniently be presented in unit dosage form and may be prepared by any methods well known in the art of pharmacy. The amount of active ingredient which can be combined with a carrier material to produce a single dosage form can vary depending upon the host being treated, the particular mode of administration. The amount of active ingredient which can be combined with a carrier material to produce a single dosage form will generally be that amount of the compound which produces a therapeutic effect.
Pharmaceutical compositions described herein can be prepared according to any method known to the art for the manufacture of pharmaceuticals. Such compositions can contain, for example, preserving agents. A composition can be admixtured with nontoxic pharmaceutically acceptable excipients which are suitable for manufacture. Compositions may comprise one or more diluents, emulsifiers, preservatives, buffers, excipients, etc. and may be provided in such forms as liquids, powders, emulsions, lyophilized powders, controlled release formulations, on patches, in implants, etc. Wetting agents, emulsifiers, and lubricants, such as sodium lauryl sulfate and magnesium stearate, as well as coloring agents, release agents, coating agents, sweetening, flavoring and perfuming agents, preservatives and antioxidants can also be present in the compositions.
Aqueous suspensions can contain an active agent (e.g., nucleic acid sequences of the invention) in admixture with excipients suitable for the manufacture of aqueous suspensions, e.g., for aqueous intradermal injections. Such excipients include a suspending agent, such as sodium carboxymethylcellulose, methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia, and dispersing or wetting agents such as a naturally occurring phosphatide (e.g., lecithin), a condensation product of an alkylene oxide with a fatty acid (e.g., polyoxyethylene stearate), a condensation product of ethylene oxide with a long chain aliphatic alcohol (e.g., heptadecaethylene oxycetanol), a condensation product of ethylene oxide with a partial ester derived from a fatty acid and a hexitol (e.g., polyoxyethylene sorbitol mono-oleate), or a condensation product of ethylene oxide with a partial ester derived from fatty acid and a hexitol anhydride (e.g., polyoxyethylene sorbitan mono-oleate). The aqueous suspension can also contain one or more preservatives such as ethyl or n-propyl p-hydroxybenzoate, one or more coloring agents, one or more
flavoring agents and one or more sweetening agents, such as sucrose, aspartame or saccharin. Formulations can be adjusted for osmolarity.
In some embodiments, oil-based pharmaceuticals are used for administration of nucleic acid sequences as described herein. As an example of an injectable oil vehicle, see Minto (1997) J. Pharmacol. Exp. Ther. 281:93-102.
Pharmaceutical compositions can also be in the form of oil-in-water emulsions. The oily phase can be a vegetable oil or a mineral oil, described above, or a mixture of these. Suitable emulsifying agents include naturally-occurring gums, such as gum acacia and gum tragacanth, naturally occurring phosphatides, such as soybean lecithin, esters or partial esters derived from fatty acids and hexitol anhydrides, such as sorbitan mono-oleate, and condensation products of these partial esters with ethylene oxide, such as polyoxyethylene sorbitan mono-oleate. The emulsion can also contain sweetening agents and flavoring agents, as in the formulation of syrups and elixirs. Such formulations can also contain a demulcent, a preservative, or a coloring agent. In alternative embodiments, these injectable oil-in- water emulsions of the invention comprise a paraffin oil, a sorbitan monooleate, an ethoxylated sorbitan monooleate and/or an ethoxylated sorbitan trioleate.
In some embodiments, the pharmaceutical compositions can also be delivered as microspheres for slow release in the body. For example, microspheres can be administered via intradermal injection of drug which slowly release subcutaneously; see Rao (1995) J. Biomater Sci. Polym. Ed. 7:623-645; as biodegradable and injectable gel formulations, see, e.g., Gao (1995) Pharm. Res. 12:857-863 (1995); or, as microspheres for oral administration, see, e.g., Eyles (1997) J. Pharm. Pharmacol. 49:669-674.
In some embodiments, the pharmaceutical compositions can be parenterally administered, such as by intravenous (IV) administration or administration into a body cavity or lumen of an organ. These formulations can comprise a solution of active agent dissolved in a pharmaceutically acceptable carrier. Acceptable vehicles and solvents that can be employed are water and Ringer's solution, an isotonic sodium chloride. In addition, sterile fixed oils can be employed as a solvent or suspending medium. For this purpose any bland fixed oil can be employed including synthetic mono- or diglycerides. In addition, fatty acids such as oleic acid can likewise be used in the preparation of injectables. These solutions are sterile and generally free of undesirable matter. These formulations may be sterilized by conventional, well known sterilization techniques. The formulations may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and
buffering agents, toxicity adjusting agents, e.g., sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate and the like. The concentration of active agent in these formulations can vary widely, and will be selected primarily based on fluid volumes, viscosities, body weight, and the like, in accordance with the particular mode of administration selected and the patient's needs. For IV administration, the formulation can be a sterile injectable preparation, such as a sterile injectable aqueous or oleaginous suspension. This suspension can be formulated using those suitable dispersing or wetting agents and suspending agents. The sterile injectable preparation can also be a suspension in a nontoxic parenterally- acceptable diluent or solvent, such as a solution of 1,3-butanediol. The administration can be by bolus or continuous infusion (e.g., substantially uninterrupted introduction into a blood vessel for a specified period of time).
In some embodiments, the pharmaceutical compounds and formulations can be lyophilized. Stable lyophilized formulations comprising an inhibitory nucleic acid can be made by lyophilizing a solution comprising a pharmaceutical of the invention and a bulking agent, e.g., mannitol, trehalose, raffmose, and sucrose or mixtures thereof. A process for preparing a stable lyophilized formulation can include lyophilizing a solution about 2.5 mg/mL protein, about 15 mg/mL sucrose, about 19 mg/mL NaCl, and a sodium citrate buffer having a pH greater than 5.5 but less than 6.5. See, e.g., U.S. 20040028670.
The compositions and formulations can be delivered by the use of liposomes. By using liposomes, particularly where the liposome surface carries ligands specific for target cells, or are otherwise preferentially directed to a specific organ, one can focus the delivery of the active agent into target cells in vivo. See, e.g., U.S. PatentNos. 6,063,400; 6,007,839; Al-Muhammed (1996) J. Microencapsul. 13:293-306; Chonn (1995) Curr. Opin. Biotechnol. 6:698-708; Ostro (1989) Am. J. Hosp. Pharm. 46:1576-1587. As used in the present invention, the term “liposome” means a vesicle composed of amphiphilic lipids arranged in a bilayer or bilayers. Liposomes are unilamellar or multilamellar vesicles that have a membrane formed from a lipophilic material and an aqueous interior that contains the composition to be delivered. Cationic liposomes are positively charged liposomes that are believed to interact with negatively charged DNA molecules to form a stable complex. Liposomes that are pH-sensitive or negatively-charged are believed to entrap DNA rather than complex with it. Both cationic and noncationic liposomes have been used to deliver DNA to cells.
Liposomes can also include “sterically stabilized” liposomes, i.e., liposomes comprising one or more specialized lipids. When incorporated into liposomes, these
specialized lipids result in liposomes with enhanced circulation lifetimes relative to liposomes lacking such specialized lipids. Examples of sterically stabilized liposomes are those in which part of the vesicle-forming lipid portion of the liposome comprises one or more glycolipids or is derivatized with one or more hydrophilic polymers, such as a polyethylene glycol (PEG) moiety. Liposomes and their uses are further described in U.S. Pat. No. 6,287,860. Compositions disclosed herein can be administered for prophylactic and/or therapeutic treatments. In some embodiments, for therapeutic applications, compositions are administered to a subject who is infected or at risk of infection with SARS-CoV2, in an amount sufficient to cure, alleviate or partially arrest the clinical manifestations of the disorder or its complications; this can be called a therapeutically effective amount. For example, in some embodiments, pharmaceutical compositions of the invention are administered in an amount sufficient to decrease the number of lung cells infected with SARS-CoV2.
The inhibitory nucleic acids used to practice the methods described herein, can be isolated from a variety of sources, genetically engineered, amplified, and/or expressed/ generated recombinantly. Recombinant nucleic acid sequences can be individually isolated or cloned and tested for a desired activity. Any recombinant expression system can be used, including e.g. in vitro, bacterial, fungal, mammalian, yeast, insect, or plant cell expression systems. Modulating gene expression of a target RNA
In some embodiments, a method of upregulating gene expression of a target RNA can include delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus -derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
In some embodiments, a method of modulating gene expression of a target RNA can include delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus -derived protein, and wherein the RNA recognition complex binds to the target RNA and modulates gene expression of the target RNA in the cell.
In some embodiments, the RNA recognition complex is present in a delivery system. In some embodiments, the delivery system comprises a delivery vehicle selected from the group consisting of an adeno-associated virus, a nanoparticle, and a liposome.
In some embodiments, the RNA recognition complex can be introduced into any cell, e.g., a mammalian cell. Non-limiting examples of a mammalian cell include: a human cell, a
rodent cell (e.g., a rat cell or a mouse cell), a rabbit cell, a dog cell, a cat cell, a porcine cell, or a non-human primate cell. In some embodiments, the RNA recognition complex can be delivered into the cytoplasm of a cell. In some embodiments, the RNA recognition complex can be delivered into the cell by chemical transfection, non-chemical transfection, particle- based transfection, or viral transfection. In some embodiments, the RNA recognition complex can be delivered with a transfection reagent. In some embodiments, the transfection reagent can be lipofectamine. In some embodiments, the transfection reagent can be FuGENE transfection reagent.
In some embodiments, the method further includes profiling the gene expression of the target RNA in the cell, wherein the gene expression is upregulated. In some embodiments, a target RNA, through an RNA-targeting agent’s association with a coronavirus-derived protein, drives upregulation of the target RNA within a cell. In some embodiments, the coronavirus- derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
In some embodiments, the method further includes profiling the gene expression of the target RNA in the cell, wherein the gene expression is downregulated. In some embodiments, a target RNA, through an RNA-targeting agent’s association with a coronavirus-derived protein, drives downregulation of the target RNA within a cell. In some embodiments, the coronavirus-derived protein comprises aNSP9 protein.
As used herein, “profiling” can refer to the measurement of activity (e.g., expression) of one or more genes, to create a global picture of cellular function. In some embodiments, profiling includes sequencing of a nucleic acid (e.g., DNA or RNA), wherein the gene expression profile includes information of active translation at a point in time. In some embodiments, the profiling comprises transcriptome analysis or gene expression analysis. In some embodiments, the profiling comprises enhanced cross-linking immunoprecipitation (eCLIP). As used herein, “enhanced crosslinking and immunoprecipitation (eCLIP)” refers to a method to profile RNAs bound by an RNA binding protein of interest. In some embodiments, eCLIP can be modified and used to profile RNAs bound by specific ribosomal subunit proteins. In some embodiments, enhanced crosslinking and immunoprecipitation (eCLIP) recovers protein-coding mRNAs (with a particular enrichment for coding sequence regions).
As used herein, “immunoprecipitation” is the technique of precipitating a protein antigen out of solution using an antibody that specifically bind to that particular protein. In some embodiments, the solution containing the protein antigen is in the form of a crude lysate
of an animal tissue. Immunoprecipitation can be used to isolate and concentrate a particular protein from a sample containing many different proteins. Also, this technique requires that the antibody by coupled to a solid substrate (e.g., immunoprecipitation beads) while performing the procedure. Existing crosslinking and immunoprecipitation (CLIP) methods also identify RNA nucleotides that bind proteins of interest, but typically deliver regions up to hundreds of nucleotides in length that are the approximate binding sites of the given protein. Enhanced crosslinking and immunoprecipitation (eCLIP) is a method to profile RNAs bound by an RNA binding protein of interest.
Methods of Treating
In some embodiments, a method of treating a disease of reduced gene expression in a subject in need thereof can include administering a RNA recognition complex to the subject, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus- derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
EXAMPLES
The disclosure is further described in the following examples, which do not limit the scope of the disclosure described in the claims.
Example 1 - eCLIP elucidates SARS-CoV-2 protein-RNA interactions in virus infected cells
To investigate the RNA interactome of SARS-CoV-2 proteins, eCLIP was performed on SARS-CoV-2 infected African Green Monkey kidney (Vero E6) cells (Fig. la). Cells were infected at a multiplicity of infection (MOI) of 0.01 for 48 hours before UV irradiation of cells that covalently crosslink interacting proteins to RNAs. This was followed by immunoprecipitation of the NSP8, NSP12 (also known as the RNA dependent RNA polymerase) and N (nucleocapsid) proteins using protein-specific antibodies to isolate the bound RNA. The RNA-bound proteins were resolved via SDS-PAGE and transferred to nitrocellulose membranes such that only the region spanning the expected protein size and 75 kDa larger were excised and purified in subsequent steps. The same size region of a non- immunoprecipitated input whole cell lysate was included as size-matched input to identify enriched sequences. RNA was converted to libraries and sequenced to an average depth of ~25
million reads, and mapped to the SARS-CoV-2 viral genome and African Green Monkey genome to determine SARS-CoV-2 protein RNA interactions. Targeted transcripts were determined by having one or more peaks that meet the stringent IDR (irreproducible discovery rate) threshold of overlapping peaks between two replicates for every protein, and satisfy statistical cutoffs of p<0.001, and more than 8-fold enrichment in the immunoprecipitated sample (IP) over the size-matched input sample.
It was found that NSP8, NSP12 and N interact with 457, 703 and 24 genes with 658, 1457 and 39 significant peaks, respectively (Fig. lb). The number of RNA reads in Transcripts Per Kilobase Million (TPM) from both NSP8 and NSP12 immunoprecipitation (IP) samples were mapped more frequently to host transcripts than viral RNA (Fig. lc). In contrast, a majority of N immunoprecipitated RNA reads were mapped to viral RNA, consistent with its role in enclosing the viral genome during virion assembly. All three proteins bound to viral RNA with peaks that were highly statistically significant (p-values < lO 400), although the large number of peaks (2137) that map to the host genes suggests a potential role in their regulation (Fig. Id)
The eCLIP results provide the first viral RNA genome map of interactions with NSP8, NSP12 and N proteins. We observed strong NSP8 and NSP12 eCLIP peaks at the 5' untranslated region (UTR) and 3' UTR of both positive and negative strand viral transcripts (Fig. le). This is consistent with the role of replicase proteins NSP8 and NSP12 in viral genome replication. Furthermore, NSP12 enrichment was seen on the negative strand at all transcription-regulatory sequences (TRSs) of the viral genome, implying that it may play a role in the transcription of subgenomic RNAs, which results in the expression of accessory protein products. However, no enrichment of eCLIP reads were observed on the positive sense strand for TRSs in eCLIP of NSP12, and NSP8 and N eCLIP reads were not enriched at the TRSs of either strand. In fact, very few distinct peaks were identified from the eCLIP results of N, as the eCLIP reads were nonspecifically distributed across the genome, indistinguishable to the input sample (Fig. le). This is consistent with the nucleocapsid encapsulating the entire viral genome in the packaged viral particles.
Unexpectedly, a distinct NSP12 eCLIP peak at the region around position 7450 - 7550 in the positive sense strand was observed, near the 3' end of the gene encoding for
NSP3. Upon closer inspection, the eCLIP read density showed a sharp drop in reads at position 7481 on both strands, which may correspond to reverse transcription termination during eCLIP library preparation at a UV crosslinking site (Fig. If). Within this region, the sequence at
position 7470-7510 forms a stable hairpin from RNA secondary structure prediction. The high read density in the hairpin region suggests a potential stalling of NSP12 polymerase elongation, which may result in aborted transcripts. In support of this hypothesis, RNA-seq of A549-ACE2 cells infected with SARS-CoV-2 was performed and a steep decrease in transcript read density at the site of NSP12 eCLIP peak was observed (Fig. lg). Aborted transcripts were also confirmed in a direct RNA-sequencing study using the Oxford Nanoporel8. Furthermore, some of these aborted transcripts join up with the downstream sequences, forming deletion products.
Polymerase stalling may play a role in generating genetic diversity of viruses via recombination, which has been shown to contribute to the evolution of SARS-CoV-2. To determine the likelihood of recombination across the viral genome, a multiple sequence alignment and phylogenetic analysis of the reference sequences of the complete genomes of betacoronaviruses from NCBI and the complete genomes of bat and pangolin coronaviruses from GISAID was performed (Fig. lh). The multiple sequence analysis shows the peak region sequence to be highly conserved among the analyzed betacoronavirus sequences. The hairpin structure also appears conserved among bat and pangolin sequences. Recombination breakpoints are predicted from this sequence alignment, using a pairwise scanning approach that identifies regions with greater similarity among phylogenetically distant sequences. The prediction found a likely breakpoint -250 nt downstream of the peak in region 7450-7550. This breakpoint was predicted to be a recombination event between SARS-CoV-2 and the Tylonycteris bat coronavirus HKU4 (Fig. li). While there are several other breakpoints that did not coincide with NSP12 eCLIP peaks, the presence of the NSP12 eCLIP peak in the 7470 - 7510 region proximal to a potential recombination site suggests a possible contribution to recombination in ancestral sequences of SARS-CoV-2. In addition to a high degree of sequence conservation, the RNA secondary structure appears conserved in the region containing the 7470 - 7510 peak among the closely related pangolin and bat betacoronaviruses, suggesting a potential function associated with NSP12 binding to this region that may be important for virus replication.
Taken together, the first eCLIP data showing the interaction of SARS-CoV-2 proteins NSP8, NSP12 and N bound to the viral genome is presented. These findings suggest that NSP12 may be involved in transcription stalling and contribute to viral genetic diversity via recombination. The large number of host RNAs bound by NSP12 prompted a systematic investigation of SARS-CoV-2 protein-host RNA interactions.
Example 2 - SARS-CoV-2 proteins interact with one third of the transcriptome in lung epithelial cells
To investigate whether SARS-CoV-2 proteins directly interact with the human host transcriptome, eCLIP was performed on the 29 proteins encoded in the SARS-CoV-2 genome and one mutant (Fig. 2a). Due to the lack of antibodies specific for most of the viral proteins, the individual proteins were overexpressed in a lung epithelial cell line BEAS-2B, which is an immortalized primary bronchial cell line representative of normal lung physiology. Each protein was either fused with a 2xStrep tag and expressed stably via lentiviral transduction or fused with a 3xFLAG tag and expressed transiently via transfection. Following UV crosslinking, the tagged proteins were immunoprecipitated using anti-FLAG or anti-Strep antibodies.
From the SARS-CoV-2 proteome-wide eCLIP results, SARS-CoV-2 proteins interacted with RNA represented by 4,821 coding genes, which is about a third of the transcriptome of BEAS-2B cells. Nucleocapsid and non-structural proteins NSP2, NSP3, NSP5, NSP9 and NSP12 were found to target the greatest number of unique genes at 1339, 1647, 1199, 902, 863, and 865, respectively (Fig. 2b). The large number of genes targeted by the viral proteins is consistent with the non-structural proteins from the replicase (ORFlab) having a high affinity for its own RNA, though their potential for widespread interaction with host RNA has not been shown previously. The widespread interaction of Nucleocapsid with host RNAs when expressed in isolation is consistent with its capacity for nonspecific RNA binding, whereas its targeting the virus genome during RNA assembly occurs via interaction with the M protein. For comparison, the extensively studied splicing factor RBFOX2 binds to 958 genes in HepG2 cells and 471 genes in K562 cells, the stress granule assembly factor G3BP1 binds to 561 genes in HepG2 cells, and the histone RNA hairpin-binding protein SLBP binds to 19 genes in K562 (Fig. 2b). This suggests that viral proteins have the same capacity for interacting with RNA as endogenous human RBPs. Most of the target genes (400/518) in the NSP12 eCLIP in virus infected Vero E6 cells are represented in the eCLIP assay from exogenous expression in the BEAS-2B cells (Fig. 2c). Only transcripts that are expressed at a TPM of >1.0 in both cell lines are used in this comparison, and target genes are considered if bound by one or more peaks that satisfy statistical cutoffs of - loglO(p-value) > 3, and more than 4-fold enrichment over size-matched input. This suggests that NSP12 bound genes are
similar in the context of the virus infected cells and in the context of NSP12 expressed in isolation.
Distinct processes related to viral replication and host response are targeted by the viral proteins as shown by gene ontology (GO) analysis (Fig. 2d). Many of the enriched GO terms are related to nucleic acid and protein synthesis, modification and transport, which is consistent with the primary objective of the virus hijacking host resources for its own biosynthesis and replication. A few stress response processes are enriched, including response to heat, as targeted by ORF7b. Immune response processes are also enriched, including neutrophil mediated immunity targeted by NSP12 and platelet degranulation targeted by ORF9c. This supports the choice of lung epithelial cells as a model system that expresses the relevant cytokines for recruiting immune cells. In addition to immune response, ciliary basal body plasma membrane docking genes are enriched, which may be related to ciliated lung cells as the site of viral entry. While the enriched GO terms are highly relevant to viral and host response processes, further analysis of binding patterns is needed to determine if there are any functional implications of viral proteins interacting with these genes.
To determine if there are sequence features that the viral proteins recognize, sequence logos were generated from 6-mers of the bound RNA reads. While some of the proteins display strong sequence preferences (Fig. 2d) other proteins appear to bind more non-specifically. Some motifs resemble enrichments observed for human RBPs, where M, ORF7a and NSP10 appear to favor G-rich or GU rich motifs, and NSP5 has a motif (GNAUG). Other motifs may result from regional binding preferences (Fig. 2e), as NSP2 and NSP9 have a strong preference for UC-rich polypyrimidine motifs (p values of 10-96 and 10-41 respectively), which may be a result of their binding to polypyrimidine tracts in intronic regions, whereas N has an AU-rich motif likely because it preferentially binds to 3' UTR which contain AU-rich elements. NSP3, a large multifunctional protein, appears to coat entire transcripts and may not have a meaningful sequence motif. NSP12 primarily binds in the 5' UTR, and a weakly enriched GUCCCG motif that resembles terminal oligopyrimidine (TOP) motifs hints at a possible role in translation regulation.
The systematic interrogation of SARS-CoV-2 protein-host RNA interactions demonstrates that a majority of SARS-CoV-2 viral proteins are RNA binding proteins that target a third of the human transcriptome. The analysis implies that these viral proteins may be involved in perturbing many essential cellular processes of the host. In addition, SARS-CoV- 2 protein specific antibodies enabled confirming the large number of interactions between viral
proteins NSP12 and NSP8 and host RNAs in the context of the intact and live virus. As eCLIP in virus infected cells are limited by IP -grade antibodies, focus was placed on the data obtained from the exogenous expression of individual proteins in BEAS-2B cells for systematic analysis of potential functional implications.
Example 3 - Select SARS-CoV-2 proteins upregulate protein expression of target transcripts
By examining the regional binding preferences of each SARS-CoV-2 protein, it was found that SARS-CoV-2 proteins are enriched at distinct regions of target mRNAs, which imply different regulatory functions because of the protein-RNA interaction. Aggregating the analysis of all targeted peaks for each SARS-CoV-2 protein identifies RNA regions that are preferentially bound (Fig. 3a). Of note, NSP12, ORF3b, ORF7b and ORF9c show the highest proportion of peaks in the 5' UTR, NSP2, NSP3,NSP6 andNSP14 show the highest proportion of peaks in the coding region (CDS), NSP5, NSP7 andNSP9 display ahigh proportion of peaks in intronic regions, and N and NSP15 show the largest proportion of peaks in the 3' UTR. Afiner-grained metagene analysis of read density across all target mRNA transcripts was also performed, where each of the 5' UTR, CDS and 3' UTR regions in an mRNA are scaled to standardized lengths (Fig. 3b). It was found that even though NSP2 has a similar number and proportion of peaks in the CDS as NSP3, it mainly targets the region spanning the 5' UTR and coding start. In contrast, NSP3 reads, along with that of NSP6 and NSP14, coat the entire CDS, with a slight bias towards the start of the coding sequence.
Since 8 of the SARS-CoV-2 proteins - NSP2, NSP3, NSP6, NSP12, NSP14, ORF3b, ORF7b and ORF9c - have binding preferences at the 5' UTR and CDS, it was hypothesized that their protein-RNA interactions could affect expression of the target mRNAs at the level of RNA turnover or translation. To evaluate the functional role of the specific protein-RNA interactions of SARS-CoV-2 proteins and target transcripts, 14 of the proteins were characterized using the tethered function reporter assays (Fig.3c). The individual proteins were fused with an MS2 phage coat protein (MCP), which localizes the tagged protein to MS2 aptamer hairpins inserted in the 3' UTR of Renilla luciferase. A firefly luciferase without MS2 hairpins is included as a control for non-specific effects of the viral protein. Plasmids encoding the MCP-tagged proteins and reporter constructs are co-transfected into HEK293T cells. Changes in Renilla luciferase activity normalized to firefly luciferase activity measures up- or downregulation of protein expression via either translation or mRNA stability because of
positioning the MCP tagged protein in the vicinity of th eRenilla mRNA. The luciferase readout does not by itself distinguish between translational or mRNA stabilizing effects.
From the tethering experiments, it was found that the ratio of Renilla-MS2 to firefly luciferase for 9 of the 14 SARS-CoV-2 proteins increase 1.9 (NSP6) to 3.5-fold (ORF9c) relative to FLAG-MCP control (p-value < 0.002, two tailed multiple /-test) (Fig. 3d). Interestingly, these SARS-CoV-2 proteins display a stronger effect on mRNA translation than the tethering of BOLL (1.5-fold), which is a human RBP previously characterized to be amongst the strongest upregulators from a screen of more than 700 human RBPs. Even though NSP1 was found to bind to very few host mRNAs and its peaks are not mapped to the 5' UTR and CDS, the results for NSP1 are consistent with its ability to enhance the transcription and translation of its own mRNA via interacting with the 5' UTR of the genomic viral mRNA. Of the remaining 5 SARS-CoV-2 proteins, only NSP5, NSP16 and N display slight (but not significant) down-regulation effects (0.73-fold to 0.58-fold) compared to the FLAG peptide control, but to a lesser extent than that of the known translation repressor CNOT7 (0.16-fold). NSP7 and NSP9 appear to have no effect on the targeted expression of the Renilla reporter. To understand if the upregulation is occurring at the RNA or protein level, RT-qPCR was performed to measure the ratio of Renilla-MS2 to Firefly mRNAs. For all the enhancing proteins except for NSP2, the Renilla-MS2/Firefly mRNA ratio is significantly increased (p<0.05) compared to wildtype, albeit to different extents for different proteins (Fig. 3e). Of note, ORF9c shows the greatest enhancing effect (3.5-fold) in the dual luciferase assay, but its effect on the reporter RNAs is middling (1.5-fold). Taking the fold change in luciferase activity ratio to RNA ratio, ORF9c displays the greatest extent of upregulation at the protein level compared to RNA (2.3-fold) (Fig. 3f), followed by NSP2 and ORF3b (1.6 and 1.7 fold respectively). The rest of the proteins range from 1.1-fold (NSP6) to 1.5-fold (NSP14), compared to 1.0-fold of BOLL, suggesting that upregulation likely occurs at both the RNA and protein level.
To understand the origin of increase in mRNA translation, eCLIP reads were mapped to the 18S and 28S ribosomal subunits to determine if there are any specific interactions with the ribosome. Fold enrichment was determined directly from comparing read coverage in IP to size-matched input. It was found that enrichment peaks (>5-fold) of NSP1 reads are mostly mapped to the mRNA entry channel of 40S ribosome corresponding to helix 16 (peak2) and 18 (peak 3) of 18S rRNA, which is consistent with several cryo-EM structure data showing that NSP1 blocks the mRNA entry channel to inhibit host translation (Fig. 3g). In addition, a
NSPl-binding peak was also observed mapped to helix 26/26a (peak 4) of 18S rRNA, a location important for hepatitis C viral internal ribosome entry site (IRES) element binding to the ribosome. This provides further evidence that the function of NSP1 is not only to block the host translation, but that it also may be involved in the regulation of viral RNA translation through mediating the interaction of SARS-CoV-2 5' UTR/IRES with the ribosome. The impact of NSP1 enrichment at helix 10 (peak 1), an exposed flexible region of 18S rRNA, is unclear.
Unlike NSP1, ORF9c shows enrichment at both 28S and 18S rRNA. One of the major enriched regions of ORF9c on 28S rRNA is above the surface of 60S ribosome. This region consists of two ORF9c binding peaks (28S peak 1 and 2) that correspond to two helices, which are connected by their interactions with RPL4 and interact with RPL27a and RPL7 respectively. RPL4 has been shown to interact with RPL7 and further protrude into the core of 60S ribosome and associate with the peptide exit tunnel. The other major region of ORF9c binding to the ribosome is at the intersubunit interface which comprises a helix H63/ES27 (28S peak 3) of 28S rRNA, and two helices, helix 10 (18S peak 2) and 44 (18S peak 5), of 18S rRNA. These helices interact with RPL19, RPL24, RPS6, and RPS8, and have been shown to contribute to establishing eukaryote-specific intersubunit bridges. The interactions of ORF9c at the above two regions suggest that ORF9c may play a role in joining two ribosomal subunits to optimize ribosome function. The last ORF9c binding region is around the mRNA entry channel of 18S rRNA corresponding to helix 16 (18S peak 3), and two nearby helices, helix 1 (18S peak 1), and helix 26/26a (18S (peak 4)). Due to the relatively small size of ORF9c, its binding at helix 16 suggests it may play a role in regulating translation initiation by altering the position of helix 16. The metagene density plot for ORF9c shows binding mainly in the 5' UTR of target mRNAs. By stabilizing the ribosomal complex, ORF9c may enhance translation efficiency of its target mRNAs at the start of translation. In addition, the binding of ORF9c at helix 1 and 26/26a implies it may mediate the interaction of SARS CoV25'UTR/IRES to host ribosome. Taken together, the results indicate ORF9c may be involved in optimizing ribosome structure and regulating translation initiation.
As an orthogonal validation and further evaluation of whether there is any regional effect in binding and upregulation of protein expression, ORF9c was fused to RNA-targeting Cas9 (RCas9) and its effect on mRNA translation of a reporter substrate was assessed. It was previously shown that regional binding preferences were not captured by the MS2-tethering assay, as human RBPs that bind to all three regions were found to regulate the expression of
the targeted reporter, which was brought into proximity. Using 7 guide RNAs that tiled across the mRNA encoding yellow fluorescent protein (YFP) (Table 1), it was found that RCas9- ORF9c fusions upregulated the expression of YFP mRNA when targeted to its 5' UTR. This regional preference is supported by the metagene read density analysis as well (Fig. 3b). Since most translational regulation occurs at the translation initiation step where the translational machinery assembles at the 5' UTR, ORF9c targeting 5' UTR of mRNAs suggests a potential role in upregulating the protein expression of target transcripts.
Taken together, these results suggest that SARS-CoV-2 proteins with a preference for binding to 5' UTR and CDS regions have a capacity for upregulating the expression of target mRNAs. The increase in ultimate translation output was due to effects at both the RNA stabilization level and the translation enhancing level. Mapping eCLIP reads of ORF9c to 18S and 28S rRNA implies a role in enhancing translation and redirecting translation to target mRNAs. [Table 1]
Example 4 - NSP12 upregulates genes in mitochondria and N-linked glycosylation processes
Based on the results of the two reporter assays, it was conjectured that SARS-CoV-2 proteins that bind to the 5' UTR and CDS of its target genes upregulate gene expression. eCLIP target genes were mapped to existing proteomics datasets from SARS-CoV-2 infected cells and it was found that of the differentially expressed proteins (p < 0.05, 24 hours post infection), proteins that are eCLIP targets with IDR reproducible peaks are expressed at higher levels than the non-targeted genes (p < 10 12 by Kolmogorov-Smimoff (KS) test) (Fig.4a). NSP12 targeted genes also appear to be less downregulated (p <1() 4. KS test) due to SARS-Cov2 infection,
with genes bound by more significant peaks showing a greater difference (p <105) (Fig. 4a). However, the opposite is observed in transcriptomics data from SARS-CoV-2 infected cells. eCLIP target genes show decreased RNA abundance (p <108), with NSP12 targeted genes appearing even more downregulated (p <1027). This may need to be understood in the complex context of regulation and counter regulation in viral-host relationships. There may be certain processes that are downregulated due to global transcription shutdown, but post transcriptional upregulation as exerted by NSP12 may upregulate specific genes to the advantage of the virus.
The GO processes enriched by the genes targeted by NSP12 include those related to neutrophil mediated immunity, mitochondrial processes (transport, translation elongation, ATP synthesis coupled electron transport), protein N-linked glycosylation and other cellular protein metabolic process (Fig. 4c, d). Among these processes, NSP12 targeted mitochondrial transport genes are the most significantly upregulated (p < 0.03, KS test) compared to non- eCLIP target genes (Fig. 4e). To confirm whether individual genes in these pathways are upregulated by NSP12, genes from the top GO terms that are targeted by NSP12 in the 5TJTR region were selected, which are representative of the metagene profile for NSP12 (Fig.3b, Fig. 3f). Among the N-linked glycosylated GO term genes, Ribophorin I (RPN1) is part of an N- oligosaccharyl transferase complex that links high mannose oligosaccharides to asparagine residues found in the Asn-X-Ser/Thr consensus motif of nascent polypeptide chains, and UDP- Glucose Glycoprotein Glucosyltransferase 1 (UGGT1) is a soluble protein of the endoplasmic reticulum (ER) that selectively reglucosylates unfolded glycoproteins. Represented in the mitochondrial ATP synthesis coupled electron transport and the respiratory electron transport chain GO processes, NDUFA4 is part of the enzyme cytochrome-c oxidase (or complex IV) and is important for its activity and biogenesis. NSP12 was exogenously introduced by transiently transfecting HEK293T cells, and by comparing to a control where a GFP plasmid was transfected, it was found by Western blotting that UGGT1, RPNl and NDUFA4 are expressed at higher levels (Fig. 4g). In a human lung carcinoma cell line clonally overexpressing ACE2 (A549-ACE2), it was found by immunofluorescence that all three proteins appear induced in SARS-CoV-2 infected cells (stained for NSP8) (Fig. 4h, i), confirming the relevance of these induced genes to the actual viral infection.
Here, it was demonstrated that overexpression of NSP12, as well as SARS-CoV-2 virus infection in cells, enhances the expression of N-linked glycosylation related genes, UGGT1 and RPNl, and the mitochondrial cytochrome c oxidase subunit NDUFA4. Since N-linked glycosylation of host ACE2 receptor and virus Spike protein are important for their interactions
and virus entry, the results suggest that the SARS-CoV-2 infection could activate the N-linked glycosylation pathway to facilitate the viral-host interaction and virus entry through NSP12. Upregulation of NDUFA4 by NSP12 may also imply a role in modulating mitochondrial bioenergetics during virus infection, as viral biogenesis depends on energy and metabolic resources provided by the host.
Example 5 - NSP9 associates with the nuclear pore to block mRNA export
Using affinity mass-spectrometry, it was shown that NSP9 interacts with several nuclear pore complex proteins, including NUP62, NUP214, NUP88, NUP54 and 396 NUP581 (Fig. 5a). It was confirmed that NUP62 indeed co-immunoprecipitated with NSP9, which led to the hypothesis that NSP9 may interfere with mRNA export by associating with the nuclear pore (Fig. 5b). To determine if NSP9 inhibits mRNA export activity, the mRNA levels of NSP9 target genes in cytoplasmic and nuclear fractions were assayed. Both NSP9 expressing BEAS-2B cells and the parental or wild type BEAS-2B cells were fractionated into nuclear and cytoplasmic fractions, followed by RNA extraction and RT-qPCR of target genes. NSP9 target genes were observed to have significant peaks near the 3' splice site, which may suggest interference of splicing-coupled export (Fig. 5c). It was found that target genes IL-la, ANXA2 and UPP1 had lower cytosolic to total mRNA ratios in NSP9- expressing versus parental cells, whereas the cytosolic mRNA levels of non-targeted control genes MALAT1 and UBC were not significantly lowered (Fig. 5d). Even though nuclear RNA fractions were purified at high yields (>1 pg/pl), the RT-qPCR CT values of the target genes were too high (>25 cycles) for accurate quantification. Interleukin la (IL-la) is an important inflammatory cytokine constitutively produced in epithelial cells and plays a central role in regulating immune responses, including being a master cytokine in acute lung inflammation induced by silica micro- and nanoparticles. Interleukin 1b (IL-Ib) binds to the same IL1 receptor as IL-la, and its mRNA is bound by NSP9 even though it does not pass the IDR threshold. To determine if NSP9 inhibiting the nucleocytoplasmic export of the mRNA of IL- la has any impact on the production of this cytokine, an ELISA was performed on the growth media of BEAS-2B wild type and NSP9 expressing cells 48 hours after induction by several common cytokines. Interferon a, b and g resulted in lowered IL-la levels in NSP9 cells compared to wild type, though tumor necrosis factor alpha (TNFa) resulted in the greatest reduction (~ 30%) (Fig. 5e). The observation of reduced IL-la produced at different concentrations of TNFa (Fig. 5f) was reproduced. In addition, it was observed that reduced IL-
1b was produced in NSP9 expressing cells than in wildtype BEAS-2B cells (Fig. 5g). Thus, NSP9 association with the nuclear pore complex proteins aligns with the observation of decreased cytoplasmic abundance of NSP9 target mRNAs, suggesting that NSP9 interaction may directly inhibit nuclear export. Further, NSP9 reduced the production of its target gene IL- la, which suggests that the export inhibition mechanism may be a strategy that SARS-CoV-2 employs to dampen inflammatory host response.
Example 6 - SARS-CoV-2 protein-host RNA interactions identify potential therapeutic targets Like many viruses, the host-viral interactions underlying SARS-CoV-2 infection is broadly understood in terms of the virus hijacking the host cell by globally shutting down the expression of host genes that are irrelevant or hostile to its replication, while the host attempts to fight off the virus by mounting apoptotic and inflammatory responses. To add to this understanding, it was proposed that viral proteins interact with host RNAs to activate a subset of host genes for its own survival through targeted translation activation or mRNA stabilization
(Fig. 6). It was shown that NSP12 specifically upregulates genes in the processes of protein N- linked glycosylation and mitochondrial ATP synthesis and transport. While it has been shown that NSP1 is a global repressor of host cell transcription and translation, it was also proposed that NSP9 contributes another layer to dampening host gene expression by inhibiting mRNA export. Understanding specifically upregulated processes and genes will enable the development of new antiviral strategies.
OTHER EMBODIMENTS It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
Claims
1. An RNA recognition complex comprising:
(a) an RNA-targeting agent; and
(b) a coronavirus-derived protein.
2. The RNA recognition complex of claim 1, further comprising a linker.
3. The RNA recognition complex of claim 1, wherein the RNA-targeting agent comprises CRISPR/Cas9 components.
4. The RNA recognition complex of any one of claims 1-3, wherein the RNA-targeting agent comprises an RNA-targeting Cas effector.
5. The RNA recognition complex of claim 4, wherein the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Cas 13d protein.
6. The RNA recognition complex of claim 4, wherein the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein.
7. The RNA recognition complex of claim 4, wherein the RNA-targeting Cas effector comprises a Cas 13b protein.
8. The RNA recognition complex of claim 4, wherein the RNA-targeting Cas effector comprises a Casl3d protein.
9. The RNA recognition complex of claim 1, wherein the RNA-targeting agent comprises a PUF protein.
10. The RNA recognition complex of claim 1, wherein the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
11. The RNA recognition complex of any one of claims 1-8, wherein the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to an individual gene of a cell.
12. The RNA recognition complex of claim 11, wherein the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
13. The RNA recognition complex of any one of claims 1-12, wherein the coronavirus- derived protein comprises a SARS-CoV-2 protein.
14. The RNA recognition complex of claim 13, wherein the coronavirus-derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
15. A method of upregulating gene expression of a target RNA comprising: delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell.
16. A method of modulating gene expression of a target RNA comprising: delivering a RNA recognition complex into a cell, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and modulates gene expression of the target RNA in the cell.
17. The method of claim 15 or 16, wherein the method further comprises profiling the gene expression of the target RNA in the cell, wherein the gene expression is upregulated.
18. The method of any one of claims 15-17, wherein the coronavirus-derived protein comprises a SARS-CoV-2 protein.
19. The method of claim 18, wherein the coronavirus-derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
20. The method of claim 16, wherein the method further comprises profiling the gene expression of the target RNA in the cell, wherein the gene expression is downregulated.
21. The method of claim 20, wherein the coronavirus-derived protein comprises aNSP9 protein.
22. The method of any one of claims 17-21, wherein the profiling comprises transcriptome analysis or gene expression analysis.
23. The method of any one of claims 17-22, wherein the profiling comprises enhanced cross-linking immunoprecipitation (eCLIP).
24. The method of any one of claims 15-23, wherein the RNA-targeting agent comprises CRISPR/Cas9 components.
25. The method of any one of claims 15-24, wherein the RNA-targeting agent comprises an RNA-targeting Cas effector.
26. The method of claim 25, wherein the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Cas 13d protein.
27. The method of claim 25, wherein the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein.
28. The method of claim 25, wherein the RNA-targeting Cas effector comprises a Cas 13b protein.
29. The method of claim 25, wherein the RNA-targeting Cas effector comprises a Cast 3d protein.
30. The method of any one of claims 15-23, wherein the RNA-targeting agent comprises a PUF protein.
31. The method of any one of claims 15-23, wherein the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
32. The method of any one of claims 15-29, wherein the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to the target RNA in the cell.
33. The method of claim 32, wherein the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
34. A method of treating a disease associated with reduced gene expression in a subject in need thereof, the method comprising: administering a RNA recognition complex to the subject, wherein the RNA recognition complex comprises a RNA-targeting agent, and a coronavirus-derived protein, and wherein the RNA recognition complex binds to the target RNA and upregulates gene expression of the target RNA in the cell, thereby treating the disease associated with reduced gene expression.
35. The method of claim 34, wherein the RNA-targeting agent comprises CRISPR/Cas9 components.
36. The method of claim 34 or 35, wherein the RNA-targeting agent comprises an RNA- targeting Cas effector.
37. The method of claim 36, wherein the RNA-targeting Cas effector comprises a Cas9 protein, a Cas 13b protein, or a Cas 13d protein.
38. The method of claim 36, wherein the RNA-targeting Cas effector comprises a nulcease dead Cas9 (dCas9) protein.
39. The method of claim 36, wherein the RNA-targeting Cas effector comprises a Casl3b protein.
40. The method of claim 36, wherein the RNA-targeting Cas effector comprises a Casl3d protein.
41. The method of claim 34, wherein the RNA-targeting agent comprises a PUF protein.
42. The method of claim 34, wherein the RNA-targeting agent comprises a pentatricopeptide repeat (PPR) protein.
43. The method of any one of claims 34-40, wherein the RNA-targeting agent further comprises a single guide RNA (sgRNA), wherein the sgRNA is targeted to the target RNA in the cell.
44. The method of claim 43, wherein the sgRNA is selected from a group consisting of SEQ ID NOs: 1-7.
45. The method of any one of claims 34-44, wherein the coronavirus-derived protein comprises a SARS-CoV-2 protein.
46. The method of any one of claims 34-45, wherein the coronavirus-derived protein comprises aNSPl, aNSP2, aNSP3, aNSP6, aNSP12, aNSP14, a ORF3b, a ORF7b, or a ORF9c protein.
47. The method of any one of claims 34-46, wherein the RNA-targeting agent comprises a sequence which is complementary to a target RNA sequence.
48. The method of any one of claims 34-46, wherein the RNA-targeting agent complementary sequence is at least 98% complementary to a target RNA sequence.
49. The method of any one of claims 34-46, wherein the RNA-targeting agent complementary sequence is at least 95% complementary to a target RNA sequence.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163195980P | 2021-06-02 | 2021-06-02 | |
US63/195,980 | 2021-06-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022256414A1 true WO2022256414A1 (en) | 2022-12-08 |
Family
ID=84324520
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2022/031780 WO2022256414A1 (en) | 2021-06-02 | 2022-06-01 | Rna recognition complex and uses thereof |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2022256414A1 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200123569A1 (en) * | 2018-06-08 | 2020-04-23 | Locana, Inc. | Rna-targeting fusion protein compositions and methods for use |
-
2022
- 2022-06-01 WO PCT/US2022/031780 patent/WO2022256414A1/en unknown
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20200123569A1 (en) * | 2018-06-08 | 2020-04-23 | Locana, Inc. | Rna-targeting fusion protein compositions and methods for use |
Non-Patent Citations (1)
Title |
---|
BANERJEE ABHIK K.; BLANCO MARIO R.; BRUCE EMILY A.; HONSON DREW D.; CHEN LINLIN M.; CHOW AMY; BHAT PRASHANT; OLLIKAINEN NOAH; QUIN: "SARS-CoV-2 Disrupts Splicing, Translation, and Protein Trafficking to Suppress Host Defenses", CELL, ELSEVIER, AMSTERDAM NL, vol. 183, no. 5, 8 October 2020 (2020-10-08), Amsterdam NL , pages 1325, XP086368291, ISSN: 0092-8674, DOI: 10.1016/j.cell.2020.10.004 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2021530985A (en) | Fososome composition and its use | |
US11352646B2 (en) | Vector system for expressing regulatory RNA | |
KR20200083550A (en) | How to rescue a stop codon through gene redirection by ACE-tRNA | |
US20210222178A1 (en) | MICROBIAL SYSTEM FOR PRODUCTION AND DELIVERY OF EUKARYOTE-TRANSLATABLE mRNA TO EUKARYA | |
BR112020005287A2 (en) | compositions and methods for editing the ttr gene and treating attr amyloidosis | |
CN113544267A (en) | Targeted nuclear RNA cleavage and polyadenylation using CRISPR-Cas | |
JP6956416B2 (en) | Transposon system, kits containing it and their use | |
JP2022525428A (en) | Compositions and Methods Containing TTR Guide RNA and Polynucleotides Encoding RNA Guide DNA Binders | |
CN114174520A (en) | Compositions and methods for selective gene regulation | |
KR20220155981A (en) | Methods and compositions for treating premature stop codon-mediated disorders | |
JP2022527302A (en) | Polynucleotides, compositions, and methods for polypeptide expression | |
JP2022525429A (en) | Compositions and methods for treating TTR gene editing and ATTR amyloidosis, including corticosteroids, or their use. | |
Song et al. | RBM39 alters phosphorylation of c-Jun and binds to viral RNA to promote PRRSV proliferation | |
KR20210131310A (en) | Anellosome and how to use it | |
US20230383275A1 (en) | Sgrna targeting aqp1 rna, and vector and use thereof | |
CN115461118A (en) | Compositions and methods for treating familial hypercholesterolemia and increased low density lipoprotein cholesterol | |
Ruiz et al. | In vitro search for alternative promoters to the human immediate early cytomegalovirus (IE-CMV) to express the G gene of viral haemorrhagic septicemia virus (VHSV) in fish epithelial cells | |
KR20210131309A (en) | Anellosomes for transporting secreted therapeutic modalities | |
WO2022256414A1 (en) | Rna recognition complex and uses thereof | |
KR20230124682A (en) | In vitro assembly of anellovirus capsid encapsulating RNA | |
TW202144579A (en) | Use of viral vectors for coronavirus vaccine production | |
CN109266684B (en) | Method for constructing animal model with pathogen infection sensitivity | |
JP2022515211A (en) | Synthetic microRNA mimic | |
KR20210131308A (en) | Anellosomes for transporting intracellular therapeutic modalities | |
CN114908089B (en) | Construction method and application of 3' UTR |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22816779 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |