WO2023197015A1 - Compositions and analysis of dephosphorylated oligoribonucleotides - Google Patents
Compositions and analysis of dephosphorylated oligoribonucleotides Download PDFInfo
- Publication number
- WO2023197015A1 WO2023197015A1 PCT/US2023/065602 US2023065602W WO2023197015A1 WO 2023197015 A1 WO2023197015 A1 WO 2023197015A1 US 2023065602 W US2023065602 W US 2023065602W WO 2023197015 A1 WO2023197015 A1 WO 2023197015A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- rna
- substrate
- endoribonuclease
- cleavage
- rnase
- Prior art date
Links
- 239000000203 mixture Substances 0.000 title claims abstract description 140
- 238000004458 analytical method Methods 0.000 title claims abstract description 81
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims abstract description 663
- 108010093099 Endoribonucleases Proteins 0.000 claims abstract description 257
- 102000004190 Enzymes Human genes 0.000 claims abstract description 194
- 108090000790 Enzymes Proteins 0.000 claims abstract description 194
- 230000008439 repair process Effects 0.000 claims abstract description 101
- 241000894007 species Species 0.000 claims abstract description 63
- 241000282414 Homo sapiens Species 0.000 claims abstract description 44
- 241000588724 Escherichia coli Species 0.000 claims abstract description 12
- 230000001580 bacterial effect Effects 0.000 claims abstract description 12
- 240000008067 Cucumis sativus Species 0.000 claims abstract description 8
- 235000009849 Cucumis sativus Nutrition 0.000 claims abstract description 8
- 240000006439 Aspergillus oryzae Species 0.000 claims abstract description 7
- 235000002247 Aspergillus oryzae Nutrition 0.000 claims abstract description 7
- 241000205156 Pyrococcus furiosus Species 0.000 claims abstract description 6
- 241001515965 unidentified phage Species 0.000 claims abstract description 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 12
- 239000000758 substrate Substances 0.000 claims description 300
- 238000003776 cleavage reaction Methods 0.000 claims description 251
- 230000007017 scission Effects 0.000 claims description 250
- 125000003729 nucleotide group Chemical group 0.000 claims description 219
- 239000002773 nucleotide Substances 0.000 claims description 207
- 238000000034 method Methods 0.000 claims description 171
- 108020004999 messenger RNA Proteins 0.000 claims description 158
- 108091034117 Oligonucleotide Proteins 0.000 claims description 142
- 102000002494 Endoribonucleases Human genes 0.000 claims description 110
- 102000006382 Ribonucleases Human genes 0.000 claims description 84
- 108010083644 Ribonucleases Proteins 0.000 claims description 84
- -1 poly(ethylene glycol) Polymers 0.000 claims description 71
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 claims description 69
- 239000003298 DNA probe Substances 0.000 claims description 61
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 61
- 108020003215 DNA Probes Proteins 0.000 claims description 52
- 230000000694 effects Effects 0.000 claims description 48
- 239000007787 solid Substances 0.000 claims description 46
- XSQUKJJJFZCRTK-UHFFFAOYSA-N urea group Chemical group NC(=O)N XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 claims description 46
- 102000040430 polynucleotide Human genes 0.000 claims description 43
- 108091033319 polynucleotide Proteins 0.000 claims description 43
- 239000002157 polynucleotide Substances 0.000 claims description 39
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 claims description 36
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 claims description 33
- 239000003795 chemical substances by application Substances 0.000 claims description 33
- 230000027455 binding Effects 0.000 claims description 32
- 239000012634 fragment Substances 0.000 claims description 29
- 238000004949 mass spectrometry Methods 0.000 claims description 28
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 claims description 27
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 claims description 27
- 239000004202 carbamide Substances 0.000 claims description 24
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 claims description 23
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 claims description 23
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 claims description 21
- 239000000463 material Substances 0.000 claims description 19
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 claims description 18
- 102000005891 Pancreatic ribonuclease Human genes 0.000 claims description 18
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 claims description 18
- 230000000295 complement effect Effects 0.000 claims description 17
- 108010073254 Colicins Proteins 0.000 claims description 16
- 238000000338 in vitro Methods 0.000 claims description 16
- 239000006172 buffering agent Substances 0.000 claims description 15
- 239000002777 nucleoside Substances 0.000 claims description 15
- 229920001223 polyethylene glycol Polymers 0.000 claims description 15
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 claims description 14
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 claims description 14
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 claims description 14
- 101710086015 RNA ligase Proteins 0.000 claims description 14
- 108020004566 Transfer RNA Proteins 0.000 claims description 14
- 108010082351 cusativin Proteins 0.000 claims description 14
- 108020005403 ribonuclease U2 Proteins 0.000 claims description 14
- 239000012530 fluid Substances 0.000 claims description 13
- 108020004418 ribosomal RNA Proteins 0.000 claims description 12
- 150000003839 salts Chemical class 0.000 claims description 12
- 239000001226 triphosphate Substances 0.000 claims description 12
- 235000011178 triphosphate Nutrition 0.000 claims description 12
- 238000005859 coupling reaction Methods 0.000 claims description 11
- 229910052751 metal Inorganic materials 0.000 claims description 11
- 239000002184 metal Substances 0.000 claims description 11
- 239000002679 microRNA Substances 0.000 claims description 11
- 108091032955 Bacterial small RNA Proteins 0.000 claims description 10
- 108091028075 Circular RNA Proteins 0.000 claims description 10
- 108700011259 MicroRNAs Proteins 0.000 claims description 10
- 241000251539 Vertebrata <Metazoa> Species 0.000 claims description 9
- 108010066490 ribonuclease 4 Proteins 0.000 claims description 9
- 108020005004 Guide RNA Proteins 0.000 claims description 8
- 244000302512 Momordica charantia Species 0.000 claims description 8
- 238000001502 gel electrophoresis Methods 0.000 claims description 8
- 239000002924 silencing RNA Substances 0.000 claims description 8
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 claims description 7
- 235000009811 Momordica charantia Nutrition 0.000 claims description 7
- 102100026411 Ribonuclease 4 Human genes 0.000 claims description 7
- 238000005251 capillar electrophoresis Methods 0.000 claims description 7
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 claims description 7
- ABBQHOQBGMUPJH-UHFFFAOYSA-M Sodium salicylate Chemical compound [Na+].OC1=CC=CC=C1C([O-])=O ABBQHOQBGMUPJH-UHFFFAOYSA-M 0.000 claims description 6
- 238000003981 capillary liquid chromatography Methods 0.000 claims description 6
- 229960004025 sodium salicylate Drugs 0.000 claims description 6
- 210000001519 tissue Anatomy 0.000 claims description 6
- 241000588722 Escherichia Species 0.000 claims description 5
- WAEMQWOKJMHJLA-UHFFFAOYSA-N Manganese(2+) Chemical compound [Mn+2] WAEMQWOKJMHJLA-UHFFFAOYSA-N 0.000 claims description 5
- 206010036790 Productive cough Diseases 0.000 claims description 5
- 241000282898 Sus scrofa Species 0.000 claims description 5
- 108091046869 Telomeric non-coding RNA Proteins 0.000 claims description 5
- 238000001574 biopsy Methods 0.000 claims description 5
- 239000008280 blood Substances 0.000 claims description 5
- 210000004369 blood Anatomy 0.000 claims description 5
- 239000003184 complementary RNA Substances 0.000 claims description 5
- 210000003608 fece Anatomy 0.000 claims description 5
- 230000000155 isotopic effect Effects 0.000 claims description 5
- 210000002751 lymph Anatomy 0.000 claims description 5
- 210000003296 saliva Anatomy 0.000 claims description 5
- 210000000582 semen Anatomy 0.000 claims description 5
- 239000002689 soil Substances 0.000 claims description 5
- 210000003802 sputum Anatomy 0.000 claims description 5
- 208000024794 sputum Diseases 0.000 claims description 5
- 210000004243 sweat Anatomy 0.000 claims description 5
- 210000002700 urine Anatomy 0.000 claims description 5
- 238000005406 washing Methods 0.000 claims description 5
- 108020005544 Antisense RNA Proteins 0.000 claims description 4
- 108091023037 Aptamer Proteins 0.000 claims description 4
- 108020005174 Archaeal RNA Proteins 0.000 claims description 4
- JLVVSXFLKOJNIY-UHFFFAOYSA-N Magnesium ion Chemical compound [Mg+2] JLVVSXFLKOJNIY-UHFFFAOYSA-N 0.000 claims description 4
- VEQPNABPJHWNSG-UHFFFAOYSA-N Nickel(2+) Chemical compound [Ni+2] VEQPNABPJHWNSG-UHFFFAOYSA-N 0.000 claims description 4
- 108020000999 Viral RNA Proteins 0.000 claims description 4
- XLJKHNWPARRRJB-UHFFFAOYSA-N cobalt(2+) Chemical compound [Co+2] XLJKHNWPARRRJB-UHFFFAOYSA-N 0.000 claims description 4
- 239000010865 sewage Substances 0.000 claims description 4
- 239000010802 sludge Substances 0.000 claims description 4
- 238000004891 communication Methods 0.000 claims description 3
- 230000004570 RNA-binding Effects 0.000 claims description 2
- 238000007865 diluting Methods 0.000 claims description 2
- 238000004811 liquid chromatography Methods 0.000 claims description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 abstract description 15
- 241000282890 Sus Species 0.000 abstract description 2
- 102100030011 Endoribonuclease Human genes 0.000 abstract 1
- 241000218984 Momordica Species 0.000 abstract 1
- 235000009815 Momordica Nutrition 0.000 abstract 1
- 239000000047 product Substances 0.000 description 148
- 102100030013 Endoribonuclease Human genes 0.000 description 147
- 101000650940 Autographa californica nuclear polyhedrosis virus RNA ligase Proteins 0.000 description 106
- 101001139028 Enterobacteria phage T4 Polynucleotide kinase Proteins 0.000 description 106
- 101001094809 Homo sapiens Polynucleotide 5'-hydroxyl-kinase Proteins 0.000 description 106
- 101001099586 Homo sapiens Pyridoxal kinase Proteins 0.000 description 106
- 102100035460 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 106
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 102
- 230000029087 digestion Effects 0.000 description 82
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 51
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 50
- 229940045145 uridine Drugs 0.000 description 50
- 238000006243 chemical reaction Methods 0.000 description 46
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 44
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 44
- 150000001413 amino acids Chemical group 0.000 description 34
- 238000012163 sequencing technique Methods 0.000 description 31
- 229910019142 PO4 Inorganic materials 0.000 description 29
- 239000010452 phosphate Substances 0.000 description 29
- 239000000523 sample Substances 0.000 description 29
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 27
- 102000039446 nucleic acids Human genes 0.000 description 26
- 108020004707 nucleic acids Proteins 0.000 description 26
- 108090000623 proteins and genes Proteins 0.000 description 26
- 102000004169 proteins and genes Human genes 0.000 description 26
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 25
- 108010046983 Ribonuclease T1 Proteins 0.000 description 25
- 229960005305 adenosine Drugs 0.000 description 25
- 235000018102 proteins Nutrition 0.000 description 25
- 108091027075 5S-rRNA precursor Proteins 0.000 description 24
- 238000002474 experimental method Methods 0.000 description 24
- 239000003161 ribonuclease inhibitor Substances 0.000 description 23
- 229940029575 guanosine Drugs 0.000 description 22
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 22
- 229920000642 polymer Polymers 0.000 description 22
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 21
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 21
- 238000004873 anchoring Methods 0.000 description 21
- 108090000765 processed proteins & peptides Proteins 0.000 description 21
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 20
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 20
- 102100037968 Ribonuclease inhibitor Human genes 0.000 description 20
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 20
- 125000000524 functional group Chemical group 0.000 description 19
- 235000002639 sodium chloride Nutrition 0.000 description 18
- 102100034343 Integrase Human genes 0.000 description 17
- 239000000562 conjugate Substances 0.000 description 17
- 238000002372 labelling Methods 0.000 description 17
- 230000004048 modification Effects 0.000 description 17
- 238000012986 modification Methods 0.000 description 17
- 150000007523 nucleic acids Chemical class 0.000 description 17
- 101710203526 Integrase Proteins 0.000 description 16
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 16
- 239000000872 buffer Substances 0.000 description 16
- 238000009826 distribution Methods 0.000 description 16
- 229920001184 polypeptide Polymers 0.000 description 16
- 102000004196 processed proteins & peptides Human genes 0.000 description 16
- 235000001014 amino acid Nutrition 0.000 description 15
- 229940024606 amino acid Drugs 0.000 description 15
- 238000005520 cutting process Methods 0.000 description 15
- 238000013467 fragmentation Methods 0.000 description 15
- 238000006062 fragmentation reaction Methods 0.000 description 15
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 14
- 239000002585 base Substances 0.000 description 14
- 238000011534 incubation Methods 0.000 description 14
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 13
- 108010016626 Dipeptides Proteins 0.000 description 13
- 230000000875 corresponding effect Effects 0.000 description 13
- UEZVMMHDMIWARA-UHFFFAOYSA-M phosphonate Chemical compound [O-]P(=O)=O UEZVMMHDMIWARA-UHFFFAOYSA-M 0.000 description 13
- 238000004885 tandem mass spectrometry Methods 0.000 description 13
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 12
- 239000011324 bead Substances 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 12
- 238000012512 characterization method Methods 0.000 description 12
- 238000001819 mass spectrum Methods 0.000 description 12
- ZUHQCDZJPTXVCU-UHFFFAOYSA-N C1#CCCC2=CC=CC=C2C2=CC=CC=C21 Chemical group C1#CCCC2=CC=CC=C2C2=CC=CC=C21 ZUHQCDZJPTXVCU-UHFFFAOYSA-N 0.000 description 11
- 239000003153 chemical reaction reagent Substances 0.000 description 11
- 238000011002 quantification Methods 0.000 description 11
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 10
- 230000026279 RNA modification Effects 0.000 description 10
- 101710141795 Ribonuclease inhibitor Proteins 0.000 description 10
- 229940122208 Ribonuclease inhibitor Drugs 0.000 description 10
- 150000001450 anions Chemical class 0.000 description 10
- 238000013459 approach Methods 0.000 description 10
- 230000004927 fusion Effects 0.000 description 10
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 10
- 230000026731 phosphorylation Effects 0.000 description 10
- 238000006366 phosphorylation reaction Methods 0.000 description 10
- 150000003212 purines Chemical class 0.000 description 10
- 150000003230 pyrimidines Chemical class 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 239000013614 RNA sample Substances 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 150000001720 carbohydrates Chemical class 0.000 description 9
- 235000014633 carbohydrates Nutrition 0.000 description 9
- 108010024226 placental ribonuclease inhibitor Proteins 0.000 description 9
- 239000011541 reaction mixture Substances 0.000 description 9
- HTWSTKVLFZRAPM-QYYRPYCUSA-N (2r,3r,4s,5s)-2-(6-aminopurin-9-yl)-4-azido-5-(hydroxymethyl)oxolan-3-ol Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](N=[N+]=[N-])[C@H]1O HTWSTKVLFZRAPM-QYYRPYCUSA-N 0.000 description 8
- 238000010790 dilution Methods 0.000 description 8
- 239000012895 dilution Substances 0.000 description 8
- 239000003446 ligand Substances 0.000 description 8
- 230000005291 magnetic effect Effects 0.000 description 8
- 238000013507 mapping Methods 0.000 description 8
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 8
- 239000011780 sodium chloride Substances 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- ZNZYKNKBJPZETN-WELNAUFTSA-N Dialdehyde 11678 Chemical group N1C2=CC=CC=C2C2=C1[C@H](C[C@H](/C(=C/O)C(=O)OC)[C@@H](C=C)C=O)NCC2 ZNZYKNKBJPZETN-WELNAUFTSA-N 0.000 description 7
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 7
- 108010092408 Eosinophil Peroxidase Proteins 0.000 description 7
- 102100031939 Erythropoietin Human genes 0.000 description 7
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 7
- 230000007022 RNA scission Effects 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 229960002685 biotin Drugs 0.000 description 7
- 235000020958 biotin Nutrition 0.000 description 7
- 239000011616 biotin Substances 0.000 description 7
- 229910052799 carbon Inorganic materials 0.000 description 7
- 239000003814 drug Substances 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 7
- 125000004430 oxygen atom Chemical group O* 0.000 description 7
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 238000004445 quantitative analysis Methods 0.000 description 7
- 239000000377 silicon dioxide Substances 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- BYEAHWXPCBROCE-UHFFFAOYSA-N 1,1,1,3,3,3-hexafluoropropan-2-ol Chemical compound FC(F)(F)C(O)C(F)(F)F BYEAHWXPCBROCE-UHFFFAOYSA-N 0.000 description 6
- DVGKRPYUFRZAQW-UHFFFAOYSA-N 3 prime Natural products CC(=O)NC1OC(CC(O)C1C(O)C(O)CO)(OC2C(O)C(CO)OC(OC3C(O)C(O)C(O)OC3CO)C2O)C(=O)O DVGKRPYUFRZAQW-UHFFFAOYSA-N 0.000 description 6
- 241000203069 Archaea Species 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 6
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 6
- 150000001412 amines Chemical class 0.000 description 6
- 239000011521 glass Substances 0.000 description 6
- 239000012535 impurity Substances 0.000 description 6
- 150000002632 lipids Chemical class 0.000 description 6
- 150000003833 nucleoside derivatives Chemical class 0.000 description 6
- 239000002904 solvent Substances 0.000 description 6
- UVBYMVOUBXYSFV-XUTVFYLZSA-N 1-methylpseudouridine Chemical compound O=C1NC(=O)N(C)C=C1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 UVBYMVOUBXYSFV-XUTVFYLZSA-N 0.000 description 5
- UVBYMVOUBXYSFV-UHFFFAOYSA-N 1-methylpseudouridine Natural products O=C1NC(=O)N(C)C=C1C1C(O)C(O)C(CO)O1 UVBYMVOUBXYSFV-UHFFFAOYSA-N 0.000 description 5
- 102000004533 Endonucleases Human genes 0.000 description 5
- 108010042407 Endonucleases Proteins 0.000 description 5
- 101001010783 Homo sapiens Endoribonuclease Proteins 0.000 description 5
- 238000000738 capillary electrophoresis-mass spectrometry Methods 0.000 description 5
- 210000004027 cell Anatomy 0.000 description 5
- 239000007795 chemical reaction product Substances 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 230000002255 enzymatic effect Effects 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- 238000010438 heat treatment Methods 0.000 description 5
- 229910052739 hydrogen Inorganic materials 0.000 description 5
- 239000001257 hydrogen Substances 0.000 description 5
- 238000010348 incorporation Methods 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000002336 ribonucleotide Substances 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 238000001195 ultra high performance liquid chromatography Methods 0.000 description 5
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 4
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 241000701533 Escherichia virus T4 Species 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- 101001010787 Homo sapiens Endoribonuclease Proteins 0.000 description 4
- 108010093096 Immobilized Enzymes Proteins 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- 102000003960 Ligases Human genes 0.000 description 4
- 108090000364 Ligases Proteins 0.000 description 4
- 229940026233 Pfizer-BioNTech COVID-19 vaccine Drugs 0.000 description 4
- 101710124239 Poly(A) polymerase Proteins 0.000 description 4
- 108091036407 Polyadenylation Proteins 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- 108020004518 RNA Probes Proteins 0.000 description 4
- 239000003391 RNA probe Substances 0.000 description 4
- 108091028664 Ribonucleotide Proteins 0.000 description 4
- GWEVSGVZZGPLCZ-UHFFFAOYSA-N Titan oxide Chemical compound O=[Ti]=O GWEVSGVZZGPLCZ-UHFFFAOYSA-N 0.000 description 4
- 239000000654 additive Substances 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 150000001345 alkine derivatives Chemical class 0.000 description 4
- 150000001540 azides Chemical class 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 4
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 4
- 125000004122 cyclic group Chemical group 0.000 description 4
- 238000010828 elution Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 150000004676 glycans Chemical class 0.000 description 4
- 230000003301 hydrolyzing effect Effects 0.000 description 4
- 239000000543 intermediate Substances 0.000 description 4
- 150000002500 ions Chemical class 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 238000001948 isotopic labelling Methods 0.000 description 4
- 108700021021 mRNA Vaccine Proteins 0.000 description 4
- 229940126582 mRNA vaccine Drugs 0.000 description 4
- 229910052759 nickel Inorganic materials 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 239000000863 peptide conjugate Substances 0.000 description 4
- 239000004033 plastic Substances 0.000 description 4
- 229920003023 plastic Polymers 0.000 description 4
- 229920001282 polysaccharide Polymers 0.000 description 4
- 239000005017 polysaccharide Substances 0.000 description 4
- 125000002652 ribonucleotide group Chemical group 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 150000003573 thiols Chemical class 0.000 description 4
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 4
- 229960005486 vaccine Drugs 0.000 description 4
- XWNJMSJGJFSGRY-UHFFFAOYSA-N 2-(benzylamino)-3,7-dihydropurin-6-one Chemical compound N1C=2N=CNC=2C(=O)N=C1NCC1=CC=CC=C1 XWNJMSJGJFSGRY-UHFFFAOYSA-N 0.000 description 3
- WHSOKGZCVSCOJM-UHFFFAOYSA-N 4-amino-1-benzylpyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1CC1=CC=CC=C1 WHSOKGZCVSCOJM-UHFFFAOYSA-N 0.000 description 3
- ZXIATBNUWJBBGT-JXOAFFINSA-N 5-methoxyuridine Chemical compound O=C1NC(=O)C(OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZXIATBNUWJBBGT-JXOAFFINSA-N 0.000 description 3
- QGZKDVFQNNGYKY-OUBTZVSYSA-N Ammonia-15N Chemical compound [15NH3] QGZKDVFQNNGYKY-OUBTZVSYSA-N 0.000 description 3
- 102000008682 Argonaute Proteins Human genes 0.000 description 3
- 108010088141 Argonaute Proteins Proteins 0.000 description 3
- 208000025721 COVID-19 Diseases 0.000 description 3
- OKTJSMMVPCPJKN-OUBTZVSYSA-N Carbon-13 Chemical compound [13C] OKTJSMMVPCPJKN-OUBTZVSYSA-N 0.000 description 3
- 229920002101 Chitin Polymers 0.000 description 3
- 102000016911 Deoxyribonucleases Human genes 0.000 description 3
- 108010053770 Deoxyribonucleases Proteins 0.000 description 3
- YZCKVEUIGOORGS-OUBTZVSYSA-N Deuterium Chemical compound [2H] YZCKVEUIGOORGS-OUBTZVSYSA-N 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 108010070675 Glutathione transferase Proteins 0.000 description 3
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 3
- 102100029100 Hematopoietic prostaglandin D synthase Human genes 0.000 description 3
- OAKJQQAXSVQMHS-UHFFFAOYSA-N Hydrazine Chemical compound NN OAKJQQAXSVQMHS-UHFFFAOYSA-N 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 3
- 101100137042 Mus musculus Pnkp gene Proteins 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 239000002202 Polyethylene glycol Substances 0.000 description 3
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 3
- 108010090804 Streptavidin Proteins 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 3
- 150000001350 alkyl halides Chemical class 0.000 description 3
- 125000004429 atom Chemical group 0.000 description 3
- 238000010461 azide-alkyne cycloaddition reaction Methods 0.000 description 3
- 230000003139 buffering effect Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000011248 coating agent Substances 0.000 description 3
- 238000000576 coating method Methods 0.000 description 3
- 239000010949 copper Substances 0.000 description 3
- 238000004132 cross linking Methods 0.000 description 3
- 239000005549 deoxyribonucleoside Substances 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 229910052805 deuterium Inorganic materials 0.000 description 3
- 239000005546 dideoxynucleotide Substances 0.000 description 3
- 235000014113 dietary fatty acids Nutrition 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 229930195729 fatty acid Natural products 0.000 description 3
- 239000000194 fatty acid Substances 0.000 description 3
- 150000004665 fatty acids Chemical class 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 229910052737 gold Inorganic materials 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 3
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 3
- 150000004715 keto acids Chemical class 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000012743 protein tagging Effects 0.000 description 3
- XKMLYUALXHKNFT-UHFFFAOYSA-N rGTP Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O XKMLYUALXHKNFT-UHFFFAOYSA-N 0.000 description 3
- 239000011347 resin Substances 0.000 description 3
- 229920005989 resin Polymers 0.000 description 3
- 239000002342 ribonucleoside Substances 0.000 description 3
- 239000012266 salt solution Substances 0.000 description 3
- 239000004332 silver Substances 0.000 description 3
- 229910052709 silver Inorganic materials 0.000 description 3
- 238000001179 sorption measurement Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical group [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 229910052725 zinc Inorganic materials 0.000 description 3
- 239000011701 zinc Substances 0.000 description 3
- WQBCHXWMHQMQKW-XVFCMESISA-N 1-[(2r,3r,4s,5s)-4-azido-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O[C@@H]1[C@H](N=[N+]=[N-])[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 WQBCHXWMHQMQKW-XVFCMESISA-N 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- 108010022794 2',3'-Cyclic-Nucleotide Phosphodiesterases Proteins 0.000 description 2
- 102000012438 2',3'-Cyclic-Nucleotide Phosphodiesterases Human genes 0.000 description 2
- OFEZSBMBBKLLBJ-UHFFFAOYSA-N 2-(6-aminopurin-9-yl)-5-(hydroxymethyl)oxolan-3-ol Chemical compound C1=NC=2C(N)=NC=NC=2N1C1OC(CO)CC1O OFEZSBMBBKLLBJ-UHFFFAOYSA-N 0.000 description 2
- IZFJAICCKKWWNM-JXOAFFINSA-N 4-amino-1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-methoxypyrimidin-2-one Chemical compound O=C1N=C(N)C(OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 IZFJAICCKKWWNM-JXOAFFINSA-N 0.000 description 2
- HKFJGAHHVULNMG-XVFCMESISA-N 4-amino-1-[(2r,3r,4s,5s)-4-azido-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](N=[N+]=[N-])[C@@H](CO)O1 HKFJGAHHVULNMG-XVFCMESISA-N 0.000 description 2
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical class O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 2
- 101710159080 Aconitate hydratase A Proteins 0.000 description 2
- 101710159078 Aconitate hydratase B Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108090001008 Avidin Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 2
- 101150014715 CAP2 gene Proteins 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 229910004613 CdTe Inorganic materials 0.000 description 2
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 2
- KRHYYFGTRYWZRS-UHFFFAOYSA-M Fluoride anion Chemical compound [F-] KRHYYFGTRYWZRS-UHFFFAOYSA-M 0.000 description 2
- 229910001218 Gallium arsenide Inorganic materials 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 2
- 101000987586 Homo sapiens Eosinophil peroxidase Proteins 0.000 description 2
- 101000920686 Homo sapiens Erythropoietin Proteins 0.000 description 2
- 238000006736 Huisgen cycloaddition reaction Methods 0.000 description 2
- 229910000673 Indium arsenide Inorganic materials 0.000 description 2
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 2
- 229930010555 Inosine Natural products 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- 108090001090 Lectins Proteins 0.000 description 2
- 102000004856 Lectins Human genes 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- 101100198353 Mus musculus Rnasel gene Proteins 0.000 description 2
- 101100260872 Mus musculus Tmprss4 gene Proteins 0.000 description 2
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 2
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 2
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 2
- SEQKRHFRPICQDD-UHFFFAOYSA-N N-tris(hydroxymethyl)methylglycine Chemical compound OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- XYFCBTPGUUZFHI-UHFFFAOYSA-N Phosphine Chemical compound P XYFCBTPGUUZFHI-UHFFFAOYSA-N 0.000 description 2
- 239000004698 Polyethylene Substances 0.000 description 2
- 102000001253 Protein Kinase Human genes 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 229930185560 Pseudouridine Natural products 0.000 description 2
- 230000006819 RNA synthesis Effects 0.000 description 2
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 2
- 101710105008 RNA-binding protein Proteins 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- DPOPAJRDYZGTIR-UHFFFAOYSA-N Tetrazine Chemical compound C1=CN=NN=N1 DPOPAJRDYZGTIR-UHFFFAOYSA-N 0.000 description 2
- 102000003929 Transaminases Human genes 0.000 description 2
- 108090000340 Transaminases Proteins 0.000 description 2
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- 229910052770 Uranium Inorganic materials 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 150000001336 alkenes Chemical class 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 239000004411 aluminium Substances 0.000 description 2
- 229910052782 aluminium Inorganic materials 0.000 description 2
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 2
- 150000001414 amino alcohols Chemical class 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 125000000129 anionic group Chemical group 0.000 description 2
- 239000012620 biological material Substances 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 239000006227 byproduct Substances 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- UHYPYGJEEGLRJD-UHFFFAOYSA-N cadmium(2+);selenium(2-) Chemical compound [Se-2].[Cd+2] UHYPYGJEEGLRJD-UHFFFAOYSA-N 0.000 description 2
- 238000011088 calibration curve Methods 0.000 description 2
- 125000002680 canonical nucleotide group Chemical group 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000007385 chemical modification Methods 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 229910017052 cobalt Inorganic materials 0.000 description 2
- 239000010941 cobalt Substances 0.000 description 2
- GUTLYIVDDKVIGB-UHFFFAOYSA-N cobalt atom Chemical compound [Co] GUTLYIVDDKVIGB-UHFFFAOYSA-N 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 229910052802 copper Inorganic materials 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000000502 dialysis Methods 0.000 description 2
- 150000004985 diamines Chemical class 0.000 description 2
- FFYPMLJYZAEMQB-UHFFFAOYSA-N diethyl pyrocarbonate Chemical compound CCOC(=O)OC(=O)OCC FFYPMLJYZAEMQB-UHFFFAOYSA-N 0.000 description 2
- 239000001177 diphosphate Substances 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 2
- 229910000397 disodium phosphate Inorganic materials 0.000 description 2
- 238000010494 dissociation reaction Methods 0.000 description 2
- 230000005593 dissociations Effects 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 229950007919 egtazic acid Drugs 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- KWIUHFFTVRNATP-UHFFFAOYSA-N glycine betaine Chemical compound C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 102000044890 human EPO Human genes 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 150000003949 imides Chemical class 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- RPQDHPTXJYYUPQ-UHFFFAOYSA-N indium arsenide Chemical compound [In]#[As] RPQDHPTXJYYUPQ-UHFFFAOYSA-N 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 229960003786 inosine Drugs 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 2
- 229910052742 iron Inorganic materials 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N iron Substances [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 239000012948 isocyanate Substances 0.000 description 2
- 150000002513 isocyanates Chemical class 0.000 description 2
- 150000002540 isothiocyanates Chemical class 0.000 description 2
- 239000002523 lectin Substances 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 101150094164 lysY gene Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 229910052748 manganese Inorganic materials 0.000 description 2
- 239000011572 manganese Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000012531 mass spectrometric analysis of intact mass Methods 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 150000002739 metals Chemical class 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 230000037230 mobility Effects 0.000 description 2
- 239000002062 molecular scaffold Substances 0.000 description 2
- 239000002159 nanocrystal Substances 0.000 description 2
- 239000002086 nanomaterial Substances 0.000 description 2
- 229910000510 noble metal Inorganic materials 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 2
- BPUBBGLMJRNUCC-UHFFFAOYSA-N oxygen(2-);tantalum(5+) Chemical compound [O-2].[O-2].[O-2].[O-2].[O-2].[Ta+5].[Ta+5] BPUBBGLMJRNUCC-UHFFFAOYSA-N 0.000 description 2
- 238000004806 packaging method and process Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 210000001322 periplasm Anatomy 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 150000008300 phosphoramidites Chemical class 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 229920000573 polyethylene Polymers 0.000 description 2
- FGIUAXJPYTZDNR-UHFFFAOYSA-N potassium nitrate Chemical compound [K+].[O-][N+]([O-])=O FGIUAXJPYTZDNR-UHFFFAOYSA-N 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 108060006633 protein kinase Proteins 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 125000006853 reporter group Chemical group 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 235000012239 silicon dioxide Nutrition 0.000 description 2
- JQWHASGSAFIOCM-UHFFFAOYSA-M sodium periodate Chemical compound [Na+].[O-]I(=O)(=O)=O JQWHASGSAFIOCM-UHFFFAOYSA-M 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 239000012536 storage buffer Substances 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- PBCFLUZVCVVTBY-UHFFFAOYSA-N tantalum pentoxide Inorganic materials O=[Ta](=O)O[Ta](=O)=O PBCFLUZVCVVTBY-UHFFFAOYSA-N 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- XXYIANZGUOSQHY-XLPZGREQSA-N thymidine 3'-monophosphate Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](OP(O)(O)=O)C1 XXYIANZGUOSQHY-XLPZGREQSA-N 0.000 description 2
- 239000004408 titanium dioxide Substances 0.000 description 2
- URYYVOIYTNXXBN-OWOJBTEDSA-N trans-cyclooctene Chemical compound C1CCC\C=C\CC1 URYYVOIYTNXXBN-OWOJBTEDSA-N 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 150000003852 triazoles Chemical class 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- XSSYCIGJYCVRRK-RQJHMYQMSA-N (-)-carbovir Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1C[C@H](CO)C=C1 XSSYCIGJYCVRRK-RQJHMYQMSA-N 0.000 description 1
- PEDCQBHIVMGVHV-UXXIZXEISA-N (2H5)Propane-1,2,3-triol Chemical compound [2H]C([2H])(O)C([2H])(O)C([2H])([2H])O PEDCQBHIVMGVHV-UXXIZXEISA-N 0.000 description 1
- GZCGUPFRVQAUEE-BFPMOEKFSA-N (2R,3S,4R,5R)-2,3,4,5,6-pentahydroxy(313C)hexanal Chemical compound O=C[C@H](O)[13C@@H](O)[C@H](O)[C@H](O)CO GZCGUPFRVQAUEE-BFPMOEKFSA-N 0.000 description 1
- CZWGGYQSEGOVSD-WJDZFWBGSA-N (2R,3S,4R,5R)-2-(hydroxymethyl)-5-(6-imino-5-methylpurin-9-yl)oxolane-3,4-diol Chemical class N=C1N=CN=C2C1(C)N=CN2[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O CZWGGYQSEGOVSD-WJDZFWBGSA-N 0.000 description 1
- YWYBWWBXDKZRTG-HUKYDQBMSA-N (2r,3r,4s,5s)-2-(6-aminopurin-9-yl)-4-(azidomethyl)-5-(hydroxymethyl)oxolan-3-ol Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](CN=[N+]=[N-])[C@H]1O YWYBWWBXDKZRTG-HUKYDQBMSA-N 0.000 description 1
- ILDPUOKUEKVHIL-QYYRPYCUSA-N (2r,3r,4s,5s)-4-amino-2-(6-aminopurin-9-yl)-5-(hydroxymethyl)oxolan-3-ol Chemical compound O[C@@H]1[C@H](N)[C@@H](CO)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ILDPUOKUEKVHIL-QYYRPYCUSA-N 0.000 description 1
- GZCGUPFRVQAUEE-CERMPYPXSA-N (2r,3s,4s,5r)-2,3,4,5,6-pentahydroxyhexanal Chemical compound OC[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)[13CH]=O GZCGUPFRVQAUEE-CERMPYPXSA-N 0.000 description 1
- QCDAWXDDXYQEJJ-QYYRPYCUSA-N (2r,3s,4s,5r)-2-(6-aminopurin-9-yl)-4-fluoro-5-(hydroxymethyl)oxolan-3-ol Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](F)[C@H]1O QCDAWXDDXYQEJJ-QYYRPYCUSA-N 0.000 description 1
- MHJJUOJOAJLYBS-ZBRNBAAYSA-N (2s)-2-aminopropanoic acid;(2s)-pyrrolidine-2-carboxylic acid Chemical compound C[C@H](N)C(O)=O.OC(=O)[C@@H]1CCCN1 MHJJUOJOAJLYBS-ZBRNBAAYSA-N 0.000 description 1
- COLNVLDHVKWLRT-YTRLMEAHSA-N (2s)-2-azanyl-3-phenylpropanoic acid Chemical compound OC(=O)[C@@H]([15NH2])CC1=CC=CC=C1 COLNVLDHVKWLRT-YTRLMEAHSA-N 0.000 description 1
- COLNVLDHVKWLRT-CMLFETTRSA-N (2s)-2-azanyl-3-phenylpropanoic acid Chemical compound O[13C](=O)[13C@@H]([15NH2])[13CH2][13C]1=[13CH][13CH]=[13CH][13CH]=[13CH]1 COLNVLDHVKWLRT-CMLFETTRSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-GZPBOPPUSA-N (2s)-2-azanylpropanoic acid Chemical compound C[C@H]([15NH2])C(O)=O QNAYBMKLOCPYGJ-GZPBOPPUSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UVYXLFMMSA-N (2s)-2-azanylpropanoic acid Chemical compound [13CH3][13C@H]([15NH2])[13C](O)=O QNAYBMKLOCPYGJ-UVYXLFMMSA-N 0.000 description 1
- ONIBWKKTOPOVIA-JGTYJTGKSA-N (2s)-proline Chemical compound OC(=O)[C@@H]1CCC[15NH]1 ONIBWKKTOPOVIA-JGTYJTGKSA-N 0.000 description 1
- PYMYPHUHKUWMLA-PVQXRQKHSA-N (2s,3r,4r)-2,3,4,5-tetrahydroxypentanal Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[13CH]=O PYMYPHUHKUWMLA-PVQXRQKHSA-N 0.000 description 1
- FBPFZTCFMRRESA-CCCNNFAYSA-N (2s,3s,4r,5r)-hexane-1,2,3,4,5,6-hexol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)[13CH2]O FBPFZTCFMRRESA-CCCNNFAYSA-N 0.000 description 1
- HSINOMROUCMIEA-FGVHQWLLSA-N (2s,4r)-4-[(3r,5s,6r,7r,8s,9s,10s,13r,14s,17r)-6-ethyl-3,7-dihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]-2-methylpentanoic acid Chemical compound C([C@@]12C)C[C@@H](O)C[C@H]1[C@@H](CC)[C@@H](O)[C@@H]1[C@@H]2CC[C@]2(C)[C@@H]([C@H](C)C[C@H](C)C(O)=O)CC[C@H]21 HSINOMROUCMIEA-FGVHQWLLSA-N 0.000 description 1
- BJHIKXHVCXFQLS-WEOUYLKASA-N (3S,4R,5R)-1,3,4,5,6-pentahydroxy(113C)hexan-2-one Chemical compound O[13CH2]C(=O)[C@@H](O)[C@H](O)[C@H](O)CO BJHIKXHVCXFQLS-WEOUYLKASA-N 0.000 description 1
- WQZGKKKJIJFFOK-RUIMULFXSA-N (3r,4s,5r,6r)-2-deuterio-6-(hydroxymethyl)oxane-2,3,4,5-tetrol Chemical compound [2H]C1(O)O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O WQZGKKKJIJFFOK-RUIMULFXSA-N 0.000 description 1
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 description 1
- BHQCQFFYRZLCQQ-HFINQHRVSA-N (4r)-4-[(3r,5s,7r,8r,9s,10s,12s,13r,14s,17r)-3,7,12-trihydroxy-10,13-dimethyl-2,3,4,5,6,7,8,9,11,12,14,15,16,17-tetradecahydro-1h-cyclopenta[a]phenanthren-17-yl]pentanoic acid Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CC[13C](O)=O)C)[C@@]2(C)[C@@H](O)C1 BHQCQFFYRZLCQQ-HFINQHRVSA-N 0.000 description 1
- PHIQHXFUZVPYII-ZCFIWIBFSA-N (R)-carnitine Chemical compound C[N+](C)(C)C[C@H](O)CC([O-])=O PHIQHXFUZVPYII-ZCFIWIBFSA-N 0.000 description 1
- ZXMGHDIOOHOAAE-UHFFFAOYSA-N 1,1,1-trifluoro-n-(trifluoromethylsulfonyl)methanesulfonamide Chemical compound FC(F)(F)S(=O)(=O)NS(=O)(=O)C(F)(F)F ZXMGHDIOOHOAAE-UHFFFAOYSA-N 0.000 description 1
- MPCAJMNYNOGXPB-UHFFFAOYSA-N 1,5-Anhydro-mannit Natural products OCC1OCC(O)C(O)C1O MPCAJMNYNOGXPB-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-DMNIUWJGSA-N 1-[(2R,3R,4S,5R)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl](1,3-15N2)pyrimidine-2,4-dione Chemical compound OC[C@H]1O[C@H]([C@H](O)[C@@H]1O)[15n]1ccc(=O)[15nH]c1=O DRTQHJPVMGBUCF-DMNIUWJGSA-N 0.000 description 1
- WOUCZTLSFFROTQ-XVFCMESISA-N 1-[(2r,3r,4s,5s)-4-amino-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O[C@@H]1[C@H](N)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 WOUCZTLSFFROTQ-XVFCMESISA-N 0.000 description 1
- KLUJHYLVSBGISP-SHYZEUOFSA-N 1-[(2r,3r,5s)-3-azido-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O1[C@H](CO)C[C@@H](N=[N+]=[N-])[C@@H]1N1C(=O)NC(=O)C=C1 KLUJHYLVSBGISP-SHYZEUOFSA-N 0.000 description 1
- QOXJRLADYHZRGC-SHYZEUOFSA-N 1-[(2r,3r,5s)-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O1[C@H](CO)C[C@@H](O)[C@@H]1N1C(=O)NC(=O)C=C1 QOXJRLADYHZRGC-SHYZEUOFSA-N 0.000 description 1
- FVBOTRDLABQYMI-XVFCMESISA-N 1-[(2r,3s,4s,5r)-4-fluoro-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O[C@@H]1[C@H](F)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 FVBOTRDLABQYMI-XVFCMESISA-N 0.000 description 1
- ZSNNBSPEFVIUDS-SHYZEUOFSA-N 1-[(2r,4s,5s)-4-azido-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound C1[C@H](N=[N+]=[N-])[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 ZSNNBSPEFVIUDS-SHYZEUOFSA-N 0.000 description 1
- GFYLSDSUCHVORB-IOSLPCCCSA-N 1-methyladenosine Chemical class C1=NC=2C(=N)N(C)C=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O GFYLSDSUCHVORB-IOSLPCCCSA-N 0.000 description 1
- KJUGUADJHNHALS-UHFFFAOYSA-N 1H-tetrazole Substances C=1N=NNN=1 KJUGUADJHNHALS-UHFFFAOYSA-N 0.000 description 1
- 108010041801 2',3'-Cyclic Nucleotide 3'-Phosphodiesterase Proteins 0.000 description 1
- 102100040458 2',3'-cyclic-nucleotide 3'-phosphodiesterase Human genes 0.000 description 1
- WVXRAFOPTSTNLL-NKWVEPMBSA-N 2',3'-dideoxyadenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1CC[C@@H](CO)O1 WVXRAFOPTSTNLL-NKWVEPMBSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- VKOBVWXKNCXXDE-BKDZISOFSA-N 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,12,12,13,13,14,14,15,15,16,16,17,17,18,18,19,19,20,20,20-nonatriacontadeuterioicosanoic acid Chemical compound C(C(C(C(C(C(C(C(C(C(C(C(C(C(C(C(C(C(C(C([2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])([2H])[2H])(=O)O VKOBVWXKNCXXDE-BKDZISOFSA-N 0.000 description 1
- XBDQKXXYIPTUBI-CBTSVUPCSA-N 2,2-dideuteriopropanoic acid Chemical compound [2H]C([2H])(C)C(O)=O XBDQKXXYIPTUBI-CBTSVUPCSA-N 0.000 description 1
- RGLYKWWBQGJZGM-ASHPLPESSA-N 2,3,6-trideuterio-4-[(E)-1,1,1,2,2-pentadeuterio-4-(4-hydroxyphenyl)hex-3-en-3-yl]phenol Chemical compound C(C(\C(\C1=C(C(=C(O)C(=C1)[2H])[2H])[2H])=C(/C1=CC=C(O)C=C1)\CC)([2H])[2H])([2H])([2H])[2H] RGLYKWWBQGJZGM-ASHPLPESSA-N 0.000 description 1
- BTOTXLJHDSNXMW-POYBYMJQSA-N 2,3-dideoxyuridine Chemical compound O1[C@H](CO)CC[C@@H]1N1C(=O)NC(=O)C=C1 BTOTXLJHDSNXMW-POYBYMJQSA-N 0.000 description 1
- 150000003923 2,5-pyrrolediones Chemical class 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- FMYBFLOWKQRBST-UHFFFAOYSA-N 2-[bis(carboxymethyl)amino]acetic acid;nickel Chemical compound [Ni].OC(=O)CN(CC(O)=O)CC(O)=O FMYBFLOWKQRBST-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- QFESSFZUUIXGID-QYYRPYCUSA-N 2-amino-9-[(2R,3R,4S,5S)-4-(azidomethyl)-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-1H-purin-6-one Chemical compound NC1=NC2=C(N=CN2[C@@H]2O[C@H](CO)[C@@H](CN=[N+]=[N-])[C@H]2O)C(=O)N1 QFESSFZUUIXGID-QYYRPYCUSA-N 0.000 description 1
- KUZIQNHQBXJRAQ-OBXARNEKSA-N 2-amino-9-[(2R,3R,5S)-3-azido-5-(hydroxymethyl)oxolan-2-yl]-1H-purin-6-one Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)C[C@H]1N=[N+]=[N-] KUZIQNHQBXJRAQ-OBXARNEKSA-N 0.000 description 1
- WQZYJWINGYJUHN-DXTOWSMRSA-N 2-amino-9-[(2r,3r,4s,5s)-4-amino-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound O[C@@H]1[C@H](N)[C@@H](CO)O[C@H]1N1C(NC(N)=NC2=O)=C2N=C1 WQZYJWINGYJUHN-DXTOWSMRSA-N 0.000 description 1
- VDOWHLFGBWKXJC-DXTOWSMRSA-N 2-amino-9-[(2r,3s,4s,5r)-4-fluoro-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](CO)[C@@H](F)[C@H]1O VDOWHLFGBWKXJC-DXTOWSMRSA-N 0.000 description 1
- HETOJIJPBJGZFJ-KVQBGUIXSA-N 2-amino-9-[(2r,4s,5s)-4-azido-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1C[C@H](N=[N+]=[N-])[C@@H](CO)O1 HETOJIJPBJGZFJ-KVQBGUIXSA-N 0.000 description 1
- OCLZPNCLRLDXJC-NTSWFWBYSA-N 2-amino-9-[(2r,5s)-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1CC[C@@H](CO)O1 OCLZPNCLRLDXJC-NTSWFWBYSA-N 0.000 description 1
- XSSYCIGJYCVRRK-TTYOAZMQSA-N 2-amino-9-[4-[dideuterio(hydroxy)methyl]cyclopent-2-en-1-yl]-3h-purin-6-one Chemical compound C1=CC(C([2H])(O)[2H])CC1N1C(NC(N)=NC2=O)=C2N=[13CH]1 XSSYCIGJYCVRRK-TTYOAZMQSA-N 0.000 description 1
- 125000001731 2-cyanoethyl group Chemical group [H]C([H])(*)C([H])([H])C#N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- IQZWKGWOBPJWMX-IOSLPCCCSA-N 2-methyladenosine Chemical class C12=NC(C)=NC(N)=C2N=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O IQZWKGWOBPJWMX-IOSLPCCCSA-N 0.000 description 1
- 125000001494 2-propynyl group Chemical group [H]C#CC([H])([H])* 0.000 description 1
- OROIAVZITJBGSM-OBXARNEKSA-N 3'-deoxyguanosine Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](CO)C[C@H]1O OROIAVZITJBGSM-OBXARNEKSA-N 0.000 description 1
- 108010037497 3'-nucleotidase Proteins 0.000 description 1
- ILDPUOKUEKVHIL-UHFFFAOYSA-N 3-(1,2-epoxypropyl)-5,6-dihydro-5-hydroxy-6-methylpyran-2-one Natural products OC1C(N)C(CO)OC1N1C2=NC=NC(N)=C2N=C1 ILDPUOKUEKVHIL-UHFFFAOYSA-N 0.000 description 1
- PYUKHTBSBVGAKT-XVFCMESISA-N 4-amino-1-[(2r,3r,4s,5s)-4-amino-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one Chemical compound O[C@@H]1[C@H](N)[C@@H](CO)O[C@H]1N1C(=O)N=C(N)C=C1 PYUKHTBSBVGAKT-XVFCMESISA-N 0.000 description 1
- GRDBEWZWKVQNKS-SHYZEUOFSA-N 4-amino-1-[(2r,3r,5s)-3-azido-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](N=[N+]=[N-])C[C@@H](CO)O1 GRDBEWZWKVQNKS-SHYZEUOFSA-N 0.000 description 1
- ZHHOTKZTEUZTHX-SHYZEUOFSA-N 4-amino-1-[(2r,3r,5s)-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)C[C@@H](CO)O1 ZHHOTKZTEUZTHX-SHYZEUOFSA-N 0.000 description 1
- PKOBNLOZXOHYOP-XVFCMESISA-N 4-amino-1-[(2r,3s,4s,5r)-4-fluoro-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](F)[C@@H](CO)O1 PKOBNLOZXOHYOP-XVFCMESISA-N 0.000 description 1
- YIEFKLOVIROQIL-SHYZEUOFSA-N 4-amino-1-[(2r,4s,5s)-4-azido-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](N=[N+]=[N-])C1 YIEFKLOVIROQIL-SHYZEUOFSA-N 0.000 description 1
- JTEGQNOMFQHVDC-PIAXYVKQSA-N 4-amino-1-[(2r,5s)-2-(hydroxymethyl)-1,3-oxathiolan-5-yl]pyrimidin-2-one Chemical compound O=[13C]1[15N]=C(N)C=C[15N]1[C@H]1O[C@@H](CO)SC1 JTEGQNOMFQHVDC-PIAXYVKQSA-N 0.000 description 1
- STWTUEAWRAIWJG-UHFFFAOYSA-N 5-(1H-pyrazol-4-yl)-2-[6-(2,2,6,6-tetramethylpiperidin-4-yl)oxypyridazin-3-yl]phenol Chemical compound C1C(C)(C)NC(C)(C)CC1OC1=CC=C(C=2C(=CC(=CC=2)C2=CNN=C2)O)N=N1 STWTUEAWRAIWJG-UHFFFAOYSA-N 0.000 description 1
- ZAYHVCMSTBRABG-UHFFFAOYSA-N 5-Methylcytidine Natural products O=C1N=C(N)C(C)=CN1C1C(O)C(O)C(CO)O1 ZAYHVCMSTBRABG-UHFFFAOYSA-N 0.000 description 1
- MMUBPEFMCTVKTR-IBNKKVAHSA-N 5-[(2s,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)-2-methyloxolan-2-yl]-1h-pyrimidine-2,4-dione Chemical compound C=1NC(=O)NC(=O)C=1[C@]1(C)O[C@H](CO)[C@@H](O)[C@H]1O MMUBPEFMCTVKTR-IBNKKVAHSA-N 0.000 description 1
- JKNCSZDPWAVQAI-ZKWXMUAHSA-N 5-[(2s,3s,4r)-3,4-diaminothiolan-2-yl]pentanoic acid Chemical compound N[C@H]1CS[C@@H](CCCCC(O)=O)[C@H]1N JKNCSZDPWAVQAI-ZKWXMUAHSA-N 0.000 description 1
- DEQPBRIACBATHE-FXQIFTODSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]-2-iminopentanoic acid Chemical compound N1C(=O)N[C@@H]2[C@H](CCCC(=N)C(=O)O)SC[C@@H]21 DEQPBRIACBATHE-FXQIFTODSA-N 0.000 description 1
- QXDXBKZJFLRLCM-UAKXSSHOSA-N 5-hydroxyuridine Chemical class O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(O)=C1 QXDXBKZJFLRLCM-UAKXSSHOSA-N 0.000 description 1
- ASKZRYGFUPSJPN-UHFFFAOYSA-N 7-(4,7-diazaspiro[2.5]octan-7-yl)-2-(2,8-dimethylimidazo[1,2-b]pyridazin-6-yl)pyrido[1,2-a]pyrimidin-4-one Chemical compound CC1=CN2N=C(C=C(C)C2=N1)C1=CC(=O)N2C=C(C=CC2=N1)N1CCNC2(CC2)C1 ASKZRYGFUPSJPN-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical class C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- WGUUHZZLLQSZJX-QYYRPYCUSA-N 9-[(2R,3R,4S,5S)-4-amino-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-1H-purin-6-one Chemical compound O[C@@H]1[C@H](N)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 WGUUHZZLLQSZJX-QYYRPYCUSA-N 0.000 description 1
- RPZDLTVHZJHPAW-BAJZRUMYSA-N 9-[(2r,3r,5s)-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound O1[C@H](CO)C[C@@H](O)[C@@H]1N1C(NC=NC2=O)=C2N=C1 RPZDLTVHZJHPAW-BAJZRUMYSA-N 0.000 description 1
- POYFYFKHABGRAR-QYYRPYCUSA-N 9-[(2r,3s,4s,5r)-4-fluoro-3-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound O[C@@H]1[C@H](F)[C@@H](CO)O[C@H]1N1C(NC=NC2=O)=C2N=C1 POYFYFKHABGRAR-QYYRPYCUSA-N 0.000 description 1
- SEUFIMAGNSSKRT-RRKCRQDMSA-N 9-[(2r,4s,5s)-4-azido-5-(hydroxymethyl)oxolan-2-yl]-3h-purin-6-one Chemical compound C1[C@H](N=[N+]=[N-])[C@@H](CO)O[C@H]1N1C(NC=NC2=O)=C2N=C1 SEUFIMAGNSSKRT-RRKCRQDMSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical group CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 108700016155 Acyl transferases Proteins 0.000 description 1
- 102000057234 Acyl transferases Human genes 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical class NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000607534 Aeromonas Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical class [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 101000909256 Caldicellulosiruptor bescii (strain ATCC BAA-1888 / DSM 6725 / Z-1320) DNA polymerase I Proteins 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 102000007132 Carboxyl and Carbamoyl Transferases Human genes 0.000 description 1
- 108010072957 Carboxyl and Carbamoyl Transferases Proteins 0.000 description 1
- 241001531266 Carnation Italian ringspot virus Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 241000270607 Chelonia mydas Species 0.000 description 1
- 241000588923 Citrobacter Species 0.000 description 1
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- KQLDDLUWUFBQHP-UHFFFAOYSA-N Cordycepin Natural products C1=NC=2C(N)=NC=NC=2N1C1OCC(CO)C1O KQLDDLUWUFBQHP-UHFFFAOYSA-N 0.000 description 1
- 102100031673 Corneodesmosin Human genes 0.000 description 1
- VMQMZMRVKUZKQL-UHFFFAOYSA-N Cu+ Chemical compound [Cu+] VMQMZMRVKUZKQL-UHFFFAOYSA-N 0.000 description 1
- 101710111837 Cyclic phosphodiesterase Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 108010092681 DNA Primase Proteins 0.000 description 1
- 102000016559 DNA Primase Human genes 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 108091027757 Deoxyribozyme Proteins 0.000 description 1
- 241000605716 Desulfovibrio Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- BXZVVICBKDXVGW-NKWVEPMBSA-N Didanosine Chemical compound O1[C@H](CO)CC[C@@H]1N1C(NC=NC2=O)=C2N=C1 BXZVVICBKDXVGW-NKWVEPMBSA-N 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 101100510232 Enterobacteria phage T4 pseT gene Proteins 0.000 description 1
- 239000004593 Epoxy Substances 0.000 description 1
- 108010002700 Exoribonucleases Proteins 0.000 description 1
- 102000004678 Exoribonucleases Human genes 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 108090000331 Firefly luciferases Proteins 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- 101000749809 Homo sapiens 2',3'-cyclic-nucleotide 3'-phosphodiesterase Proteins 0.000 description 1
- 101000796277 Homo sapiens C-type natriuretic peptide Proteins 0.000 description 1
- 101000586086 Homo sapiens Origin recognition complex subunit 4 Proteins 0.000 description 1
- 101001109588 Homo sapiens Polynucleotide 5'-hydroxyl-kinase NOL9 Proteins 0.000 description 1
- 101000692933 Homo sapiens Ribonuclease 4 Proteins 0.000 description 1
- 101000667595 Homo sapiens Ribonuclease pancreatic Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- RAXXELZNTBOGNW-UHFFFAOYSA-O Imidazolium Chemical compound C1=C[NH+]=CN1 RAXXELZNTBOGNW-UHFFFAOYSA-O 0.000 description 1
- 241000222712 Kinetoplastida Species 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- QNAYBMKLOCPYGJ-IALWIIEESA-N L-alanine-2,3,3,3-d4 Chemical compound [2H]C([2H])([2H])[C@]([2H])(N)C(O)=O QNAYBMKLOCPYGJ-IALWIIEESA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BFEYZEMLSA-N L-proline-d7 Chemical compound [2H]C1([2H])N[C@]([2H])(C(O)=O)C([2H])([2H])C1([2H])[2H] ONIBWKKTOPOVIA-BFEYZEMLSA-N 0.000 description 1
- GDBQQVLCIARPGH-UHFFFAOYSA-N Leupeptin Natural products CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N GDBQQVLCIARPGH-UHFFFAOYSA-N 0.000 description 1
- 108090000856 Lyases Proteins 0.000 description 1
- 102000004317 Lyases Human genes 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 239000007987 MES buffer Substances 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical class [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical class CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 description 1
- 241001302042 Methanothermobacter thermautotrophicus Species 0.000 description 1
- 241000005783 Monographella albescens Species 0.000 description 1
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Natural products C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 1
- 108010047290 Multifunctional Enzymes Proteins 0.000 description 1
- 102000006833 Multifunctional Enzymes Human genes 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- OKIZCWYLBDKLSU-UHFFFAOYSA-M N,N,N-Trimethylmethanaminium chloride Chemical compound [Cl-].C[N+](C)(C)C OKIZCWYLBDKLSU-UHFFFAOYSA-M 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical class ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- BAQMYDQNMFBZNA-UHFFFAOYSA-N N-biotinyl-L-lysine Natural products N1C(=O)NC2C(CCCCC(=O)NCCCCC(N)C(O)=O)SCC21 BAQMYDQNMFBZNA-UHFFFAOYSA-N 0.000 description 1
- OBDQZNFPEZXMBB-SQEXRHODSA-N NC1=NC(=O)N(C=C1)[C@@H]1O[C@H](CO)[C@@H](CN=[N+]=[N-])[C@H]1O Chemical compound NC1=NC(=O)N(C=C1)[C@@H]1O[C@H](CO)[C@@H](CN=[N+]=[N-])[C@H]1O OBDQZNFPEZXMBB-SQEXRHODSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- 108020003217 Nuclear RNA Proteins 0.000 description 1
- 102000043141 Nuclear RNA Human genes 0.000 description 1
- MSWZFWKMSRAUBD-ARXMZJIQSA-N OC1[C@H]([15NH2])[C@@H](O)[C@H](O)[C@H](O1)CO Chemical compound OC1[C@H]([15NH2])[C@@H](O)[C@H](O)[C@H](O1)CO MSWZFWKMSRAUBD-ARXMZJIQSA-N 0.000 description 1
- RTPZRVHZYXEEOB-SQEXRHODSA-N OC[C@H]1O[C@H]([C@H](O)[C@@H]1CN=[N+]=[N-])N1C=CC(=O)NC1=O Chemical compound OC[C@H]1O[C@H]([C@H](O)[C@@H]1CN=[N+]=[N-])N1C=CC(=O)NC1=O RTPZRVHZYXEEOB-SQEXRHODSA-N 0.000 description 1
- 102100030030 Origin recognition complex subunit 4 Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 241000282577 Pan troglodytes Species 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108090000279 Peptidyltransferases Proteins 0.000 description 1
- 241000009328 Perro Species 0.000 description 1
- CXOFVDLJLONNDW-UHFFFAOYSA-N Phenytoin Chemical compound N1C(=O)NC(=O)C1(C=1C=CC=CC=1)C1=CC=CC=C1 CXOFVDLJLONNDW-UHFFFAOYSA-N 0.000 description 1
- 102220492040 Phospholipid scramblase 1_D80A_mutation Human genes 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 101150079960 Pnkp gene Proteins 0.000 description 1
- 102100022739 Polynucleotide 5'-hydroxyl-kinase NOL9 Human genes 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- 241000205160 Pyrococcus Species 0.000 description 1
- 101000902592 Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) DNA polymerase Proteins 0.000 description 1
- RWRDLPDLKQPQOW-UHFFFAOYSA-O Pyrrolidinium ion Chemical compound C1CC[NH2+]C1 RWRDLPDLKQPQOW-UHFFFAOYSA-O 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 101710188536 RNA ligase 1 Proteins 0.000 description 1
- 101710188535 RNA ligase 2 Proteins 0.000 description 1
- 239000013616 RNA primer Substances 0.000 description 1
- 101710093506 RNA-editing ligase 1, mitochondrial Proteins 0.000 description 1
- 101710204104 RNA-editing ligase 2, mitochondrial Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 101150030456 RNASE4 gene Proteins 0.000 description 1
- 108091028733 RNTP Proteins 0.000 description 1
- 102000042498 RNase T2 family Human genes 0.000 description 1
- 108091078656 RNase T2 family Proteins 0.000 description 1
- 241000270942 Rana pipiens Species 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 241000193448 Ruminiclostridium thermocellum Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241000122971 Stenotrophomonas Species 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical group [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- UZMAPBJVXOGOFT-UHFFFAOYSA-N Syringetin Natural products COC1=C(O)C(OC)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UZMAPBJVXOGOFT-UHFFFAOYSA-N 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108010076818 TEV protease Proteins 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 239000007997 Tricine buffer Substances 0.000 description 1
- 101000956368 Trittame loki CRISP/Allergen/PR-1 Proteins 0.000 description 1
- 208000034953 Twin anemia-polycythemia sequence Diseases 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 108010031318 Vitronectin Proteins 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- WREGKURFCTUGRC-POYBYMJQSA-N Zalcitabine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)CC1 WREGKURFCTUGRC-POYBYMJQSA-N 0.000 description 1
- XDRZJDXXQHFAAE-RRKCRQDMSA-N [(2s,3s,5r)-5-(6-aminopurin-9-yl)-3-azidooxolan-2-yl]methanol Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](N=[N+]=[N-])[C@@H](CO)O1 XDRZJDXXQHFAAE-RRKCRQDMSA-N 0.000 description 1
- JVPFDOXMGPHRJG-BAJZRUMYSA-N [(2s,4r,5r)-5-(6-aminopurin-9-yl)-4-azidooxolan-2-yl]methanol Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)C[C@H]1N=[N+]=[N-] JVPFDOXMGPHRJG-BAJZRUMYSA-N 0.000 description 1
- OLXZPDWKRNYJJZ-UQXAQLLXSA-N [15NH2]C1=[15N]C=[15N]C2=C1[15N]=C[15N]2[C@@H](C1)O[C@H](CO)[C@H]1O Chemical compound [15NH2]C1=[15N]C=[15N]C2=C1[15N]=C[15N]2[C@@H](C1)O[C@H](CO)[C@H]1O OLXZPDWKRNYJJZ-UQXAQLLXSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 150000003838 adenosines Chemical class 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000003450 affinity purification method Methods 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 239000012773 agricultural material Substances 0.000 description 1
- 239000003570 air Substances 0.000 description 1
- 150000001299 aldehydes Chemical class 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 125000005262 alkoxyamine group Chemical group 0.000 description 1
- 150000001348 alkyl chlorides Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 125000002344 aminooxy group Chemical group [H]N([H])O[*] 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 208000007502 anemia Diseases 0.000 description 1
- 239000003945 anionic surfactant Substances 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000012062 aqueous buffer Substances 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- TVWOWDDBXAFQDG-DQRAZIAOSA-N azorubine Chemical compound C1=CC=C2C(\N=N/C3=C(C4=CC=CC=C4C(=C3)S(O)(=O)=O)O)=CC=C(S(O)(=O)=O)C2=C1 TVWOWDDBXAFQDG-DQRAZIAOSA-N 0.000 description 1
- 235000012733 azorubine Nutrition 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- PXXJHWLDUBFPOL-UHFFFAOYSA-N benzamidine Chemical compound NC(=N)C1=CC=CC=C1 PXXJHWLDUBFPOL-UHFFFAOYSA-N 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 229960003237 betaine Drugs 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 239000003613 bile acid Substances 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- BAQMYDQNMFBZNA-MNXVOIDGSA-N biocytin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)NCCCC[C@H](N)C(O)=O)SC[C@@H]21 BAQMYDQNMFBZNA-MNXVOIDGSA-N 0.000 description 1
- KCSKCIQYNAOBNQ-YBSFLMRUSA-N biotin sulfoxide Chemical compound N1C(=O)N[C@H]2CS(=O)[C@@H](CCCCC(=O)O)[C@H]21 KCSKCIQYNAOBNQ-YBSFLMRUSA-N 0.000 description 1
- UCCKRVYTJPMHRO-UHFFFAOYSA-N bis(trifluoromethylsulfonyl)azanide;1-butyl-2,3-dimethylimidazol-3-ium Chemical compound CCCC[N+]=1C=CN(C)C=1C.FC(F)(F)S(=O)(=O)[N-]S(=O)(=O)C(F)(F)F UCCKRVYTJPMHRO-UHFFFAOYSA-N 0.000 description 1
- 125000005621 boronate group Chemical class 0.000 description 1
- 229950001657 branaplam Drugs 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Chemical class 0.000 description 1
- 238000002619 cancer immunotherapy Methods 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 1
- PFKFTWBEEFSNDU-UHFFFAOYSA-N carbonyldiimidazole Chemical class C1=CN=CN1C(=O)N1C=CN=C1 PFKFTWBEEFSNDU-UHFFFAOYSA-N 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 241000902900 cellular organisms Species 0.000 description 1
- 230000004637 cellular stress Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000013522 chelant Substances 0.000 description 1
- SIOLHBFVZMHKPF-MOXYJHBNSA-N chembl436727 Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)[C@@H](C)O)C1=CNC=N1 SIOLHBFVZMHKPF-MOXYJHBNSA-N 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000007806 chemical reaction intermediate Substances 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 229960001231 choline Drugs 0.000 description 1
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 1
- 238000013375 chromatographic separation Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- WDDPHFBMKLOVOX-AYQXTPAHSA-N clofarabine Chemical compound C1=NC=2C(N)=NC(Cl)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@@H]1F WDDPHFBMKLOVOX-AYQXTPAHSA-N 0.000 description 1
- 229960000928 clofarabine Drugs 0.000 description 1
- ZIHHMGTYZOSFRC-UWWAPWIJSA-M cobamamide Chemical compound C1(/[C@](C)(CCC(=O)NC[C@H](C)OP(O)(=O)OC2[C@H]([C@H](O[C@@H]2CO)N2C3=CC(C)=C(C)C=C3N=C2)O)[C@@H](CC(N)=O)[C@]2(N1[Co+]C[C@@H]1[C@H]([C@@H](O)[C@@H](O1)N1C3=NC=NC(N)=C3N=C1)O)[H])=C(C)\C([C@H](C/1(C)C)CCC(N)=O)=N\C\1=C/C([C@H]([C@@]\1(CC(N)=O)C)CCC(N)=O)=N/C/1=C(C)\C1=N[C@]2(C)[C@@](C)(CC(N)=O)[C@@H]1CCC(N)=O ZIHHMGTYZOSFRC-UWWAPWIJSA-M 0.000 description 1
- 239000011789 cobamamide Substances 0.000 description 1
- 235000006279 cobamamide Nutrition 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 238000001360 collision-induced dissociation Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- OFEZSBMBBKLLBJ-BAJZRUMYSA-N cordycepin Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)C[C@H]1O OFEZSBMBBKLLBJ-BAJZRUMYSA-N 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000037029 cross reaction Effects 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 238000006352 cycloaddition reaction Methods 0.000 description 1
- 108010031180 cypridina luciferase Proteins 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 230000009615 deamination Effects 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000003936 denaturing gel electrophoresis Methods 0.000 description 1
- KXGVEGMKQFWNSR-FCSCGBJGSA-N deoxycholic acid-2,2,4,4-d4 Chemical compound C([C@@H]12)[C@H](O)[C@]3(C)[C@@H]([C@H](C)CCC(O)=O)CC[C@H]3[C@@H]1CC[C@H]1[C@]2(C)CC([2H])([2H])[C@@H](O)C1([2H])[2H] KXGVEGMKQFWNSR-FCSCGBJGSA-N 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 239000012954 diazonium Substances 0.000 description 1
- 150000001989 diazonium salts Chemical class 0.000 description 1
- WCRDXYSYPCEIAK-UHFFFAOYSA-N dibutylstannane Chemical compound CCCC[SnH2]CCCC WCRDXYSYPCEIAK-UHFFFAOYSA-N 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- KCFYHBSOLOXZIF-UHFFFAOYSA-N dihydrochrysin Natural products COC1=C(O)C(OC)=CC(C2OC3=CC(O)=CC(O)=C3C(=O)C2)=C1 KCFYHBSOLOXZIF-UHFFFAOYSA-N 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 150000002019 disulfides Chemical class 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 239000012039 electrophile Substances 0.000 description 1
- 230000002616 endonucleolytic effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 150000002118 epoxides Chemical class 0.000 description 1
- 125000003700 epoxy group Chemical group 0.000 description 1
- 238000011067 equilibration Methods 0.000 description 1
- 229960003276 erythromycin Drugs 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- RTZKZFJDLAIYFH-UHFFFAOYSA-N ether Substances CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- FVTCRASFADXXNN-SCRDCRAPSA-N flavin mononucleotide Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-SCRDCRAPSA-N 0.000 description 1
- FVTCRASFADXXNN-UHFFFAOYSA-N flavin mononucleotide Natural products OP(=O)(O)OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O FVTCRASFADXXNN-UHFFFAOYSA-N 0.000 description 1
- 239000011768 flavin mononucleotide Substances 0.000 description 1
- 229940013640 flavin mononucleotide Drugs 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000005021 gait Effects 0.000 description 1
- IRSCQMHQWWYFCW-UHFFFAOYSA-N ganciclovir Chemical compound O=C1NC(N)=NC2=C1N=CN2COC(CO)CO IRSCQMHQWWYFCW-UHFFFAOYSA-N 0.000 description 1
- 229960002963 ganciclovir Drugs 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- 150000004820 halides Chemical class 0.000 description 1
- 125000005179 haloacetyl group Chemical group 0.000 description 1
- 230000035876 healing Effects 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- IPCSVZSSVZVIGE-XPOOIHDOSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCC[13CH2]C(O)=O IPCSVZSSVZVIGE-XPOOIHDOSA-N 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000007625 higher-energy collisional dissociation Methods 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 102000047974 human CNP Human genes 0.000 description 1
- 229940042795 hydrazides for tuberculosis treatment Drugs 0.000 description 1
- 150000002429 hydrazines Chemical class 0.000 description 1
- 230000005660 hydrophilic surface Effects 0.000 description 1
- 150000002443 hydroxylamines Chemical class 0.000 description 1
- 230000015784 hyperosmotic salinity response Effects 0.000 description 1
- 230000008102 immune modulation Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 239000002608 ionic liquid Substances 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 150000002527 isonitriles Chemical class 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- ONIBWKKTOPOVIA-XAFSXMPTSA-N l-proline-13c5,15n Chemical compound O[13C](=O)[13C@@H]1[13CH2][13CH2][13CH2][15NH]1 ONIBWKKTOPOVIA-XAFSXMPTSA-N 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 229960001627 lamivudine Drugs 0.000 description 1
- JTEGQNOMFQHVDC-NKWVEPMBSA-N lamivudine Chemical compound O=C1N=C(N)C=CN1[C@H]1O[C@@H](CO)SC1 JTEGQNOMFQHVDC-NKWVEPMBSA-N 0.000 description 1
- 239000010410 layer Substances 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- GDBQQVLCIARPGH-ULQDDVLXSA-N leupeptin Chemical compound CC(C)C[C@H](NC(C)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C=O)CCCN=C(N)N GDBQQVLCIARPGH-ULQDDVLXSA-N 0.000 description 1
- 108010052968 leupeptin Proteins 0.000 description 1
- 235000019136 lipoic acid Nutrition 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 108010059585 mRNA decapping enzymes Proteins 0.000 description 1
- 230000017156 mRNA modification Effects 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229960002160 maltose Drugs 0.000 description 1
- 125000003071 maltose group Chemical group 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- WPBNNNQJVZRUHP-UHFFFAOYSA-L manganese(2+);methyl n-[[2-(methoxycarbonylcarbamothioylamino)phenyl]carbamothioyl]carbamate;n-[2-(sulfidocarbothioylamino)ethyl]carbamodithioate Chemical class [Mn+2].[S-]C(=S)NCCNC([S-])=S.COC(=O)NC(=S)NC1=CC=CC=C1NC(=S)NC(=O)OC WPBNNNQJVZRUHP-UHFFFAOYSA-L 0.000 description 1
- 239000012567 medical material Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000003094 microcapsule Substances 0.000 description 1
- 150000004712 monophosphates Chemical class 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 108010087904 neutravidin Proteins 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 101150093139 ompT gene Proteins 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- CPJSUEIXXCENMM-UHFFFAOYSA-N p-ethoxyacetanilide Natural products CCOC1=CC=C(NC(C)=O)C=C1 CPJSUEIXXCENMM-UHFFFAOYSA-N 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 230000005298 paramagnetic effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229950000964 pepstatin Drugs 0.000 description 1
- 108010091212 pepstatin Proteins 0.000 description 1
- FAXGPCHRFPCXOO-LXTPJMTPSA-N pepstatin A Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- KHIWWQKSHDUIBK-UHFFFAOYSA-N periodic acid Chemical compound OI(=O)(=O)=O KHIWWQKSHDUIBK-UHFFFAOYSA-N 0.000 description 1
- 230000002572 peristaltic effect Effects 0.000 description 1
- 229960003893 phenacetin Drugs 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- 150000003003 phosphines Chemical class 0.000 description 1
- XUYJLQHKOGNDPB-UHFFFAOYSA-N phosphonoacetic acid Chemical compound OC(=O)CP(O)(O)=O XUYJLQHKOGNDPB-UHFFFAOYSA-N 0.000 description 1
- 229910000073 phosphorus hydride Inorganic materials 0.000 description 1
- 230000000865 phosphorylative effect Effects 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000003075 phytoestrogen Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 235000019422 polyvinyl alcohol Nutrition 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 239000004323 potassium nitrate Substances 0.000 description 1
- 235000010333 potassium nitrate Nutrition 0.000 description 1
- 238000011533 pre-incubation Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 229940021993 prophylactic vaccine Drugs 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 108010049718 pseudouridine synthases Proteins 0.000 description 1
- WHMDPDGBKYUEMW-UHFFFAOYSA-N pyridine-2-thiol Chemical compound SC1=CC=CC=N1 WHMDPDGBKYUEMW-UHFFFAOYSA-N 0.000 description 1
- JUJWROOIHBZHMG-UHFFFAOYSA-O pyridinium Chemical compound C1=CC=[NH+]C=C1 JUJWROOIHBZHMG-UHFFFAOYSA-O 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000011369 resultant mixture Substances 0.000 description 1
- 235000019231 riboflavin-5'-phosphate Nutrition 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- DWRXFEITVBNRMK-JXOAFFINSA-N ribothymidine Chemical class O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 DWRXFEITVBNRMK-JXOAFFINSA-N 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 229940121322 risdiplam Drugs 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000002094 self assembled monolayer Substances 0.000 description 1
- 239000013545 self-assembled monolayer Substances 0.000 description 1
- 150000007659 semicarbazones Chemical class 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108091069025 single-strand RNA Proteins 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 239000004055 small Interfering RNA Substances 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 125000004434 sulfur atom Chemical group 0.000 description 1
- YBBRCQOCSYXUOC-UHFFFAOYSA-N sulfuryl dichloride Chemical class ClS(Cl)(=O)=O YBBRCQOCSYXUOC-UHFFFAOYSA-N 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000012134 supernatant fraction Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 125000001981 tert-butyldimethylsilyl group Chemical group [H]C([H])([H])[Si]([H])(C([H])([H])[H])[*]C(C([H])([H])[H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 150000004905 tetrazines Chemical class 0.000 description 1
- 150000003536 tetrazoles Chemical class 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- 150000007970 thio esters Chemical class 0.000 description 1
- 229960002663 thioctic acid Drugs 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 125000005490 tosylate group Chemical group 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- PPDADIYYMSXQJK-UHFFFAOYSA-N trichlorosilicon Chemical group Cl[Si](Cl)Cl PPDADIYYMSXQJK-UHFFFAOYSA-N 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 230000004906 unfolded protein response Effects 0.000 description 1
- LEHOTFFKMJEONL-IOOOXAEESA-N uric acid-1,3-15n2 Chemical compound [15NH]1C(=O)[15NH]C(=O)C2=C1NC(=O)N2 LEHOTFFKMJEONL-IOOOXAEESA-N 0.000 description 1
- 229940054967 vanquish Drugs 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 239000002888 zwitterionic surfactant Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Definitions
- RNA samples may be digested with one or more endoribonuclease(s) of selected specificity.
- RNA structure may interfere with the activity of an endoribonuclease.
- RNA-based therapeutics and vaccines e g., RNA-based therapeutics and vaccines.
- present disclosure relates to methods and compositions for analyzing polyribonucleotides including natural and/or synthetic RNAs.
- a composition may comprise, for example, an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia coll), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus), and an archaea species (for example, Pyrococcus furiosus ⁇ or (ii) is a non-naturally occurring sequence; and/or an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (ii) is a non-naturally occurring sequence.
- a first species e.g., a vertebrate species (for example, Hom
- an endoribonuclease may have an amino acid sequence that corresponds to an amino acid sequence of a vertebrate (e.g., mammalian) species.
- An endoribonuclease may have specificity selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide, according to some embodiments.
- An endoribonuclease may have an average cleavage rate of once every 6-12 nucleotides.
- Example endoribonucleases include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
- An end repair enzyme may comprise phosphodiesterase and phosphomonoesterase activities.
- An end repair enzyme may comprise a polynucleotide kinase-phosphatase.
- Example end repair enzymes include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinase-phosphatases.
- a composition may further comprise one or more of a denaturing agent, a buffering agent, and an RNA substrate.
- a composition may comprise one or more oligoribonucleotides, which may be, for example, substrates and/or products of an endoribonuclease and/or an end repair enzyme.
- methods may comprise (a) contacting an RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'-cyclic- phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated; (b) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3’ -hydroxylated, and (c) optionally, characterizing the oligoribonucleotides comprising one or more repaired ends that are 2’,3’-hydroxylated.
- Endoribonucleases used in methods of the disclosure may have specificity selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide and/or an average cleavage rate of the RNA substrate of once every 6-12 nucleotides (e g., once every 8 nucleotides).
- Example endoribonucleases used in methods of the disclosure may include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
- End repair enzymes used in methods of the disclosure may comprise phosphodiesterase and phosphomonoesterase activities.
- An end repair enzyme may comprise a polynucleotide kinase-phosphatase.
- Example end repair enzymes include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinase-phosphatases.
- a methods may be performed as a coupled reaction.
- an RNA substrate may be a denatured RNA substrate.
- contacting an RNA substrate and an endoribonuclease may further comprise denaturing the RNA substrate to form a denatured RNA substrate and contacting the denatured RNA substrate and the endoribonuclease.
- Denaturing an RNA substrate may include, for example, contacting the RNA substrate with a denaturing agent (e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide) at a salt concentration of up to 50 mM or incubating the RNA substrate at a temperature of 65°C or higher at a salt concentration of up to 50 mM.
- a denaturing agent e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide
- contacting an RNA substrate and an endoribonuclease may further comprise denaturing the RNA substrate to form a denatured RNA substrate, diluting the denatured RNA substrate for form a diluted denatured RNA substrate, and contacting the diluted denatured RNA substrate and the endoribonuclease.
- (a) contacting and/or (b) contacting may further comprise contacting a buffering agent.
- contacting an RNA end repair enzyme and the oligoribonucleotides may further comprise separating the oligoribonucleotides comprising one or more unrepaired ends from the endoribonuclease to form separated oligoribonucleotides comprising one or more unrepaired ends.
- the (c) characterizing may comprise characterizing the oligoribonucleotides comprising one or more repaired ends by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry.
- the (c) characterizing may comprise separating the oligoribonucleotides from one or more of the RNA substrate, the endoribonuclease, the RNA end repair enzyme to form separated oligoribonucleotides and characterizing the separated oligoribonucleotides.
- characterizing may include fractionating the oligoribonucleotides comprising one or more repaired ends that are 2 ’,3 ’-hydroxylated by liquid chromatography to form fractionated oligoribonucleotides and ionizing the fractionated oligoribonucleotides for mass spectrometry.
- an RNA substrate (e.g., an RNA substrate included in a method of the disclosure) may comprise in vitro transcribed RNA, chemically synthesized RNA, viral RNA, prokaryotic RNA, eukaryotic RNA, archaeal RNA, or combinations thereof.
- RNA substrate may comprise tissue culture RNA, biopsy RNA, feces RNA, urine RNA, lymph RNA, blood RNA, mucous RNA, sputum RNA, skin RNA, saliva RNA, wound RNA, sweat RNA, semen RNA, shoot RNA, root RNA, seed RNA, sewage RNA, sludge RNA, soil RNA, or any combination thereof.
- RNA substrates that may be analyzed by methods of the disclosure may comprise any RNA substrate including, for example, messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (sRNA), microRNA (miRNA), long noncoding RNA (IncRNA), circular RNA (circRNA), aptamer RNA, antisense RNA, silencing RNA (siRNA), guide RNA (gRNA), or any combination thereof.
- mRNA messenger RNA
- rRNA ribosomal RNA
- tRNA transfer RNA
- sRNA small RNA
- miRNA microRNA
- IncRNA long noncoding RNA
- circRNA circular RNA
- aptamer RNA antisense RNA
- silencing RNA silencing RNA
- gRNA guide RNA
- kits for analysis of polyribonucleotides including natural and/or synthetic RNA may include (a) an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia co ), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus), and an archaea species (for example, Pyrococcus furiosusy) or (ii) is a non-naturally occurring sequence; (b) an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (i
- An endoribonuclease included in a kit may have specificity selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide, according to some embodiments.
- An endoribonuclease included in a kit may have an average cleavage rate of once every 6-12 nucleotides.
- Example endoribonucleases that may be included in a kit include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
- An end repair enzyme included in a kit may comprise phosphodiesterase and phosphomonoesterase activities.
- An end repair enzyme included in a kit may comprise a polynucleotide kinase-phosphatase.
- Example end repair enzymes that may be included in a kit include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinasephosphatases.
- kits may further include a divalent metal, wherein the divalent metal is optionally selected from magnesium(II), manganese(II), cobalt(II), and nickel(II).
- a kit may further include one or more additional enzymes, wherein the one or more additional enzymes are optionally selected from RNA polymerases and RNA ligases.
- a method may include (a) contacting an RNA substrate and one or more DNA probes, each DNA probe shorter than the RNA substrate and each comprising an affinity domain, wherein at least a portion of the RNA substrate and at least a portion of the DNA probe(s) are complementary, to form a DNA-RNA hybrid duplex comprising a double-stranded portion and at least one single-stranded overhang; (b) contacting the DNA-RNA hybrid duplex with an enzyme composition, the enzyme composition comprising a single-strand-specific nucleotide-specific endoribonuclease (e g., hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO) and, optionally, an RNA end -repair enzyme, to form a cleave
- an enzyme composition comprising a single-strand-specific nucleotide-specific endoribonucleas
- a DNA-RNA hybrid duplex may comprise the double-stranded portion and two single- stranded overhangs.
- a DNA-RNA hybrid duplex may comprise the double- stranded portion and a 5’ single-stranded RNA overhang and a 3’ single-stranded RNA overhang.
- End repair enzymes used in methods of the disclosure may comprise phosphodiesterase and phosphomonoesterase activities.
- An end repair enzyme may comprise a polynucleotide kinase-phosphatase.
- Example end repair enzymes include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinasephosphatases.
- Methods may include, for example, (a) contacting an RNA substrate, an enzyme, and an isotopically labeled nucleoside triphosphate to form a labeled RNA substrate, wherein the enzyme is optionally selected from an RNA polymerase and an RNA ligase; (b) contacting the labeled RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'-cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated; and (c) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3 ’-hydroxylated.
- a method may comprise contacting an RNA substrate, an enzyme, and a nucleoside triphosphate comprising a chemically reactive group to form a chemically reactive RNA substrate, wherein the enzyme is optionally selected from an RNA polymerase and an RNA ligase; (b) contacting the chemically reactive RNA substrate and a molecule reactive with the chemically reactive RNA substrate to form a labeled RNA substrate, wherein the molecule comprises one or more stable isotopics; (c) contacting the labeled RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'-cyclic- phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated; and (d) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3
- a method may comprise, in some embodiments, contacting an RNA substrate and one or more RNA substrate binding molecules (e.g., a “DNA probe, an RNA probe, a synthetic nucleic acid probe, an RNA binding protein, an antibody, an RNA ligand) to form RNA substrate-RNA binding molecule complexes, each complex comprising a bound portion and at least one singlestranded portion, wherein each bound portion comprises at least a portion of the RNA substrate and an RNA binding molecule.
- RNA substrate binding molecules e.g., a “DNA probe, an RNA probe, a synthetic nucleic acid probe, an RNA binding protein, an antibody, an RNA ligand
- a method may comprise contacting an RNA substrate with two species of DNA probe, a first species complementary to a more 5’ portion of the RNA substrate and a second species complementary to a more 3’ portion of the RNA substrate to form RNA substrate-RNA binding molecule complexes, each complex comprising (e g., in a 5 ’ to 3 ’ direction) a first bound portion a single-stranded portion and a second bound portion.
- a first bound portion may comprise the first DNA probe and the more 5’ portion of the RNA substrate.
- a second bound portion may comprise the second DNA probe and the more 3’ portion of the RNA substrate.
- a single-stranded portion may comprise a portion of the RNA substrate linking the more 5’ portion and the more 3’ portion.
- a method may further comprise contacting the RNA substrate-RNA binding molecule complexes with an enzyme composition (e.g., an enzyme composition comprising a single-strand-specific nucleotide-specific endoribonuclease and, optionally, an RNA endrepair enzyme) to form by cleavage of the RNA substrate at one or more sites within the singlestranded portion by the single-strand-specific nucleotide-specific endoribonuclease cleaved bound portions and one or more fragments of the single-stranded portion.
- a method may comprise, in some embodiments, separating the cleaved bound portions from the one or more fragments of the at least one single-stranded portion.
- a method may comprise analyzing one or more properties of the cleaved bound portions and/or analyzing one or more properties of the fragments.
- Analyzing one or more properties of the cleaved bound portions may include, in some embodiments, characterizing at least the RNA substrate fragment of the cleaved bound portions (e.g., the more 5’ portion and/or the more 3’ portion of the RNA substrate) by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry, wherein the characterizing optionally comprises at least one of assessing the molecular mass of the RNA substrate, assessing the sequence of the RNA substrate (or a portion thereof), and assessing the modification status (e.g., modified bases appearing in the RNA substrate including 1 -methylpseudouridine and 5-methoxycytidine; 5’ ends having a pp, ppp, CapO, Capl, or Cap2; 3’ ends having a polyA tail
- Analyzing one or more properties of the fragments of the at least one single-stranded portion may include, in some embodiments, characterizing the fragments by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry, wherein the characterizing optionally comprises at least one of assessing the molecular mass of the fragments, assessing the sequence of the fragments (or a portion thereof), and assessing the modification status (e.g., modified bases appearing in the RNA substrate including 1 -methylpseudouridine and 5 -methoxy cytidine; 5’ ends having a pp, ppp, CapO, Cap 1 , or Cap2; 3 ’ ends having a polyA tail or inverted thymidine) of the fragments.
- the characterizing optionally comprises at least one of assessing the molecular mass of the fragments, assessing the sequence of the fragments (or a portion thereof), and assessing the modification status (e.g., modified bases appearing
- RNA end repair enzyme may comprise phosphodiesterase and phosphomonoesterase activities (e.g., a polynucleotide kinase-phosphatase).
- RNA end repair enzyme include a T4 polynucleotide kinase-phosphatase or a Cth polynucleotide kinase-phosphatase.
- an RNA substrate may comprise in vitro transcribed RNA, chemically synthesized RNA, viral RNA, prokaryotic RNA, eukaryotic RNA, archaeal RNA, or any combination thereof.
- An RNA substrate may comprise tissue culture RNA, biopsy RNA, feces RNA, urine RNA, lymph RNA, blood RNA, mucous RNA, sputum RNA, skin RNA, saliva RNA, wound RNA, sweat RNA, semen RNA, shoot RNA, root RNA, seed RNA, sewage RNA, sludge RNA, soil RNA, or any combination thereof.
- RNA substrate may comprise messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (sRNA), microRNA (miRNA), long non-coding RNA (IncRNA), circular RNA (circRNA), aptamer RNA, antisense RNA, silencing RNA (siRNA), guide RNA (gRNA), or any combination thereof.
- mRNA messenger RNA
- rRNA ribosomal RNA
- tRNA transfer RNA
- sRNA small RNA
- miRNA microRNA
- IncRNA long non-coding RNA
- circRNA circular RNA
- aptamer RNA antisense RNA
- silencing RNA silencing RNA
- gRNA guide RNA
- FIGURE 1 shows a schematic of an example cassette used to produce recombinant hRNase 4 enzyme.
- pPS periplasmic signal peptide
- 6HIS hexahistidine tag
- MBP maltose binding protein
- TEV TEV protease cleavage site.
- FIGURE 2 shows an example cleavage efficiency heatmap for pooled synthetic oligonucleotides (SEQ ID NOS: 1-13) using hRNase 4 at the indicated dilutions of enzyme. Darker boxes indicate more efficient cleavage by hRNase 4.
- FIGURE 3 shows an example bar chart of the mean total intensity of 5-prime cleavage products formed by digestion with hRNase 4.
- 5-Prime cleavage products are classified according to their terminal 3 ’-nucleotide residue. Error bars represent the standard deviation from two replicate digests.
- hRNase 4 primarily produced 5-prime cleavage products comprising a 3’-uridine nucleotide.
- FIGURE 4 shows an example bar chart of the mean total intensity of 3 -prime cleavage products formed by digestion with hRNase 4.
- 3-Prime cleavage products are classified by their initial 5 ’-nucleotide residue. Error bars represent the standard deviation from two replicate digests.
- hRNase 4 primarily produced 3-prime cleavage products comprising a 5 ’-adenine or 5 ’-guanine.
- FIGURE 5 shows an example comparison of predicted (theoretical) coverage of mRNA transcripts cleaved with hRNase 4 and various endoribonucleases. Cleavage products with lengths greater than 4 and less than 40 nucleotides were considered for sequence coverage calculations.
- FIGURE 5 A shows the theoretical sequence coverage of hRNase 4 and various endoribonucleases for 1000 random human mRNA transcripts (RefSeq).
- FIGURE 5B shows the theoretical sequence coverage of hRNase 4 and various endoribonucleases for E. coll coding sequences (CDS).
- FIGURE 5C shows the theoretical sequence coverage of hRNase 4 and various endoribonucleases for the BNT162b2 COVID- 19 mRNA vaccine sequence.
- FIGURE 6 shows an example comparison of predicted (theoretical) coverage of mRNA transcripts cleaved with endoribonucleases having hRNase 4-like cleavage specificities, namely those directed to a single nucleotide followed by either a purine or a pyrimidine (N(Y/R)) or to a purine or a pyrimidine followed by a single nucleotide ((Y/R)N) and with endoribonucleases having a single dinucleotide sequence (NN) or a single nucleotide (N) specificity.
- Cleavage products with lengths greater than 4 and less than 40 nucleotides were considered for sequence coverage calculations.
- FIGURE 6A shows the theoretical sequence coverage of the endoribonucleases for 1000 randomly selected human mRNA transcripts (RefSeq).
- FIGURE 6B shows the sequence coverages of individual hRNase 4-like (‘N(Y/R) & (Y/R)N’) cleavage specificities upon digestion of 1000 randomly selected human mRNA transcripts.
- FIGURE 6C shows the theoretical sequence coverage of the endoribonucleases for E. coli coding sequences (CDS).
- FIGURE 6D shows the theoretical sequence coverage of the endoribonucleases for the BNT162b2 COVID-19 mRNA vaccine sequence.
- Endoribonucleases having hRNase 4-like cleavage specificities are predicted to produce superior mRNA coverage relative to endoribonucleases having a single dinucleotide sequence (NN) or single nucleotide (N) specificity.
- FIGURE 7 shows example overlaid UV chromatograms of the digestion of a synthetic oligoribonucleotide (SEQ ID NOS: 14-17) with hRNase 4 in the presence or absence of T4 PNK (lighter and darker traces, respectively). Cleavage products detected in this assay are represented by the sequences 1 to 4.
- FIGURE 8 shows example workflows used for digestion of an RNA.
- FIGURE 8A shows an example generic workflow.
- FIGURE 8B shows an example workflow in which the subject RNA is digested with RNase T1.
- FIGURE 8C shows an example workflow in which the subject RNA is digested with a composition comprising hRNase 4 and T4 PNK.
- FIGURE 9 shows an example theoretical sequence coverage map obtained from digestion of FLuc IVT mRNA with either hRNase 4 or RNaseTl .
- hRNase 4 is predicted to generate a much larger number of unique cleavage products than RNase Tl.
- RNase Tl is predicted to generate a high percentage of isomeric cleavage products (i.e., products with the same nucleotide composition but with distinct sequences of nucleotides).
- FIGURE 10 shows the scoring distribution (violin plots) of an example search of the deconvoluted masses of cleavage products, resulting from digestion of FLuc IVT mRNA against a human transcriptome database spiked with FLuc mRNA.
- FIGURE 10A shows the scoring distribution for digestion with hRNase 4/T4 PNK.
- FIGURE 10B shows the scoring distribution for digestion with RNase Tl. A substantially higher background was observed using RNase Tl as a result of the high percentage of isomeric cleavage products.
- FIGURE 11 shows the number of oligonucleotides identified in each pool of triplicate sequencing experiments of FLuc mRNA digests with the overlapping portion of each showing the oligonucleotides common to two or all three replicates. Most oligonucleotides were reproducibly identified in each pool with hRNase 4/T4 PNK (96) and RNase Tl (85). Each replicate had similar total number of spectral counts.
- FIGURE 12 shows the distribution of cleavage product lengths identified in each replicate from digestion of FLuc IVT mRNA with either hRNase 4/T4 PNK or RNaseTl . Increased median and maximum lengths were observed in the hRNase 4/T4 PNK condition in comparison with that of RNaseTl .
- FIGURE 13 shows the experimental coverage of FLuc mRNA observed in digests either with hRNase 4/PNK or with RNase Tl . Improved sequence coverage of FLuc mRNA was observed in each hRNase 4/T4 PNK experiment (average 69.8%) relative to that of RNaseTl (average 52.8%).
- FIGURE 14 shows the experimental coverage of FLuc mRNA observed in digests with RNaseTl alone, hRNase 4 alone, or RNaseTl and hRNase 4 in combination. Increased coverage may be obtained by combining oligonucleotide identifications across treatment with different endoribonucleases.
- FIGURE 15 shows the number of oligonucleotides identified in each pool of triplicate sequencing experiments of FLuc mRNA digests represented as a circle.
- the portions of circles that overlap represent oligonucleotides common to both (portions where two circles overlap) or all three (portions where all three circles overlap) replicates.
- Data is shown for MC1/T4 PNK-based digests and demonstrates that T4 PNK may be successfully coincubated with diverse endoribonucleases to produce reproducible cleavage product identifications.
- MCI belongs to the T2 RNase family and was isolated from seeds of the bitter gourd Momordica charantia).
- FIGURE 16 shows the experimental coverage of FLuc mRNA observed in three digests with a composition of MC1/T4 PNK. Digestion with MC1/T4 PNK results in reproducible RNA sequence coverage.
- FIGURE 17 shows the scoring distribution of an example search of the deconvoluted masses of Epo mRNA digests against a Epo mRNA-spiked in human transcriptome database.
- Epo mRNA cleavage products were generated by digestion EpoU, EpomoU, or EpomlY mRNA either with hRNase 4/T4 PNK (replicate 1, column 1; replicate 2; column 2) or with RNase T1 (replicate replicate 1, column 3; replicate 2, column 4).
- the mean signal -to-noise ratio (S/N) of the score of each U-modified (lower row), m 1 Y-modified (top row), or mo 5 U- modified (middle row) EPO mRNA sequence relative to all other transcripts is reported at the top of each pair of graphs.
- Cleavage products produced by RNase T1 are generally shorter in length as exemplified in FIGURE 11. Accordingly, there is a higher probability of mapping those oligonucleotides to unrelated transcript sequences thereby increasing the analysis background.
- FIGURE 18A shows the sequence coverage of fully modified EpoU (right column), EpomoU (middle column) or EpomlY (left column) mRNAs upon analysis of cleavage products originated from digestion with either hRNase 4/T4 PNK (replicate 1, row 1; replicate 2, row 2) or with RNaseTl (replicate 1, row 3; replicate 2, row 4).
- FIGURE 18B shows the fractional coverage of fully modified EpoU, EpomoU or EpomlY mRNAs upon analysis of cleavage products originated from digestion with either hRNase 4/T4 PNK or with RNaseTl .
- the hRNase 4/T4 PNK condition substantially increases coverage for canonical or base modified Epo mRNAs relative to the RNase T1 condition.
- FIGURE 19 shows an example in which capped versus uncapped Epo mRNA (including cap modifications, such as cap methylation) (SEQ ID NOS: 18-22) are differentiated by cleavage with hRNase 4/T4 PNK. Error bars represent standard deviation from two replicate digests.
- FIGURE 20 shows example UV chromatograms of RNA cleavage products.
- the upper trace shows that no higher-retention cleavage products detected were detected in hRNase 4/T4 PNK treatment of Epo mRNA that lacked a poly-A tail whereas the lower trace shows that higher-retention cleavage products were detected in hRNase 4/T4 PNK treatment of Epo mRNA comprising a poly-A tail.
- polyA tails may be detected by cleavage with hRNase 4/T4 PNK.
- FIGURE 21 A shows the scoring distribution of an example search of the deconvoluted masses of uridine-depleted CLuc mRNA digests against a CLuc mRNA-spiked in human transcriptome database.
- Cleavage products derived from uridine-depleted CLuc mRNAs were generated by digestion with hRNase 4/T4 PNK (2 replicates per substrate).
- the mean signal-to-noise ratio (S/N) of the score of each uridine-depleted cLuc mRNA sequence relative to all other transcripts is reported at the top of each pair of graphs.
- FIGURE 2 IB shows a schematic representation of the relative location of depletion regions (broken lines) in each of three uridine-depleted CLuc mRNAs used in this experiment (CLuc Ul, CLuc U2, and CLuc U3).
- FIGURE 22 shows a sequence coverage map of each uridine-depleted CLuc mRNA upon analysis of cleavage products originated from digestion with hRNase 4/T4 PNK, accounting for shared sequences between each sample.
- the detected coverages regions are represented for each of CLuc Ul (replicate 1 and 2, left columns), CLuc U2 (replicate 1 and 2, middle columns), CLuc U3 (replicate 1 and 2, right columns).
- analysis of hRNase 4 digests enabled correct annotation of each mRNA comprising distinct uridine-depleted segments.
- FIGURE 23 shows the number of true positive and false positive oligonucleotide identifications of each uridine depleted CLuc mRNA upon digestion with hRNase 4/T4 PNK (2 replicates per sequence) in accordance with an example embodiment.
- FIGURE 24A, FIGURE 24B, and FIGURE 24C each show a schematic for isotopically labeling RNA oligonucleotides for quantification analysis.
- a non-isotopically labeled nucleotide comprising a chemically reactive group (3’-azido-3’deoxy-nucleotide) may be incorporated at the 3’ end of an RNA oligonucleotide by incubation of the non-isotopically labeled nucleotide (e.g., a 3 ’-azido-3’ deoxy-nucleoside triphosphate) and an RNA polymerase.
- the non-isotopically labeled nucleotide e.g., a 3 ’-azido-3’ deoxy-nucleoside triphosphate
- FIGURE 24B shows isotopically labeled molecules, wherein the chemically reactive group is DBCO and the “light” and “heavy” isotopically labeled molecules are derived from the amino acid alanine.
- An example of a tandem mass tag dipeptide conjugate is also shown. This tandem mass tag comprises a reporter and a balancing amino acid (for simplicity heavy isotopes are omitted from illustration; the site of HCD fragmentation is represented by a dashed line).
- FIGURE 24C shows an example of a chemoselective reaction involving a 3'-terminal 3'- azido-modified RNA oligonucleotide and a DBCO conjugate.
- FIGURE 24D, FIGURE 24E and FIGURE 24F each illustrate an example HCD fragmentation pattern of an RNA nucleoside that has been chemoselectively labeled with a reporter group.
- the reporter group is attached to the RNA nucleotide 3’ end by the reaction of a 3 ’-azido-3 ’-deoxy adenosine with a DBCO peptide conjugate.
- the DBCO peptide conjugate may comprise one or more isotopically labeled atoms (e.g., 2H, 13C and 15N) and may be used for quantitative analysis of multiple oligonucleotides in a single experiment (e.g., for quantification of capped versus uncapped 5’ end oligonucleotides in a capping assay).
- FIGURE 24D illustrates an example HCD fragmentation mass spectrum of an alanine derived peptide-deoxyadenosine conjugate.
- FIGURE 24E illustrates an example HCD fragmentation mass spectrum of an alanine-phenylalanine derived dipeptidedeoxyadenosine conjugate.
- FIGURE 24F illustrates an example HCD fragmentation mass spectrum of an alanine-proline derived dipeptide-deoxyadenosine conjugate.
- the data demonstrate the identification of characteristic phenylalanine or proline amino acid reporter anions in the HCD spectra from each dipeptide conjugate, respectively.
- These reporter anions may further comprise isotopically labeled atoms and be used for quantification of the corresponding oligonucleotide conjugates.
- FIGURE 25A and FIGURE 25B each show an example schematic for isotopically labeling RNA oligonucleotides by incorporating a chemically reactive group.
- FIGURE 25A shows labeling the 5’ end of an oligonucleotide by first incubating the oligonucleotide with ATPyS and T4 PNK to form a 5’ terminal thiophosphate oligonucleotide, and then reacting it with an iodoacetyl tandem mass tag (iodoTMT) reagent set comprising an isobaric mixture of isotopes as shown.
- iodoTMT iodoacetyl tandem mass tag
- FIGURE 25B shows labeling the 3’ end of an oligonucleotide by first incubating the oligonucleotide with sodium periodate to form a 3’ terminal dialdehyde oligonucleotide, and then reacting it with an aminoxy tandem mass tag (aminoxyTMT) reagent set comprising an isobaric mixture of isotopes analogous to the one shown in FIGURE 25A.
- aminoTMT aminoxy tandem mass tag
- FIGURE 26 shows the relative intensities of the oligonucleotides detected by LC- MS/MS analysis following hRNase 4 cleavage in the presence and absence of human placental RNase inhibitor. Data shown demonstrate that hRNase 4-mediated cleavage of an RNA oligonucleotide is inhibited by human placental RNase inhibitor. Sequences shown include SEQ ID NO: 23 and subsequences thereof (positions 1-6, positions 1-9, positions 2- 15, positions 7-15, positions 10-15).
- FIGURE 27 shows an example of a workflow used for targeted site-specific cleavage and isolation of a 5’-capped RNA oligonucleotide for downstream capping analysis.
- the subject EPO mRNA is first annealed with a DNA probe (e.g., a biotinylated DNA probe) and then digested with a composition comprising hRNase 4 and T4 PNK.
- the DNA-RNA duplex formed after digestion is purified (e.g., by affinity capture using streptavidin beads) and the RNA oligonucleotide is released (e.g., by elution using DNase I).
- FIGURE 28 A shows a schematic representation of DNA-targeted hRNase 4 sitespecific cleavage of an IVT Epo mRNA substrate.
- the arrow shows the closest ‘UR’ cleavage site near the DNA-RNA duplex region.
- the resulting RNA oligonucleotide product is shown in grey.
- FIGURE 28B shows a total ion chromatogram from LC-MS/MS characterization of the isolated RNA oligonucleotide (top panel) after its elution from beadbound DNA-RNA duplex by treatment with DNase I. No oligonucleotide was isolated in absence of a DNA probe (middle panel) or in absence of hRNase 4 (lower panel).
- FIGURE 28C shows a deconvoluted mass spectrum depicting the intact masses observed within the single chromatographic peak of FIGURE 28B (35mer Target) in the sample treated with hRNase 4/T4 PNK in the presence of the biotinylated DNA-probe.
- Mass spectrometry analysis confirm the isolation of the desired 35mer RNA oligonucleotide comprising the mass of a m7GpppAm cap structure.
- FIGURE 29A illustrates a heatmap depicting oligonucleotide products from a DNA probe-directed, RNA cleavage protection assay using example nucleotide specific and dinucleotide specific single-stranded ribonucleases.
- Protected Products refers to those oligonucleotide cleavage products spanning the DNA hybridized region. Numbers designating the start and end position of each identified cleaved oligonucleotide within the 40mer RNA sequence are shown in the y-axis. NoP: no probe; NoE: no enzyme; lOf, 50f and 250f are fold dilutions of each ribonuclease. Data shown demonstrate that the cleavage product heterogenicity is dependent on the identity and concentration of the ribonuclease utilized in the protection assay. FIGURE 29B illustrates sequences of the most abundant protected products for each enzyme.
- the 20mer DNA probe is represented as a bar aligned above the substrate 40-mer (SEQ ID NO: 31) with the respective protected products appearing below. Fragments of the substrate 40-mer are shown with numeric ranges to the right indicating the corresponding positions of SEQ ID NO:31.
- FIGURE 30A illustrates a series of example ribonuclease protection assays with a 40mer RNA and complementary DNA probes ranging from 20 to 30 nucleotides in length.
- FIGURE 30B illustrates a heatmap of oligonucleotide products identified in ribonuclease protection assays using hRNase 4, MCI or RNase T1 with various DNA probes (NE: no enzyme). Tile shade relates to the mean signal area of each identified oligonucleotide in each experiment.
- FIGURE 30C illustrates predominant cleavage site positions within the 40mer substrate (SEQ ID NO: 31) for hRNase 4, MCI, and RNase Tl. hRNase 4 displayed less cleavage product heterogenicity regardless of the DNA probe chosen.
- FIGURE 31 A illustrates a ribonuclease protection assay of FLuc mRNA (SEQ ID NO: 26) using hRNase 4 or RNase H.
- the 25mer probe is represented as a bar (light shade represents deoxyribonucleotides; dark shade represents ribonucleotides). Black circles indicate position of biotin group.
- X represents a capped or uncapped 5’ modification. RNase cut sites are marked.
- FIGURE 3 IB illustrates products from an example FLuc mRNA 5'-end sequence cleavage with either hRNase 4 (top) or RNase H (bottom) following a biotin enrichment step.
- FIGURE 31C illustrates deconvoluted mass spectra of the main cleavage product peak of hRNase 4 digest (the 28mer) from Figure 3 IB.
- FIGURE 3 ID illustrates deconvoluted mass spectra of the main cleavage product peak of RNaseH digest (the 24mer) from Figure 3 IB.
- Both cleavage products of Figure 31C (28mer) and Figure 3 ID (24mer) comprise a mixture of the individual sequences with different 5’ ends, which may include 5’-pp (diphosphate or 2p), 5’-ppp (triphosphate or 3p), 5’-Gppp, Cap 0 (5’-m 7 GpppG) and Cap 1 (5’-m 7 GpppGm).
- FIGURE 3 IE illustrates an example heatmap of the intensity of the Cap 1 modified oligonucleotides detected in two independent experiments using hRNase 4 or RNase H.
- RNase H produces two abundant Cap 1 oligonucleotide product sequences (23mer and 24mer), whereas hRNase 4 produces predominantly one (28mer).
- FIGURE 3 IF illustrates distribution of the different 5’ ends in all identified sequences for each of hRNase 4 or RNase H digests averaged from two replicates. Data shown demonstrate that the relative quantification of capped products and their intermediates in the hRNase 4 condition is comparable to that of RNase H enzyme, indicating that hRNase 4 can be effectively used for analysis of mRNA capping efficiency.
- the presence of a 3’ end repair enzyme, such as T4 PNK, in combination with hRNase 4 may produce molecules with consistent, dephosphorylated 3’termini and, thereby, reduce ambiguity in the attribution of mRNA 5’ modification.
- FIGURE 32A compares hRNase 4 and RNase H for the analysis of the enzymatic capping efficiency of unmodified (U) and fully modified (mlY) mRNA. Data shown illustrate the distribution of 5’ ends in a population of capped mRNAs revealed by analyses with hRNase 4 or RNase T1. Comparable distributions of capped products and intermediates were obtained in both the hRNase 4 analysis and the RNase T1 analysis.
- FIGURE 32B illustrates an example heatmap of the intensity of Cap-1 modified oligonucleotides of various lengths detected for each mRNA variant.
- RNase H displayed a higher propensity to spuriously cleave one or more nucleotides upstream or downstream from the target site resulting in a mixture of cleaved products differing from each other by one or more nucleotides in length, even after extensive probe optimization.
- FIGURE 33 illustrates an example ribonuclease protection assay applied to the analysis of 3’ end of EPO mRNA (SEQ ID NO:27) using hRNase 4.
- a DNA probe was designed to direct RNA cleavage a few nucleotides upstream of the mRNA poly(A) tail (cleavage sites for hRNase 4 are designated with R4).
- the deconvoluted mass spectra of the product of the 3’ end cleavage shows a distribution of peaks between 40,000 and 45,000 u that differ from each other by an adenosine (A) nucleotide and indicative of the presence of a poly (A) tail.
- FIGURE 34 illustrates an example workflow 3400 used for integrative analysis of 5’ cap, poly(A) tail, and an mRNA internal sequence (also referred as to mRNA body sequence) using hRNase 4.
- subject RNA is annealed 3410 with DNA probes targeted to the RNA 5’ and 3’ ends, wherein each DNA probe independently of each other may comprise an affinity group.
- Hybridized RNA is digested 3420 with hRNase 4, optionally in a composition with T4 PNK.
- Cleaved DNA-RNA duplexes are purified 3430 (e g., by affinity capture) and the cleaved single-stranded RNA oligonucleotides are collected 3440 (supernatant).
- the supernatant fraction containing the cleaved single-stranded RNA oligonucleotides may be used for analysis 3450 of the mRNA internal sequence.
- DNA-RNA duplexes may be eluted 3460 and used for 5’ cap and poly(A) tail analysis 3470 (e.g., directly or after releasing the RNA strands by DNase I treatment).
- SEQ ID NOS: 1-13 which are also illustrated in FIGURE 2 and Table 1, are example oligoribonucleotides for assessing cleavage capabilities of an RNase.
- SEQ ID NOS: 14-17 which are also illustrated in FIGURE 7, are example oligoribonucleotides for assessing cleavage capabilities of an RNase.
- SEQ ID NOS: 18-22 which are also illustrated in FIGURE 19, are hRNase 4/T4 PNK cleavage products of Epo mRNA (SEQ ID NO:27).
- SEQ ID NO: 23 is an example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 24 is an example biotinylated DNA probe sequence for hybridization with an RNase substrate.
- SEQ ID NO: 25 is a portion of an EPO mRNA (SEQ ID NO: 27) example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 26 which is also illustrated in Table 1 and, in part, in FIGURE 31 A, is a FLuc mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 27 which is also illustrated in Table 1 and, in part, in FIGURE 33, is an EPO mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 28 which is also illustrated in Table 1, is a ClucUl mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 29, which is also illustrated in Table 1, is a ClucU2 mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 30 which is also illustrated in Table 1, is a ClucU3 mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
- SEQ ID NO: 31 which is also illustrated in FIGURES 29 and 30C, is an example oligoribonucleotide for assessing cleavage capabilities of an RNase.
- cleavage products may include a first fragment (e.g., consisting of positions 1-17, 1-19, 1-20, 1-21, 1-22, 1-23, 1-25, 1-26, 1-27, 1-28, 1-29, 1-30, 1-31, 1-33, 2-22, 3-22, or 4-22 of SEQ ID NO:31), one or more additional oligoribonucleotides (e.g., 4- 23 nucleotides in length), and optionally one or more mono-di- and tri-ribonucleotides.
- a first fragment e.g., consisting of positions 1-17, 1-19, 1-20, 1-21, 1-22, 1-23, 1-25, 1-26, 1-27, 1-28, 1-29, 1-30, 1-31, 1-33, 2-22, 3-22, or 4-22 of SEQ ID NO:31
- additional oligoribonucleotides e.g., 4- 23 nucleotides in length
- optionally one or more mono-di- and tri-ribonucleotides
- SEQ ID NO: 32 is an example DNA probe sequence for hybridization with an RNase substrate.
- SEQ ID NOS: 33-42 which are also illustrated in Table 5, are example DNA probe sequences for hybridization with an RNase substrate.
- SEQ ID NOS: 43-45 are example RNase 4 sequences.
- SEQ ID NO: 46 is an example polynucleotide kinase sequence.
- SEQ ID NOS: 47-50 which are also illustrated in Table 6, are example mRNA 5'- UTR coding sequences.
- SEQ ID NOS: 52-58 which are also illustrated in Table 7, are example probe sequences used to assess probe-directed RNA cleavage of mRNAs comprising distinct 5'- UTRs.
- SEQ ID NO: 59 is an example biotinylated DNA probe sequence for hybridization with an RNase substrate.
- Mass spectrometry is a technique that allows direct and comprehensive characterization of nucleic acids and their chemical modifications without any prior knowledge or assumptions (Yoluc et al., 2021). Mass spectrometry analysis of RNA may be conducted with non-hydrolyzed RNA species (top-down analysis), partially hydrolyzed RNA species (bottom-up analysis) or fully hydrolyzed RNA species (nucleoside analysis).
- RNA is partially hydrolyzed by enzymatic digestion to oligonucleotides using site-specific ribonucleases (RNases), such as RNase T1 (guanosine-specific), RNase A (pyrimidine-specific) and RNase U2 (purine-specific).
- RNases site-specific ribonucleases
- RNase T1 guanosine-specific
- RNase A pyrimidine-specific
- RNase U2 purine-specific
- the resulting RNase preparations are of low quality and sometimes may be contaminated with other undesired RNase activities, making it difficult to precisely define the RNase specific activity.
- the RNase itself does not exhibit clear-cut specificity and thus produces secondary cleavage of RNA (i.e., cleavage at sites that are different from the main cleavage motif), which often increases with the RNase concentration.
- RNA cleavage products comprising diverse phosphorylation states at the 5’ (5-prime) and 3’ (3-prime) ends may be obtained (e.g., 5’- phosphate, 5-hydroxy, 3 ’ -phosphate, 3 ’-hydroxy, 2’ -phosphate, 2’ -hydroxy, and/or 2', 3'- cyclic-phosphate), thereby complicating comprehensive RNA analysis.
- the presence of RNA digestion products of the same or different sequence with multiple possible phosphorylation status at their ends, including non-phosphorylated ends, increases the spectral complexity and the likelihood of peak overlaps, and reduces the overall abundance of each ion.
- Ribonuclease mapping may be used to determine RNA sequence and modification status by mass spectrometry.
- the protocol for RNA digestion comprises an additional step of treating the digested RNA, often (but optionally) after purification of the digested RNA, with a phosphodiesterase (e.g., a cyclic phosphodiesterase) to reduce the sample complexity.
- Phosphodiesterases PDEs are enzymes that are characterized by their ability to cleave a phosphodiester bond.
- Cyclic PDEs (2',3 '- cyclic nucleotide phosphodiesterase, also referred as to CNPase or CNP) cleave a phosphodiester bond in 2', 3 '-cyclic nucleotide to form a nucleoside 2'-phosphate. Cyclic PDEs, such as human CNP, do not hydrolyze phosphate monoesters (i.e., they do not exhibit phosphomonoesterase activity).
- RNA processing steps prior to downstream mass spectrometry analysis, but also increase the depth of RNA analysis are desired, for example, (i) to improve the accuracy of the resulting sequence data, (ii) to enable accurate sequencing of smaller amounts of input RNA, and/or (iii) to better differentiate sequencing errors from true sequence variations.
- Such methods may benefit from endoribonucleases showing one or more of the following properties: (a) robust and reproducible specific activity; (b) easy to express and purify as soluble protein; (c) long shelflife stability; (d) tolerates the presence of salts and/or denaturing agents; (e) conditionally inhibited and/or deactivated (e.g., to limit activity when desired); (f) at least moderately thermostable; (g) cleavage frequencies of, on average, every 6-12 nucleotides; (h) nominal or no spurious cleavage activity; and (i) capable of cleaving RNA modifications.
- Endonucleases having some or all of these properties may reduce or minimize the extent of the formation of isomeric digestion products and/or increase the sequence coverage of long, complex RNAs.
- RNA characterization which may include, for example, chromatographic and/or spectroscopic characterization.
- methods and compositions may include RNA analysis or characterization (e.g., sequencing) by LC-MS/MS.
- Methods and compositions, according to some embodiments, may include and/or use human endoribonuclease 4.
- compositions and methods may include one or more endoribonucleases and one or more RNA end repair enzymes that work (e.g., work concurrently) to recognize, cleave, and heal specific RNA sequences and may produce from a RNA substrate oligoribonucleotides having fully hydroxylated ends (i.e., RNA oligonucleotides comprising 5’-OH, 3'-OH, and 2'-OH termini).
- the present disclosure relates to methods and compositions for analyzing RNA substrates using, for example, tandem liquid chromatography-mass spectrometry (e.g., LC-MS/MS).
- Methods may include, for example, preparing oligoribonucleotides from RNA substrates and analyzing the oligoribonucleotides.
- Compositions and kits to produce oligoribonucleotides may comprise one or more components according to some embodiments.
- compositions and kits to produce oligoribonucleotides may comprise one or more enzymes or catalysts active on RNA substrates including, for example, an endoribonuclease (e.g., human endoribonuclease 4) and an RNA end repair enzyme (e.g., bacteriophage T4 polynucleotide kinase (T4 PNK)).
- RNA end repair enzyme e.g., bacteriophage T4 polynucleotide kinase (T4 PNK)
- Compositions and kits to produce oligoribonucleotides may comprise one or more buffering agents and/or one or more RNA denaturing agents.
- Compositions and methods in some embodiments, have application to analysis of RNA-based cancer immunotherapies, protein-replacement therapies, and prophylactic and therapeutic vaccines.
- each component, feature, and method step disclosed herein is optional and the disclosure contemplates embodiments in which each optional element may be expressly excluded.
- this statement is intended to serve as antecedent basis for use of such exclusive terminology as “solely,” “only” and the like in connection with the recitation of claim elements or use of a “negative” limitation.
- Sources of commonly understood terms and symbols may include: standard treatises and texts such as Kornberg and Baker, DNA Replication, Second Edition (W.H. Freeman, New York, 1992); Lehninger, Biochemistry, Second Edition (Worth Publishers, New York, 1975); Strachan and Read, Human Molecular Genetics, Second Edition (Wiley-Liss, New York, 1999); Eckstein, editor, Oligonucleotides and Analogs: A Practical Approach (Oxford University Press, New York, 1991); Gait, editor, Oligonucleotide Synthesis: A Practical Approach (IRL Press, Oxford, 1984); Singleton, et al., Dictionary of Microbiology and Molecular biology, 2d ed., John Wiley and Sons, New York (1994), and Hale & Markham, the Harper Collins Dictionary of Biology, Harper Perennial, N.Y. (1991) and the like.
- a protein refers to one or more proteins, i.e., a single protein and multiple proteins.
- an “affinity capture domain” refers to a domain capable of binding a corresponding affinity domain.
- Example materials having such properties include avidin, streptavidin, neutravidin, maltose-binding protein, GST, antibodies (e.g., anti- HA, anti-Myc, anti-FLAG), S-protein, calmodulin, lectins, nickel, cobalt, zinc, and polyhistidine.
- Further examples include groups that form an irreversible bond with a protein tag, including benzylguanine or benzylchoropyrimidine (SNAP -tag); benzylcytosine (CLIP -tag); haloalkane (HaloTag); CoA analogues (MCP-tag and ACP-tag); trimpethoprim or methotrexate (TMP-tag); FlAsH or ReAsH (Tetracysteine tag); a substrate of biotin ligase; a substrate of phosphopantetheline transferase; and a substrate of lipoic acid ligase.
- An affinity capture method may be used for selectively enriching samples by means of affinity purification methods, wherein the affinity binding partner is immobilized in a column, bead, microtiter plate, membrane or other solid support.
- an “affinity domain” refers to a domain capable of binding a corresponding affinity capture domain with high affinity (e.g., at least I0' 8 M) and specificity.
- Example materials having such properties include biotin, DBT, desthiobiotin, oxybiotin, iminobiotin, diaminobiotin, biotin sulfoxide, biocytin, digoxigenin, glutathione, heparin, maltose, coenzyme A, protein A, Brilliant Blue FCF, azorubine, phytoestrogen, nickel, cobalt, zinc, poly-histidine, HA-tag, c-myc tag, FLAG-tag, S-tag, CBP-tag, dihydrofolate reductase, a hapten to an antibody, a mono- or oligosaccharide ligand to a lectin, hormones, cytokines, toxins, dyes, and vitamins.
- Such molecules may be fuse
- buffer or “buffering agent” refers to an agent that, when in solution or in contact with a solution, contributes to or causes such solution to resist changes in pH upon addition of acid(s) or alkali(s) to the solution.
- suitable non-naturally occurring buffering agents include, for example, any of Tris, HEPES, TAPS, MOPS, tri cine, and MES.
- Coupled reaction refers to a reaction in which two or more reaction steps occur in a single reaction mixture and in a single reaction location (e.g., a tube, a container, a vessel, a well, a capillary, a flow cell, a surface or other space) or separate locations that are in fluid communication with one another (e.g., where enzymes are deposited in separate locations on a surface and the locations are immersed in a common fluid comprising, for example, one or more substrates, buffers, reaction intermediates, and/or reaction products).
- a single reaction location e.g., a tube, a container, a vessel, a well, a capillary, a flow cell, a surface or other space
- separate locations that are in fluid communication with one another (e.g., where enzymes are deposited in separate locations on a surface and the locations are immersed in a common fluid comprising, for example, one or more substrates, buffers, reaction intermediates, and/or reaction products).
- a reaction location may be defined by one or more walls (e g., of a tube), a liquid (e.g., a liquid immiscible with a reaction fluid), a fluid (e.g., a gas including, for example gaseous nitrogen or air), a vacuum, or combinations thereof. Sequential reaction steps in a coupled reaction may begin and/or continue without changes to reaction conditions (e.g., without addition or removal of reagents, changes in temperature, pH, volume, or washing) beyond those that arise or follow from the reactions themselves.
- RNA denaturing agent refers to an agent that, in contact with RNA, disrupts intramolecular hydrogen bonding in the RNA by melting existing hydrogen bonds, if present, and/or interfering with formation of new hydrogen bonds.
- An RNA denaturing agent may lack any ribonuclease activity.
- RNA denaturing agents include formamide, dimethylformamide (DMF), guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide (DMSO), propylene glycol, poly(ethylene glycol (PEG), cetyltrimethylammonium bromide (CTAB), and urea.
- DNA probe refers to a oligodeoxyribonucleotide having a length of 10-20 nucleotides, 10-30 nucleotides, 10-40 nucleotides, 10-50 nucleotides, or 10-200 nucleotides.
- a DNA probe may comprise a sequence complementary to an RNA substrate or complementary to any portion along the length of an RNA substrate.
- a DNA probe sequence may be selected to bind to an RNA substrate or bind to a specific portion of an RNA substrate.
- a DNA probe may have a sequence complementary to an RNA sequence at or near (e.g., within 1-5, 1-10, 1-15, or 1-20 nucleotides of) the 5’ end of the RNA, at or near (e.g., within 1-5, 1-10, 1-15, or 1-20 nucleotides of) the 3’ end of the RNA, or positioned between the 5’ and 3’ ends.
- a DNA probe may comprise a sequence complementary to an RNA sequence comprising and/or adjacent (e.g., within 3-15, 4-14, 5-13, or 6-12 nucleotides of and on the 5’ or 3’ side) to one or more endoribonuclease cut sites.
- a DNA probe When hybridized to a complementary RNA sequence, a DNA probe may limit or block access to one or more endoribonuclease cut sites (e.g., within the duplex) without limiting or blocking access to one or more other endoribonuclease cut sites (e.g., outside the duplex).
- a DNA probe sequence may be selected or configured to produce one or more endoribonuclease digestion products having one or more desired properties (e.g., length, 5’ fragment, 3’ fragment).
- a DNA probe and an RNA substrate may form a duplex flush with the 5’ end of the RNA, offset from the 5’ end by 1-5, 1-10, 1-15, or 1-20 nucleotides, flush with the 3’ end of the RNA, or offset from the 3’ end by 1-5, 1-10, 1-15, or 1-20 nucleotides.
- a DNA probe may comprise solely deoxyribonucleosides or may comprise mostly deoxyribonucleosides with one or more ribonucleosides (e.g., a chimeric probe for use with RNase H).
- a DNA probe may comprise solely phosphate linkages or may include one or more alternate linkages (e.g., phosphorothioate)
- a DNA probe may comprise solely canonical nucleotides or may comprise one or more modified nucleotides.
- a DNA probe may comprise one or more affinity tags (e.g., biotin).
- fusion refers to two or more polypeptides, subunits, or proteins covalently joined to one another (e.g., by a peptide bond).
- a protein fusion may refer to a non-naturally occurring polypeptide comprising a protein of interest covalently joined to a second polypeptide.
- a second polypeptide may confer upon the fusion one or more desirable properties over the protein of interest alone.
- a second polypeptide may provide an additional binding property (e g., an affinity and/or purification tag), a selection and/or detection tag (e.g., a reporter protein)
- a second polypeptide include a reporter protein, a purification tag (e.g., maltose binding protein, a histidine tag), and expression tag, a polynucleotide binding protein, an enzyme, a conjugation tag (e.g., a SNAP® tag), and a peptide linker.
- the protein of interest may be nearer to the N-terminal end or nearer to the C-terminal end than the second polypeptide to which it is joined.
- a fusion may comprise a non-naturally occurring combined polypeptide chain comprising two proteins or two protein domains joined directly to each other by a peptide bond or joined through a peptide linker.
- An example fusion may include an MBP and an hRNA4.
- human endoribonuclease 4 refers to a human protein encoded by the RNASE4 gene, having endoribonuclease activity, and cutting RNA at UR. It is an example of a strand-specific, sequence specific endoribonuclease and an example of an RNase 4.
- Human endoribonuclease 4 may also be referred to as “hRNase 4”, “Homo sapiens RNase4”, “hRNase IV”, “hRNase4”, or “Hs RNase4”.
- hRNase 4 is one of the eight members of the human RNase A superfamily of endoribonucleases (Lu et al., Immune Modulation by Human Secreted RNases at the Extracellular Space, Front Immunol. 2018, 9:1012). hRNase 4 retains a high interspecies homology within mammals, and shares conserved structural features with non-mammalian vertebrate RNases.
- Example hRNase 4 amino acid sequences include SEQ ID NOS: 43-45.
- hRNase 4 shows strong selectivity for RNA recognition with preference for uridine at the main binding site.
- hRNase 4 preference for uridine over cytidine in comparison to other family members can be correlated with some structural features at the binding pocket, such as the presence of an asparagine residue at position 80. Substitution of Asp80 to alanine reduces preference for uridine over cytidine.
- immobilized refers to covalent attachment to a solid support with or without a linker.
- solid supports include beads (e.g, magnetic, agarose, polystyrene, polyacrylamide, chitin).
- Beads may include one or more surface modifications (e.g., O 6 -benzyleguanine, polyethylene glycol) that facilitate covalent attachment and/or activity of an enzyme of interest.
- a support may comprise a ligand and an enzyme may have a receptor for such ligand or an enzyme may comprise a ligand and a support may comprise a receptor for such ligand.
- Receptor-ligand binding may be covalent or non-covalent.
- Non-covalent attachment may be useful in some embodiments, for example, where the level of dissociation of the binding partner is deemed tolerable.
- a linker may be disposed, for example, between a support and an enzyme or between a support and a DNA probe.
- a linker disposed between a support and an enzyme may have a first covalent bond to the support and a second covalent bond to the enzyme.
- An immobilized enzyme comprising a ligand-receptor attachment may have a linker disposed between the support and the ligand-receptor attachment, a linker disposed between the enzyme and the ligand-receptor attachment, or both.
- An immobilized enzyme comprising a linker may also comprise an optional covalent bond directly between the enzyme and the support.
- a linker may be of any desired length and have any desired range of motion.
- a peptide linker may comprise one or more repeats (e.g., 1-10 repeats) of glycine-serine.
- non-naturally occurring refers to a polynucleotide, polypeptide, carbohydrate, lipid, or composition that does not exist in nature.
- a polynucleotide, polypeptide, carbohydrate, lipid, or composition may differ from naturally occurring polynucleotides polypeptides, carbohydrates, lipids, or compositions in one or more respects.
- a polymer e.g., a polynucleotide, polypeptide, or carbohydrate
- the component building blocks e.g., nucleotide sequence, amino acid sequence, or sugar molecules.
- a polymer may differ from a naturally occurring polymer with respect to the molecule(s) to which it is linked.
- a “non- naturally occurring” protein may differ from naturally occurring proteins in its secondary, tertiary, or quaternary structure, by having a chemical bond (e.g., a covalent bond including a peptide bond, a phosphate bond, a disulfide bond, an ester bond, and ether bond, and others) to a polypeptide (e.g., a fusion protein), a lipid, a carbohydrate, or any other molecule.
- a chemical bond e.g., a covalent bond including a peptide bond, a phosphate bond, a disulfide bond, an ester bond, and ether bond, and others
- a “non-naturally occurring” polynucleotide or nucleic acid may contain one or more other modifications (e.g., an added label or other moiety) to the 5’ - end, the 3’ end, and/or between the 5’- and 3’-ends (e.g., methylation) of the nucleic acid.
- a “non-naturally occurring” composition may differ from naturally occurring compositions in one or more of the following respects: (a) having components that are not combined in nature, (b) having components in concentrations not found in nature, (c) lacking one or more components otherwise found in naturally occurring compositions (e.g., a cell-free composition, a chromosome-free composition, a histone-free composition, a polymerase-free composition, a cell membrane-free composition, a lyophilized composition), (d) having a form not found in nature, e.g., dried, freeze dried, crystalline, aqueous, and (e) having one or more additional components beyond those found in nature (e.g., buffering agents, a detergent, a dye, a solvent or a preservative).
- a cell-free composition e.g., a chromosome-free composition, a histone-free composition, a polymerase-free composition, a cell membrane-free composition, a lyophilized composition
- nucleotide refers to a molecule comprising a base, a sugar and one or more phosphate groups.
- a base also referred to as a “nitrogenous base” or a “nucleobase” may be a purine or pyrimidine.
- a sugar may be a five-carbon ribose (as in ribonucleotides) or a 2-deoxyribose (as in deoxyribonucleotides), which is bound via a glycosidic linkage to the base.
- Nucleotides may have one, two or three phosphate groups (mono-, di- or triphosphates).
- Phosphate groups may form a chemical bond at the 5-carbon position of the sugar, although they may also bond at the 2 or 3-carbon positions of the sugar group. Cyclic nucleotides form when a phosphate group is bound to two hydroxyl groups on the sugar.
- a “nucleoside” comprises a nucleobase and sugar. A nucleotide may also be called a nucleoside mono-, di- or triphosphate.
- oligoribonucleotide refers to a polymer of ribonucleotides that are less than 500 nucleotides long, less than 200 nucleotides long or less than 100 nucleotides long.
- oligoribonucleotides may be 4-80 nucleotides long, 4-60 nucleotides long, or 4-40 nucleotides long.
- An oligoribonucleotide may be an RNA substrate.
- Ribonuclease refers to a nuclease that catalyzes the cleavage of RNA into smaller components. Ribonucleases include endoribonucleases and exoribonucleases. Ribonucleases may cleave single- stranded RNA, double-stranded RNA, or single-stranded RNA and double-stranded RNA. Examples of ribonucleases may include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
- methods and compositions may exclude one or more of the foregoing example ribonucleases.
- An endoribonuclease may have mononucleotide specificity, dinucleotide specificity, trinucleotide specificity, or higher nucleotide specificity.
- an endoribonuclease with a single dinucleotide specificity might be expected to cleave RNA substrates (having a random distribution of all 4 bases within their sequences) on average once every 16 nucleotides.
- An endoribonuclease having specificity for one or more dinucleotide or trinucleotide combinations may cleave an RNA substrate more frequently, for example, on average once every 6 to 12 nucleotides, for example on average once every 8 nucleotides (e.g., calculated with reference to an RNA substrate having a random distribution of all 4 bases within its sequence).
- Examples of endoribonucleases with specificity for one or more dinucleotide combinations are those whose specificity comprise a main nucleotide anchoring site (referred as Bl site) and a secondary nucleotide binding site (referred as B2 site).
- a selected endoribonuclease is capable of cleaving a 3 ',5 ' phosphodiester bond between Bl and B2 sites with selectivity for one of: uridine at the main anchoring site Bl and pyrimidines at the secondary site B2; or cytidine at the main anchoring site Bl and pyrimidines at the secondary site B2; or adenosine at the main anchoring site Bl and pyrimidines at the secondary site B2; or guanosine at the main anchoring site Bl and pyrimidines at the secondary site B2; or uridine at the main anchoring site Bl and purines at the secondary site B2; or cytidine at the main anchoring site Bl and purines at the secondary site B2; or adenosine at the main anchoring site B 1 and purines at the secondary site B2; or guanosine at the main anchoring site Bl and purines at the secondary site B2.
- Example endoribonucleases include those whose specificity comprise a secondary nucleotide binding at the Bl site and a main nucleotide anchoring binding at the B2 site. Such examples include endoribonucleases that are capable of cleaving a 3 ',5' phosphodiester bond between Bl and B2 sites with selectivity for one of: purines at the secondary site Bl and uridine at the main anchoring site B2; or purines at the secondary site Bl and cytidine at the main anchoring site B2; or purines at the secondary site Bl and adenosine at the main anchoring site B2; or purines at the secondary site Bl and guanosine at the main anchoring site B2; or pyrimidines at the secondary site Bl and uridine at the main anchoring site B2; or pyrimidines at the secondary site Bl and cytidine at the main anchoring site B2; or pyrimidines at the secondary site Bl and aden
- Hs RNase4 Homo sapiens
- Hs RNases 2, 3, 6, and 7 Hs RNases 2, 3, 6, and 7 (cleave either uridine or cytidine at Bl position with strong preference for adenosine at B2 position)
- Rana pipiens (Rp) RNase, Chelonia mydas (Cm) RNasel, and Gallus gallus (Gg) RNasel cleave either uridine or cytidine at Bl position with preference for guanosine at B2 position.
- the endoribonuclease may present a mild preference for a given nucleotide at Bl or B2 positions, and this preference may be tuned in such a way (for example, by dilution of enzyme concentration, by buffer change, by pH change, or by temperature change) that the endonuclease may effectively cleave at frequencies that are on average once every 6 to 12 nucleotides.
- Those examples include endoribonucleases such as Hs RNase5 (cleaves either uridine or cytidine at Bl position with mild preference for adenosine over guanosine at B2 position).
- An endoribonuclease may have any desired form, for example, a fluid form (e.g., with or without glycerol), a lyophilized form, a dried form, and/or an immobilized form.
- ribonuclease inhibitor refers to a material that reduce (e.g., partially or completely) the RNA cleavage activity of a ribonuclease.
- endoribonuclease inhibitors include human placental RNase inhibitor, murine RNase inhibitor, ribonucleoside-vanadyl complex, guanidine thiocyanate, IRE1 RNase inhibitor, diethyl pyrocarbonate (DEPC), egtazic acid (EGTA), ethylenediaminetetraacetic acid (EDTA), and any combination thereof.
- a ribonuclease inhibitor may have any desired form, for example, a fluid form (e.g., with or without glycerol), a lyophilized form, a dried form, and/or an immobilized form.
- a ribonuclease inhibitor may bind to a ribonuclease (e.g., a susceptible ribonuclease) with high affinity, for example, an affinity similar to the affinity of avidin and biotin.
- RNA end repair refers to a process of converting RNA phosphorylated ends (e.g., cyclic and/or linear phosphorylated ends) into RNA hydroxylated ends (e.g., 5’-OH, 2'-OH and/or 3'-OH ends). RNA end repair, in the context of the present disclosure, excludes ligation of 5’ and 3’ ends to one another.
- RNA end repair enzyme refers to an enzyme that performs RNA end repair and comprises both phosphodiesterase (PDE) and phosphomonoesterase (PME) activities.
- An RNA end repair enzyme may maintain or manipulate RNA structure in response to RNA breakage events.
- RNA end repair enzymes are present in diverse taxa in all phylogenetic domains of life and repair RNA breaks inflicted by sequence-specific or structure-specific endoribonucleases during physiological RNA processing (e.g., tRNA splicing; kinetoplast mRNA editing) and under conditions of cellular stress (e.g., virus infection; unfolded protein response).
- a repair enzyme may resolve 2', 3'- cyclic-phosphorylated oligoribonucleotide ends, 3 '-phosphorylated oligoribonucleotide ends and/or 2'-phosphorylated oligoribonucleotide ends.
- a repair enzyme may have any desired form, for example, a fluid form (e.g., with or without glycerol), a lyophilized form, a dried form, and/or an immobilized form.
- RNA end repair enzymes include polynucleotide kinases including polynucleotide kinase-phosphatase (Pnkp) enzymes with 5'-hydroxyl kinase, 3 '-phosphatase and/or 2',3'-cyclic phosphodiesterase activities that function in nucleic acid repair.
- Pnkp polynucleotide kinase-phosphatase
- Enterobacteria phage such as phages RB55 and RB59
- Desulfovibrio sp Shigella phage, Escherichia phage, Yersinia phage, Bacillus cereus, Salmonella phage, Citrobacter phage, Serratia phage, Vibrio phage, Aeromonas phage, Acinetobacter phage, Klebsiella phage, Stenotrophomonas phage, and Staphylococcus aureus (AAA family ATPase).
- the PNKP gene (named pseT) is conserved in many species, including H.
- Example PNKs in this family of enzymes include bacteriophage T4 polynucleotide kinase (T4 PNK; also referred as to T4 polynucleotide kinase-phosphatase or T4 Pnkp) (Das and Shuman, 2013) and Clostridium thermocellum (Cth) polynucleotide kinase-phosphatase.
- T4 PNK also referred as to T4 polynucleotide kinase-phosphatase or T4 Pnkp
- Cth Clostridium thermocellum
- An example RNA end repair enzyme amino acid sequence is SEQ ID NO:46.
- T4 PNK heals 2',3'-cyclic-phosphorylated oligoribonucleotide ends, 3'-phosphorylated oligoribonucleotide ends, and 2'-phosphorylated oligoribonucleotide ends, in each case, resulting in products that comprise a 2 ’,3 ’-hydroxylated (2'-OH, 3'-OH) end.
- T4 PNK heals broken tRNA ends through (i) hydrolysis of a 2’,3’-cyclic phosphate to a 3’-hydroxy end and (ii) phosphorylation of a 5 ’-hydroxy (via its polynucleotide kinase activity) to form a 5’- phosphate end (these tRNA healed ends are eventually sealed by another enzyme, RNA ligase 1, Rnll).
- Phosphorylation of the 5’-OH may be NTP-dependent (e.g., ATP- dependent).
- phosphorylation of the 5’-OH may not occur without an NTP, which produces healed ends comprising 5 ’-OH, 2'-OH and/or 3 '-OH.
- T4 PNK is capable of phosphorylating the 5' end of double- and single-stranded RNA or DNA.
- Cth PNK is a multifunctional enzyme that belongs to a family of RNA end-healing enzymes found in diverse bacteria.
- Cth PNK has three catalytic modules: (i) an N-terminal polynucleotide 5'-kinase; (ii) a central 2', 3 '-phosphatase; and (iii) a C-terminal ligase (Das and Shuman, 2013).
- Cth PNK converts an RNA 2'-phosphate, 3'- phosphate, or a 2', 3 '-cyclic phosphate end to an RNA product comprising a 2'-OH, 3'-OH end by means of its phosphodiesterase and phosphomonoesterase activities.
- Cth PNK may use either Mn(II) or Ni(II) as a metal cofactor.
- RNA substrate refers to any composition including one or more ribonucleotide (RNA) species of one or more lengths from one or more sources.
- An RNA substrate may be obtained from one or more sources, including viruses, prokaryotic cells, eukaryotic cells, or archaea cells.
- An RNA substrate may arise from or include any biological material (e.g., solid, fluid, aerosol) including organs, tissues, tissue cultures, biopsies, blood, lymph, mucous, sputum, skin, saliva, lesions, swabs, sweat, semen, urine, feces, and secretions.
- Biological materials may be fresh or processed (e.g., embedded with a paraffin or other support).
- RNA substrate may arise from or include an environmental sample (e.g., air, water, soil, and/or biota or other substrate), food materials, agricultural materials, medical materials, and/or waste products.
- An RNA substrate may arise from or include RNA from in-vitro transcription (e.g., by the use of RNA polymerases) and/or from chemical synthesis (e.g., by the use of phosphoramidite chemistry or related processes).
- RNA substrate may comprise solely ribonucleosides or may comprise mostly ribonucleosides with one or more deoxyribonucleosides.
- An RNA substrate may comprise solely phosphate linkages or may include one or more alternate linkages (e g., phosphorothioate).
- An RNA substrate may comprise solely canonical nucleotides or may comprise one or more modified nucleotides.
- an RNA substrate may comprise one or more adenosines, cytidines, guanosines, uridines, 1 -methyladenosines, 2- methyladenosines, N 5 -methyladenosines, 5-methylcytidines, 5-hydromethylcytidines, wyosines, 1 -methyl guanosines, 7-methylguanosines, pseudouridines, 1-methy -pseudouridines, 5 -methyluridines, and/or 5-hydroxyuridines.
- An RNA substrate may have any desired length.
- an RNA substrate may have over 50 nucleotides, over 100 nucleotides, or over 200 nucleotides.
- An RNA substrate may have 50-500 nucleotides, 100-1000 nucleotides, or 200-2000 nucleotides.
- An RNA substrate may have 1000-5000 nucleotides, 5000-9000 nucleotides, or 9000-22000 nucleotides.
- An RNA substrate may be linear, folded, or circular.
- An RNA substrate may comprise one or more endoribonuclease cut sites.
- An RNA substrate may comprise, according to some embodiments, a plurality of RNA species, including one or more of in vitro transcribed RNA, artificially synthesized RNA by chemical methods, or RNA obtained from native sources.
- An RNA substrate may include RNA pol I transcripts, RNA pol II transcripts, RNA pol III transcripts, nascent RNA, primase, prokaryotic RNA polymerase, or any combination thereof.
- an RNA substrate may comprise a plurality of RNA species including one or more of single-stranded or double-stranded RNAs.
- RNA substrate may arise from or include messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNAs (tRNAs), small RNA (sRNA), microRNA (miRNA), long non-coding RNA (IncRNA), circular RNA (circRNA), or any combination thereof.
- mRNA messenger RNA
- rRNA ribosomal RNA
- tRNAs transfer RNAs
- sRNA small RNA
- miRNA microRNA
- IncRNA long non-coding RNA
- circRNA circular RNA
- An RNA substrate may include mature and/or nascent RNA species.
- An RNA substrate may comprise RNAs that are capped or uncapped (eukaryotic mRNAs, except for nascent transcripts and mature uncapped RNA, exhibit a 5’-Gppp cap; archaeal and bacterial mRNAs are typically uncapped and exhibit a terminal 5’ triphosphate).
- the RNA may be naturally or artificially capped (for example with a 5’-m7Gppp cap).
- compositions for analyzing and characterizing RNA may include, according to some embodiments, one or more RNA substrates, one or more endoribonucleases (naturally-occurring or non- naturally occurring variants; having specificity for one or more dinucleotide or trinucleotide combinations, for example, cleaving an RNA substrate on average once every 6 to 12 nucleotides), one or more RNA end repair enzymes (naturally-occurring or non-naturally occurring variants), or combinations thereof.
- endoribonucleases naturally-occurring or non- naturally occurring variants; having specificity for one or more dinucleotide or trinucleotide combinations, for example, cleaving an RNA substrate on average once every 6 to 12 nucleotides
- RNA end repair enzymes naturally-occurring or non-naturally occurring variants
- compositions may comprise one or more of an RNA substrate, an endoribonuclease, and an RNA end repair enzyme, wherein the RNA end repair enzyme is capable of healing RNA ends.
- a composition may comprise an RNA substrate and an endoribonuclease, an RNA substrate, an endoribonuclease, and an RNA end repair enzyme, or an endoribonuclease and an RNA end repair enzyme.
- Compositions may include, according to some embodiments, one or more buffering agents.
- Compositions with a buffer may have, for example, a pH of 5-9, 6-8, 6.7-7.4, 6.8- 7.3, 6.8-8.0, 7.0-8.2, 7.0, 7.5, or 8.0.
- a composition may include a metal ion, examples of which include magnesium(II), manganese(II), cobalt(II), or nickel(II).
- compositions may include one or more RNA denaturing agents including, for example, 0.5 M - 4 M urea (e.g., 1 M urea).
- a composition may comprise, for example, less than 0.5 M urea, less than 0.75 M urea, less than 1.0 M urea, less than 2.0 M urea, less than 3.0 M urea, less than 4.0 M urea, less than 5.0 M urea, less than 6.0 M urea, less than 7.0 M urea, less than 8.0 M urea, 8.0 M urea, more than 7.0 M urea, more than 8.0 M urea.
- a composition may comprise, for example, less than 10% formamide, less than 15% formamide, less than 20% formamide, less than 25% formamide, less than 30% formamide, less than 35% formamide, less than 40% formamide, less than 45% formamide, less than 50% formamide, less than 55% formamide, more than 50% formamide, or more than 55% formamide.
- a composition may include an endoribonuclease (e g., an endoribonuclease having specificity for one or more dinucleotide or trinucleotide combinations, for example, cleaving an RNA substrate on average once every 6 to 12 nucleotides) and one or more RNA denaturing agents.
- a composition may comprise an RNA substrate in any amount and/or at any concentration.
- a composition may comprise less than 1 ng, less than 1 pg, less than 2 pg, less than 3 pg, less than 4 pg, less than 5 pg, less than 6 pg, less than
- a fluid composition may comprise, for example, less than 1 ng/pL, less than 1 pg/pL, less than 2 pg/pL, less than 3 pg/pL, less than 4 pg/pL, less than 5 pg/pL, less than 6 pg/pL, less than 7 pg/pL, less than
- RNA substrate may comprise a subject RNA and one or more additional materials (e.g., impurities and/or supports).
- an RNA substrate comprising a synthetic RNA may also comprise impurities resulting from the process of in vitro synthesizing the RNA, either via an enzymatic process or a chemical process or a combination of both processes.
- An RNA substrate comprising a native RNA may also comprise impurities from or associated with the isolation or enrichment method including, for example, partially degraded or fragmented RNA species, undesired RNA species (e.g., contaminant ribosomal RNA in a mRNA preparation), DNA, and/or proteins.
- RNA substrate may comprise RNA and a solid support (e.g., magnetic or non-magnetic polymeric beads), for example, where the RNA is attached to the solid support through its 5’ end, through its 3’ end or through an internal nucleotide, in each case, with or without an optional linker (e.g., a linear or branched linker).
- An optional linker may serve as steric spacer and does not necessarily have to be of defined length. Examples of suitable linkers may be selected from any of the hetero-bifunctional cross-linking molecules described by Hermanson, Bioconjugate Techniques, 2nd Ed; Academic Press: London, Bioconjugate Reagents, pp 276-335 (2008), incorporated by reference.
- An optional linker may be a flexible linker connecting the solid support to one or a plurality of same or different RNAs.
- An endoribonuclease may be expressed in E. coll, such as the periplasm of E. coll, or Pichia pastoris and purified utilizing an affinity tag.
- an endoribonuclease may have a discrete substrate specificity.
- an endoribonuclease may have the capacity to cleave an RNA 3', 5 ' phosphodiester bond with specific activity towards a nucleotide or a combination of one or more nucleotide sequences comprising 2-7 nucleotides each; or towards a structural element such as a stem, an internal loop, a multibranch loop, or a pseudoknot.
- the substrate specificity of an endoribonuclease may include recognition and cleavage of one or more modified nucleotides (e.g., pseudouridine, 1 -methylpseudouridine, 5-methoxyuridine, 5-methylcytidine, 6- methyladenosine, and inosine).
- modified nucleotides e.g., pseudouridine, 1 -methylpseudouridine, 5-methoxyuridine, 5-methylcytidine, 6- methyladenosine, and inosine.
- RNA end repair enzymes in some embodiments, may have both phosphodiesterase and phosphomonoesterase activities.
- a composition comprising an RNA end repair enzyme and an endoribonuclease, optionally in a denaturing buffering solution, may be used to prepare oligoribonucleotide mixtures from an RNA substrate.
- a composition of T4 PNK and an endoribonuclease, optionally in a denaturing buffering solution is used to prepare oligoribonucleotide mixtures from an RNA substrate.
- pre-heating the RNA substrate and/or including an RNA denaturing agent in the reaction mixture may reduce the impact of RNA structure (e.g., Watson-Crick base pairing and/or other intra- and/or intermolecular hydrogen bonding) on the production of endoribonuclease digestion products.
- RNA structure e.g., Watson-Crick base pairing and/or other intra- and/or intermolecular hydrogen bonding
- Compositions may include one or more endoribonucleases that are capable of cleaving 3 ',5' phosphodiester bonds with specific activity towards a nucleotide or a sequence of one or more nucleotides, towards one or more nucleotide modifications, or towards a structural element such as a stem, an internal loop, a multibranch loop, a pseudoknot, a duplex segment, a triplex segment, or a quadruplex segment.
- endoribonucleases that are capable of cleaving 3 ',5' phosphodiester bonds with specific activity towards a nucleotide or a sequence of one or more nucleotides, towards one or more nucleotide modifications, or towards a structural element such as a stem, an internal loop, a multibranch loop, a pseudoknot, a duplex segment, a triplex segment, or a quadruplex segment.
- Examples include endoribonucleases of the RNase A superfamily that cleave a 3 ',5' phosphodiester bond with specificity for pyrimidines at the main anchoring site (often called Bl site) and preference for purines at the secondary site (often called B2 site).
- Illustrative examples are hRNase5 (cuts both uridine and cytidine at B 1 position and shows only a mild preference for adenosine over guanosine at B2 position), hRNase 4 (shows a significant preference for cutting uridine at Bl position and a minor preference for adenosine over guanosine at B2 position), and hRNases 2, 3, 6, and 7 (cut both uridine and cytidine at B 1 position and do not have any detectable activity for guanosine at B2 position).
- mutant endoribonucleases such as porcine RNase4 D80A, wherein the substitution of Asp80 by alanine decreased the preference for cutting uridine at Bl position and increased the preference for cutting cytidine at B 1 position.
- endoribonucleases with specificity for pyrimidines at the main anchoring site Bl are some enzymes of the RNase T2 family, such as RNase MCI, which has been isolated from seeds of Momordica charantia (specificity for uridine at Bl position), and RNase Cusativin, which has been isolated from Cucumis sativus (specificity for uridine at Bl position).
- endoribonucleases examples include endoribonucleases that are capable of cleaving a 3 ',5' phosphodiester bond with specificity for purines at the main anchoring site Bl.
- endoribonucleases of the RNase T1 superfamily specificity for guanosine at Bl position
- RNase U2 purine-specific at Bl position
- Csxl specificity for adenosine at Bl position
- endoribonucleases also include endoribonucleases that are part of toxinantitoxin systems in bacteria or archaea. Endoribonucleases that are part of toxin-antitoxin systems may have a wider recognition cleavage site. Examples of endoribonucleases that are part of toxin-antitoxin systems include E. coli MazF (preferentially cuts before ACA trinucleotide motif), ChpB (preferentially cuts after uridine in UAC trinucleotide motiftrinucleotide), MqsR (preferentially cuts after guanosine in GC dinucleotide motif), and YafO (preferentially cuts after uridine).
- E. coli MazF preferentially cuts before ACA trinucleotide motif
- ChpB preferentially cuts after uridine in UAC trinucleotide motiftrinucleotide
- MqsR preferential
- endoribonucleases include thermostable endoribonucleases, for example, endoribonucleases that are active at temperatures above 50°C, above 55°C, above 60°C, above 65°C, above 70°C, above 80°C, or above 90°C.
- Endoribonucleases that are capable of cleaving a subject RNA at such high temperatures may support cleavage of an RNA in absence of a denaturing reagent and/or eliminate the prior step of heating the RNA sample in a low salt solution (e g , up to 50 mM salt) to reduce RNA structure biases during the digestion reaction.
- a low salt solution may comprise, for example, sodium chloride, magnesium sulfate, potassium nitrate, and /or sodium bicarbonate.
- an endoribonuclease utilized for digestion of an mRNA may originate from a vertebrate species (for example, Homo sapiens, Sus scr fa), a bacterial species (for example, Escherichia coli), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus). and an archaea species (for example, Pyrococcus furiosus).
- a recombinant endoribonuclease is expressed in the periplasm of E. coli or Pichia pastoris and purified utilizing an affinity tag.
- An enzyme comprising phosphodiesterase and phosphomonoesterase activities may be included in a composition with oligonucleotides having one or more ends that are 2', 3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated. Contacting such an enzyme with an oligoribonucleotide may dephosphorylate one or more ends.
- a composition may comprise 0.1 to 10 pL (e.g., 1 to 3 pL) of an endoribonuclease, wherein 1 pL of a given endoribonuclease is capable of cleaving RNA with catalytic activity comparable to that of 1 pL of commercially available RNase T1 (1000 U/pL; ThermoFisher Scientific #EN0541).
- a composition may comprise 50 to 500 U/pL (e.g., 100 to 200 U/pL) of an RNA end repair enzyme.
- the ratio of RNA substrate to endoribonuclease may be from 0.1 to 10 pg of RNA substrate per 1 pL of the endoribonuclease, preferably 1 to 10 pg of RNA substrate per 1 pL of the endoribonuclease. It may be desirable to decrease the ratio of RNA substrate to endoribonuclease where the RNA substrate comprises modified nucleotides. For example, the ratio of RNA substrate to endoribonuclease may be decreased as a function of the proportion of modified nucleotides present.
- a ratio of RNA substrate to hRNase 4 of 10 pg (of substrate)/l pL (of enzyme), of 5 pg/1 pL, of 2 pg/ 1 pL, 1 pg/1 pL, or of 0.1 pg/1 pL may be used to digest a fully-modified (e.g., all uridines replaced with 1- methylpseudouridine) mRNA comprising about 800 nucleotides (e.g., EPO mRNA).
- the ratio of RNA end repair enzymes to endoribonuclease may be from 0.1:1 to 0.2:1 to 0.5: 1 to 1:1 to 1 :2 to 1:5 to 1 :10. .
- RNA substrate comprises longer RNA substrates (for example, greater than 1000 nucleotides, greater than 2000 nucleotides, greater than 3000 nucleotides, greater than 5000 nucleotides).
- the ratio of RNA end repair enzymes to endoribonuclease may be increased as a function of the length of the RNA substrate present.
- a ratio of T4 PNK to hRNase 4 of 40 U/l pL, of 80 U/l pL, of 160 U/l pL, 320 U/l pL, or of 500 U/l pL may be used to digest a mRNA comprising about 800 nucleotides (e g , EPO mRNA).
- an enzyme e g., an endoribonuclease and/or an end repair enzyme
- a ribonuclease inhibitor e g., a DNA probe
- an enzyme may be immobilized to a solid support, including covalent bonding to the support surface and non-covalent interaction (binding by adsorption, e. g. cationic, anionic, lipophilic, or hydrophilic surfaces) of the enzyme with the surface.
- Covalent immobilization may include reaction of an active functional group on the enzyme with an activated functional group on the solid support.
- reactive functional groups include amines, hydroxyl amines, hydrazines, hydrazides, thiols, phosphines, isothiocyanates, isocyanates, N-hydroxysuccinimide (NHS) esters, carbodiimides, thioesters, haloacetyl derivatives, sulfonyl chlorides, nitro- and dinitrophenyl esters, tosylates, mesylates, tritiates, maleimides, disulfides, carboxyl groups, hydroxyl groups, carbonyldiimidazoles, epoxides, aldehydes, acyl-aldehydes, ketones, azides, alkynes, alkenes, nitrones, tetrazines, isonitriles, tetrazoles, and boronates.
- Examples of such reactions include the reaction between an amine and an activated carboxy group forming an amide, between a thiol and a maleimide forming a thioether bond, between an azide and an alkyne derivative undergoing a 1,3 -dipolar cycloaddition reaction, between an amine and an epoxy group, between an amine and another amine functional group reacting with an added bifunctional linker reagent of the type of activated bis-dicarboxylic acid derivative giving rise to two amide bonds, or other combinations known in the art.
- UV-mediated cross-linking or chemi cal -mediated crosslinking can be used for covalent attachment of enzymes to solid supports.
- chemi cal -mediated crosslinking e.g., using formaldehyde or glutaraldehyde
- Disclosed methods may be used/adapted to prepare an immobilized ribonuclease inhibitor and/or an immobilized DNA probe.
- a functional group may be inherently present in the material used for the solid support synthesis or a functional group may be provided by treating or coating the support with a suitable material.
- a functional group may also be introduced by contacting the solid support surface with an appropriate chemical agent. Activation in this context includes a modification of a functional group on the solid support surface to enable coupling of a binding agent to the surface.
- Solid support in this context includes any solid (flexible or rigid) material onto which it is desired to capture and immobilize the enzyme.
- Solid support may be biological, non-biological, organic, inorganic or a combination thereof, and may be in the form of particles, strands, precipitates, gels, sheets, tubings, spheres, containers, capillaries, cartridges, pads, slices, films, plates, slides, and have any convenient shape, including flat, disc, sphere, circle, etc.
- the surface of the solid support may be composed of a variety of materials, for example, polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, membranes, among others, provided that the surface may support functional groups.
- Examples of a convenient solid support include glass surfaces such as glass slides, microtiter plates, and suitable sensor elements, for example, functionalized polymers (e.g. in the form of beads), chemically modified oxidic surfaces, (e.g. silicon dioxide, tantalum pentoxide or titanium dioxide), or also chemically modified metal surfaces, e.g. noble metal surfaces such as gold or silver, copper or aluminium surfaces, magnetic surfaces, e.g.
- functionalized polymers e.g. in the form of beads
- chemically modified oxidic surfaces e.g. silicon dioxide, tantalum pentoxide or titanium dioxide
- metal surfaces e.g. noble metal surfaces such as gold or silver, copper or aluminium surfaces
- magnetic surfaces e.g.
- quantum dots e.g., III-V (GaN, GaP, GaAs, InP, or InAs) or II- VI (ZnO, ZnS, CdS, CdSe, or CdTe) semiconductors, or Ln- doped fluoride nanocrystals, rare earth-doped oxidic nanomaterials.
- III-V GaN, GaP, GaAs, InP, or InAs
- II- VI ZnO, ZnS, CdS, CdSe, or CdTe
- Ln- doped fluoride nanocrystals rare earth-doped oxidic nanomaterials.
- a solid support surface may be provided with a layer of a polymer, for example, a polymer comprising functional groups to be activated.
- a polymer may be selected from any suitable class of compounds, for example, polyethylene glycols, polyethylene imides, polysaccharides, polypeptides, or polynucleotides, just to name a few. Attachment of the polymers to the support surface may be achieved by a variety of methods which are readily apparent to a person skilled in the art. For example, polymers bearing trichlorosilyl or trisalkoxy groups may be reacted with hydroxyl groups on the substrate surface to form siloxane bonds. Attachment to a gold or silver surface may take place via thiol groups on the polymer.
- the polymer may be attached via an intermediate species, such as a self-assembled monolayer of alkanethiols.
- an intermediate species such as a self-assembled monolayer of alkanethiols.
- the type of polymers selected, and the method selected for attaching the polymers to the surface will thus depend on the polymer having suitable reactivity for being attached to the substrate surface, and on the properties of the polymers regarding non-specific adsorption to, especially, DNA and RNA.
- the functional groups may be present on the polymer or may be added to the polymer by the addition of single or multiple functional groups.
- a spacer arm can be used to provide flexibility to the binding enzyme allowing it to interact with its environment in a way which minimizes steric hindrance with the solid support.
- the solid support surface may comprise additional coating molecules, for example, polyethylene glycols, polyethylene imides, polysaccharides, polypeptides, or polynucleotides, that do not carry a reactive functional group.
- additional coating molecules that do not carry a reactive functional group may increase the specific activity and/or stability of the immobilized enzyme, for example, by providing a local hydrophilic environment that favors the enzyme folding.
- activated functional groups on a solid support may be present on the predefined regions only, or alternatively on the entire surface, are reacted selectively with the functional groups present in the enzyme molecules.
- Suitable reaction conditions including time, temperature, pH, solvent(s), and additives, will depend on inter alia the particular species and may be selected in accordance with conditions for similar reactions.
- Functional group may be inherent to the enzyme amino acid sequence.
- Enzymes may be synthesized to incorporate a desired functional group either through a chemical reaction or through genetic engineering. Amino acids can be modified either chemically or enzymatically with any type of functional group in order to provide the desired reactivity.
- Endoribonucleases and/or RNA repair enzymes may be included in a fusion protein and immobilized on a solid support by means of such fusion protein.
- a fusion protein construct of an endoribonuclease and/or a RNA repair enzyme may generated with, for example, a maltose-binding protein (MBP), a chitin or chitin-binding domain (CBD), a poly-histidine 6xHis or poly-His-tag), a HA-tag, a c-myc tag, a FLAG-tag, a SNAP -tag (U.S. Patent Nos.
- a solid support surface may be coated with an affinity group that is capable of specifically binding to the corresponding protein fusion partner, for example, a maltose moiety for MBP fusions, a benzylguanine (BG) moiety for SNAP -tag fusions, a benzylcytosine (BC) moiety for CLIP-tag fusions, a chloroalkane moiety for Halotag fusions, a peptide sequence (Lys-Glu-Thr-Ala-Ala-Ala-Lys-Phe-Glu-Arg-Gln-His- Met-Asp-Ser) for S-tag fusions, a nickel-nitrilotriacetic acid (Ni-NTA) chelate for His-tag fusions, and so on.
- an affinity group that is capable of specifically binding to the corresponding protein fusion partner
- immobilization is achieved using an affinity binding pair, such as in streptavidin-functionalized on the beads and biotinylated enzymes.
- an affinity binding pair such as in streptavidin-functionalized on the beads and biotinylated enzymes.
- the protein fusion of the endoribonuclease and/or the RNA repair enzyme e.g., with MBP may be used to enhance their solubility and facilitate their proper folding.
- Endoribonucleases and/or RNA repair enzymes may be immobilized on a solid support by means of physical adsorption, for example, where binding is mainly by hydrogen bonds, multiple salt linkages, and/or Van der Waal's forces.
- magnetic or paramagnetic solid supports e.g., silica beads
- negatively charged molecules e.g., carboxyl -containing molecules
- positively charged e.g., amino-containing molecules
- a crowding agent e.g., polyethylene glycol, such as 10-50% PEG
- salt e.g., NaCl, such as 0.1-4 M NaCl
- immobilization may be based on the entrapment of the enzyme within the lattice of a polymer matrix (e.g., synthetic polymers such as polyarylamide and polyvinylalcohol) or of a membrane (e.g., polymeric microcapsule).
- a polymer matrix e.g., synthetic polymers such as polyarylamide and polyvinylalcohol
- membrane e.g., polymeric microcapsule
- One or a plurality of endoribonucleases and/or RNA repair enzymes may be immobilized on the same or different solid supports. They may be immobilized randomly on a given solid surface; or they may be immobilized at a specific arrangement, for example, on specific compartments of a given solid surface, so that the enzymes are arranged in series or in parallel to each other, or a combination of both arrangements. Such arrangement may serve different purposes, such as the sequential treatment of an RNA sample with an endoribonuclease followed by an RNA repair enzyme.
- RNA sample is spatially confined to separate compartments (in the same of different reaction vessels) so that there is no crossreaction of the sample with different enzymes.
- One or a plurality of endoribonucleases and/or the RNA repair enzymes may be immobilized on cartridges and these cartridges may be integrated in LC-MS/MS systems, wherein the cartridges may be individually selected by column selectors according to the requirements of a given experiment and allowing subsequential incubations with any of these enzymes.
- Use of immobilized endoribonucleases and/or RNA repair enzymes may enable automation of one or more RNA sample processing steps (e.g., digestion) prior to downstream analysis (e.g., LC-MS/MS).
- Use of immobilized endoribonucleases and/or RNA repair enzymes may reduce the amount of sample and/or time required for processing the RNA prior to downstream analysis.
- Use of immobilized endoribonucleases and/or RNA repair enzymes may enable miniaturization and/or high-throughput analysis of RNA samples.
- immobilized enzymes may provide the ability to multiplex reactions, streamline reaction processes and workflows, reduce level of degradation byproducts (e.g., unwanted RNA hydrolysis, oxidation, deamination, etc.), reduce manual steps and the risk of manual (human) errors, and importantly, in some cases increase hydrolytic and/or thermal stability of the enzymes (relative to their non-immobilized forms).
- the endoribonuclease(s) and/or the RNA repair enzyme(s) may be irreversibly adsorbed or covalently linked to the solid surface using any one of the methods described in this invention.
- the endoribonuclease(s) and/or the RNA repair enzyme(s) may be stably and efficiently immobilized on a microchip or any column reactor or fluid channel network, in such a way that buffers and reagents are flowed through (e.g., manually or using a peristaltic pump) the reaction vessel.
- Methods may include, for example, preparing oligoribonucleotides from RNA substrates (e.g., total RNA, genomic RNA, messenger RNA, transfer RNA, ribosomal RNA, coding RNA, non-coding RNA, micro RNA, small interfering RNA, nuclear RNA, nucleolar RNA). Methods may include, in some embodiments, contacting an RNA substrate with an RNA denaturing agent to form a denatured RNA substrate.
- RNA substrates e.g., total RNA, genomic RNA, messenger RNA, transfer RNA, ribosomal RNA, coding RNA, non-coding RNA, micro RNA, small interfering RNA, nuclear RNA, nucleolar RNA.
- a method may include heating an RNA sample (e.g., at 90°C for 10 min) in a low salt solution (e.g., containing 0-50 mMNaCl) or in a denaturing solution (e.g., containing 3 M urea) to form the denatured RNA substrate.
- a denaturing agent if used, may be separated from the denatured RNA substrate (e.g., by dialysis, affinity or size-exclusion chromatography or other methods).
- Compositions including RNA substrates and RNA denaturing agents may be diluted (e g., more than 10-fold, more than 100-fold, more than 500-fold, more than 1000-fold).
- compositions including RNA substrates and RNA denaturing agents may be diluted to reduce the impact of included RNA denaturing agent(s) on enzymes in one or more subsequent steps. Dilution may reduce the RNA denaturing agent to a concentration that permits an enzyme in a subsequent (e.g., an endoribonuclease and/or an RNA end repair enzyme) step to have at least 1% of its activity in the absence of such RNA denaturing agent(s) (e.g., under otherwise the same conditions of temperature, pH, enzyme concentration, substrate concentration, kind and concentration of buffer, and/or other components).
- a subsequent e.g., an endoribonuclease and/or an RNA end repair enzyme
- Digestion of RNA with some endoribonucleases may produce a mixture of cleavage products comprising 2',3'-cyclic-phosphate (sometimes also referred as to 2’,3’- phosphodiester) and 3 '-phosphate (sometimes also referred as to 3 '-linear phosphate or 3’- phosphomonoester) termini, whereas some other endoribonucleases may produce a mixture of cleavage products comprising 2',3'-cyclic-phosphate and 2'-phosphate (sometimes also referred as to 2'-linear phosphate or 2’ -phosphomonoester) termini.
- 2',3'-cyclic-phosphate sometimes also referred as to 2’,3’- phosphodiester
- 3 '-phosphate sometimes also referred as to 3 '-linear phosphate or 3’- phosphomonoester
- the extent of formation of 2',3'-cyclic-phosphate and 2’- or 3'-linear phosphate may depend on the enzyme concentration, the digestion buffer and/or incubation time.
- a mixture of cleavage products may also comprise 2',3'-hydroxylated species
- Enzyme-independent hydrolytic opening of 2',3'-cyclic-phosphate may generate a mixture comprising 2',3'-cyclic-phosphate, 3'- phosphate, 2'-phosphate, and/or 2',3'-hydroxy termini in any combination.
- Enzymeindependent hydrolytic cleavage of RNA may further produce a mixture of 5’-phosphate and 5 ’-hydroxy termini. The potential presence of any of these products, in any combination, can convolute analysis by mass spectrometry techniques.
- Methods may include contacting an RNA substrate (or a denatured RNA substrate) with a composition comprising an endoribonuclease and/or an optional RNA end repair enzyme under conditions (e.g., temperature, pH, enzyme and substrate concentrations, and buffers or other components) permitting the RNA substrate (or the denatured RNA substrate) to be cleaved and oligoribonucleotides to be formed.
- the optional RNA end repair enzyme may be omitted.
- Methods may further comprise analyzing oligoribonucleotides (e.g., oligoribonucleotides formed by digestion of an RNA substrate with an endoribonuclease) by LC-MS/MS.
- oligoribonucleotides may be analyzed by capillary electrophoresis-mass spectrometry (CE-MS).
- CE-MS capillary electrophoresis-mass spectrometry
- oligoribonucleotides may be analyzed by gel electrophoresis.
- LC- MS/MS and/or CE-MS are used to determine the masses and/or fragmentation profiles of species in compositions of oligoribonucleotides (e g., oligoribonucleotides formed by digestion of an RNA substrate with an endoribonuclease).
- oligoribonucleotides e g., oligoribonucleotides formed by digestion of an RNA substrate with an endoribonuclease.
- Methods may include contacting an RNA substrate with an RNA substrate binding molecule to form a complex, the complex comprising a binding molecule-RNA substrate interface and single-stranded RNA substrate portion.
- an RNA substrate binding molecule may include a DNA probe (e.g., at least partially complementary to the RNA substrate), an RNA probe (e.g., at least partially complementary to the RNA substrate), a synthetic nucleic acid probe (e.g., a locked nucleic acid that is at least partially complementary to the RNA substrate), an RNA binding protein, an antibody, an RNA ligand (e.g., adenosylcobalamin, lysine, glycine, flavin mononucleotide, fluorescent dyes, and drugs including, for example, branaplam and risdiplam), divalent ions (e.g., salts of magnesium, calcium, zinc, manganese, etc.), ribosomes, and lipid-based membranes.
- divalent ions
- a binding molecule-RNA substrate interface may comprise one or more endoribonuclease cut sites for which access by the corresponding endoribonuclease is limited.
- a single-stranded RNA substrate portion may comprise one or more endoribonuclease cut sites that are accessible to the corresponding endoribonuclease.
- a method may comprise, in some embodiments, contacting a complex with an endoribonuclease to form cleavage products.
- Cleavage products may include (two or more) fragments of the single- stranded RNA substrate portion and a cleaved binding molecule-RNA substrate interface, the RNA component of which remains uncut by the endoribonuclease and wherein the site(s) of cleavage of the cleaved binding molecule-RNA substrate interface are adjacent to the interface (e.g., not within the interface).
- methods may include hybridizing an RNA substrate to at least one DNA probe to form an RNA/DNA duplex comprising a doublestranded portion and at least one single- stranded portion.
- a double- stranded portion of a duplex may comprise one or more endoribonuclease cut sites.
- a single-stranded portion of a duplex may comprise one or more endoribonuclease cut sites.
- a method may include contacting a duplex and an endoribonuclease to form cleavage products, the cleavage products comprising two or more fragments of the single-stranded portion and a cleaved double-stranded portion, the RNA component of which remains uncut by the endoribonuclease.
- a method may include assessing the integrity, identity, presence and/or purity of a target RNA in a sample (e.g., through “fingerprinting”, “signature profiling”, and/or “ID testing”) and/or confirming the identity of an RNA produced by synthesis or isolated from native sources.
- ID testing may be performed by HPLC retention time analysis, intact mass analysis, failure sequence analysis, MS/MS sequencing, MS-fragmentation pattern analysis, NMR, melting temperature analysis, or any combination thereof.
- Methods may include, according to some embodiments, de novo sequencing a subject RNA (e.g., RNA in an RNA substrate) using mass spectrometry including sequencing oligoribonucleotides and assembling resulting sequences to form an assembled sequence corresponding to the subject RNA.
- these oligoribonucleotide mixtures are used for determining the identity and location of a modified nucleotide in an RNA substrate (“modification mapping”).
- oligoribonucleotides from RNA substrates may be used for characterizing impurities in an RNA sample.
- Impurities may include, for example, truncated RNA species, protracted RNA species, for example, obtained from read-through synthesis, degraded RNA species, RNA species containing nucleotide misincorporations, deletions, or additions, RNA species containing impurities derived from phosphoramidite-based synthesis, such as RNA containing residual protective groups (e.g., DMT, CEP, TBDMS, Bz, iBu, and others), RNA containing depurinated bases, and RNA containing by-products of RNA synthesis and deprotection, such as cyanoethyl adducts; carried-over reagents (e g., plasmids), exogenous nucleic acid contaminants, and any combination of the foregoing.
- RNA containing residual protective groups e.g., DMT, CEP
- Methods may further comprise analyzing activity and/or specificity of an enzyme (e.g., a ligase, a polymerase, a transferase, a methyltransferase, a carbamoyltransferase, a glycosyltransferase, an acyltransferase, an aminotransferase, a peptidyltransferase, a pseudouridine synthase, a transglycosylase, a transaminase, a glycosidase, a capping enzyme, a decapping enzyme, a kinase, a phosphatase, a nuclease (endo or exo), a lyase, an oxidoreductase, and/or a deaminase) by analyzing RNA products of such enzyme.
- an enzyme e.g., a ligase, a polymerase, a transferase
- methods include digestion of an RNA substrate using a composition of at least one endoribonuclease and at least one an RNA end repair enzyme, wherein the RNA end repair enzyme comprises both phosphodiesterase (PDE) and phosphomonoesterase (PME) activities, in a buffering solution optionally containing an RNA denaturing agent.
- PDE phosphodiesterase
- PME phosphomonoesterase
- An example of an RNA end repair enzyme comprising both phosphodiesterase and phosphomonoesterase activities is the bacteriophage T4 polynucleotide kinase (T4 PNK; also referred as to T4 polynucleotide kinase-phosphatase or T4 Pnkp) (Das and Shuman, 2013).
- T4 PNK heals each of 2',3'-cyclic-phosphorylated, 3'- phosphorylated and 2'-phosphorylated oligoribonucleotide ends resulting in products that comprise a 2’,3’-hydroxylated (2'-OH, 3'-OH) end.
- co-incubation of T4 PNK with an endoribonuclease resolves 2',3'-cyclic-phosphorylated, 3'-phosphorylated and/or 2'- phosphorylated oligoribonucleotides that may be produced upon endoribonuclease cleavage.
- T4 PNK By converting 2',3'-cyclic-phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated oligoribonucleotide ends into 2’, 3 ’-hydroxylated ends, T4 PNK reduces spectral complexity and enhances the mass signal of endoribonuclease digestion products.
- Methods may comprise contacting a polyribonucleotide substrate with an endoribonuclease to form a cleaved polyribonucleotide product and contacting the cleaved product with an RNA end repair enzyme to form a polyribonucleotide cleavage product with healed ends (e.g., 5’ ends comprising a 5’-OH and/or 3’ ends comprising a 3’-OH and/or 2’- OH).
- healed ends e.g., 5’ ends comprising a 5’-OH and/or 3’ ends comprising a 3’-OH and/or 2’- OH.
- Contacting in some embodiments, may be performed in sequential steps (e.g., contact with endoribonuclease followed by repair enzyme, often with an intervening cleanup step) or concurrently as a coupled reaction (e g., in a single compartment, tube, container, vessel or other space).
- reactions may be concurrent if they overlap in time with one another, even if their start times and/or completions times are not synchronized.
- a method may comprise simultaneously adding an endoribonuclease and an RNA end repair enzyme to a composition comprising an RNA substrate (and, optionally, a buffering agent and/or an RNA denaturing agent).
- RNA product with healed ends may be subjected to characterization by tandem liquid chromatography-mass spectrometry (LC-MS) or by tandem capillary electrophoresis-mass spectrometry (CE-MS).
- LC-MS liquid chromatography-mass spectrometry
- CE-MS tandem capillary electrophoresis-mass spectrometry
- An enzyme comprising a phosphodiesterase and phosphomonoesterase may be contacted with (e.g., added to) a composition comprising oligoribonucleotides having one or more 2',3'-cyclic-phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated ends to produce one or more dephosphorylated ends.
- Oligoribonucleotides having one or more 2', 3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated ends may be provided as such or, in some embodiments, an RNA substrate may be contacted with an endoribonuclease in the same composition or space to form the oligonucleotides.
- a method may comprise contacting an RNA substrate with an endoribonuclease to form oligoribonucleotides having one or more 2',3'-cyclic- phosphorylated, 3'-phosphorylated and/or 2' -phosphorylated ends and, following exhaustion of RNA substrate, contacting the oligoribonucleotides with an enzyme comprising a phosphodiesterase and a phosphomonoesterase to form one or more dephosphorylated ends.
- a method may further comprise purifying the oligoribonucleotides prior to contact with the enzyme comprising a phosphodiesterase and a phosphomonoesterase.
- a method may include, according to some embodiments, incubating (e.g., heating) an RNA substrate (e.g., prior to or upon contacting with an endoribonuclease) to form a denatured or melted RNA substrate.
- Incubating an RNA substrate may comprise maintaining the RNA substrate at a temperature of 65°C or higher, for example, 65°C - 75°C, 70°C - 80°C, 75°C - 85°C, 80°C - 90°C, 85°C - 95°C, 90°C - 100°C, more than 95°C, or more than 100°C.
- Heating may comprise maintaining the RNA substrate at a selected temperature for up to a minute, 1-10 minutes, at least a minute, at least 5 minutes, at least 6 minutes, at least 7 minutes, at least 8 minutes, at least 9 minutes, at least 10 minutes, or up to 20 minutes.
- a method may comprise contacting an RNA substrate (e.g., a melted RNA substrate) with an endoribonuclease at a temperature of less than 30°C, 25°C - 35°C, 30°C - 40°C, 35°C - 45°C, 37°C, 40°C - 50°C, 45°C - 55°C, 50°C - 60°C, more than 50°C, or more than 55°C.
- an RNA substrate e.g., a melted RNA substrate
- an endoribonuclease at a temperature of less than 30°C, 25°C - 35°C, 30°C - 40°C, 35°C - 45°C, 37°C, 40°C - 50°C, 45°C - 55°C, 50°C - 60°C, more than 50°C, or more than 55°C.
- a method may include, according to some embodiments, contacting an RNA substrate (e g., a melted RNA substrate) with an endoribonuclease for 30- 120 minutes, up to 30 minutes, at least 30 minutes, up to 45 minutes, up to 60 minutes, up to 75 minutes, up to 90 minutes, up to 105 minutes, up to 120 minutes, or at least 105 minutes.
- an RNA substrate e g., a melted RNA substrate
- endoribonuclease for 30- 120 minutes, up to 30 minutes, at least 30 minutes, up to 45 minutes, up to 60 minutes, up to 75 minutes, up to 90 minutes, up to 105 minutes, up to 120 minutes, or at least 105 minutes.
- a method may comprise fingerprinting 2',3'-hydroxylated oligonucleotides (e.g., arising from a subject RNA following contact with an endoribonuclease and an RNA end repair enzyme) by LC-MS analysis.
- Fingerprinting by LC-MS analysis may comprise, for example, deconvoluting the charge state distribution of raw mass spectra and comparing the observed masses to masses from a theoretical digestion of a subject RNA. The resulting mass “fingerprint” may be utilized to assess the identity of a subject RNA.
- fingerprinting by LC-MS comprises comparing deconvoluted mass spectrum to a database of RNA transcripts using a computer and assessing the “identity” of the characterized transcript by a mathematical metric.
- a method may comprise sequencing 2',3'-hydroxylated oligonucleotides (e.g., arising from a subject RNA following contact with an endoribonuclease and an RNA end repair enzyme) by LC-MS/MS analysis.
- a sequencing method may comprise acquiring mass spectra (e.g., MS and/or MS/MS spectra) from an oligoribonucleotide comprising healed ends and comparing the acquired mass spectra with theoretical mass spectra from a theoretical digestion of a subject RNA with an endoribonuclease of selected specificity.
- Methods may comprise preparing oligoribonucleotides from RNA substrates using one or more endoribonucleases in absence of an RNA end repair enzyme.
- An endoribonuclease e.g., an endoribonuclease for methods including analysis of RNA by LC-MS/MS
- Endoribonucleases with average cleavage frequencies of every 6-12 nucleotides may provide better mapping coverage relative to endoribonucleases with average cleavage frequencies of every 4 nucleotides or less and/or relative to relative to endoribonucleases with average cleavage frequencies of once every 16 nucleotides or more.
- Example 3 and FIGURE 5 and FIGURE 6 show the theoretical mapping coverage of 1000 randomly selected human transcripts comparing oligonucleotides generated by endoribonucleases with cleavage frequencies of 1 out 2 nucleotides (RNase A), 1 out 4 nucleotides (RNase Tl, MCI -2015) and Cusativin-2021), 3 out of 16 nucleotides (Cusativin-2017 and MC1-2021), 1 out 8 nucleotides (hRNase 4), and 1 out 16 nucleotides (Colicin E5).
- RNase A oligonucleotides generated by endoribonucleases with cleavage frequencies of 1 out 2 nucleotides (RNase A), 1 out 4 nucleotides (RNase Tl, MCI -2015) and Cusativin-2021), 3 out of 16 nucleotides (Cusativin-2017 and MC1-2021), 1 out 8 nucleotides (hRNase 4), and 1 out 16 nu
- Endoribonucleases with specificity that result in RNA cleavage on average once every 8 nucleotides are capable of producing a higher theoretical sequence coverage of the human transcriptome, whether based on the total content of their cleavage products or on the content of their cleavage products comprising unique sequences (see FIGURE 6).
- a method may include separation or removal of a ribonuclease from reactants and/or products.
- a method may comprise contacting an RNA substrate with an endoribonuclease (e.g., RNase 4) to form one or more reaction products comprising at least one RNA substrate cleavage product and the endoribonuclease.
- an endoribonuclease e.g., RNase 4
- a method may further include separating the at least one RNA substrate cleavage product from the endoribonuclease.
- An endoribonuclease (and, optionally, an end repair enzyme, if included) may be immobilized on a magnetic bead and separating may comprise magnetically gathering the immobilized endoribonuclease (e.g., into a pellet), thereby allowing the at least one RNA substrate cleavage product to be removed.
- an endoribonuclease may be susceptible to a ribonuclease inhibitor and separating may comprise contacting the reaction products with an immobilized ribonuclease inhibitor to form immobilized complexes comprising the immobilized ribonuclease inhibitor, thereby allowing the at least one RNA substrate cleavage product to be removed.
- an endoribonuclease or a ribonuclease inhibitor may be immobilized on a surface (e.g., a column) or in a filter and reaction materials (e.g., reactions and/or reaction products) may be passed over the surface or through the filter. Additional information about separating an immobilized material from reaction products may be found in U.S. Patent Application 18/182,122 filed March 10, 2023, incorporated herein by reference.
- a ribonuclease inhibitor provided in an immobilized form may be utilized to capture and remove an endoribonuclease (e.g., a soluble endoribonuclease) from a reaction mixture or vessel. Removal of a soluble endoribonuclease from a reaction mixture or vessel may be used to stop the digestion reaction of the RNA substrate at desired time points. In some embodiments, removal of soluble endoribonuclease at designed time points may be used as a strategy to produce incomplete or partial cleavage of the RNA substrate thus resulting in oligonucleotides cleavage products with one or more uncut cleavage sites.
- an endoribonuclease e.g., a soluble endoribonuclease
- Such partially uncut oligonucleotides are longer in size than those that have been cut all at possible cleavage sites, and thus may increase mapping coverage at certain RNA substrate regions.
- removal of a soluble endoribonuclease from a reaction mixture or vessel may be used to prevent contamination of downstream analytical instrumentation (e.g., chromatographic columns) with active endoribonucleases.
- the removal of a soluble endoribonuclease from a reaction mixture or vessel may be used in automation protocols to facilitate and streamline methods of analysis.
- methods may include identifying and/or quantifying one or more components in an RNA sample or RNA substrate. For example, it may be desirable to accurately identify and/or quantify certain features of an RNA substrate, such as the presence of a 5 ’-cap structure, a 3 ’-poly(A) tail, or an RNA modification within an RNA.
- an RNA obtained by chemical synthesis or by enzymatic in vitro transcription (IVT) e.g., intended for use as a vaccine or another therapeutic application
- the presence and/or quantity of certain features e.g., cap structures, polyA tails
- oligoribonucleotide products generated by cleavage of RNA substrates may be quantified by mass spectrometric methods.
- oligoribonucleotides may be quantified using calibration curves constructed with authentic standards. Oligoribonucleotides may be labeled, according to some embodiments, at their 5’ or 3’ end with stable isotopes for relative or absolute quantification.
- Differential incorporation of stable isotopes may be used for multiplex quantitative analysis of oligonucleotides in which multiple oligonucleotide features (e.g., presence of a cap structure, one or more RNA modifications, presence of a polyA tail) may be analyzed simultaneously by means of isobaric tags.
- methods may include isotope labeling for quantitative analysis of oligonucleotides.
- Isotope labeling may be performed in one step, wherein an isotopically labeled nucleotide is incorporated at the 3’ end of an oligonucleotide by the action of an RNA polymerase.
- the isotopically labeled nucleotide is blocked at its 3’ position to prevent further extension of oligonucleotides beyond a single labeled nucleotide.
- nucleotides examples include 3’- deoxynucleotides and 2’,3’-dideoxynucleotides, including but not limited to 3'- deoxyadenosine (Cordycepin), 3'-deoxyinosine, 3 '-deoxy guanosine, 3'-deoxyuridine, 3'- deoxycytidine, 2’,3’-dideoxyadenosine, 2’, 3 ’-dideoxyinosine, 2’,3’-dideoxyguanosine, 2’,3’- dideoxyuridine, 2’, 3 ’-dideoxy cytidine, 3’-azido-3'-deoxyadenosine, 3’-azido-3'- deoxyinosine, 3 ’-azi do-3 '-deoxyguanosine, 3 ’-azido-3 '-deoxyuridine, 3’-azido-3'- deoxycytidine,
- nucleotide analogues including Carbovir, Ganciclovir, Lamivudine, and Clofarabine, among others.
- the stable isotope may be selected from one or more of Deuterium (d), Carbon-13 (13C), and Nitrogen- 15 (15N), in any combination (for instance, Cordycepin- 13 C5, Carbovir- 13 C,d2; Ganciclovir-d5; Lamivudine- 15N2,13C; and Clofarabine-13C,15N3).
- An isotopically labeled nucleotide may be incorporated at the 3’ end of an oligonucleotide by reaction of the corresponding nucleoside triphosphate with a polymerase.
- polymerases that may catalyze template independent addition of a desired nucleotide monophosphate (NMP) from the nucleoside triphosphate (NTP) to the 3’ end of RNA are (including recombinant and mutants therof): E. coli Poly (A) Polymerase, Yeast Poly (A) Polymerase, Poly(U) Polymerase, and DNA Polymerase 0 (Pol0).
- NMP nucleotide monophosphate
- NTP nucleoside triphosphate
- isotope labeling for quantitative analysis of oligonucleotides may be performed by incorporation of an isotopically labeled nucleotide at the 5’ or 3’ end of an oligonucleotide by the action of an RNA ligase.
- the 3’ end labeling may be performed in one step using a T4 RNA ligase and a pre-adenylated nucleotide.
- Examples of a preadenylated nucleotide include A(5’)pp(5’)Cp, wherein the cytidine comprises a 3’-phosphate and one or more isotope labels; A(5’)pp(5’)Gp, wherein the guanosine comprises a 3’- phosphate and one or more isotope labels; A(5’)pp(5’)Up, wherein the uridine comprises a 3’-phosphate and one or more isotope labels; A(5’)pp(5’)Ip, wherein the inosine comprises a 3’-phosphate and one or more isotope labels; and A(5’)pp(5’)Ap, wherein the 3’ terminal adenosine comprises a 3 ’-phosphate and one or more isotope labels.
- the 3’ end labeling may comprise (i) adenylating an isotopically labeled pCp, pGp, pip, pUp, or pAp using a Methanobacterium thermoautotrophicum (Mth) RNA ligase in the presence of ATP, (ii) inactivating the Mth RNA ligase, and (iii) ligating the adenylated isotopically labeled nucleotide using a T4 RNA ligase.
- Mth Methanobacterium thermoautotrophicum
- methods may include converting oligonucleotides to their 5’- phosphorylated form (for instance, by using T4 PNK in the presence of ATP) prior to ligation.
- 5’ end labeling may be performed by ligation of a 5’ adapter (e.g., 5-50 nucleotides in length) to an oligonucleotide, the adapter comprising one or more isotopically labeled nucleotides (e.g., adenosine, guanosine, uridine or cytidine labeled with one or more of Deuterium, Carbon- 13, and Nitrogen- 15).
- a 5’ adapter e.g., 5-50 nucleotides in length
- the adapter comprising one or more isotopically labeled nucleotides (e.g., adenosine, guanosine, uridine or cytidine labeled with one or more of Deuterium, Carbon- 13, and Nitrogen
- Ligation of a 5’ adapter to a target oligonucleotide may be performed by an RNA ligase such as T4 RNA ligase 2 and may be carried out in the presence of additives (e.g., PEG) and/or splint adapters (e.g., 5-50 nucleotides in length whose sequence is randomized or partially annealing to the 5’ adapter).
- additives e.g., PEG
- splint adapters e.g., 5-50 nucleotides in length whose sequence is randomized or partially annealing to the 5’ adapter.
- 3’ end labeling may be performed by ligation of a 3’ adapter (e.g., 5-50 nucleotides in length) to an oligonucleotide, the adapter comprising one or more isotopically labeled nucleotides (e.g., adenosine, guanosine, uridine or cytidine labeled with one or more of Deuterium, Carbon-13, and Nitrogen- 15) using, for example, a T4 RNA ligase or variant thereof.
- a 3’ adapter e.g., 5-50 nucleotides in length
- the adapter comprising one or more isotopically labeled nucleotides (e.g., adenosine, guanosine, uridine or cytidine labeled with one or more of Deuterium, Carbon-13, and Nitrogen- 15) using, for example, a T4 RNA ligase or variant thereof.
- a method for labeling an oligonucleotide for quantitative analysis may comprise incorporating a non-isotopically labeled nucleotide at the 5’ or 3’ end of the oligonucleotide, wherein the non-isotopically labeled nucleotide comprises a chemically reactive group that is capable of reacting with an isotopically labeled molecule (also referred as to a mass label).
- a non-isotopically labeled nucleotide may comprise, in some embodiments, a 3 ’-deoxynucleotide or a 2’, 3 ’-dideoxynucleotide, in each case, having a chemically reactive group at the 2’ or 3’ position.
- a chemically reactive group may be or comprise any of a carbonyl; a carboxyl; an active ester, e.g., a succinimidyl ester; a maleimide; an amine; a thiol; an alkyne, an azide; an alkyl halide; an isocyanate; an isothiocyanate; an iodoacetamide; a 2-thiopyridine; a 3-arylproprionitrile; a diazonium salt; an alkoxyamine; a hydrazine; a hydrazide; a phosphine, an alkene; a semicarbazone; an epoxy; a phosphonate; and a tetrazine.
- an active ester e.g., a succinimidyl ester
- a maleimide e.g., a succinimidyl ester
- a maleimide e.g., a succinimidy
- An isotopically labeled molecule may be selected from an amino acid (e.g., L-alanine-15N; L-alanine-13C3,15N; L-alanine- d4,15N; L-phenylalanine-15N; L-phenylalanine-13C9,15N; L-phenylalanine-d8,15N; L- proline-15N; L-proline-13C5,15N; L-proline-d7, 15N); an a-keto acid (e.g., a-ketobutyric acid-13C4; a-ketoisocaproic acid-13C; a-ketoisovaleric acid-13C5); a nucleotide (e.g., adenosine- 15N5; 2’-deoxyadenosine-15N5; uridine-15N2; thymidine-15N2; thymidine- 13C10,15N2); a bile acid (e.g.,
- the isotopically labeled molecule may comprise a chemically reactive group that is capable of chemoselectively reacting with the non-isotopically labeled nucleotide, once the latter is incorporated into the target oligonucleotide (for instance, a L- azidohomoalanine-13C4,15N2 reacts with a 3 ’-alkyne-3 '-deoxy adenosine by means of a Cu(I)-catalyzed azide-alkyne cycloaddition).
- chemoselective reactions include a reaction between an amine reactive group and an electrophile (e.g., an alkyl halide or an N- hydroxysuccinimide ester (NHS ester)); a reaction between a thiol reactive group and an iodoacetamide or a maleimide; a reaction between an azide and an alkyne (azide-alkyne cycloaddition or “Click Chemistry”).
- An azide-alkyne cycloaddition may be catalyzed by Cu(I) or strain-promoted to yield a 1,4-substituted triazole.
- TCO trans-cyclooctene
- Tz tetrazine
- an appropriate chemically reactive group is installed in the isotopically labeled molecule prior to its reaction with an oligonucleotide comprising non- isotopically labeled nucleotide.
- a chemically reactive group e.g., dibenzocyclooctyne (DBCO)
- DBCO dibenzocyclooctyne
- DBCO dibenzocyclooctyne
- DBCO dibenzocyclooctyne
- non-isotopically labeled nucleotides may be selected from one of 3’-azido-3'-deoxyadenosine, 3’-azido-3'- deoxyinosine, 3 ’-azi do-3 '-deoxyguanosine, 3 ’-azido-3 '-deoxyuridine, 3’-azido-3'- deoxycytidine, 3’-azido-2’,3’-dideoxy-adenosine, 3’-azido-2’,3’-dideoxyinosine, 3’-azido- 2’,3’-dideoxyguanosine, 3’-azido-2’,3’-dideoxyuridine, 3’-azido-2’,3’-dideoxycytidine, 2’- azido-2’, 3 ’-dideoxy-adenosine, 2’ -azido-2’, 3 ’-dideoxycy
- non-isotopically labeled nucleotides further include 3’-alkyne-3'- deoxyadenosine, 3 ’-alkyne-3 '-deoxyinosine, 3’-alkyne-3'-deoxyguanosine, 3’-alkyne-3'- deoxyuridine, 3 ’-alkyne-3 '-deoxy cytidine, 3’-alkyne-2’,3’-dideoxy-adenosine, 3’-alkyne- 2’,3’-dideoxyinosine, 3 ’-alkyne-2’,3’ -dideoxy guanosine, 3 ’-alkyne-2’,3’ -dideoxyuridine, 3’- alkyne-2’,3 ’ -dideoxy cytidine, 2’ -alkyne-2’,3 ’-dideoxy-adenosine, 2’ -alkyne-2’,3
- a method for labeling an oligonucleotide for quantitative analysis may comprise incorporating a chemically reactive group at the 5’ or 3’ end of the oligonucleotide to form an oligonucleotide having a reactive end and contacting (e.g., reacting) the oligonucleotide having a reactive end with an isotopically labeled molecule (FIGURE 25).
- a chemically reactive group may be installed at the 5’ end of an oligonucleotide by incubating the oligonucleotide with ATPyS and T4 PNK.
- Methods may comprise, for example, contacting an RNA substrate with an endoribonuclease and a PNK to produce oligoribonucleotides and incorporating a chemically reactive group in the oligoribonucleotides to form oligoribonucleotides having a 5’ or 3’ chemically reactive group, wherein the contacting and the incorporating may be performed as coupled reactions, for example, coupled reactions further including a phosphorylation reagent (e.g., ATPyS) in the reaction location.
- a phosphorylation reagent e.g., ATPyS
- RNA oligonucleotides may be desirable to purify prior to incubation with a PNK and the phosphorylation reagent (e.g., ATPyS) so that the chemically reactive group is installed at the 5’ end of the oligonucleotide in a separate step.
- phosphothiolated oligonucleotides may react with iodoacetamide- or maleimide-functionalized molecules (e.g., nucleotides) comprising isotope labels.
- a chemically reactive group may be installed at the 3’ end of an oligoribonucleotide, for example, by reaction with sodium (or potassium) periodate to generate a dialdehyde reactive group at the 3’ end nucleotide 2’, 3 ’-diol position to produce a dialdehyde oligonucleotide.
- a dialdehyde oligonucleotide may react with hydrazine-, hydroxylamine-, or amine-functionalized molecules comprising isotope labels, including tandem mass tags.
- a method may comprise installing a chemically reactive group to the 5’ end of a capped oligoribonucleotide wherein the cap structure comprises a 2’,3’-diol group. Converting a 5’ cap comprising a 2’,3’-diol to a dialdehyde may be concurrent (e.g., a coupled reaction) with the converting a 3’ end nucleotide 2’, 3 ’-diol to a dialdehyde within the same oligonucleotide.
- the 3’ end labeling may be blocked by incubating the oligonucleotide with a polymerase and a blocking 3’-deoxynucleotide (e.g., Cordycepin) or 2’,3’-dideoxynucleotide prior the generation of the reactive dialdehyde to produce selectively labeled oligoribonucleotides having a 5’ end cap comprising a 2’, 3 ’-diol.
- the methods may further comprise subsequent labeling by reaction with an appropriate molecular scaffold (e.g., amino acids, keto acids, fatty acids, diamines, amino alcohols, carbohydrates) comprising one or more combinations of heavy and light isotope atoms.
- an appropriate molecular scaffold e.g., amino acids, keto acids, fatty acids, diamines, amino alcohols, carbohydrates
- methods may include contacting an endoribonuclease with an RNase inhibitor in an amount sufficient to at least partially inhibit the activity of the endoribonuclease.
- RNase inhibitors may be useful to a number of biotechnological applications
- methods may include an RNase inhibitor to terminate (e.g., precisely terminate) an endoribonuclease reaction at a desired point (e.g., a desired time point, upon consumption of a desired amount of substrate, upon formation of a desired product, upon formation of products having desired size(s)).
- Methods may include an RNase inhibitor to achieve controlled partial digestion of an RNA substrate (for instance, to generate RNA oligonucleotides that are on average longer in length due to incomplete cleavage of every possible cutting site that is specific for a given endoribonuclease).
- Methods may include, for example, an RNase inhibitor to avoid or prevent overdigestion of an RNA substrate (i.e., cutting substrate at nonspecific or low-preferred sites) during additional sample processing steps in a multistep preparation workflow (such as, isotope labeling of the digested RNA).
- methods may include an RNase inhibitor to avoid or prevent over-digestion of an RNA substrate immediately prior to sample analysis (for instance, during idle instrument times, such as column equilibration or instrument failure).
- Methods may include an RNase inhibitor to avoid or prevent overdigestion of an RNA substrate upon storage of a digested sample in the presence of the endoribonuclease.
- Methods may include an RNase inhibitor, for example, to study enzymatic activity (such as in kinetic studies in enzymology).
- Methods may include an RNase inhibitor, for example, to reduce cytotoxicity of an RNase during protein expression and/or purification.
- RNA substrates Targeted site-specific cleavage of RNA substrates for analysis of RNA features.
- Targeted cleavage of an RNA substrate may allow, according to some embodiments, one or more RNA oligonucleotide products of interest to be isolated. Isolation of one or more RNA oligonucleotide products may be coupled with analysis of certain RNA features, such as a 5’ cap structure or nucleobase modification (e.g., 6-methyladenosine and 5-methycytidine). Methods for assaying the identity and efficiency of cap incorporation in kilobase-long synthetic mRNA transcripts may be used in connection with quality control and/or characterization of mRNA therapeutics and vaccines.
- RNA features such as a 5’ cap structure or nucleobase modification (e.g., 6-methyladenosine and 5-methycytidine).
- Cleavage of a pre-defined oligonucleotide segment (e.g., 5-30) from the 5’ end of the mRNA substrate using a custom designed DNAzyme or ribozyme, or cleavage of a DNA-RNA hybrid duplex with RNase H (Beverly et al., Anal. Bioanal. Chem. 2016, 408:5021-30) may include analysis by denaturing gel electrophoresis or LC-MS.
- RNase H is a particular type of endonuclease that hydrolyzes phosphodiester bonds of RNA, when hybridized to DNA. RNase H is known to remove RNA primers from the Okazaki fragments of the replicating DNA. In vitro, RNase H cleaves one or more nucleotides away from the 5’ and/or 3’ of the target site (DNA-RNA hybrid duplex), giving rise to multiple cleavage products differing from each other by one or more nucleotides in length (with low or no particular nucleotide specificity). Formation of multiple cleavage products of a few nucleotides difference in length complicates the analysis by mobility -based or mass spectrometry -based methods.
- RHase H methods may be limited by demands on DNA probe design.
- applications of RNase H methods may be limited by the design of the DNA probe needed to form the duplex DNA-RNA substrate for RNase H binding and activity while also restricting its cutting region and avoiding spurious (or other unwanted) cleavage of the RNA substrate (e.g., through careful design of single-stranded probes comprising DNA-RNA or DNA-2’-O-methyl-RNA chimeras, wherein 4-6 DNA nucleotides are placed at 3’ end of the probe).
- the present disclosure provides methods and compositions that, according to some embodiments, are free of such limitations.
- nucleotide-specific endoribonuclease such as a mono-, di-, or trinucleotide-specific endoribonuclease
- a nucleotide-specific endoribonuclease such as a mono-, di-, or trinucleotide-specific endoribonuclease
- a method may comprise contacting a DNA probe (e.g., 5 to 50 nucleotides long) and an RNA comprising sequence (e.g., a sequence having a 5’ cap or a sequence having one or more nucleobase modifications) at least partially complementary to the DNA probe to form a DNA-RNA hybrid and contacting the DNA-RNA hybrid and a nucleotide-specific endoribonuclease.
- a DNA probe e.g., 5 to 50 nucleotides long
- an RNA comprising sequence e.g., a sequence having a 5’ cap or a sequence having one or more nucleobase modifications
- RNA-RNA hybrid duplex comprising a double-stranded portion and a single-stranded portion
- DNA-RNA hybrid duplex comprising a double-stranded portion and a single-stranded portion
- contacting the DNA-RNA hybrid duplex with an enzyme composition comprising a single-strand-specific nucleotidespecific endoribonuclease and, optionally, an RNA end-repair enzyme, to form a cleaved DNA-RNA hybrid duplex and one or more single-stranded RNA fragments of the RNA substrate by cleavage of the RNA substrate at one or more sites within the single-stranded portion by the single-strand-specific nucleotide-specific endoribonuclease;
- an enzyme composition comprising a single-strand-specific nucleotidespecific endoribonuclease and, optionally, an RNA end-repair enzyme
- a DNA probe may comprise a sequence complementary to a sequence of an RNA substrate.
- a DNA probe may be shorter than an RNA substrate such that the duplex formed upon hybridization comprises RNA overhangs at the 5’ and/or 3’ ends.
- a DNA/RNA duplex in some embodiments, may comprise a DNA probe and an RNA substrate longer than the DNA probe, wherein the RNA substrate has single-stranded overhangs at both the 5’ and 3’ ends.
- the portion of the RNA substrate hybridized to the DNA probe may be protected from endoribonucleases that cleave only single stranded RNA while the single-stranded overhangs at one or both of the 5’ and 3’ ends would be subject to cleavage.
- hybridization of a DNA probe to a complementary sequence of an RNA substrate may be directed or guided by an accessory protein, for example, a prokaryotic argonaute (e.g., a bacterial argonaute, such as Thermus thermophilus argonaute), whose endonucleolytic activity has been inactivated but retained its ability to search for their guide-defined substrate).
- a prokaryotic argonaute e.g., a bacterial argonaute, such as Thermus thermophilus argonaute
- Including an accessory protein may hasten hybridization (e.g., more rapid seeking of RNA substrate at a rate near the limit of diffusion).
- Including an accessory protein may improve (e.g., overcome limitations on) substrate accessibility; and/or facilitate hybridization by reducing the entropic barrier to duplex formation.
- an accessory protein that selectively binds duplex substrates over singlestranded substrates may be used to stabilize or conceal the duplex segment.
- one or more chemical additives may be included in methods or compositions of the disclosure to increase the stability and/or specificity of the DNA-RNA duplex, including salts (e.g, NaCl, MgCh), crowding agents (e.g., polyethylene glycol (PEG), Ficoll, Dextran, etc.), duplex strengtheners (e g., betaine, proline, trehalose, proline, tetramethylammonium chloride, etc ), and ionic liquids (e.g., imidazolium, pyridinium, pyrrolidinium, and phosphonium cations; halides, tetrafluoroborate (BF4-), hexafluorophosphate (PF6-), and bis[(trifluor
- a high-salt washing buffer may be used to wash away unbound RNA (e.g., the one or more single-stranded RNA fragments of the RNA substrate cleaved from the RNA substrate) while retaining solid support-bound DNA-RNA duplexes.
- unbound RNA e.g., the one or more single-stranded RNA fragments of the RNA substrate cleaved from the RNA substrate
- a low-salt buffer or water
- treatment with a DNase e.g., DNase I
- the RNA oligonucleotide strand may be retained on the solid support for downstream applications.
- a solid support may include any solid (flexible or rigid) material onto which a DNA-RNA hybrid duplex may be captured.
- a solid support may include a matrix formed from an affinity capture domain or coated with the affinity capture domain.
- a solid support may be, for example, a bead including a magnetic bead, a column, a porous matrix, or a flat surface formed from for example, plastic or paper.
- a solid support may be biological, non-biological, organic, inorganic or a combination thereof.
- a solid support may have any desired form including, for example, particles, strands, precipitates, gels, sheets, tubings, spheres, containers, capillaries, cartridges, pads, slices, films, plates, slides, and/or have any desired shape, including, for example, a plane, a disc, a sphere, a ring, a torus, a cube, a cylinder, a cone, a vesica, a rod, and an ellipsoid.
- the surface of a solid support may comprise one or more materials including, for example, polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, and membranes.
- Example solid supports include glass surfaces (e.g., glass slides, microtiter plates) and suitable sensor elements. Sensor elements may include, for example, functionalized polymers (e.g., in the form of beads).
- Example solid supports also include chemically modified oxidic surfaces (e.g.
- silicon dioxide, tantalum pentoxide or titanium dioxide chemically modified metal surfaces (e.g., noble metal surfaces such as gold or silver, copper or aluminium surfaces), magnetic surfaces (e.g., Fe, Mn, Ni, Co, and their oxides), quantum dots (e.g., III-V (GaN, GaP, GaAs, InP, or InAs) or II- VI (ZnO, ZnS, CdS, CdSe, or CdTe) semiconductors), Ln-doped fluoride nanocrystals, and rare earth-doped oxidic nanomaterials.
- noble metal surfaces e.g., noble metal surfaces such as gold or silver, copper or aluminium surfaces
- magnetic surfaces e.g., Fe, Mn, Ni, Co, and their oxides
- quantum dots e.g., III-V (GaN, GaP, GaAs, InP, or InAs) or II- VI (ZnO, ZnS, CdS, CdSe
- a DNA probe may hybridize with and protect a portion of a single-strand RNA from cleavage by a single-strand specific endoribonuclease.
- Synthetic nucleic acids may hybridize a single-stranded RNA substrate and/or protect the singlestranded RNA substrate from endoribonuclease cleavage (e.g., like a DNA probe that hybridizes to such single-stranded RNA).
- Synthetic nucleic acids may include, for example, a peptide nucleic acid (PNA), a lock nucleic acid (LNA), an unlock nucleic acid (UNA), a bridge nucleic acid (BNA), a triazole nucleic acid, a morpholine nucleic acid, an amide- linked nucleic acid, a 1,5 anhydrohexitol nucleic acid (HNA), a cyclohexenyl nucleic acid (CeNA), an arabinose nucleic acid (ANA), a 2'-fluoro-arabinose nucleic acid (FANA), a ot-L- threofuranosyl nucleic acid (TNA), a 4’-thioribose nucleic acid (4’S-RNA), a 2'-fluoro-4’- thioarabinose nucleic acid (4’S-FANA), a 4’-selenoribose nucleic acid (4’Se-RNA),
- RNA probes comprising complete or partial 2'-OH nucleotides substitution with 2'-O-alkyl-nucleotides (e g., 2'-O-methyl-nucleotides), 2'-O-methoxyethyl- nucleotides (MOE), 2' -fluoro-nucleotides, 2'-O-allyl-nucleotides, 2'-0-alkylamine- nucleotides (e.g., 2'-O-ethylamine-nucleotides), 2'-O-cyanoethyl-nucleotides, 2'-O- acetalester-nucleotides, and 2'-azido-nucleotides.
- 2'-O-alkyl-nucleotides e g., 2'-O-methyl-nucleotides
- MOE methoxyethyl- nucleotides
- nucleic acids that may be used include DNA or RNA probes comprising partial or complete backbone modifications such phosphororothioate (replacement of one non-bridging oxygen atom of the phosphate group with a sulfur atom), phosphorodi thioate (both non-bridging oxygen atoms of the phosphate group are replaced with sulfur), alkyphosphonate (a non-bridging oxygen atom of the phosphate group has been replaced with alkyl group, e g. methyl), arylphosphonate (a non-bridging oxygen atom of the phosphate group has been replaced with aryl group, e.g.
- oligoribonucleotide products generated by cleavage of DNA- RNA hybrid duplexes may be quantified by mobility -based methods, such as gel- or capillary electrophoresis, or by mass spectrometric methods.
- oligoribonucleotide products are quantified using calibration curves constructed employed authentic standards.
- such oligoribonucleotides are labeled at their 5’ or 3’ end with stable isotopes for relative or absolute quantification as disclosed herein. Differential incorporation of stable isotopes may be used for multiplex quantitative analysis of oligonucleotide, enabling simultaneous analysis of multiple RNA features by means of isobaric tags.
- kits including an endoribonuclease and/or an RNA end repair enzyme may include an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia coll), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus), and an archaea species (for example, Pyrococcus furiosus)) or (ii) is a non-naturally occurring sequence.
- a first species e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia coll), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Mom
- a kit may include, for example, an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (ii) is a non-naturally occurring sequence.
- a kit may include one or more additional enzymes (e.g., an RNA polymerase, an RNA ligase), a denaturing agent (e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide), a buffering agent, and any combination thereof.
- additional enzymes e.g., an RNA polymerase, an RNA ligase
- a denaturing agent e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate,
- An enzyme may be included in a storage buffer (e.g., comprising glycerol and a buffering agent).
- a kit may include a reaction buffer which may be in concentrated form, and the buffer may contain additives (e g. glycerol), salt (e.g. KC1), reducing agent, EDTA or detergents, among others.
- a kit may include an endoribonuclease having specificity for one or more dinucleotide combinations (e.g., cleavage after a specific nucleotide followed by a purine, cleavage after a specific nucleotide followed by a pyrimidine, cleavage after a purine followed by a specific nucleotide, and cleavage after a pyrimidine followed by a specific nucleotide).
- an endoribonuclease may have an average cleavage rate of once every 6-12 nucleotides.
- kits examples include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
- a kit may comprise an RNA end repair enzyme, for example, comprising phosphodiesterase and phosphomonoesterase activities.
- a kit may include, according to some embodiments, a divalent metal, for example, a divalent metal selected from magnesium(II), manganese(II), cobalt(II), and nickel(II).
- a kit may comprise one or more rNTPs including, for example, one, two, three of all four of rATP, rUTP, rGTP and rCTP.
- a kit may further comprise one or more modified nucleotides.
- a kit may include an RNase inhibitor.
- a kit may include an affinity-labeled DNA probe.
- One or more components of a kit may be included in one container for a single step or coupled reaction, or one or more components may be contained in one container (e.g., a box, case), but separated (e.g., in one or more tubes) from other components for sequential use or parallel use.
- the contents of a kit may be formulated for use in a desired method or process.
- An enzyme for example, an enzyme included in a kit, may have any desired form (e.g., fluid, freeze-dried, and lyophilized forms).
- An enzyme composition and/or kit may comprise non-ionic, ionic e.g. anionic or zwitterionic surfactants and crowding agents.
- a kit may include instructions for using the components of the kit to practice a desired method (e.g., methods for analyzing an RNA substrate). Instructions may be recorded on a suitable recording medium. For example, instructions may be printed on a substrate, such as paper or plastic and/or displayed electronically. Instructions may be present in the kits as a package insert, in the labeling of the container of the kit or components thereof (i.e., associated with the packaging or sub-packaging). Instructions may be present as an electronic storage data file residing on a suitable computer readable storage medium (e.g. a CD-ROM, a flash drive). Instructions may be provided remotely using, for example, cloud or internet resources with a link or other access instructions provided in or with a kit.
- a suitable computer readable storage medium e.g. a CD-ROM, a flash drive
- Recombinant wild-type hRNase 4 enzyme was periplasmically expressed as a MBP fusion protein containing an N-terminal signal peptide (61.7 kDa) (see FIGURE 1) and stored in an ammonium acetate buffer [100 mM NFLOAC, pH 5.5, 0.5 mM DTT, 50% glycerol]. Expression of hRNase 4 was induced with 10 pM IPTG, and the protein was expressed from a periplasmic hRNase 4-containing plasmid in T7 Express lysY Competent E.
- elution buffer 20 mM Tris/Cl pH 7.5, 250 mM NaCl, 1 mM DTT, 10 mM maltose
- the protein was loaded onto a GraviTrap His column, equilibrated with GTH column buffer (20 mM Na2HPO4 pH 7.5, 0.5 M NaCl, 1 mM DTT, 20% glycerol).
- hRNase 4 was eluted in two 3 ml fractions with GTH elution buffer (20 mM Na 2 HPO 4 pH 7.5, 0.5 M NaCl, 1 mM DTT, 0.5 M imidazole, 20% glycerol).
- the enzyme-containing fraction was dialyzed into the hRNase 4 storage buffer (200 mM NH 4 OAC, pH 5.5 + 1 mM DTT), and after dialysis supplemented with an equal volume of 100% glycerol.
- EXAMPLE 2 Characterization of human hRNase 4 activity and cleavage specificity The activity and specificity of hRNase 4 cleavage was assessed utilizing a LC- MS/MS-based multiplexed cleavage assay. A defined pool of 13 synthetic oligonucleotides comprising all possible dinucleotide combinations (at least once) flanked by poly-adenosine sequences of varying lengths (Table 1) was prepared.
- RNA oligonucleotides were characterized by LC-MS/MS analysis. Liquid chromatographic separation of RNA oligonucleotides was performed on a Thermo Scientific Vanquish Horizon UHPLC equipped with a DNAPac RP Column (2.1 x 50 mm, 4 mm) at 70°C using a 25-minute gradient of solvent A (1% hexafluoroisopropanol (HFIP), 0.1% N,N- diisopropylethylamine (DIEA), 1 pM EDTA) and increasing solvent B (5 - 35%) (80% Methanol, 0.075% HFIP, 0.0375% DIEA, 1 pM EDTA) at a 0.3 mL/min flow rate.
- solvent A 1% hexafluoroisopropanol (HFIP), 0.1% N,N- diisopropylethylamine (DIEA), 1 pM EDTA
- solvent B 5 - 35%) (80% Methanol
- MS/MS data were collected on a Thermo Scientific Q Exactive Plus Orbitrap Mass Spectrometer. Intact mass analysis was performed (scan range: 480 - 2500 m/z) at a resolution of 70,000.
- Raw intact MS data was deconvoluted utilizing ProMass (Novatia LLC) and Avalon peak detection and integration algorithm (Thermo Fisher Scientific). To determine the relative abundance of each input oligonucleotide and cleavage product following incubation with hRNase 4, deconvoluted mass data was compared with the theoretical masses of each input oligonucleotide and cleavage product using a 10-ppm mass difference cutoff.
- FIGURE 2 A heatmap of the relative abundance of each input oligonucleotide within the oligonucleotide pool after incubation with hRNase 4 is shown in FIGURE 2.
- Oligonucleotides comprising a uridine followed by a purine (abbreviated as “R”) were cleaved by incubation with hRNase 4.
- the oligonucleotide comprising the ‘UC’ dinucleotide was not cleaved by hRNase 4 at any of the tested concentrations.
- none of the oligonucleotides comprising ‘CG’, ‘CC’, ‘CA’ motifs were cleaved by hRNase 4.
- the identities and quantities of each cleavage product were analyzed.
- the 5' cleavage products were analyzed with respect to the composition of their 3'-terminal nucleotide residue.
- 5’ cleavage products were grouped according to the composition of their 3'- terminal nucleotide residue, regardless of their phosphorylation status. As shown in FIGURE 3, digestion with hRNase 4 resulted in an accumulation of 5' cleavage products comprising a uridine at the 3'-terminus.
- Digestion with hRNase 4 also produced, albeit to a much lesser extent and dependent upon the enzyme concentration, some very low levels of 5' cleavage products comprising a cytidine at the 3'-terminus.
- the 3' cleavage products were analyzed with respect to the composition of their 5'-terminal nucleotide residue.
- 3’ cleavage products were grouped according to the composition of their 5'-terminal nucleotide residue, regardless of their phosphorylation status.
- hRNase 4 may preferentially cleave RNA between ‘UR’ dinucleotides (i.e., on the 3' side of uridine and on the 5' side of adenosine or guanosine) (Shapiro et al., 1986; Zhou and Strydom, 1993; Teryzan et al., 1999).
- EXAMPLE 3 Prediction of hRNase 4 cleavage products in mRNA transcripts
- FIGURE 5A-C The calculated sequence coverage for each mRNA transcript based on the predicted cleavage products formed by digestion with a given endoribonuclease is shown on FIGURE 5. Only cleavage products between 4 and 40 nucleotides in length were utilized for the calculation of RNA sequence coverage, as they are the most useful for MS/MS sequencing purposes. Exact duplicate cleavage products were also excluded, as they are not uniquely mappable to a given RNA sequence. As shown in FIGURE 5A-C (left panels), hRNase 4 produced the highest median total predicted sequence coverage among the tested endoribonuclease specificities across transcripts from species as diverse as human and E. coli.
- hRNase 4 also resulted in the highest median theoretical sequence coverage across all transcripts considering only cleavage products with a unique mass (i.e., excluding cleavage products with isomeric sequences) as shown in FIGURE 5A-C (right panels).
- cleavage frequency as a results effective variable and/or optimizing cleavage frequency range for mass spectrometry -based RNA sequencing have been confounded, in part, because distance between consecutive endoribonuclease cleavage sites may vary in different RNA sequences and/or may result in oligonucleotides that are too short for sequencing purposes. Furthermore, the cleavage efficiency at any given endoribonuclease cleavage site may be affected by local RNA secondary structures and presence of RNA modifications.
- Discrepancies may exist between a predicted cleavage frequency and the corresponding actual or observed mean oligonucleotide product length (e.g., hRNase 4 has a predicted cleavage frequency of 1 out 8 nucleotides, but was observed experimentally to produce oligonucleotide products having median length of 12 nt; RNase T1 has a predicted cleavage frequency of 1 out 4 nucleotides, but was observed experimentally to produce oligonucleotide products having median length of 8 nt).
- the data shown in FIGURE 5A-C may be used as a guide to select endoribonucleases that may improve sequencing and fingerprinting of mRNAs using LC-MS/MS techniques.
- cleavage frequencies within the range of once every 6-12 nucleotide residues provide the highest RNA sequence coverage as follows.
- cleavage motifs were as follows: cleavage after a given single nucleotide (‘N’); cleavage after a given single nucleotide followed a purine (‘NR’); cleavage after a given single nucleotide followed a pyrimidine (‘NY’); cleavage after a purine followed by a single nucleotide (‘RN’); cleavage after a pyrimidine followed by a single nucleotide (‘YN’); and cleavage between a single dinucleotide sequence (‘NN’).
- cleavage motifs were represented as follows: cleavage after a given single nucleotide followed a purine (‘NR)’ and cleavage after a given single nucleotide followed a pyrimidine (‘NY)’ were represented as ‘N(Y/R)’; cleavage after a purine followed by a single nucleotide (‘RN’) and cleavage after a pyrimidine followed by a single nucleotide (‘YN’) were represented as ‘(Y/R)N’.
- the expected cleavage frequency for endoribonucleases with ‘N’ specificity is 1 out of 4 nucleotide residues; the expected cleavage frequency for endoribonucleases with ‘N(Y/R)’ specificity is 1 out of 8 nucleotide residues; the expected cleavage frequency for endoribonucleases with ‘NN’ specificity is 1 out of 16 nucleotide residues.
- Examples of endoribonucleases with the ‘N(Y/R)’ specificity are those whose specificity comprise one of: a uridine followed by a pyrimidine; a cytidine followed by a pyrimidine; an adenosine followed by a pyrimidine; a guanosine followed by a pyrimidine; a uridine followed by a purine; a cytidine followed by a purine; an adenosine followed by a purine; or a guanosine followed by a purine.
- endoribonucleases with ‘(Y/R)N’ specificity also result in cleavage frequencies that are on average 1 out of 8 nucleotide residues.
- Examples of endoribonucleases with the ‘(Y/R)N’ specificity are those whose specificity comprise one of: a pyrimidine followed by a uridine; a pyrimidine followed by a cytidine; a pyrimidine followed by an adenosine; a pyrimidine followed by a guanosine; a purine followed by a uridine; a purine followed by a cytidine; a purine followed by an adenosine; or a purine followed by a guanosine.
- Nucleotide combinations that result in cutting frequencies within the range of once every 6-12 nucleotide residues may include cleavage sites comprising two or more nucleotides. Examples of desirable cleavage specificities may include:
- cleavage specificities ( £ N(Y/R) & (Y/R)N’) that result in similar cleavage frequencies (1 out of 8) as to that of hRNase 4 produced consistently the highest theoretical sequence coverage (>75%) across a plurality of transcripts.
- sequence coverage may vary, for example, where cleavage efficiency is a function of reaction conditions (e.g., buffer composition, pH, salt concentration, temperature, incubation time, etc.); enzyme specificity (e.g., some endonucleases show minor cleavage activities to other nucleotide combinations); enzyme quality (e.g., presence of contaminating nucleases or absence of essential/nonessential cofactors); and/or properties of the substrate RNA (e.g., the presence of secondary structure and/or RNA modifications).
- reaction conditions e.g., buffer composition, pH, salt concentration, temperature, incubation time, etc.
- enzyme specificity e.g., some endonucleases show minor cleavage activities to other nucleotide combinations
- enzyme quality e.g., presence of contaminating nucleases or absence of essential/nonessential cofactors
- properties of the substrate RNA e.g., the presence of secondary structure and/or RNA modifications.
- Endoribonucleases with cleavage specificity similar to the ‘UR’ of hRNase 4, such as ‘N(Y/R)’ or ‘(Y/R)N’, may be suitable for applications such as mass spectrometry -based sequencing and fingerprinting of mRNA and other RNA substrates.
- EXAMPLE 4 Digestion of an RNA oligonucleotide with T4 PNK and hRNase 4
- Digestion of RNA with certain endonucleases may produce a mixture of cleavage products comprising 2',3'-cyclic-phosphate and 3'-phosphate at the 3’ terminus. In many cases, this process depends on the enzyme concentration, the digestion buffer, and/or incubation time, in any combination. In some cases, the product mixture may also comprise 2',3'-hydroxylated species. In other cases, enzyme-independent hydrolytic opening of 2', 3 cyclic-phosphate may generate a mixture comprising 2',3'-cyclic-phosphate, 3'-phosphate, 2'- phosphate, 2',3'-hydroxy, 5’-phosphate, and/or 5’-hydroxy termini, in any combination.
- This example describes the digestion of a synthetic RNA oligonucleotide substrate by co-incubation with a mixture of T4 PNK (Phage T4 polynucleotide kinase) and hRNase 4.
- T4 PNK Phage T4 polynucleotide kinase
- RNA oligonucleotide substrate (Oligonucleotide #1: AAAAAAAAAAAUGAAAAAAAAAA)(SEQ ID NO:5) was incubated with a combination of 0.2 pL of T4 PNK and 1 pL of human hRNase 4 in 1 x NEBuffer 1 (10 mM Bis-Tris-Propane-HCl, 10 mM MgCE, 1 mM DTT, pH 7) for 30 minutes at 37°C.
- FIGURE 7 shows the overlaid UV chromatograms from hRNase 4 treatment of the Oligonucleotide #1 in the presence and absence of T4 PNK.
- a mixture of 5' cleavage products comprising 3 '-phosphorylated and 2',3'-cyclic-phosphorylated ends was observed upon hRNase 4 digestion in the absence of T4 PNK (see FIGURE 7, cleavage products #3 and #4).
- cleavage products #3 and #4 did not a single 5' cleavage product comprising a 2', 3 '-hydroxylated end was observed upon hRNase 4 digestion and addition of T4 PNK (see FIGURE 7, cleavage product #2).
- EXAMPLE 5 Use of hRNase 4/T4 PNK for sequencing and mass fingerprinting of an mRNA
- This example describes sequencing and fingerprinting of Firefly Luciferase messenger RNA (FLuc mRNA) by means of digestion with a combination of hRNase 4 and T4 PNK. For comparison purposes, digestion of FLuc mRNA was also performed with RNaseTl alone.
- a FLuc mRNA transcript was produced by in vitro transcription (IVT) utilizing the HiScribeTM T7 High Yield RNA Synthesis Kit (NEB, Catalog # E2040S).
- IVT in vitro transcription
- a linearized DNA template encoding the FLuc mRNA sequence (1 pg) under the control of T7 promoter was mixed with 10 mM rATP, 10 mM rGTP, 10 mM rCTP, 10 mM rUTP, and 2 pL of T7 RNA Polymerase in a 20 pL reaction volume. The resultant mixture was incubated at 37°C for 2 h.
- the reaction mixture was diluted to 100 pL in 1 x DNase 1 buffer (10 mM Tris-HCl, 2.5 mM MgC12, 0.5 mM CaC12, pH 7.6) and incubated with 2 pL of DNase 1 (NEB, Catalog # M0303S) for 15 minutes at 37°C.
- DNase 1 NEB, Catalog # M0303S
- the in vitro transcribed FLuc mRNA was purified utilizing an NEB Monarch RNA Cleanup Kit (500 pg) (NEB, Catalog # T2050L).
- the concentration of purified FLuc IVT mRNA was quantified utilizing a NanoDrop spectrophotometer (Thermo Fisher Scientific).
- Digestion using hRNase 4/T4 PNK was performed as illustrated in the example workflow in FIGURE 8 (left panel) and example composition of Table 2.
- 10 pg of purified FLuc IVT mRNA was mixed with 3 M Urea in 1 x NEBuffer 1 (10 mM Bis-Tris- Propane-HCl, 10 mM MgC12, 1 mM DTT, pH 7). The mixture was heated to 90°C for 10 minutes and cooled to room temperature. The mixture was diluted 3 -fold in a 1 x NEBuffer 1. Then 0.4 pL of T4 PNK (160 units) and 2 pL of purified recombinant human RNase4 was added to the reaction mixture (See Table 2).
- RNA digestion with hRNase 4/T4 PNK was performed for 2 h at 37°C with shaking at 300 rpm.
- a parallel digestion of FLuc IVT mRNA was performed using 1 pL of RNase T1 (FIGURE 8, right panel).
- the resultant digestion products of either workflow were filtered using a Millipore Ultrafree MC- GV spin column (0.22 um) at 13,400 rpm for 5 minutes.
- Table 2 Representative composition of an hRNase 4/T4 PNK digestion mixture Each sample was characterized by LC-MS/MS analysis as described in Example 2 with slight variations in the UHPLC gradient time and MS/MS parameters. A 25 -minute UHPLC gradient was applied, and MS/MS data was collected with a Thermo Scientific Q Exactive Plus Orbitrap Mass Spectrometer in Top-5 ddMS2 acquisition mode at a resolution of 35,000 with a normalized collision energy of 20% in negative ionization mode. Theoretical prediction of the cleavage products generated by digestion of FLuc IVT mRNA with either hRNase 4 or RNase T1 is shown in FIGURE 9.
- Complete digestion of FLuc IVT mRNA with hRNase 4 is predicted to produce a substantially higher sequence mapping (higher sequence coverage percentage) in comparison with RNaseTl.
- hRNase 4 is predicted to produce a high percentage of cleavage products with unique sequences
- RNase T1 is predicted to generate a high percentage of isomeric cleavage products.
- EXAMPLE 5A hRNase 4/PNK-based mass fingerprinting
- the accurate determination of mass-to-charge (m/z) ratio of oligonucleotide cleavage products serves as a unique identifier of a particular RNA and allows identification of the unknown RNAs in a sample by matching the resulting oligonucleotide masses with the theoretical oligonucleotide masses of RNAs in a database (such as NCBI RefSeq).
- Oligonucleotide mass fingerprinting was performed by deconvoluting raw intact MS data with ProMass software (Novatia LLC) and Avalon peak detection and integration algorithm (Thermo Fisher Scientific). Deconvoluted oligonucleotide masses detected in either the hRNase 4/T4 PNK condition or the RNaseTl condition were compared to a database of human transcripts (RefSeq) in which the FLuc IVT mRNA sequence was spiked in. The product of the proportion of total spectral intensity explained by theoretical masses and the proportion of theoretical oligonucleotides identified in the spectra from each transcript was calculated, hereafter referred to as the score of each transcript.
- EXAMPLE 5B hRNase 4/T4 PNK-based sequencing
- NASH Nucleic Acid Search Engine
- FIGURE 14 shows the FLuc mRNA sequence coverage after aggregating replicates of RNase T1 alone, hRNase 4/T4 PNK alone, or RNaseTl and hRNase 4/T4 PNK combined. Aggregation of triplicate FLuc IVT mRNA digests resulted in 75% sequence coverage with hRNase 4/T4 PNK condition and 55.8% with RNaseTl condition. Aggregation of FLuc IVT mRNA digests from combined hRNase 4/T4 PNK and RNaseTl experiments resulted in an improvement in sequence coverage to 89.3%. Parallel digestion using hRNase 4/T4 PNK and RNase T1 may be beneficial due to the complementary cleavage specificities presented by these endoribonucleases.
- hRNase 4/T4 PNK resulted in a distribution of longer cleavage products with a higher overall coverage of the FLuc mRNA sequence in comparison to RNase Tl.
- these data indicate that hRNase 4/T4 PNK offers a complementary alternative to conventional enzymatic tools such as RNase TL
- composition embodying an RNA end-repair enzymes such as T4 PNK may be effectively extended to other endoribonucleases.
- T4 PNK RNA end-repair enzymes
- MCI uridine-specific endoribonuclease that produces a mixture of 2',3'-cyclic-phosphate and 3 '-phosphate termini (Addepalli et al., 2015), for sequencing of FLuc IVT mRNA.
- FLuc mRNA was prepared as described in Example 5. Digestion with MC1/T4 PNK was performed as follows: 5 pg of purified FLuc IVT mRNA was mixed with 3 M Urea in 1 x NEBuffer 1 (10 mM Bis-Tris-Propane-HCl, 10 mM MgCh, 1 mM DTT, pH 7). The mixture was heated to 90°C for 10 minutes and cooled to room temperature. The mixture was diluted 3-fold in a 1 x NEBuffer 1. Then 0.2 L of T4 PNK (20000 units) and 1 pL of ribonuclease MCI were added to the reaction mixture. RNA digestion was performed for 1 h at 37°C with shaking at 300 rpm. Digestions were performed in triplicate.
- EXAMPLE 7 hRNase 4/T4-PNK-based sequencing and fingerprinting of a human erythropoietin (Epo) mRNA
- Epo mRNA was in vitro transcribed either using canonical UTP, ATP, GTP, and CTP (herein referred as to U Epo mRNA or EpoU), or using mo5UTP replacing UTP to result in a Epo mRNA with full substitution of uridine with 5- methoxyuridine (herein referred as to mo5U Epo mRNA or EpomoU), or using ml YTP replacing UTP to result in a Epo mRNA with full substitution of uridine with 1-methyl- pseudouridine (herein referred as to ml Y Epo mRNA or EpomlY).
- Epo mRNA has been utilized to demonstrate the therapeutic potential of IVT mRNAs in the treatment of anemia (Kariko et al., 2012, Thess et al., 2015).
- Epo mRNA fully modified with mo5U or mlY was utilized as a model system to assess the use of hRNase 4 in combination with T4 PNK to characterize putative therapeutic mRNAs.
- FIGURE 17 shows the score (as defined in Example 5) of each transcript relative to RNA length.
- the synthetic Epo mRNA sequence could be uniquely identified relative to all other human transcripts in each of hRNase 4/T4 PNK or RNase T1 conditions.
- RNase T1 data exhibited a substantially higher identification background relative to that of hRNase 4/T4 PNK.
- FIGURE 18 shows the overall sequence coverage obtained in each digestion experiment. Digestion with hRNase 4/T4 PNK resulted in consistently higher sequence coverage (73-87%) relative to digestion with RNase T1 (50-61%) across all Epo mRNA substrates tested.
- hRNase 4 may be useful for applications in the analysis of mRNA-based medicines (e.g., mRNA vaccines and therapeutics).
- EXAMPLE 8 hRNase 4/T4-PNK-based characterization of an Epo mRNA comprising a 5- prime m 7 GpppAm cap and 3-prime poly-adenosine (poly-A) tail]
- This example describes the use of hRNase 4/T4 PNK for characterizing Epo mRNAs comprising a 5' terminal m7GpppAm cap and a 3' terminal 120-nt poly-adenosine (Poly-A).
- the presence of 5' cap and 3' poly-A tail structures may confer or imprive the stability and/or translation of an IVT mRNA upon introduction into mammalian cells and organisms.
- FIGURE 19 shows a summary of the intensities of 5' cleavage products detected with and without a 5' m7GpppAm cap.
- oligonucleotides comprising deconvoluted masses equivalent to a triphosphorylated guanosine and two methyl groups were detected only in the capped Epo mRNA digests.
- EXAMPLE 9 hRNase 4/T4-PNK-based characterization of uridine-depleted variants of CLuc mRNA
- This example describes the LC-MS/MS analysis of three highly similar uridine- depleted variants of CLuc (Cypridina luciferase) mRNA using a composition of hRNase 4/T4 PNK. Depletion of the number of uridines in an RNA template has been utilized as a strategy to reduce the immunogenicity of IVT mRNAs without the need for introduction of chemical modifications (Vaidyanathan, et al., 2018).
- each mass fingerprint in the context of a human transcriptome database supplemented with each uridine-depleted CLuc mRNA sequence was assessed (see FIGURE 21).
- the correct uridine-depleted CLuc mRNA substrate was uniquely identified in each digestion experiment by LC-MS fingerprinting analysis.
- FIGURE 22 shows the sequence coverage of each uridine-depleted mRNA substrate in each hRNase 4/T4 PNK digest.
- the correct uridine-depleted CLuc mRNA sequence exhibited a substantially higher sequence coverage relative to the others.
- the vast majority of sequenced cleavage products detected could be confidently attributed to the correct uridine-depleted CLuc mRNA in each experiment (see FIGURE 23).
- hRNase 4/T4 PNK may be utilized to discriminate between highly similar nucleotide-depleted substrates by both sequencing and fingerprinting techniques.
- RNA oligonucleotides are prepared by digestion of an RNA-of-interest using hRNase 4/T4 PNK or a related composition as described above. This labeling method may be useful for and/or combined with multiplexing and relative quantification of one or more RNA oligonucleotides.
- a non -template directed RNA polymerase is utilized to add a single 3'-azido-3'- deoxy-nucleotidetriphosphate (NTP) to the 3'-end of one or more RNA oligonucleotides (see FIGURE 24A).
- NTP 3'-azido-3'- deoxy-nucleotidetriphosphate
- Examples of a non-template directed RNA polymerase include, E. coli poly(A) polymerase, yeast poly(A) polymerase, DNA polymerase 0 or any non-template directed RNA polymerase which accepts 3'-azido-3'-deoxy-NTPs as a substrate.
- DBCO Dibenzocyclooctyne
- Mass label conjugates may be utilized to produce “heavy” and “light” variants of differential isotopic composition for the comparison between two experimental conditions, for example for the analysis of 5’-capped and uncapped mRNAs.
- One sample e.g., RNA oligonucleotides generated by digestion of an uncapped mRNA
- a “light” version of a mass tag as described above.
- the other sample e.g., RNA oligonucleotides generated by digestion of a mRNA whose capping percentage is unknown
- RNA oligonucleotides generated by digestion of a mRNA whose capping percentage is unknown is labeled with a version of the same tag that comprises a “heavy” isotope.
- oligonucleotides from each sample co-elute as pairs of peaks and may be distinguished by the mass difference between the “heavy” and “light” isotope content.
- Identification and quantitation of the relevant 5’ end oligonucleotides are performed by a combination of intact MS and MS/MS fragment analyses. This approach may be extended to quantify other features of RNA substrates, such as RNA modification analysis.
- the mass tag concept is not restricted to isotopically labeled amino acids and could be extended as to other molecule classes (i.e., the group R in FIGURE 24B could comprise a keto acid, a lipid, a carbohydrate, etc.).
- DBCO dipeptide conjugates may be utilized to produce isobaric dipeptide (or polypeptide) mass tags.
- each tag has the same molecular mass, but the positions of the “heavy” and “light” isotopes are distributed within the peptide. This is achieved, for instance, by placing combinations of 13C and 15N heavy isotopes at different positions for each tag, so that the total number of isotopes is constant for all tags, thus creating distinct reporter and balancing regions.
- reporter and balancing peptide fragments may be distinguished, and the identity and quantity of each mass tag determined.
- Samples labeled with distinct peptide conjugate mass labels may be multiplexed and compared by LC-MS/MS analysis as described in Example 2. This approach permits multiplexed analysis of several RNA features in the same experiment (for instance, the simultaneous analysis of multiple RNA modifications in a given mRNA).
- methods and workflows may include fragmentation of peptide- RNA nucleoside conjugates utilizing high energy collision dissociation (HCD).
- HCD high energy collision dissociation
- Labeling RNA oligonucleotides with isobaric tags with distinct stable isotopic distributions is an example of an approach that may be utilized to enable multiplexing and relative quantification of RNA oligonucleotides between different experimental samples.
- Various molecular scaffolds may be utilized as isobaric tags, such as amino acids, keto acids, fatty acids, diamines, amino alcohols, carbohydrates, and dipeptides among others.
- Dipeptides may be cleavable by fragmentation modalities such as HCD, at the amide bond between the N-terminal and C-terminal amino acids to produce an amino acid reporter anion.
- the amino acid reporter anion fragment may comprise a defined set of heavy and light isotopes in its composition so that differential isotopic composition between the isobaric tag and the reporter fragment distinguish and relatively quantitate labeled oligonucleotide from distinct samples and/or experimental conditions (e g., capped and uncapped mRNAs),
- a 3'-azido-3'-deoxyadenosine nucleoside was independently conjugated to three amino acid/dipeptide labels by copper-free click chemistry reaction with DBCO-alanine, DBCO- alanine-phenylalanine or DBCO-alanine-Proline.
- FIGURES 24D-24F show the fragmentation pattern by HCD (with a normalized HCD of 30%) of the three 3 '-azido-3 '-deoxyadenosine derived peptide-nucleoside conjugates — including 3 '-azido-3 '-deoxyadenosine conjugated to DBCO-alanine (FIGURE 24D), to DBCO-alanine-phenylalanine (FIGURE 24E) and to DBCO-alanine-Proline (FIGUE 24F).
- the single amino acid conjugate formed by the reaction of 3 '-azido-3 '-deoxy adenosine with DBCO-alanine was used as a model for the fragmentation studies.
- the main anions detected upon fragmentation of the nucleosi de-alanine conjugate were derived from the adenosine nucleobase (denoted as A-base for simplicity), triazole-DBCO (denoted as DBCO for simplicity), and triazole-DBCO-alanine (denoted as DBCO-Ala for simplicity) fragments.
- the fragmentation of the dipeptide conjugates formed by the reaction of 3 '-azido-3 '-deoxyadenosine with DBCO-alanine-phenylalanine or with DBCO-alanine-Proline dipeptides is shown in FIGURE 24E and FIGUE 24F.
- fragmentation between the N- terminal and C-terminal amino acids for each of those dipeptide conjugates produced the intended amino acid reporter anion: phenylalanine (Phe) and proline (Pro) derived anions, respectively.
- Phe phenylalanine
- Pro proline
- a target RNA oligonucleotide (SEQ ID NO: 23; AAGAGAGAUAGAGAA) containing a single hRNase 4 cleavage site was added to the mixture and the reaction was incubated for 15 minutes at 37°C. Robust cleavage of the target oligonucleotide by hRNase 4 was observed in the absence of human placental RNase inhibitor (FIGURE 26). However, cleavage of the target oligonucleotide was inhibited following preincubation of hRNase 4 with human placental RNase inhibitor. hRNase 4 activity was inhibited by human placental RNase inhibitor in the presence of 1 M urea, which helps unfold substrate RNA secondary structures.
- EXAMPLE 12 Targeted substrate protection and hRNase 4 cleavage for mRNA capping analysis
- FIGURE 27 shows an example workflow illustrating this method. Briefly, a capped RNA substrate and a 5 ’-biotinylated DNA probe which is complementary to at least a portion of the capped RNA substrate (e.g., a segment of interest) are annealed to form an RNAZDNA duplex. The duplex and an enzyme composition (e.g., comprising hRNase 4 and optionally an RNA end repair enzyme) are combined to form a cleaved DNA-RNA hybrid duplex and one or more single-stranded RNA fragments of the RNA substrate.
- an enzyme composition e.g., comprising hRNase 4 and optionally an RNA end repair enzyme
- the cleaved DNA-RNA hybrid duplex may then be affinity purified (e.g., using streptavidin magnetic beads).
- the remaining portion of the RNA substrate included in the purified DNA-RNA hybrid duplex may be eluted, for example, by contacting the purified DNA-RNA hybrid duplex with a DNase I.
- a 30-nt DNA probe sequence (SEQ ID NO: 24; /Biotin/GAGCTTCTGCAAAAAGAACAAGCAAGCCCT) was hybridized to the 5 ’-terminal sequence of a 5' m7GpppAm capped EPO mRNA (as illustrated in FIGURE 28A) utilizing a touchdown hybridization approach (heating to 95 °C for 2 minutes, followed by slowly cooling to 22°C at 0.1°C/s) in lx NEBuffer 1 supplemented with 3 M urea. The hybridized mRNA solution was diluted to 1 M urea in NEBuffer 1 and a composition of hRNase 4/T4 PNK was added. The mixture was incubated at 37°C for 1.5 hours.
- RNA Digestion was stopped by addition of human placental RNase inhibitor.
- the resulting duplex comprising the 5'-biotinylated DNA probe and the corresponding hybridized RNA oligonucleotide was purified utilizing streptavidin magnetic beads.
- the hybridized RNA was eluted by incubation with DNase I at 37°C.
- the isolated RNA oligonucleotide was characterized by LC-MS/MS. Comparative experiments were performed in the absence of either the DNA probe or hRNase 4/T4 PNK.
- FIGURE 28B shows a single prominent chromatographic peak (FIGURE 28B, top panel), whose identity was confirmed by mass spectrometry analysis (FIGURE 28C). Notably, no corresponding chromatographic peak was detected in purifications performed in the absence of the DNA probe or in the absence of hRNase 4/T4 PNK (FIGURE 28B, middle and lower panels).
- FIGURE 28C shows the deconvoluted mass of the RNA 35mer oligonucleotide corresponding to the 5 ’-terminal segment of EPO mRNA (SEQ ID NO: 25;
- the isolated RNA oligonucleotide product comprises the sequence that is “protected” by the DNA probe plus any subsequent ribonucleotides at the 3’ end preceding an hRNase 4 ‘UR’ cutting site (indicated by the arrow in FIGURE 28A).
- a DNA/RNA duplex may comprise a DNA probe and an RNA substrate longer than the DNA probe, wherein the RNA substrate has single-stranded overhangs at both the 5’ and 3’ ends.
- Protecting an internal oligoribonucleotide segment of a given RNA substrate by hybridization with a DNA probe that leaves 5’ and 3’ overhangs may limit cleavage by hRNase 4 to the 5’ and 3’ UR sites that are nearest to the DNA probe-RNA substrate duplex.
- results shown in FIGURE 28B and FIGURE 28C demonstrate that protection of a portion of an RNA substrate from the action of a single-stranded nucleotide-specific endoribonuclease (e g., hRNase 4) by hybridization with a complementary affinity tagged DNA probe (e.g., shorter than the RNA substrate) can be used to selectively isolate and analyze features of the protected portion, such as a cap structure and/or any modifications that are present.
- the methods illustrated in this example may include contacting an RNA substrate with multiple biotinylated-DNA probes targeting different portions of the RNA substrate, permitting simultaneous analysis of such portions.
- Disclosed methods may be applied to RNA modification analysis, such as RNA identification, locating an RNA within a sequence, assessing RNA stoichiometry, detecting RNA presence, permanence, and/or dynamics (i.e., installation and removal), and detecting co-existence of RNA modifications.
- RNA modification analysis such as RNA identification, locating an RNA within a sequence, assessing RNA stoichiometry, detecting RNA presence, permanence, and/or dynamics (i.e., installation and removal), and detecting co-existence of RNA modifications.
- EXAMPLE 13 DNA probe-directed RNA cleavage with site-specific ribonucleases
- Targeted cleavage of an RNA substrate with site-specific ribonucleases may be directed by hybridization of the RNA substrate with complementary DNA probe(s).
- an RNA substrate may be annealed to one or more DNA probes, each complementary to one or more sequences of interest within the RNA substrate sequence, forming one or more DNA/RNA duplex segments. Segments comprising DNA/RNA duplexes may be cleaved upstream and/or downstream of the double-stranded region, for example, using a ribonuclease capable of cleaving single-stranded RNA, optionally in the presence of a repair enzyme.
- the RNA cleavage may occur at nucleotide positions within the internal edges of the double-stranded region (presumably due to local conformation fluctuations, also referred as to breathing or fraying, that may form transient single-stranded regions; or by the action the ribonuclease itself). Resultant products of ribonuclease digestion, whether cleaved DNA/RNA duplex segments or cleaved single-stranded segments, or both, may be assessed by LC-MS/MS analysis.
- Optional steps of isolation of the cleaved DNA/RNA duplex segments may be employed prior to LC-MS/MS analysis, such as capturing the cleaved DNA/RNA duplex segments by means of affinity enrichment followed by selective elution of the corresponding RNA strand.
- FIGURE 29 shows example results of an assay to assess the cleavage of an RNA substrate (SEQ ID NO: 31;
- GGGACUCUAACUAUGUCAAUCGCCGUGAUGUAAUUAUCGC hybridized to a DNA probe (SEQ ID NO: 32; ATTGACATAGTTAGAGTCCC).
- the first 20 nucleotides of a 40mer RNA sequence were hybridized with a complementary 20mer DNA probe to form at least partially duplex DNA/RNA polynucleotides, which were then cleaved using one of several ribonucleases.
- Hybridization of the RNA substrate and DNA probe was conducted by heating to 80°C for 2 minutes, followed by slowly cooling at 0.1°C/s to 22°C to form a DNA/RNA duplex solution.
- the hybridized DNA/RNA duplex solution was diluted in a reaction buffer appropriate for each ribonuclease.
- TABLE 4 shows the ribonucleases and reaction buffers used in this example.
- the ribonucleases included three sitespecific RNases (hRNase 4, MCI and RNase Tl) and two ribonucleases with poor specificity (RNase A and RNase Ir).
- a composition of 1 pL of a 5-fold dilution series of each RNase was added to the reaction mixture.
- T4 PNK (1 :75 dilution) (400,000U/mL) was added to the hRNase 4, MCI and RNase Tl reaction mixtures. Each mixture was heated for 30 minutes at 37°C. The resultant mixture was characterized by LC-MS/MS. Comparative experiments were performed in the absence of either the DNA probe or ribonuclease.
- RNA cleavage products were classified as “protected products” if they were associated with limited cleavage of the hybridized DNA/RNA duplex, including products with 3'- overhangs, 3 '-overhangs in combination with 5'-reaccessed ends (less than 4nt internal to the hybrid duplex), blunt ends and 3 '-reaccessed ends (less than 4nt internal to the hybrid duplex) with respect to the hybridized DNA/RNA duplex.
- Products classified as “internal products” refer to those RNA cleavage products resulting from one or more cleavage events within the DNA/R A duplex (greater than 4nt internally from either end of the hybrid duplex).
- Products classified as “external products” refer to those cleavage products resulting from cleavage events only in the unhybridized (single stranded) regions of the RNA substrate.
- the tested RNases produced different levels of protected products following DNA/RNA hybridization and cleavage.
- the plurality of protected products in digests with sitespecific RNases exhibited well-defined 3 '-overhangs terminating at the respective recognition site immediately following the hybridized DNA/RNA duplex ( Figure 29).
- the less-specific RNases yielded a mixture of products with variable 3 '-recessed ends and 3 '-overhangs relative to the DNA hybridized region.
- the sequences of the most abundant protected products and their positions are shown in the right panel (light gray).
- RNA cleavage with site-specific RNases can be directed to predictable and well-defined sites by DNA probe hybridization.
- Data shown demonstrate that the 5' and 3' heterogenicity in the resulting cleavage products is a function of ribonuclease utilized in the protection assay. At higher ribonuclease concentrations, the product heterogenicity may increase at different levels for different ribonucleases.
- EXAMPLE 14 Varying DNA probe lengths in DNA probe-directed RNA cleavage with site specific RNases
- FIGURE 30A shows an example experiment to examine how cleavage of a DNA probe- hybridized RNA substrate changes with varying DNA probes.
- the 40mer RNA of Example 13 was hybridized with one of a sequential series of DNA probes ranging from 22 to 30 nucleotides in length.
- the sequences of the DNA probes were designed to be complementary to the 5' end of the RNA substrate sequence (TABLE 5.).
- FIGURE 30B shows the cleavage pattern of each site-specific ribonuclease at and around the varying DNA/RNA duplex regions. All site-specific RNases produced cleavage products primarily with 3 '-overhangs or blunt ends terminating at the first respective recognition site immediately downstream of the DNA probe hybridized region (FIGURE 3 OB, bottom panel). With the increase in probe length, a noticeably transition to the formation of longer protected products was observed, which were a result of cleavage at subsequent recognition sites downstream of the DNA probe hybridized region. Upon DNA probe hybridization overlapping a given recognition site a mixture of cleavage products from cleavage at successive downstream and upstream recognition sites was observed.
- EXAMPLE 15 Comparing the use of hRNase 4 and RNase H in ribonuclease protection assays
- RNase H may be used for analysis of synthetic mRNA 5' cap incorporation and cleaves RNA substrate at adjacent phosphodiester bonds 5' and 3' to the RNA hybridized to the 5' deoxynucleotide of an DNA-RNA chimera probe.
- RNase H may also cleave one or more nucleotides away from 5' and 3 ' of the target site, giving rise to multiple cleavage products that differ by one or more nucleotides, thereby complicating product analysis by electrophoresis or LC-MS/MS.
- extensive optimization of the DNA-RNA chimera probe is usually required to achieve uniform cleavage of an RNA substrate at predetermined sites.
- This example shows a comparison between the specificities of hRNase 4 and RNase H to cleave an RNA substrate in vitro at a pre-defined site at or near a double-stranded segment generated by hybridizing a complementary probe to the 5’ end of the target RNA substrate.
- a synthetic FLuc mRNA transcript (Seq ID NO:26) were first capped with a Faustovirus Capping Enzyme (FCE; NEB Cat # M2081S) and then methylated at the 2'-0 position of the first nucleotide adjacent to the cap structure with a mRNA Cap 2'- O-Methyltransferase (NEB Cat # M0366S), according to manufacturer’s instructions, to produce a capped FLuc mRNA that comprise a 5'-terminal m 7 GpppGm (Cap 1) structure and a series of intermediate products, including a 5'-terminal diphosphate (pp or 2p), a 5'-terminal triphosphate (ppp or 3p), a 5 '-terminal guanosine triphosphate (Gppp), and 5 '-terminal m 7 GpppG (Cap 0).
- FCE Faustovirus Capping Enzyme
- NEB Cat # M0366S mRNA Cap 2'-
- This 5 '-end m 7 GpppGm modified FLuc mRNA was hybridized either with a 25-nt biotinylated DNA probe (SEQ ID NO: 51) for cleavage conditions with hRNase 4, or with a 25-nt desthiobiotinylated DNA-RNA chimeric probe (DNA/RNA probe; SEQ ID NO: 55), wherein the first 6 positions at the 5’ end are deoxyribonucleotides and the remaining 19 positions are ribonucleotides; deoxyribonucleotides are denoted by a preceding ‘d’) for cleavage conditions with RNase H.
- Hybridization was performed utilizing a touchdown approach (by heating to 80°C followed by a ramp-down at 0.1°C/s to 22°C) in absence of a denaturant (e.g., such as urea).
- the DNA/FLuc mRNA hybrid was cleaved utilizing a composition of 1 pL of hRNase 4 (10-fold dilution) and 0.4 pL of T4 PNK (400,000U/mL) in NEB buffer rl .1 at 37°C for 1 hour. Each digestion was stopped by addition of 1 pL of human placental RNase inhibitor.
- the cleaved DNA/RNA duplex segment was affinity purified utilizing streptavidin magnetic beads and eluted by heating to 80°C in water.
- DNA-RNA chimera/FLuc mRNA hybrid was cleaved utilizing 1 pL of Thermostable RNase H (NEB Cat # M0523S), then affinity purified utilizing streptavidin magnetic beads and eluted by heating to 80°C in water as above.
- Digestion of the FLuc mRNA substrate hybridized to the DNA probe is expected to produce a 28mer RNA cleavage product terminating at the first hRNase 4 recognition site following (i.e., at a more 3’ position on the mRNA substrate than) the DNAZRNA hybridized region of the FLuc mRNA.
- Digestion of the same FLuc mRNA substrate/DNA probe duplex with RNase H is expected to yield a 24mer RNA cleavage product (FIGURE 31 A).
- Two main chromatographic peaks were observed in the hRNase 4/T4 PNK reaction after cleavage and DNA/RNA duplex enrichment (FIGURE 3 IB, top panel).
- probe-directed hRNase 4 cleavage resulted in a substantially more specific formation of the expected cleavage product, indicating that protecting a sequence segment of interest with a complementary probe and contacting the RNA substrate with a site-specific ribonuclease that preferentially cuts single-stranded RNA is a superior strategy to generate predetermined oligonucleotides for LC-MS/MS analysis.
- FIGURE 31C shows the deconvoluted mass spectrum of the 28mer cleavage product peak of Figure 3 IB (hRNase 4 condition).
- FIGURE 3 ID shows the deconvoluted mass spectrum of the 24mer cleavage product peak of Figure 3 IB (RNase H condition).
- Both mass spectra are consistent with the formation of a series of intermediary modifications of the RNA 5' end resulting from incomplete enzymatic capping and/or methylation of the FLuc mRNA (described above). These include 5'-pp (diphosphate or 2p or pp), 5 '-ppp (triphosphate or 3p or ppp), 5'-Gppp, and Cap 1 (5'-m 7 GpppGm).
- FIGURE 3 IF A relative quantitation of the products with different 5'-end modifications, and by extension, a relative measure of FLuc mRNA capping efficiency is shown in FIGURE 3 IF.
- the aggregate results of hRNase 4 and RNaseH largely agree with each other.
- the Capl product represented the majority of species identified (66.7 ⁇ 1.2% and 64.2 ⁇ 1.4%, respectively).
- the relative abundance of intermediary products also exhibited good concordance between the hRNase 4 and RNaseH conditions, with the di phosphorylated product (24.0 i- 2.2% and 21.9 ⁇ 0.4%, respectively) exhibiting the highest relative abundance among all intermediary products.
- ribonucleases such as hRNase 4, that feature a high degree of specificity for cleaving a single-stranded RNA substrate at defined sites (e.g., specificity for cleaving an RNA at one or more dinucleotide, trinucleotide or tetranucleotide combinations) may be useful to characterize the extent of mRNA 5' end capping.
- RNase 4 results are comparable to RNase H results but with differences.
- RNase H requires a double-stranded RNA target such that the DNA probe must be chimeric requiring both a DNA and an RNA portion and even with such a probe, RNaseH cleavage products vary in size by -2 to +2 nucleotides around the recognition site, complicating fragment analysis.
- Disclosed methods may be used to analyze aspects of the protected RNA segment, including modifications present in the segment, such as a cap structure.
- disclosed methods may include contacting an RNA substrate with (a) a probe targeting an internal segment of the RNA substrate of interest, (b) a probe targeting a 3' end segment of the RNA substrate of interest or (c) multiple probes (e.g., multiple biotinylated-DNA probes) targeting different portions of the RNA substrate, permitting simultaneous analysis of such portions.
- Disclosed methods may be applied to RNA modification analysis, such as RNA identification, locating an RNA within a sequence, assessing RNA stoichiometry, detecting RNA presence, permanence, and/or dynamics (i.e., installation and removal), and detecting co-existence of RNA modifications.
- RNA modification analysis such as RNA identification, locating an RNA within a sequence, assessing RNA stoichiometry, detecting RNA presence, permanence, and/or dynamics (i.e., installation and removal), and detecting co-existence of RNA modifications.
- RNA 5' end cap analysis methods including methods of analyzing cap structures present in mRNAs (e.g., 7-methyl guanosine triphosphate cap), small nuclear RNAs (e g., 2,2,7-trimethylguanosine triphosphate cap or y-monomethyl phosphate cap) and mitochondrial RNA (e.g., NAD cap), may include contacting a sample and a 5'— >3' exoribonuclease that is capable of hydrolyzing 5 '-monophosphate RNA in the 5' to 3' direction and that does not hydrolyze 5 '-capped RNA.
- mRNAs e.g., 7-methyl guanosine triphosphate cap
- small nuclear RNAs e g., 2,2,7-trimethylguanosine triphosphate cap or y-monomethyl phosphate cap
- mitochondrial RNA e.g., NAD cap
- RNA-phosphate-dependent exonucleases examples include XRN-1 (NEB, Cat # M0338S) and Terminator (LGC, Biosearch Technologies, Cat # TER51020).
- XRN-1 NEB, Cat # M0338S
- LGC Biosearch Technologies, Cat # TER51020
- Treatment of an RNA sample with XRN-1 or Terminator, prior or after contacting the RNA substrate (or fragments thereof) with a site-specific endoribonuclease such as hRNase 4 may reduce the complexity of the sample and facilitate data analysis of RNA 5 '-capped ends.
- RNA segments originated from abortive transcription initiation events may be analyzed for detecting RNA segments originated from abortive transcription initiation events; or RNA segments originated from premature transcription termination events (e.g., resulting in truncated RNAs); or RNA segments originated from cis-primed transcription extension and/or self-primed transcription extension that result in a transcript pool comprising longer than encoded RNA products, often forming regions of double-stranded RNA that may trigger innate immune response and affect the action of RNA vaccines and therapeutics.
- Disclosed methods may be used in absence of a protection probe so that it permits that RNA segments comprising double-stranded regions or other structural regions (e.g., hairpins, stem loops, pseudoknots, etc.) that may form within an RNA substrate of interest (intramolecular structures) or may form among multiple RNAs or DNA/RNA hybrids (intermolecular structures), including triple or quadruple helices, to be either directly analyzed by LC-MS/MS or undergo a process of purification to isolate the double-stranded and/or other structural regions prior to LC-MS/MS analysis.
- RNA segments comprising double-stranded regions or other structural regions (e.g., hairpins, stem loops, pseudoknots, etc.) that may form within an RNA substrate of interest (intramolecular structures) or may form among multiple RNAs or DNA/RNA hybrids (intermolecular structures), including triple or quadruple helices, to be either directly analyzed by LC-MS/MS or
- the structured region(s) will (in analogy to a region protected by an exogenous probe) direct the ribonuclease to cleave the RNA only at accessible sites (e g., ribonuclease specific sites located at unstructured or poorly structured regions), thus enabling analyses of such structured region(s).
- Disclosed methods may be used to determine RNA structured regions implicated in certain biological functions, such as translation modulators, splicing regulatory elements, microRNA processing sites, riboswitches, IRES, and others.
- a cap analysis method may be performed in the absence of a protection probe, for example, where site-specific ribonuclease access to the subject RNA segment is limited by a protein (e.g., an RNA binding protein or an antibody), by an RNA ligand (e.g., cellular metabolites such as adenosylcobalamin, lysine, glycine, flavin mononucleotide, etc.
- a protein e.g., an RNA binding protein or an antibody
- an RNA ligand e.g., cellular metabolites such as adenosylcobalamin, lysine, glycine, flavin mononucleotide, etc.
- RNA cleavage by a site-specific ribonuclease in the surrounding region(s) of the bound element may be used to determine the identity of the sequence to which this element is bound, the relative occupancy, and/or binding dissociation properties.
- chemical crosslinking may be performed prior to contacting the RNA with a site-specific ribonuclease (e.g., hRNase 4).
- the RNA may be crosslinked intramolecularly and/or intramolecularly (e.g., to complementary probe, to an RNA binding protein, to a ribosome, to an aptamer, to another DNA or RNA strand).
- the RNA may be cleaved by a site-specific ribonuclease in the surrounding region(s) of the crosslinked region for isolation and analyses.
- EXAMPLE 16 The effect of varying the RNA 5 '-end sequence in probe-directed RNA cleavage with hRNase 4 or RNase H
- disclosed methods may be applied to analysis of mRNA capping in transcripts with distinct 5'-UTR sequences and with sequences comprising full replacement of uridine sites (U) with 1 -methyl pseudouridine sites (m l P or m'Y) as illustrated in this example.
- Synthetic mRNA transcripts were constructed by replacing the FLuc mRNA 5'-UTR coding sequence with the coding sequence of the 5'-UTR of interest. TABLE 6 lists the mRNA 5'-UTR sequences used in this example.
- Transcripts were produced by in vitro transcription (IVT) utilizing the HiScribeTM T7 High Yield RNA Synthesis Kit (NEB, Catalog # E2040S) utilizing either canonical UTP or m 1 TTP replacing UTP to result in full substitution of uridine with 1 -methyl -pseudouridine as described in Examples 5 and 7.
- Each mRNA was capped with FCE and methylated with a 2 ’-O-m ethyltransferase to produce a 5' terminal m 7 GpppGm (Cap 1) containing product and a series of intermediary 5' end capped and uncapped products as described in Example 15.
- Each mRNA was hybridized to a corresponding biotinylated DNA probe (TABLE 7) utilizing the touchdown hybridization approach as described in Example 15.
- Each hybridized DNA/RNA duplex was digested with either hRNase 4/T4 PNK or RNase H, affinity purified and characterized by LC-MS/MS as described in Example 15. TABLE 7.
- FIGURE 32A shows the extent of capping observed for each mRNA substrate. A relative quantitation of the cleavage products with different 5' end modifications was determined for each of the hRNase 4 and RNaseH conditions as described in Example 15. A range of capping efficiencies (30-90%) was detected across U or m' -modified mRNAs with distinct 5'-UTRs.
- each mRNA 5' end product (comprising a 5'-pp, or a 5'-ppp, or a 5'-Gppp, or a Cap 0, or a Cap 1) was consistent in both the hRNase 4 and RNaseH conditions.
- FIGURE 32B shows the length distribution of m 7 GpppGm (Cap 1) capped cleavage products observed in each of the hRNase 4 and RNaseH conditions.
- Cap 1 products of identical length were detected for both U and rn'T'-modified variants of a particular mRNA 5'-UTR.
- the hRNase 4 condition yielded predominantly Cap 1 products of a discrete length, resulting from cleavage of each mRNA substrate at an hRNase 4 recognition site downstream of the DNA probe hybridized region in each target mRNA.
- a higher heterogeneity regarding the length distribution of the Cap 1 products for each individual mRNA sub state was observed in the RNase H condition.
- cleavage with RNase H resulted in primarily one product
- mRNA substrates for instance, mRNA comprising HBB and pRNA21 5'-UTRs
- cleavage with RNase H resulted in mixtures of products varying by one or more nucleotides in length.
- ribonucleases such as hRNase 4 are useful to assess mRNA 5' end capping across mRNAs with a diversity of 5' end sequences.
- the ability of hRNase 4 to produce specific cleavage at defined sites surrounding a protected portion of an RNA substrate may yield a more defined set of RNA cleavage products, which may advantageously simplify data analysis and facilitate the assessment of aspects of interest, such as the presence of a cap, a tail and/or modifications (e.g., endogenous modifications, synthetically incorporated modifications, or RNA modifications resulting from damage caused by irradiation, exposure to hazardous chemicals, temperature or pH fluctuation, among others).
- EXAMPLE 17 DNA probe-directed selective purification of RNA poly(A) tails with hRNase 4
- disclosed workflows may be applied to selectively cleaving and purifying an mRNA 3' end poly(A) tail utilizing a site-specific ribonuclease.
- care may be taken to select a site-specific ribonuclease (e.g., hRNase 4) that is not adenosine specific (i.e., does not cleave a 3 ',5 '-phosphodiester bond with specificity for adenosine at the main anchoring site B 1).
- a workflow may include, for example, contacting an mRNA 3' end poly(A) tail with a DNA probe to form a duplex product in which the DNA probe is annealed to at least a portion of the poly(A) tail and one or more additional nucleotides immediately upstream of poly(A) tail sequence.
- FIGURE 33 shows a representative example of a deconvoluted LC-MS/MS spectrum of oligonucleotide cleavage products comprising regions of the mRNA poly(A) tail that were isolated from the capped and polyadenylated synthetic EPO mRNA of EXAMPLE 8.
- the in vitro synthesize EPO mRNA whose coding sequence encoded a 120-nt poly(A) tail, was annealed to a biotinylated DNA probe an (SEQ ID NO: 59 /5BiosG/TTTTT/iBiodT/TTTTT/iBiodT/TTTTTTTTTVN), contacted with hRNase 4 and T4 PNK, and then purified with the use of magnetic streptavidin beads. After elution from the beads, a 126-nt product cleavage product comprising a distribution of poly(A) tail-related oligonucleotides sequences differing in mass from each other by a single adenosine residue (FIGURE 33) was identified.
- EXAMPLE 18 Integrated analysis of an mRNA sequence, 5 ’-cap and Poly(A) tailing using hRNase 4/T4 PNK
- FIGURE 34 shows an example of an analytical workflow for integrated mRNA analysis enabled by the use of a site-specific ribonuclease such as hRNase 4, optionally in combination with a repair enzyme such as T4 PNK.
- a site-specific ribonuclease such as hRNase 4
- a repair enzyme such as T4 PNK
- an mRNA comprising a 5' end cap, an internal RNA sequence, and a 3' end poly(A) tail, contacts a 5’ targeting DNA probe complementary to the 5’ end of the mRNA and a 3’ DNA targeting probe complementary to the 5’ end of the mRNA to form annealed polynucleotide products comprising in a 5’ to 3’ direction relative to the mRNA, a double- stranded first DNA probe/5’ mRNA segment, an internal single-stranded sequence, and a double-stranded second DNA probe/3’ mRNA segment.
- Annealed polynucleotides may be contacted with hRNase 4, optionally in combination with T4 PNK, to selectively generate cleavage products (oligonucleotides). Cleavage products may be isolated and analyzed by LC-MS/MS. For example, a capped and poly-adenylated mRNA may be annealed to two complementary DNA probes, each optionally comprising one or more affinity tags.
- One DNA probe may be complementary to the 5' end sequence of the mRNA and the other DNA probe may be an oligo-dT probe complementary to the poly(A) tail, forming DNA/mRNA hybrids at the mRNA 5' end and 3' end poly(A) tail, respectively, so that these regions are protected from cleavage by the ribonuclease.
- a digestion reaction may comprise contacting annealed polynucleotides with a composition including a site-specific ribonuclease (e.g., hRNase 4) and a repair enzyme (e g , T4 PNK) to form products site-specifically cleaved at accessible regions of the mRNA substrate (e.g., the internal single-stranded sequence) to form cleavage products comprising single-stranded fragments of the mRNA, the doublestranded first DNA probe/5’ mRNA segment, and the double-stranded second DNA probe/3’ mRNA segment.
- a site-specific ribonuclease e.g., hRNase 4
- a repair enzyme e.g , T4 PNK
- single-stranded fragments and cleaved DNA/mRNA duplex segments may be separated from each other (e.g., by affinity purification) wherein one fraction comprises single-stranded fragments and another comprises duplex segments.
- One or more fractions may be subject to LC-MS/MS analysis.
- single-stranded fragments e.g., in a supernatant fraction of an affinity purification
- LC-MS/MS e.g., in an eluted fraction of an affinity purification
- LC-MS/MS to characterize the 5' cap and 3' poly(A) tail.
- the combined analysis of cleavage products, resulting from the internal mRNA sequence fraction and from the cap and poly(A) tail fraction, can be used for an integrated characterization of an mRNA substrate of interest (e.g., characterization of the RNA sequence and any modifications) from a single experimental preparation within the same workflow.
- a mRNA comprising a 5' end cap, an internal RNA sequence, and a 3' end poly(A) tail may be annealed to one or more DNA probes, each complementary to at least a portion of the mRNA (e.g., each independently designed to be complementary to 5' and/or 3' end regions of the mRNA substrate).
- a workflow may additionally comprise one or more DNA probes targeting selected regions of the internal mRNA sequence.
- a DNA probe targeting the mRNA 3' end may comprise an oligo-dT DNA probe.
- a DNA probe targeting an mRNA 3' end may comprise one or more additional nucleotides complementary to the mRNA sequence immediately upstream of the oligo-dT DNA probe binding site.
- Each of the DNA probes targeting the 5’ or 3' end regions of the mRNA may independently of each other comprise an affinity group (e.g., a biotin) so that the DNA-RNA duplex can be isolated by affinity purification at any stage of the workflow.
- the affinity group may be attached to the 5' end of the DNA probe, to the 3' end of the DNA probe, or internally to the 5' end of the DNA probe (e.g., the affinity group may be covalently linked to the base an internal nucleotide, for instance to the 5-position of thymine).
- multiple affinity groups may be used to increase purification efficiency and/or allow purification using multiple affinity matrices.
- the DNA does not comprise an affinity group and the isolation of the DNA- RNA duplex may be performed by size exclusion chromatography (SEC), gel filtration chromatography, anion-exchange chromatography (AEX), hydrophilic interaction liquid chromatography (HILIC), reversed-phase liquid chromatography (RP-LC), ion-paring reversed-phase liquid chromatography (IP -RP-LC), solid-phase reversible immobilization (e.g., SPRI paramagnetic beads), or any combination thereof (also referred as multimodal or mixed-mode chromatography).
- a DNA-RNA duplex may be analyzed directly without isolation or purification.
- an mRNA comprising a 5' end cap, an internal RNA sequence, and a 3' end poly(A) tail, and annealed to one or more targeting DNA probes may be contacted with hRNase 4, optionally in combination with T4 PNK, to selectively generate cleavage products (oligonucleotides) by cutting in regions of the mRNA sequence that comprise accessible ribonuclease cleavage sites (e.g., sites not protected by the targeting DNA probe).
- Oligonucleotide cleavage products comprising a mixture of DNA-RNA duplex (DNA-RNA hybrids) region(s) and single-stranded RNA regions may be either directly analyzed by LC- MS/MS or undergo a process of purification to isolate the cleaved DNA-RNA duplex region(s) prior to LC-MS/MS analysis.
- Cleaved DNA-RNA duplex region(s) may be isolated by affinity purification; by purification using of one or more of the chromatographic or immobilization modes described above; or both.
- the remaining supernatant enriched in single-stranded internal mRNA regions may be either directly analyzed by LC-MS/MS or undergo purification using of one or more of the chromatographic or immobilization modes described above, and then be analyzed by LC- MS/MS.
- the isolated DNA-RNA duplex region(s) may be eluted as appropriated according to the method of purification chosen and then analyzed by LC-MS/MS.
Abstract
The present disclosure relates, according to some embodiments, to compositions and analysis of RNA (e.g., dephosphorylated oligoribonucleotides) including, for example, natural and/or synthetic RNAs. A composition may comprise, for example, an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., Homo sapiens, Escherichia coli, Aspergillus oryzae, Momordica charanlia. Pyrococcus furiosus, Cucumis sativus, and Sus scrojd) or (ii) is a non-naturally occurring sequence; and/or an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (ii) is a non-naturally occurring sequence.
Description
COMPOSITIONS AND ANALYSIS OF DEPHOSPHORYLATED OLIGORIBONUCLEOTIDE S
CROSS-REFERENCE TO RELATED APPLICATIONS
This application is a continuation-in-part of U.S. Patent Application 18/182,122 filed March 10, 2023. This application also claims priority to U.S. Provisional Application No. 63/329,262 filed April 8, 2022. The contents of all of the above are hereby incorporated in their entirety by reference.
SEQUENCE LISTING STATEMENT
This disclosure includes a Sequence Listing submitted electronically in .xml format under the file name “NEB-450. xml” created on March 30, 2023, and having a size of 64.8 KB. This Sequence Listing is incorporated herein in its entirety by this reference.
BACKGROUND
Increasing the breadth of analytical approaches to assess the purity, quantity, sequence, and identity of synthetic RNAs (e g., RNA produced by in vitro transcription (IVT)), including synthetic RNAs for use in therapeutics and/or vaccines, is an important area of technical development. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) may be used to directly sequence and to verify the position and identity of RNA modifications (e.g., 5' cap structures, nucleobase and ribose modifications) within native and synthetic RNAs. To characterize full-length RNA substrates by LC-MS/MS, RNA samples may be digested with one or more endoribonuclease(s) of selected specificity. Coupling endoribonuclease digestion to LC-MS/MS analysis presents several key challenges. First, RNA structure may interfere with the activity of an endoribonuclease. Second, incubation of RNA with one or more endoribonucleases often produces a mixture that may contain 2',3'-cyclic-phosphorylated, 3'- phosphorylated, and 2',3'-hydroxylated oligoribonucleotide products, convoluting the analysis of the resultant oligonucleotide mixture and reducing the intensity of the signal for individual oligonucleotides in LC-MS/MS experiments. Third, the limited availability to endoribonucleases with discrete recognition and cleavage specificities that have been fully characterized, are robust, and are presented with enough purity to generate reliable and reproducible digestion products for downstream mass spectrometry analysis.
SUMMARY
Accordingly, needs have arisen for improved analytical methods to assess the purity, quantity, sequence, and identity of synthetic RNAs (e g., RNA-based therapeutics and vaccines). The present disclosure relates to methods and compositions for analyzing polyribonucleotides including natural and/or synthetic RNAs. A composition may comprise, for example, an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia coll), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus), and an archaea species (for example, Pyrococcus furiosus} or (ii) is a non-naturally occurring sequence; and/or an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (ii) is a non-naturally occurring sequence. In some embodiments, an endoribonuclease may have an amino acid sequence that corresponds to an amino acid sequence of a vertebrate (e.g., mammalian) species. An endoribonuclease may have specificity selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide, according to some embodiments. An endoribonuclease may have an average cleavage rate of once every 6-12 nucleotides. Example endoribonucleases include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO. An end repair enzyme may comprise phosphodiesterase and phosphomonoesterase activities. An end repair enzyme may comprise a polynucleotide kinase-phosphatase. Example end repair enzymes include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinase-phosphatases. In some embodiments, a composition may further comprise one or more of a denaturing agent, a buffering agent, and an RNA substrate. Optionally, a composition may comprise one or more oligoribonucleotides, which may be, for example, substrates and/or products of an endoribonuclease and/or an end repair enzyme.
The present disclosure relates to methods for analyzing polyribonucleotides. For example, methods may comprise (a) contacting an RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'-cyclic- phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated; (b) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3’ -hydroxylated, and (c)
optionally, characterizing the oligoribonucleotides comprising one or more repaired ends that are 2’,3’-hydroxylated. Endoribonucleases used in methods of the disclosure may have specificity selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide and/or an average cleavage rate of the RNA substrate of once every 6-12 nucleotides (e g., once every 8 nucleotides). Example endoribonucleases used in methods of the disclosure may include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO. End repair enzymes used in methods of the disclosure may comprise phosphodiesterase and phosphomonoesterase activities. An end repair enzyme may comprise a polynucleotide kinase-phosphatase. Example end repair enzymes include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinase-phosphatases. According to some embodiments, a methods may be performed as a coupled reaction. For example, (a) contacting and the (b) contacting occur in a single location or occur in separate locations that are in fluid communication with one another. In some embodiments, an RNA substrate may be a denatured RNA substrate. For example, contacting an RNA substrate and an endoribonuclease may further comprise denaturing the RNA substrate to form a denatured RNA substrate and contacting the denatured RNA substrate and the endoribonuclease. Denaturing an RNA substrate may include, for example, contacting the RNA substrate with a denaturing agent (e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide) at a salt concentration of up to 50 mM or incubating the RNA substrate at a temperature of 65°C or higher at a salt concentration of up to 50 mM. In some embodiments, contacting an RNA substrate and an endoribonuclease may further comprise denaturing the RNA substrate to form a denatured RNA substrate, diluting the denatured RNA substrate for form a diluted denatured RNA substrate, and contacting the diluted denatured RNA substrate and the endoribonuclease. According to some embodiments, (a) contacting and/or (b) contacting may further comprise contacting a buffering agent. In some embodiments, contacting an RNA end repair enzyme and the oligoribonucleotides may further comprise separating the oligoribonucleotides comprising one or more unrepaired ends from the endoribonuclease to form separated oligoribonucleotides comprising one or more unrepaired ends. In some embodiments, the (c) characterizing may comprise characterizing the oligoribonucleotides comprising one or more repaired ends by one or more of gel
electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry. In some embodiments, the (c) characterizing may comprise separating the oligoribonucleotides from one or more of the RNA substrate, the endoribonuclease, the RNA end repair enzyme to form separated oligoribonucleotides and characterizing the separated oligoribonucleotides. For example, characterizing may include fractionating the oligoribonucleotides comprising one or more repaired ends that are 2 ’,3 ’-hydroxylated by liquid chromatography to form fractionated oligoribonucleotides and ionizing the fractionated oligoribonucleotides for mass spectrometry.
According to some embodiments, an RNA substrate (e.g., an RNA substrate included in a method of the disclosure) may comprise in vitro transcribed RNA, chemically synthesized RNA, viral RNA, prokaryotic RNA, eukaryotic RNA, archaeal RNA, or combinations thereof. An RNA substrate (e.g., an RNA substrate included in a method of the disclosure), in some embodiments, may comprise tissue culture RNA, biopsy RNA, feces RNA, urine RNA, lymph RNA, blood RNA, mucous RNA, sputum RNA, skin RNA, saliva RNA, wound RNA, sweat RNA, semen RNA, shoot RNA, root RNA, seed RNA, sewage RNA, sludge RNA, soil RNA, or any combination thereof. RNA substrates that may be analyzed by methods of the disclosure may comprise any RNA substrate including, for example, messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (sRNA), microRNA (miRNA), long noncoding RNA (IncRNA), circular RNA (circRNA), aptamer RNA, antisense RNA, silencing RNA (siRNA), guide RNA (gRNA), or any combination thereof.
In some embodiments, the present disclosure relates to kits for analysis of polyribonucleotides including natural and/or synthetic RNA. For example, a kit may include (a) an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia co ), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus), and an archaea species (for example, Pyrococcus furiosusy) or (ii) is a non-naturally occurring sequence; (b) an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (ii) is a non-naturally occurring sequence; (c) optionally, a denaturing agent (e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide); (d) optionally, a buffering agent; and (e) optionally, an affinity-labeled DNA probe.
In some embodiments, an endoribonuclease included in a kit may have an amino acid sequence that corresponds to an amino acid sequence of a vertebrate (e.g., mammalian) species.
An endoribonuclease included in a kit may have specificity selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide, according to some embodiments. An endoribonuclease included in a kit may have an average cleavage rate of once every 6-12 nucleotides. Example endoribonucleases that may be included in a kit include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO. An end repair enzyme included in a kit may comprise phosphodiesterase and phosphomonoesterase activities. An end repair enzyme included in a kit may comprise a polynucleotide kinase-phosphatase. Example end repair enzymes that may be included in a kit include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinasephosphatases. A kit, in some embodiments, may further include a divalent metal, wherein the divalent metal is optionally selected from magnesium(II), manganese(II), cobalt(II), and nickel(II). In some embodiments, a kit may further include one or more additional enzymes, wherein the one or more additional enzymes are optionally selected from RNA polymerases and RNA ligases.
The present disclosure relates, according to some embodiments, to methods of targeting specific portions of an RNA substrate for analysis. For example, a method may include (a) contacting an RNA substrate and one or more DNA probes, each DNA probe shorter than the RNA substrate and each comprising an affinity domain, wherein at least a portion of the RNA substrate and at least a portion of the DNA probe(s) are complementary, to form a DNA-RNA hybrid duplex comprising a double-stranded portion and at least one single-stranded overhang; (b) contacting the DNA-RNA hybrid duplex with an enzyme composition, the enzyme composition comprising a single-strand-specific nucleotide-specific endoribonuclease (e g., hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO) and, optionally, an RNA end -repair enzyme, to form a cleaved DNA-RNA hybrid duplex and a released RNA fragment of the RNA substrate by cleavage of the RNA substrate at a site within the single-stranded overhang by the single-strand-specific nucleotidespecific endoribonuclease; (c) contacting the cleaved DNA-RNA hybrid duplex and a solid support comprising an affinity capture domain to form an affinity capture complex comprising the affinity domain bound to the affinity capture domain; (d) optionally, washing the affinity
capture complex to remove unbound materials, if any; and (e) optionally, dissociating the cleaved DNA-RNA hybrid duplex to release the remaining portion of the RNA substrate from the one or more DNA probes. In some embodiments, a DNA-RNA hybrid duplex may comprise the double-stranded portion and two single- stranded overhangs. For example, a DNA-RNA hybrid duplex may comprise the double- stranded portion and a 5’ single-stranded RNA overhang and a 3’ single-stranded RNA overhang. End repair enzymes used in methods of the disclosure may comprise phosphodiesterase and phosphomonoesterase activities. An end repair enzyme may comprise a polynucleotide kinase-phosphatase. Example end repair enzymes include T4 polynucleotide kinase-phosphatases and Cth polynucleotide kinasephosphatases.
The present disclosure relates, in some embodiments to methods for quantitatively analyzing an RNA. Methods may include, for example, (a) contacting an RNA substrate, an enzyme, and an isotopically labeled nucleoside triphosphate to form a labeled RNA substrate, wherein the enzyme is optionally selected from an RNA polymerase and an RNA ligase; (b) contacting the labeled RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'-cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated; and (c) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3 ’-hydroxylated. In some embodiments, a method may comprise contacting an RNA substrate, an enzyme, and a nucleoside triphosphate comprising a chemically reactive group to form a chemically reactive RNA substrate, wherein the enzyme is optionally selected from an RNA polymerase and an RNA ligase; (b) contacting the chemically reactive RNA substrate and a molecule reactive with the chemically reactive RNA substrate to form a labeled RNA substrate, wherein the molecule comprises one or more stable isotopics; (c) contacting the labeled RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'-cyclic- phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated; and (d) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3 ’-hydroxylated.
The present disclosure further provides methods for analyzing an RNA substrate. A method may comprise, in some embodiments, contacting an RNA substrate and one or more RNA substrate binding molecules (e.g., a “DNA probe, an RNA probe, a synthetic nucleic acid probe, an RNA binding protein, an antibody, an RNA ligand) to form RNA substrate-RNA
binding molecule complexes, each complex comprising a bound portion and at least one singlestranded portion, wherein each bound portion comprises at least a portion of the RNA substrate and an RNA binding molecule. For example, a method may comprise contacting an RNA substrate with two species of DNA probe, a first species complementary to a more 5’ portion of the RNA substrate and a second species complementary to a more 3’ portion of the RNA substrate to form RNA substrate-RNA binding molecule complexes, each complex comprising (e g., in a 5 ’ to 3 ’ direction) a first bound portion a single-stranded portion and a second bound portion. A first bound portion may comprise the first DNA probe and the more 5’ portion of the RNA substrate. A second bound portion may comprise the second DNA probe and the more 3’ portion of the RNA substrate. A single-stranded portion may comprise a portion of the RNA substrate linking the more 5’ portion and the more 3’ portion. According to some embodiments, a method may further comprise contacting the RNA substrate-RNA binding molecule complexes with an enzyme composition (e.g., an enzyme composition comprising a single-strand-specific nucleotide-specific endoribonuclease and, optionally, an RNA endrepair enzyme) to form by cleavage of the RNA substrate at one or more sites within the singlestranded portion by the single-strand-specific nucleotide-specific endoribonuclease cleaved bound portions and one or more fragments of the single-stranded portion. A method may comprise, in some embodiments, separating the cleaved bound portions from the one or more fragments of the at least one single-stranded portion.
According to some embodiments, a method may comprise analyzing one or more properties of the cleaved bound portions and/or analyzing one or more properties of the fragments. Analyzing one or more properties of the cleaved bound portions may include, in some embodiments, characterizing at least the RNA substrate fragment of the cleaved bound portions (e.g., the more 5’ portion and/or the more 3’ portion of the RNA substrate) by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry, wherein the characterizing optionally comprises at least one of assessing the molecular mass of the RNA substrate, assessing the sequence of the RNA substrate (or a portion thereof), and assessing the modification status (e.g., modified bases appearing in the RNA substrate including 1 -methylpseudouridine and 5-methoxycytidine; 5’ ends having a pp, ppp, CapO, Capl, or Cap2; 3’ ends having a polyA tail or inverted thymidine) of the RNA substrate fragment of the cleaved bound portions. Analyzing one or more properties of the fragments of the at least one single-stranded portion may include, in some embodiments, characterizing the fragments by one or more of gel electrophoresis, capillary electrophoresis, liquid
chromatography, and mass spectrometry, wherein the characterizing optionally comprises at least one of assessing the molecular mass of the fragments, assessing the sequence of the fragments (or a portion thereof), and assessing the modification status (e.g., modified bases appearing in the RNA substrate including 1 -methylpseudouridine and 5 -methoxy cytidine; 5’ ends having a pp, ppp, CapO, Cap 1 , or Cap2; 3 ’ ends having a polyA tail or inverted thymidine) of the fragments.
Examples of a single-strand-specific nucleotide-specific endoribonuclease include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO. An RNA end repair enzyme may comprise phosphodiesterase and phosphomonoesterase activities (e.g., a polynucleotide kinase-phosphatase). Examples of an RNA end repair enzyme include a T4 polynucleotide kinase-phosphatase or a Cth polynucleotide kinase-phosphatase. In some embodiments, an RNA substrate may comprise in vitro transcribed RNA, chemically synthesized RNA, viral RNA, prokaryotic RNA, eukaryotic RNA, archaeal RNA, or any combination thereof. An RNA substrate may comprise tissue culture RNA, biopsy RNA, feces RNA, urine RNA, lymph RNA, blood RNA, mucous RNA, sputum RNA, skin RNA, saliva RNA, wound RNA, sweat RNA, semen RNA, shoot RNA, root RNA, seed RNA, sewage RNA, sludge RNA, soil RNA, or any combination thereof. An RNA substrate may comprise messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (sRNA), microRNA (miRNA), long non-coding RNA (IncRNA), circular RNA (circRNA), aptamer RNA, antisense RNA, silencing RNA (siRNA), guide RNA (gRNA), or any combination thereof.
BRIEF DESCRIPTION OF THE FIGURES
FIGURE 1 shows a schematic of an example cassette used to produce recombinant hRNase 4 enzyme. pPS: periplasmic signal peptide; 6HIS: hexahistidine tag; MBP: maltose binding protein; TEV: TEV protease cleavage site.
FIGURE 2 shows an example cleavage efficiency heatmap for pooled synthetic oligonucleotides (SEQ ID NOS: 1-13) using hRNase 4 at the indicated dilutions of enzyme. Darker boxes indicate more efficient cleavage by hRNase 4.
FIGURE 3 shows an example bar chart of the mean total intensity of 5-prime cleavage products formed by digestion with hRNase 4. 5-Prime cleavage products are classified according to their terminal 3 ’-nucleotide residue. Error bars represent the standard deviation from two replicate digests. hRNase 4 primarily produced 5-prime cleavage products comprising a 3’-uridine nucleotide.
FIGURE 4 shows an example bar chart of the mean total intensity of 3 -prime cleavage products formed by digestion with hRNase 4. 3-Prime cleavage products are classified by their initial 5 ’-nucleotide residue. Error bars represent the standard deviation from two replicate digests. hRNase 4 primarily produced 3-prime cleavage products comprising a 5 ’-adenine or 5 ’-guanine.
FIGURE 5 shows an example comparison of predicted (theoretical) coverage of mRNA transcripts cleaved with hRNase 4 and various endoribonucleases. Cleavage products with lengths greater than 4 and less than 40 nucleotides were considered for sequence coverage calculations. FIGURE 5 A shows the theoretical sequence coverage of hRNase 4 and various endoribonucleases for 1000 random human mRNA transcripts (RefSeq). FIGURE 5B shows the theoretical sequence coverage of hRNase 4 and various endoribonucleases for E. coll coding sequences (CDS). FIGURE 5C shows the theoretical sequence coverage of hRNase 4 and various endoribonucleases for the BNT162b2 COVID- 19 mRNA vaccine sequence.
FIGURE 6 shows an example comparison of predicted (theoretical) coverage of mRNA transcripts cleaved with endoribonucleases having hRNase 4-like cleavage specificities, namely those directed to a single nucleotide followed by either a purine or a pyrimidine (N(Y/R)) or to a purine or a pyrimidine followed by a single nucleotide ((Y/R)N) and with endoribonucleases having a single dinucleotide sequence (NN) or a single nucleotide (N) specificity. Cleavage products with lengths greater than 4 and less than 40 nucleotides were considered for sequence coverage calculations. FIGURE 6A shows the theoretical sequence coverage of the endoribonucleases for 1000 randomly selected human mRNA transcripts (RefSeq). FIGURE 6B shows the sequence coverages of individual hRNase 4-like (‘N(Y/R) & (Y/R)N’) cleavage specificities upon digestion of 1000 randomly selected human mRNA transcripts. FIGURE 6C shows the theoretical sequence coverage of the endoribonucleases for E. coli coding sequences (CDS). FIGURE 6D shows the theoretical sequence coverage of the endoribonucleases for the BNT162b2 COVID-19 mRNA vaccine sequence. Endoribonucleases having hRNase 4-like cleavage specificities are predicted to produce superior mRNA coverage relative to endoribonucleases having a single dinucleotide sequence (NN) or single nucleotide (N) specificity.
FIGURE 7 shows example overlaid UV chromatograms of the digestion of a synthetic oligoribonucleotide (SEQ ID NOS: 14-17) with hRNase 4 in the presence or absence of T4 PNK (lighter and darker traces, respectively). Cleavage products detected in this assay are
represented by the sequences 1 to 4. Co-incubation of hRNase 4 with T4 PNK resulted in the conversion of a mixture of 5-prime cleavage products comprising 2',3'-cyclic phosphorylated (peak marked as #3) and 3 '-phosphorylated (peak marked as #4) termini into a single 2', 3'- hydroxylated product (peak marked as #2), thereby simplifying the sequence identity analysis of 5-prime cleavage products. The 3-prime cleavage product (peak marked as #1) is the same in either treatment.
FIGURE 8 shows example workflows used for digestion of an RNA. FIGURE 8A shows an example generic workflow. FIGURE 8B shows an example workflow in which the subject RNA is digested with RNase T1. FIGURE 8C shows an example workflow in which the subject RNA is digested with a composition comprising hRNase 4 and T4 PNK.
FIGURE 9 shows an example theoretical sequence coverage map obtained from digestion of FLuc IVT mRNA with either hRNase 4 or RNaseTl . hRNase 4 is predicted to generate a much larger number of unique cleavage products than RNase Tl. RNase Tl is predicted to generate a high percentage of isomeric cleavage products (i.e., products with the same nucleotide composition but with distinct sequences of nucleotides).
FIGURE 10 shows the scoring distribution (violin plots) of an example search of the deconvoluted masses of cleavage products, resulting from digestion of FLuc IVT mRNA against a human transcriptome database spiked with FLuc mRNA. FIGURE 10A shows the scoring distribution for digestion with hRNase 4/T4 PNK. FIGURE 10B shows the scoring distribution for digestion with RNase Tl. A substantially higher background was observed using RNase Tl as a result of the high percentage of isomeric cleavage products.
FIGURE 11 shows the number of oligonucleotides identified in each pool of triplicate sequencing experiments of FLuc mRNA digests with the overlapping portion of each showing the oligonucleotides common to two or all three replicates. Most oligonucleotides were reproducibly identified in each pool with hRNase 4/T4 PNK (96) and RNase Tl (85). Each replicate had similar total number of spectral counts.
FIGURE 12 shows the distribution of cleavage product lengths identified in each replicate from digestion of FLuc IVT mRNA with either hRNase 4/T4 PNK or RNaseTl . Increased median and maximum lengths were observed in the hRNase 4/T4 PNK condition in comparison with that of RNaseTl .
FIGURE 13 shows the experimental coverage of FLuc mRNA observed in digests either with hRNase 4/PNK or with RNase Tl . Improved sequence coverage of FLuc mRNA
was observed in each hRNase 4/T4 PNK experiment (average 69.8%) relative to that of RNaseTl (average 52.8%).
FIGURE 14 shows the experimental coverage of FLuc mRNA observed in digests with RNaseTl alone, hRNase 4 alone, or RNaseTl and hRNase 4 in combination. Increased coverage may be obtained by combining oligonucleotide identifications across treatment with different endoribonucleases.
FIGURE 15 shows the number of oligonucleotides identified in each pool of triplicate sequencing experiments of FLuc mRNA digests represented as a circle. The portions of circles that overlap represent oligonucleotides common to both (portions where two circles overlap) or all three (portions where all three circles overlap) replicates. Data is shown for MC1/T4 PNK-based digests and demonstrates that T4 PNK may be successfully coincubated with diverse endoribonucleases to produce reproducible cleavage product identifications. MCI belongs to the T2 RNase family and was isolated from seeds of the bitter gourd Momordica charantia).
FIGURE 16 shows the experimental coverage of FLuc mRNA observed in three digests with a composition of MC1/T4 PNK. Digestion with MC1/T4 PNK results in reproducible RNA sequence coverage.
FIGURE 17 shows the scoring distribution of an example search of the deconvoluted masses of Epo mRNA digests against a Epo mRNA-spiked in human transcriptome database. Epo mRNA cleavage products were generated by digestion EpoU, EpomoU, or EpomlY mRNA either with hRNase 4/T4 PNK (replicate 1, column 1; replicate 2; column 2) or with RNase T1 (replicate replicate 1, column 3; replicate 2, column 4). The mean signal -to-noise ratio (S/N) of the score of each U-modified (lower row), m 1 Y-modified (top row), or mo5U- modified (middle row) EPO mRNA sequence relative to all other transcripts is reported at the top of each pair of graphs. Cleavage products produced by RNase T1 are generally shorter in length as exemplified in FIGURE 11. Accordingly, there is a higher probability of mapping those oligonucleotides to unrelated transcript sequences thereby increasing the analysis background.
FIGURE 18A shows the sequence coverage of fully modified EpoU (right column), EpomoU (middle column) or EpomlY (left column) mRNAs upon analysis of cleavage products originated from digestion with either hRNase 4/T4 PNK (replicate 1, row 1; replicate 2, row 2) or with RNaseTl (replicate 1, row 3; replicate 2, row 4). FIGURE 18B shows the fractional coverage of fully modified EpoU, EpomoU or EpomlY mRNAs upon
analysis of cleavage products originated from digestion with either hRNase 4/T4 PNK or with RNaseTl . The hRNase 4/T4 PNK condition substantially increases coverage for canonical or base modified Epo mRNAs relative to the RNase T1 condition.
FIGURE 19 shows an example in which capped versus uncapped Epo mRNA (including cap modifications, such as cap methylation) (SEQ ID NOS: 18-22) are differentiated by cleavage with hRNase 4/T4 PNK. Error bars represent standard deviation from two replicate digests.
FIGURE 20 shows example UV chromatograms of RNA cleavage products. The upper trace shows that no higher-retention cleavage products detected were detected in hRNase 4/T4 PNK treatment of Epo mRNA that lacked a poly-A tail whereas the lower trace shows that higher-retention cleavage products were detected in hRNase 4/T4 PNK treatment of Epo mRNA comprising a poly-A tail. These data show that polyA tails may be detected by cleavage with hRNase 4/T4 PNK.
FIGURE 21 A shows the scoring distribution of an example search of the deconvoluted masses of uridine-depleted CLuc mRNA digests against a CLuc mRNA-spiked in human transcriptome database. Cleavage products derived from uridine-depleted CLuc mRNAs were generated by digestion with hRNase 4/T4 PNK (2 replicates per substrate). The mean signal-to-noise ratio (S/N) of the score of each uridine-depleted cLuc mRNA sequence relative to all other transcripts is reported at the top of each pair of graphs. FIGURE 2 IB shows a schematic representation of the relative location of depletion regions (broken lines) in each of three uridine-depleted CLuc mRNAs used in this experiment (CLuc Ul, CLuc U2, and CLuc U3).
FIGURE 22 shows a sequence coverage map of each uridine-depleted CLuc mRNA upon analysis of cleavage products originated from digestion with hRNase 4/T4 PNK, accounting for shared sequences between each sample. The detected coverages regions are represented for each of CLuc Ul (replicate 1 and 2, left columns), CLuc U2 (replicate 1 and 2, middle columns), CLuc U3 (replicate 1 and 2, right columns). Despite the similarity among these sequences, analysis of hRNase 4 digests enabled correct annotation of each mRNA comprising distinct uridine-depleted segments.
FIGURE 23 shows the number of true positive and false positive oligonucleotide identifications of each uridine depleted CLuc mRNA upon digestion with hRNase 4/T4 PNK (2 replicates per sequence) in accordance with an example embodiment.
FIGURE 24A, FIGURE 24B, and FIGURE 24C each show a schematic for isotopically labeling RNA oligonucleotides for quantification analysis. As illustrated in FIGURE 24A, a non-isotopically labeled nucleotide comprising a chemically reactive group (3’-azido-3’deoxy-nucleotide) may be incorporated at the 3’ end of an RNA oligonucleotide by incubation of the non-isotopically labeled nucleotide (e.g., a 3 ’-azido-3’ deoxy-nucleoside triphosphate) and an RNA polymerase. While isotopically labeled molecules may be functionalized with a number of chemically reactive groups, the example scheme of FIGURE 24B shows isotopically labeled molecules, wherein the chemically reactive group is DBCO and the “light” and “heavy” isotopically labeled molecules are derived from the amino acid alanine. An example of a tandem mass tag dipeptide conjugate is also shown. This tandem mass tag comprises a reporter and a balancing amino acid (for simplicity heavy isotopes are omitted from illustration; the site of HCD fragmentation is represented by a dashed line). FIGURE 24C shows an example of a chemoselective reaction involving a 3'-terminal 3'- azido-modified RNA oligonucleotide and a DBCO conjugate.
FIGURE 24D, FIGURE 24E and FIGURE 24F each illustrate an example HCD fragmentation pattern of an RNA nucleoside that has been chemoselectively labeled with a reporter group. In this example, the reporter group is attached to the RNA nucleotide 3’ end by the reaction of a 3 ’-azido-3 ’-deoxy adenosine with a DBCO peptide conjugate. The DBCO peptide conjugate may comprise one or more isotopically labeled atoms (e.g., 2H, 13C and 15N) and may be used for quantitative analysis of multiple oligonucleotides in a single experiment (e.g., for quantification of capped versus uncapped 5’ end oligonucleotides in a capping assay). FIGURE 24D illustrates an example HCD fragmentation mass spectrum of an alanine derived peptide-deoxyadenosine conjugate. FIGURE 24E illustrates an example HCD fragmentation mass spectrum of an alanine-phenylalanine derived dipeptidedeoxyadenosine conjugate. FIGURE 24F illustrates an example HCD fragmentation mass spectrum of an alanine-proline derived dipeptide-deoxyadenosine conjugate. The data demonstrate the identification of characteristic phenylalanine or proline amino acid reporter anions in the HCD spectra from each dipeptide conjugate, respectively. These reporter anions may further comprise isotopically labeled atoms and be used for quantification of the corresponding oligonucleotide conjugates.
FIGURE 25A and FIGURE 25B each show an example schematic for isotopically labeling RNA oligonucleotides by incorporating a chemically reactive group. FIGURE 25A shows labeling the 5’ end of an oligonucleotide by first incubating the oligonucleotide with
ATPyS and T4 PNK to form a 5’ terminal thiophosphate oligonucleotide, and then reacting it with an iodoacetyl tandem mass tag (iodoTMT) reagent set comprising an isobaric mixture of isotopes as shown. FIGURE 25B shows labeling the 3’ end of an oligonucleotide by first incubating the oligonucleotide with sodium periodate to form a 3’ terminal dialdehyde oligonucleotide, and then reacting it with an aminoxy tandem mass tag (aminoxyTMT) reagent set comprising an isobaric mixture of isotopes analogous to the one shown in FIGURE 25A.
FIGURE 26 shows the relative intensities of the oligonucleotides detected by LC- MS/MS analysis following hRNase 4 cleavage in the presence and absence of human placental RNase inhibitor. Data shown demonstrate that hRNase 4-mediated cleavage of an RNA oligonucleotide is inhibited by human placental RNase inhibitor. Sequences shown include SEQ ID NO: 23 and subsequences thereof (positions 1-6, positions 1-9, positions 2- 15, positions 7-15, positions 10-15).
FIGURE 27 shows an example of a workflow used for targeted site-specific cleavage and isolation of a 5’-capped RNA oligonucleotide for downstream capping analysis. In this example workflow the subject EPO mRNA is first annealed with a DNA probe (e.g., a biotinylated DNA probe) and then digested with a composition comprising hRNase 4 and T4 PNK. The DNA-RNA duplex formed after digestion is purified (e.g., by affinity capture using streptavidin beads) and the RNA oligonucleotide is released (e.g., by elution using DNase I).
FIGURE 28 A shows a schematic representation of DNA-targeted hRNase 4 sitespecific cleavage of an IVT Epo mRNA substrate. The arrow shows the closest ‘UR’ cleavage site near the DNA-RNA duplex region. The resulting RNA oligonucleotide product is shown in grey. FIGURE 28B shows a total ion chromatogram from LC-MS/MS characterization of the isolated RNA oligonucleotide (top panel) after its elution from beadbound DNA-RNA duplex by treatment with DNase I. No oligonucleotide was isolated in absence of a DNA probe (middle panel) or in absence of hRNase 4 (lower panel). FIGURE 28C shows a deconvoluted mass spectrum depicting the intact masses observed within the single chromatographic peak of FIGURE 28B (35mer Target) in the sample treated with hRNase 4/T4 PNK in the presence of the biotinylated DNA-probe. Mass spectrometry analysis confirm the isolation of the desired 35mer RNA oligonucleotide comprising the mass of a m7GpppAm cap structure.
FIGURE 29A illustrates a heatmap depicting oligonucleotide products from a DNA probe-directed, RNA cleavage protection assay using example nucleotide specific and dinucleotide specific single-stranded ribonucleases. ‘Protected Products’ refers to those oligonucleotide cleavage products spanning the DNA hybridized region. Numbers designating the start and end position of each identified cleaved oligonucleotide within the 40mer RNA sequence are shown in the y-axis. NoP: no probe; NoE: no enzyme; lOf, 50f and 250f are fold dilutions of each ribonuclease. Data shown demonstrate that the cleavage product heterogenicity is dependent on the identity and concentration of the ribonuclease utilized in the protection assay. FIGURE 29B illustrates sequences of the most abundant protected products for each enzyme. Specifically, the 20mer DNA probe is represented as a bar aligned above the substrate 40-mer (SEQ ID NO: 31) with the respective protected products appearing below. Fragments of the substrate 40-mer are shown with numeric ranges to the right indicating the corresponding positions of SEQ ID NO:31.
FIGURE 30A illustrates a series of example ribonuclease protection assays with a 40mer RNA and complementary DNA probes ranging from 20 to 30 nucleotides in length. FIGURE 30B illustrates a heatmap of oligonucleotide products identified in ribonuclease protection assays using hRNase 4, MCI or RNase T1 with various DNA probes (NE: no enzyme). Tile shade relates to the mean signal area of each identified oligonucleotide in each experiment. FIGURE 30C illustrates predominant cleavage site positions within the 40mer substrate (SEQ ID NO: 31) for hRNase 4, MCI, and RNase Tl. hRNase 4 displayed less cleavage product heterogenicity regardless of the DNA probe chosen.
FIGURE 31 A illustrates a ribonuclease protection assay of FLuc mRNA (SEQ ID NO: 26) using hRNase 4 or RNase H. The 25mer probe is represented as a bar (light shade represents deoxyribonucleotides; dark shade represents ribonucleotides). Black circles indicate position of biotin group. X represents a capped or uncapped 5’ modification. RNase cut sites are marked. FIGURE 3 IB illustrates products from an example FLuc mRNA 5'-end sequence cleavage with either hRNase 4 (top) or RNase H (bottom) following a biotin enrichment step. RNase H produces a significant amount of an additional cleavage sequence that is one nucleotide shorter (‘Product -Inf) than that of the main cleavage sequence (24mer). FIGURE 31C illustrates deconvoluted mass spectra of the main cleavage product peak of hRNase 4 digest (the 28mer) from Figure 3 IB. FIGURE 3 ID illustrates deconvoluted mass spectra of the main cleavage product peak of RNaseH digest (the 24mer) from Figure 3 IB. Both cleavage products of Figure 31C (28mer) and Figure 3 ID (24mer) comprise a
mixture of the individual sequences with different 5’ ends, which may include 5’-pp (diphosphate or 2p), 5’-ppp (triphosphate or 3p), 5’-Gppp, Cap 0 (5’-m7GpppG) and Cap 1 (5’-m7GpppGm). FIGURE 3 IE illustrates an example heatmap of the intensity of the Cap 1 modified oligonucleotides detected in two independent experiments using hRNase 4 or RNase H. RNase H produces two abundant Cap 1 oligonucleotide product sequences (23mer and 24mer), whereas hRNase 4 produces predominantly one (28mer). FIGURE 3 IF illustrates distribution of the different 5’ ends in all identified sequences for each of hRNase 4 or RNase H digests averaged from two replicates. Data shown demonstrate that the relative quantification of capped products and their intermediates in the hRNase 4 condition is comparable to that of RNase H enzyme, indicating that hRNase 4 can be effectively used for analysis of mRNA capping efficiency. The presence of a 3’ end repair enzyme, such as T4 PNK, in combination with hRNase 4 may produce molecules with consistent, dephosphorylated 3’termini and, thereby, reduce ambiguity in the attribution of mRNA 5’ modification.
FIGURE 32A compares hRNase 4 and RNase H for the analysis of the enzymatic capping efficiency of unmodified (U) and fully modified (mlY) mRNA. Data shown illustrate the distribution of 5’ ends in a population of capped mRNAs revealed by analyses with hRNase 4 or RNase T1. Comparable distributions of capped products and intermediates were obtained in both the hRNase 4 analysis and the RNase T1 analysis. FIGURE 32B illustrates an example heatmap of the intensity of Cap-1 modified oligonucleotides of various lengths detected for each mRNA variant. In contrast to hRNase 4, RNase H displayed a higher propensity to spuriously cleave one or more nucleotides upstream or downstream from the target site resulting in a mixture of cleaved products differing from each other by one or more nucleotides in length, even after extensive probe optimization.
FIGURE 33 illustrates an example ribonuclease protection assay applied to the analysis of 3’ end of EPO mRNA (SEQ ID NO:27) using hRNase 4. A DNA probe was designed to direct RNA cleavage a few nucleotides upstream of the mRNA poly(A) tail (cleavage sites for hRNase 4 are designated with R4). The deconvoluted mass spectra of the product of the 3’ end cleavage shows a distribution of peaks between 40,000 and 45,000 u that differ from each other by an adenosine (A) nucleotide and indicative of the presence of a poly (A) tail.
FIGURE 34 illustrates an example workflow 3400 used for integrative analysis of 5’ cap, poly(A) tail, and an mRNA internal sequence (also referred as to mRNA body sequence)
using hRNase 4. In this example workflow, subject RNA is annealed 3410 with DNA probes targeted to the RNA 5’ and 3’ ends, wherein each DNA probe independently of each other may comprise an affinity group. Hybridized RNA is digested 3420 with hRNase 4, optionally in a composition with T4 PNK. Cleaved DNA-RNA duplexes are purified 3430 (e g., by affinity capture) and the cleaved single-stranded RNA oligonucleotides are collected 3440 (supernatant). The supernatant fraction containing the cleaved single-stranded RNA oligonucleotides may be used for analysis 3450 of the mRNA internal sequence. DNA-RNA duplexes may be eluted 3460 and used for 5’ cap and poly(A) tail analysis 3470 (e.g., directly or after releasing the RNA strands by DNase I treatment).
BRIEF DESCRIPTION OF THE SEQUENCES
SEQ ID NOS: 1-13, which are also illustrated in FIGURE 2 and Table 1, are example oligoribonucleotides for assessing cleavage capabilities of an RNase.
SEQ ID NOS: 14-17, which are also illustrated in FIGURE 7, are example oligoribonucleotides for assessing cleavage capabilities of an RNase.
SEQ ID NOS: 18-22, which are also illustrated in FIGURE 19, are hRNase 4/T4 PNK cleavage products of Epo mRNA (SEQ ID NO:27).
SEQ ID NO: 23 is an example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 24 is an example biotinylated DNA probe sequence for hybridization with an RNase substrate.
SEQ ID NO: 25 is a portion of an EPO mRNA (SEQ ID NO: 27) example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 26, which is also illustrated in Table 1 and, in part, in FIGURE 31 A, is a FLuc mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 27, which is also illustrated in Table 1 and, in part, in FIGURE 33, is an EPO mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 28, which is also illustrated in Table 1, is a ClucUl mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 29, which is also illustrated in Table 1, is a ClucU2 mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 30, which is also illustrated in Table 1, is a ClucU3 mRNA example RNase substrate for assessing cleavage capabilities of an RNase.
SEQ ID NO: 31, which is also illustrated in FIGURES 29 and 30C, is an example oligoribonucleotide for assessing cleavage capabilities of an RNase. When hybridized with an oligodeoxyribonucleotide, cleavage products may include a first fragment (e.g., consisting of positions 1-17, 1-19, 1-20, 1-21, 1-22, 1-23, 1-25, 1-26, 1-27, 1-28, 1-29, 1-30, 1-31, 1-33, 2-22, 3-22, or 4-22 of SEQ ID NO:31), one or more additional oligoribonucleotides (e.g., 4- 23 nucleotides in length), and optionally one or more mono-di- and tri-ribonucleotides.
SEQ ID NO: 32 is an example DNA probe sequence for hybridization with an RNase substrate.
SEQ ID NOS: 33-42, which are also illustrated in Table 5, are example DNA probe sequences for hybridization with an RNase substrate.
SEQ ID NOS: 43-45 are example RNase 4 sequences.
SEQ ID NO: 46 is an example polynucleotide kinase sequence.
SEQ ID NOS: 47-50, which are also illustrated in Table 6, are example mRNA 5'- UTR coding sequences.
SEQ ID NOS: 52-58, which are also illustrated in Table 7, are example probe sequences used to assess probe-directed RNA cleavage of mRNAs comprising distinct 5'- UTRs.
SEQ ID NO: 59 is an example biotinylated DNA probe sequence for hybridization with an RNase substrate.
DETAILED DESCRIPTION
Precise analytical approaches may be desirable and/or necessary to directly confirm the nucleotide sequence, and the identity and position of nucleotide modifications in a subject RNA. Mass spectrometry (MS) is a technique that allows direct and comprehensive characterization of nucleic acids and their chemical modifications without any prior knowledge or assumptions (Yoluc et al., 2021). Mass spectrometry analysis of RNA may be conducted with non-hydrolyzed RNA species (top-down analysis), partially hydrolyzed RNA species (bottom-up analysis) or fully hydrolyzed RNA species (nucleoside analysis). Typically, prior to bottom-up MS analysis, RNA is partially hydrolyzed by enzymatic digestion to oligonucleotides using site-specific ribonucleases (RNases), such as RNase T1 (guanosine-specific), RNase A (pyrimidine-specific) and RNase U2 (purine-specific). The efficacy and reproducibility of RNA analysis approaches may be highly dependent on the quality and purity of RNases. However, to date just a few RNases have been fully characterized and validated for RNA analysis. Because RNases are often toxic to the
expression host, the production of RNases in high yields and high purities is challenging. In many cases, the resulting RNase preparations are of low quality and sometimes may be contaminated with other undesired RNase activities, making it difficult to precisely define the RNase specific activity. In other cases, the RNase itself does not exhibit clear-cut specificity and thus produces secondary cleavage of RNA (i.e., cleavage at sites that are different from the main cleavage motif), which often increases with the RNase concentration. Furthermore, depending on the nature and concentration of the RNase(s), digestion buffer(s), and incubation time(s), a complex mixture of RNA cleavage products comprising diverse phosphorylation states at the 5’ (5-prime) and 3’ (3-prime) ends may be obtained (e.g., 5’- phosphate, 5-hydroxy, 3 ’ -phosphate, 3 ’-hydroxy, 2’ -phosphate, 2’ -hydroxy, and/or 2', 3'- cyclic-phosphate), thereby complicating comprehensive RNA analysis. The presence of RNA digestion products of the same or different sequence with multiple possible phosphorylation status at their ends, including non-phosphorylated ends, increases the spectral complexity and the likelihood of peak overlaps, and reduces the overall abundance of each ion.
Ribonuclease mapping may be used to determine RNA sequence and modification status by mass spectrometry. In some instances, as part of ribonuclease mapping, the protocol for RNA digestion comprises an additional step of treating the digested RNA, often (but optionally) after purification of the digested RNA, with a phosphodiesterase (e.g., a cyclic phosphodiesterase) to reduce the sample complexity. Phosphodiesterases (PDEs) are enzymes that are characterized by their ability to cleave a phosphodiester bond. Cyclic PDEs (2',3 '- cyclic nucleotide phosphodiesterase, also referred as to CNPase or CNP) cleave a phosphodiester bond in 2', 3 '-cyclic nucleotide to form a nucleoside 2'-phosphate. Cyclic PDEs, such as human CNP, do not hydrolyze phosphate monoesters (i.e., they do not exhibit phosphomonoesterase activity).
Development of methods that not only simplify and shorten RNA processing steps prior to downstream mass spectrometry analysis, but also increase the depth of RNA analysis are desired, for example, (i) to improve the accuracy of the resulting sequence data, (ii) to enable accurate sequencing of smaller amounts of input RNA, and/or (iii) to better differentiate sequencing errors from true sequence variations. Such methods may benefit from endoribonucleases showing one or more of the following properties: (a) robust and reproducible specific activity; (b) easy to express and purify as soluble protein; (c) long shelflife stability; (d) tolerates the presence of salts and/or denaturing agents; (e) conditionally inhibited and/or deactivated (e.g., to limit activity when desired); (f) at least moderately
thermostable; (g) cleavage frequencies of, on average, every 6-12 nucleotides; (h) nominal or no spurious cleavage activity; and (i) capable of cleaving RNA modifications.
Endonucleases having some or all of these properties may reduce or minimize the extent of the formation of isomeric digestion products and/or increase the sequence coverage of long, complex RNAs.
The present disclosure relates, in some embodiments, to methods and compositions for RNA characterization, which may include, for example, chromatographic and/or spectroscopic characterization. For example, methods and compositions may include RNA analysis or characterization (e.g., sequencing) by LC-MS/MS. Methods and compositions, according to some embodiments, may include and/or use human endoribonuclease 4. In some embodiments, compositions and methods may include one or more endoribonucleases and one or more RNA end repair enzymes that work (e.g., work concurrently) to recognize, cleave, and heal specific RNA sequences and may produce from a RNA substrate oligoribonucleotides having fully hydroxylated ends (i.e., RNA oligonucleotides comprising 5’-OH, 3'-OH, and 2'-OH termini).
In some embodiments, the present disclosure relates to methods and compositions for analyzing RNA substrates using, for example, tandem liquid chromatography-mass spectrometry (e.g., LC-MS/MS). Methods may include, for example, preparing oligoribonucleotides from RNA substrates and analyzing the oligoribonucleotides. Compositions and kits to produce oligoribonucleotides may comprise one or more components according to some embodiments. For example, compositions and kits to produce oligoribonucleotides may comprise one or more enzymes or catalysts active on RNA substrates including, for example, an endoribonuclease (e.g., human endoribonuclease 4) and an RNA end repair enzyme (e.g., bacteriophage T4 polynucleotide kinase (T4 PNK)). Compositions and kits to produce oligoribonucleotides may comprise one or more buffering agents and/or one or more RNA denaturing agents. Compositions and methods, in some embodiments, have application to analysis of RNA-based cancer immunotherapies, protein-replacement therapies, and prophylactic and therapeutic vaccines.
General Considerations
Aspects of the present disclosure can be further understood in light of the embodiments, section headings, figures, descriptions and examples, none of which should be construed as limiting the entire scope of the present disclosure in any way. Accordingly, the innovations set forth herein should be construed in view of the full breadth and spirit of the disclosure.
Each of the individual embodiments described and illustrated herein has discrete components and features which can be readily separated from or combined with the features of any of the other several embodiments without departing from the scope or spirit of the present teachings. Any recited method can be carried out in the order of events recited or in any other order which is logically possible. Unless otherwise expressly stated to be required herein, each component, feature, and method step disclosed herein is optional and the disclosure contemplates embodiments in which each optional element may be expressly excluded. As such, this statement is intended to serve as antecedent basis for use of such exclusive terminology as “solely,” “only” and the like in connection with the recitation of claim elements or use of a “negative” limitation.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Still, certain terms are defined herein with respect to embodiments of the disclosure and for the sake of clarity and ease of reference.
Sources of commonly understood terms and symbols may include: standard treatises and texts such as Kornberg and Baker, DNA Replication, Second Edition (W.H. Freeman, New York, 1992); Lehninger, Biochemistry, Second Edition (Worth Publishers, New York, 1975); Strachan and Read, Human Molecular Genetics, Second Edition (Wiley-Liss, New York, 1999); Eckstein, editor, Oligonucleotides and Analogs: A Practical Approach (Oxford University Press, New York, 1991); Gait, editor, Oligonucleotide Synthesis: A Practical Approach (IRL Press, Oxford, 1984); Singleton, et al., Dictionary of Microbiology and Molecular biology, 2d ed., John Wiley and Sons, New York (1994), and Hale & Markham, the Harper Collins Dictionary of Biology, Harper Perennial, N.Y. (1991) and the like.
In the context of the present disclosure, the singular forms “a” and “an” include plural referents unless the context clearly dictates otherwise. For example, the term “a protein” refers to one or more proteins, i.e., a single protein and multiple proteins.
Numeric ranges are inclusive of the numbers defining the range. All numbers should be understood to encompass the midpoint of the integer above and below the integer i.e., the number 2 encompasses 1.5-2.5. The number 2.5 encompasses 2.45-2.55 etc. When sample numerical values are provided, each alone may represent an intermediate value in a range of values and together may represent the extremes of a range unless specified. Concentration percentages are disclosed as (w/v) unless expressly stated otherwise.
Definitions
In the context of the present disclosure, an “affinity capture domain” refers to a domain capable of binding a corresponding affinity domain. Example materials having such properties include avidin, streptavidin, neutravidin, maltose-binding protein, GST, antibodies (e.g., anti- HA, anti-Myc, anti-FLAG), S-protein, calmodulin, lectins, nickel, cobalt, zinc, and polyhistidine. Further examples include groups that form an irreversible bond with a protein tag, including benzylguanine or benzylchoropyrimidine (SNAP -tag); benzylcytosine (CLIP -tag); haloalkane (HaloTag); CoA analogues (MCP-tag and ACP-tag); trimpethoprim or methotrexate (TMP-tag); FlAsH or ReAsH (Tetracysteine tag); a substrate of biotin ligase; a substrate of phosphopantetheline transferase; and a substrate of lipoic acid ligase. An affinity capture method may be used for selectively enriching samples by means of affinity purification methods, wherein the affinity binding partner is immobilized in a column, bead, microtiter plate, membrane or other solid support.
In the context of the present disclosure, an “affinity domain” refers to a domain capable of binding a corresponding affinity capture domain with high affinity (e.g., at least I0'8M) and specificity. Example materials having such properties include biotin, DBT, desthiobiotin, oxybiotin, iminobiotin, diaminobiotin, biotin sulfoxide, biocytin, digoxigenin, glutathione, heparin, maltose, coenzyme A, protein A, Brilliant Blue FCF, azorubine, phytoestrogen, nickel, cobalt, zinc, poly-histidine, HA-tag, c-myc tag, FLAG-tag, S-tag, CBP-tag, dihydrofolate reductase, a hapten to an antibody, a mono- or oligosaccharide ligand to a lectin, hormones, cytokines, toxins, dyes, and vitamins. Such molecules may be fused with a molecule to be marked as desired. For example, an affinity domain may be fused to the 5’ end, the 3’ end, or anywhere along the length of a polyribonucleotide.
In the context of the present disclosure, “buffer” or “buffering agent” refers to an agent that, when in solution or in contact with a solution, contributes to or causes such solution to resist changes in pH upon addition of acid(s) or alkali(s) to the solution. Examples of suitable non-naturally occurring buffering agents that may be used include, for example, any of Tris, HEPES, TAPS, MOPS, tri cine, and MES.
In the context of the present disclosure, “coupled reaction” refers to a reaction in which two or more reaction steps occur in a single reaction mixture and in a single reaction location (e.g., a tube, a container, a vessel, a well, a capillary, a flow cell, a surface or other space) or separate locations that are in fluid communication with one another (e.g., where enzymes are deposited in separate locations on a surface and the locations are immersed in a common fluid comprising, for example, one or more substrates, buffers, reaction intermediates, and/or
reaction products). A reaction location may be defined by one or more walls (e g., of a tube), a liquid (e.g., a liquid immiscible with a reaction fluid), a fluid (e.g., a gas including, for example gaseous nitrogen or air), a vacuum, or combinations thereof. Sequential reaction steps in a coupled reaction may begin and/or continue without changes to reaction conditions (e.g., without addition or removal of reagents, changes in temperature, pH, volume, or washing) beyond those that arise or follow from the reactions themselves.
In the context of the present disclosure, “denaturing agent” or “RNA denaturing agent” refers to an agent that, in contact with RNA, disrupts intramolecular hydrogen bonding in the RNA by melting existing hydrogen bonds, if present, and/or interfering with formation of new hydrogen bonds. An RNA denaturing agent may lack any ribonuclease activity. Examples of RNA denaturing agents include formamide, dimethylformamide (DMF), guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide (DMSO), propylene glycol, poly(ethylene glycol (PEG), cetyltrimethylammonium bromide (CTAB), and urea.
In the context of the present disclosure, “DNA probe” refers to a oligodeoxyribonucleotide having a length of 10-20 nucleotides, 10-30 nucleotides, 10-40 nucleotides, 10-50 nucleotides, or 10-200 nucleotides. A DNA probe may comprise a sequence complementary to an RNA substrate or complementary to any portion along the length of an RNA substrate. A DNA probe sequence may be selected to bind to an RNA substrate or bind to a specific portion of an RNA substrate. For example, a DNA probe may have a sequence complementary to an RNA sequence at or near (e.g., within 1-5, 1-10, 1-15, or 1-20 nucleotides of) the 5’ end of the RNA, at or near (e.g., within 1-5, 1-10, 1-15, or 1-20 nucleotides of) the 3’ end of the RNA, or positioned between the 5’ and 3’ ends. A DNA probe may comprise a sequence complementary to an RNA sequence comprising and/or adjacent (e.g., within 3-15, 4-14, 5-13, or 6-12 nucleotides of and on the 5’ or 3’ side) to one or more endoribonuclease cut sites. When hybridized to a complementary RNA sequence, a DNA probe may limit or block access to one or more endoribonuclease cut sites (e.g., within the duplex) without limiting or blocking access to one or more other endoribonuclease cut sites (e.g., outside the duplex). A DNA probe sequence may be selected or configured to produce one or more endoribonuclease digestion products having one or more desired properties (e.g., length, 5’ fragment, 3’ fragment). A DNA probe and an RNA substrate may form a duplex flush with the 5’ end of the RNA, offset from the 5’ end by 1-5, 1-10, 1-15, or 1-20 nucleotides, flush with the 3’ end of the RNA, or offset from the 3’ end by 1-5, 1-10, 1-15, or 1-20 nucleotides.
A DNA probe may comprise solely deoxyribonucleosides or may comprise mostly deoxyribonucleosides with one or more ribonucleosides (e.g., a chimeric probe for use with RNase H). A DNA probe may comprise solely phosphate linkages or may include one or more alternate linkages (e.g., phosphorothioate) A DNA probe may comprise solely canonical nucleotides or may comprise one or more modified nucleotides. For example, a DNA probe may comprise one or more affinity tags (e.g., biotin).
In the context of the present disclosure, “fusion” refers to two or more polypeptides, subunits, or proteins covalently joined to one another (e.g., by a peptide bond). For example, a protein fusion may refer to a non-naturally occurring polypeptide comprising a protein of interest covalently joined to a second polypeptide. A second polypeptide may confer upon the fusion one or more desirable properties over the protein of interest alone. For example, a second polypeptide may provide an additional binding property (e g., an affinity and/or purification tag), a selection and/or detection tag (e.g., a reporter protein) Examples of a second polypeptide include a reporter protein, a purification tag (e.g., maltose binding protein, a histidine tag), and expression tag, a polynucleotide binding protein, an enzyme, a conjugation tag (e.g., a SNAP® tag), and a peptide linker. Unless otherwise disclosed, the protein of interest may be nearer to the N-terminal end or nearer to the C-terminal end than the second polypeptide to which it is joined. A fusion may comprise a non-naturally occurring combined polypeptide chain comprising two proteins or two protein domains joined directly to each other by a peptide bond or joined through a peptide linker. An example fusion may include an MBP and an hRNA4.
In the context of the present disclosure, “human endoribonuclease 4” refers to a human protein encoded by the RNASE4 gene, having endoribonuclease activity, and cutting RNA at UR. It is an example of a strand-specific, sequence specific endoribonuclease and an example of an RNase 4. Human endoribonuclease 4 may also be referred to as “hRNase 4”, “Homo sapiens RNase4”, “hRNase IV”, “hRNase4”, or “Hs RNase4”. hRNase 4 is one of the eight members of the human RNase A superfamily of endoribonucleases (Lu et al., Immune Modulation by Human Secreted RNases at the Extracellular Space, Front Immunol. 2018, 9:1012). hRNase 4 retains a high interspecies homology within mammals, and shares conserved structural features with non-mammalian vertebrate RNases. Example hRNase 4 amino acid sequences include SEQ ID NOS: 43-45. hRNase 4 shows strong selectivity for RNA recognition with preference for uridine at the main binding site. hRNase 4 preference for uridine over cytidine in comparison to other family members can be correlated with some
structural features at the binding pocket, such as the presence of an asparagine residue at position 80. Substitution of Asp80 to alanine reduces preference for uridine over cytidine.
In the context of the present disclosure, “immobilized” refers to covalent attachment to a solid support with or without a linker. Examples of solid supports include beads (e.g, magnetic, agarose, polystyrene, polyacrylamide, chitin). Beads may include one or more surface modifications (e.g., O6-benzyleguanine, polyethylene glycol) that facilitate covalent attachment and/or activity of an enzyme of interest. For example, a support may comprise a ligand and an enzyme may have a receptor for such ligand or an enzyme may comprise a ligand and a support may comprise a receptor for such ligand. Receptor-ligand binding may be covalent or non-covalent. Non-covalent attachment (e.g, avidimbiotin, chitimCBP) may be useful in some embodiments, for example, where the level of dissociation of the binding partner is deemed tolerable. A linker may be disposed, for example, between a support and an enzyme or between a support and a DNA probe. For example, a linker disposed between a support and an enzyme may have a first covalent bond to the support and a second covalent bond to the enzyme. An immobilized enzyme comprising a ligand-receptor attachment may have a linker disposed between the support and the ligand-receptor attachment, a linker disposed between the enzyme and the ligand-receptor attachment, or both. An immobilized enzyme comprising a linker may also comprise an optional covalent bond directly between the enzyme and the support. A linker may be of any desired length and have any desired range of motion. A peptide linker may comprise one or more repeats (e.g., 1-10 repeats) of glycine-serine.
In the context of the present disclosure, “non-naturally occurring” refers to a polynucleotide, polypeptide, carbohydrate, lipid, or composition that does not exist in nature. Such a polynucleotide, polypeptide, carbohydrate, lipid, or composition may differ from naturally occurring polynucleotides polypeptides, carbohydrates, lipids, or compositions in one or more respects. For example, a polymer (e.g., a polynucleotide, polypeptide, or carbohydrate) may differ in the kind and arrangement of the component building blocks (e.g., nucleotide sequence, amino acid sequence, or sugar molecules). A polymer may differ from a naturally occurring polymer with respect to the molecule(s) to which it is linked. For example, a “non- naturally occurring” protein may differ from naturally occurring proteins in its secondary, tertiary, or quaternary structure, by having a chemical bond (e.g., a covalent bond including a peptide bond, a phosphate bond, a disulfide bond, an ester bond, and ether bond, and others) to a polypeptide (e.g., a fusion protein), a lipid, a carbohydrate, or any other molecule. Similarly, a “non-naturally occurring” polynucleotide or nucleic acid may contain one or more other
modifications (e.g., an added label or other moiety) to the 5’ - end, the 3’ end, and/or between the 5’- and 3’-ends (e.g., methylation) of the nucleic acid. A “non-naturally occurring” composition may differ from naturally occurring compositions in one or more of the following respects: (a) having components that are not combined in nature, (b) having components in concentrations not found in nature, (c) lacking one or more components otherwise found in naturally occurring compositions (e.g., a cell-free composition, a chromosome-free composition, a histone-free composition, a polymerase-free composition, a cell membrane-free composition, a lyophilized composition), (d) having a form not found in nature, e.g., dried, freeze dried, crystalline, aqueous, and (e) having one or more additional components beyond those found in nature (e.g., buffering agents, a detergent, a dye, a solvent or a preservative).
In the context of the present disclosure, “nucleotide ” refers to a molecule comprising a base, a sugar and one or more phosphate groups. A base (also referred to as a “nitrogenous base” or a “nucleobase”) may be a purine or pyrimidine. A sugar may be a five-carbon ribose (as in ribonucleotides) or a 2-deoxyribose (as in deoxyribonucleotides), which is bound via a glycosidic linkage to the base. Nucleotides may have one, two or three phosphate groups (mono-, di- or triphosphates). Phosphate groups may form a chemical bond at the 5-carbon position of the sugar, although they may also bond at the 2 or 3-carbon positions of the sugar group. Cyclic nucleotides form when a phosphate group is bound to two hydroxyl groups on the sugar. A “nucleoside” comprises a nucleobase and sugar. A nucleotide may also be called a nucleoside mono-, di- or triphosphate.
In the context of the present disclosure, “oligoribonucleotide” refers to a polymer of ribonucleotides that are less than 500 nucleotides long, less than 200 nucleotides long or less than 100 nucleotides long. For example, oligoribonucleotides may be 4-80 nucleotides long, 4-60 nucleotides long, or 4-40 nucleotides long. An oligoribonucleotide may be an RNA substrate.
In the context of the present disclosure, “ribonuclease” or “RNase” refers to a nuclease that catalyzes the cleavage of RNA into smaller components. Ribonucleases include endoribonucleases and exoribonucleases. Ribonucleases may cleave single- stranded RNA, double-stranded RNA, or single-stranded RNA and double-stranded RNA. Examples of ribonucleases may include hRNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO. Different specificities, cleavage frequencies, activities, salt tolerance, temperature sensitivities, and other factors may favor using one or
more endoribonucleases. Thus, according to some embodiments, methods and compositions may exclude one or more of the foregoing example ribonucleases.
An endoribonuclease may have mononucleotide specificity, dinucleotide specificity, trinucleotide specificity, or higher nucleotide specificity. In this context, an endoribonuclease with a single dinucleotide specificity might be expected to cleave RNA substrates (having a random distribution of all 4 bases within their sequences) on average once every 16 nucleotides. An endoribonuclease having specificity for one or more dinucleotide or trinucleotide combinations may cleave an RNA substrate more frequently, for example, on average once every 6 to 12 nucleotides, for example on average once every 8 nucleotides (e.g., calculated with reference to an RNA substrate having a random distribution of all 4 bases within its sequence). Examples of endoribonucleases with specificity for one or more dinucleotide combinations are those whose specificity comprise a main nucleotide anchoring site (referred as Bl site) and a secondary nucleotide binding site (referred as B2 site). In some embodiments, a selected endoribonuclease is capable of cleaving a 3 ',5 ' phosphodiester bond between Bl and B2 sites with selectivity for one of: uridine at the main anchoring site Bl and pyrimidines at the secondary site B2; or cytidine at the main anchoring site Bl and pyrimidines at the secondary site B2; or adenosine at the main anchoring site Bl and pyrimidines at the secondary site B2; or guanosine at the main anchoring site Bl and pyrimidines at the secondary site B2; or uridine at the main anchoring site Bl and purines at the secondary site B2; or cytidine at the main anchoring site Bl and purines at the secondary site B2; or adenosine at the main anchoring site B 1 and purines at the secondary site B2; or guanosine at the main anchoring site Bl and purines at the secondary site B2. Example endoribonucleases include those whose specificity comprise a secondary nucleotide binding at the Bl site and a main nucleotide anchoring binding at the B2 site. Such examples include endoribonucleases that are capable of cleaving a 3 ',5' phosphodiester bond between Bl and B2 sites with selectivity for one of: purines at the secondary site Bl and uridine at the main anchoring site B2; or purines at the secondary site Bl and cytidine at the main anchoring site B2; or purines at the secondary site Bl and adenosine at the main anchoring site B2; or purines at the secondary site Bl and guanosine at the main anchoring site B2; or pyrimidines at the secondary site Bl and uridine at the main anchoring site B2; or pyrimidines at the secondary site Bl and cytidine at the main anchoring site B2; or pyrimidines at the secondary site Bl and adenosine at the main anchoring site B2; or pyrimidines at the secondary site Bl and guanosine at the main anchoring site B2. Representative examples of such
endoribonucleases are Homo sapiens (Hs) RNase4 (preferentially cleaves uridine at Bl position and either adenosine or guanosine at B2 position); Hs RNases 2, 3, 6, and 7 (cleave either uridine or cytidine at Bl position with strong preference for adenosine at B2 position); and Rana pipiens (Rp) RNase, Chelonia mydas (Cm) RNasel, and Gallus gallus (Gg) RNasel (cleave either uridine or cytidine at Bl position with preference for guanosine at B2 position). In some examples, the endoribonuclease may present a mild preference for a given nucleotide at Bl or B2 positions, and this preference may be tuned in such a way (for example, by dilution of enzyme concentration, by buffer change, by pH change, or by temperature change) that the endonuclease may effectively cleave at frequencies that are on average once every 6 to 12 nucleotides. Those examples include endoribonucleases such as Hs RNase5 (cleaves either uridine or cytidine at Bl position with mild preference for adenosine over guanosine at B2 position). An endoribonuclease may have any desired form, for example, a fluid form (e.g., with or without glycerol), a lyophilized form, a dried form, and/or an immobilized form.
In the context of the present disclosure, “ribonuclease inhibitor” or “RNase inhibitor” refers to a material that reduce (e.g., partially or completely) the RNA cleavage activity of a ribonuclease. Examples of endoribonuclease inhibitors include human placental RNase inhibitor, murine RNase inhibitor, ribonucleoside-vanadyl complex, guanidine thiocyanate, IRE1 RNase inhibitor, diethyl pyrocarbonate (DEPC), egtazic acid (EGTA), ethylenediaminetetraacetic acid (EDTA), and any combination thereof. A ribonuclease inhibitor may have any desired form, for example, a fluid form (e.g., with or without glycerol), a lyophilized form, a dried form, and/or an immobilized form. A ribonuclease inhibitor may bind to a ribonuclease (e.g., a susceptible ribonuclease) with high affinity, for example, an affinity similar to the affinity of avidin and biotin.
In the context of the present disclosure, “RNA end repair” refers to a process of converting RNA phosphorylated ends (e.g., cyclic and/or linear phosphorylated ends) into RNA hydroxylated ends (e.g., 5’-OH, 2'-OH and/or 3'-OH ends). RNA end repair, in the context of the present disclosure, excludes ligation of 5’ and 3’ ends to one another.
In the context of the present disclosure, “RNA end repair enzyme” refers to an enzyme that performs RNA end repair and comprises both phosphodiesterase (PDE) and phosphomonoesterase (PME) activities. An RNA end repair enzyme may maintain or manipulate RNA structure in response to RNA breakage events. RNA end repair enzymes are present in diverse taxa in all phylogenetic domains of life and repair RNA breaks inflicted by
sequence-specific or structure-specific endoribonucleases during physiological RNA processing (e.g., tRNA splicing; kinetoplast mRNA editing) and under conditions of cellular stress (e.g., virus infection; unfolded protein response). A repair enzyme may resolve 2', 3'- cyclic-phosphorylated oligoribonucleotide ends, 3 '-phosphorylated oligoribonucleotide ends and/or 2'-phosphorylated oligoribonucleotide ends. A repair enzyme may have any desired form, for example, a fluid form (e.g., with or without glycerol), a lyophilized form, a dried form, and/or an immobilized form.
Examples of RNA end repair enzymes include polynucleotide kinases including polynucleotide kinase-phosphatase (Pnkp) enzymes with 5'-hydroxyl kinase, 3 '-phosphatase and/or 2',3'-cyclic phosphodiesterase activities that function in nucleic acid repair. Similar proteins are found in many species, including Enterobacteria phage (such as phages RB55 and RB59), Desulfovibrio sp, Shigella phage, Escherichia phage, Yersinia phage, Bacillus cereus, Salmonella phage, Citrobacter phage, Serratia phage, Vibrio phage, Aeromonas phage, Acinetobacter phage, Klebsiella phage, Stenotrophomonas phage, and Staphylococcus aureus (AAA family ATPase). The PNKP gene (named pseT) is conserved in many species, including H. sapiens, chimpanzee, Rhesus monkey, dog, cow, mouse, rat, zebrafish, fruit fly, C. elegans, S. pombe, M. oryzae, N. crassa, and frog. Example PNKs in this family of enzymes include bacteriophage T4 polynucleotide kinase (T4 PNK; also referred as to T4 polynucleotide kinase-phosphatase or T4 Pnkp) (Das and Shuman, 2013) and Clostridium thermocellum (Cth) polynucleotide kinase-phosphatase. An example RNA end repair enzyme amino acid sequence is SEQ ID NO:46.
T4 PNK heals 2',3'-cyclic-phosphorylated oligoribonucleotide ends, 3'-phosphorylated oligoribonucleotide ends, and 2'-phosphorylated oligoribonucleotide ends, in each case, resulting in products that comprise a 2 ’,3 ’-hydroxylated (2'-OH, 3'-OH) end. In vivo, T4 PNK heals broken tRNA ends through (i) hydrolysis of a 2’,3’-cyclic phosphate to a 3’-hydroxy end and (ii) phosphorylation of a 5 ’-hydroxy (via its polynucleotide kinase activity) to form a 5’- phosphate end (these tRNA healed ends are eventually sealed by another enzyme, RNA ligase 1, Rnll). Phosphorylation of the 5’-OH (kinase activity) may be NTP-dependent (e.g., ATP- dependent). For example, phosphorylation of the 5’-OH may not occur without an NTP, which produces healed ends comprising 5 ’-OH, 2'-OH and/or 3 '-OH. In vitro, T4 PNK is capable of phosphorylating the 5' end of double- and single-stranded RNA or DNA.
Cth PNK is a multifunctional enzyme that belongs to a family of RNA end-healing enzymes found in diverse bacteria. Cth PNK has three catalytic modules: (i) an N-terminal
polynucleotide 5'-kinase; (ii) a central 2', 3 '-phosphatase; and (iii) a C-terminal ligase (Das and Shuman, 2013). As with T4 PNK, Cth PNK converts an RNA 2'-phosphate, 3'- phosphate, or a 2', 3 '-cyclic phosphate end to an RNA product comprising a 2'-OH, 3'-OH end by means of its phosphodiesterase and phosphomonoesterase activities. Cth PNK may use either Mn(II) or Ni(II) as a metal cofactor.
In the context of the present disclosure, “RNA substrate” refers to any composition including one or more ribonucleotide (RNA) species of one or more lengths from one or more sources. An RNA substrate may be obtained from one or more sources, including viruses, prokaryotic cells, eukaryotic cells, or archaea cells. An RNA substrate may arise from or include any biological material (e.g., solid, fluid, aerosol) including organs, tissues, tissue cultures, biopsies, blood, lymph, mucous, sputum, skin, saliva, lesions, swabs, sweat, semen, urine, feces, and secretions. Biological materials may be fresh or processed (e.g., embedded with a paraffin or other support). An RNA substrate may arise from or include an environmental sample (e.g., air, water, soil, and/or biota or other substrate), food materials, agricultural materials, medical materials, and/or waste products. An RNA substrate may arise from or include RNA from in-vitro transcription (e.g., by the use of RNA polymerases) and/or from chemical synthesis (e.g., by the use of phosphoramidite chemistry or related processes).
An RNA substrate may comprise solely ribonucleosides or may comprise mostly ribonucleosides with one or more deoxyribonucleosides. An RNA substrate may comprise solely phosphate linkages or may include one or more alternate linkages (e g., phosphorothioate). An RNA substrate may comprise solely canonical nucleotides or may comprise one or more modified nucleotides. For example, an RNA substrate may comprise one or more adenosines, cytidines, guanosines, uridines, 1 -methyladenosines, 2- methyladenosines, N5 -methyladenosines, 5-methylcytidines, 5-hydromethylcytidines, wyosines, 1 -methyl guanosines, 7-methylguanosines, pseudouridines, 1-methy -pseudouridines, 5 -methyluridines, and/or 5-hydroxyuridines. An RNA substrate may have any desired length. For example, an RNA substrate may have over 50 nucleotides, over 100 nucleotides, or over 200 nucleotides. An RNA substrate may have 50-500 nucleotides, 100-1000 nucleotides, or 200-2000 nucleotides. An RNA substrate may have 1000-5000 nucleotides, 5000-9000 nucleotides, or 9000-22000 nucleotides. An RNA substrate may be linear, folded, or circular. An RNA substrate may comprise one or more endoribonuclease cut sites.
An RNA substrate may comprise, according to some embodiments, a plurality of RNA species, including one or more of in vitro transcribed RNA, artificially synthesized RNA by
chemical methods, or RNA obtained from native sources. An RNA substrate may include RNA pol I transcripts, RNA pol II transcripts, RNA pol III transcripts, nascent RNA, primase, prokaryotic RNA polymerase, or any combination thereof. In some embodiments, an RNA substrate may comprise a plurality of RNA species including one or more of single-stranded or double-stranded RNAs. An RNA substrate may arise from or include messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNAs (tRNAs), small RNA (sRNA), microRNA (miRNA), long non-coding RNA (IncRNA), circular RNA (circRNA), or any combination thereof.
An RNA substrate may include mature and/or nascent RNA species. An RNA substrate may comprise RNAs that are capped or uncapped (eukaryotic mRNAs, except for nascent transcripts and mature uncapped RNA, exhibit a 5’-Gppp cap; archaeal and bacterial mRNAs are typically uncapped and exhibit a terminal 5’ triphosphate). The RNA may be naturally or artificially capped (for example with a 5’-m7Gppp cap).
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. Reagents referenced in this disclosure may be made using available materials and techniques, obtained from the indicated source, and/or obtained from New England Biolabs, Inc. (Ipswich, MA).
Compositions
The present disclosure provides, in some embodiments, compositions for analyzing and characterizing RNA. Compositions may include, according to some embodiments, one or more RNA substrates, one or more endoribonucleases (naturally-occurring or non- naturally occurring variants; having specificity for one or more dinucleotide or trinucleotide combinations, for example, cleaving an RNA substrate on average once every 6 to 12 nucleotides), one or more RNA end repair enzymes (naturally-occurring or non-naturally occurring variants), or combinations thereof. In some embodiments, compositions may comprise one or more of an RNA substrate, an endoribonuclease, and an RNA end repair enzyme, wherein the RNA end repair enzyme is capable of healing RNA ends. For example, a composition may comprise an RNA substrate and an endoribonuclease, an RNA substrate, an endoribonuclease, and an RNA end repair enzyme, or an endoribonuclease and an RNA end repair enzyme.
Compositions may include, according to some embodiments, one or more buffering agents. Compositions with a buffer may have, for example, a pH of 5-9, 6-8, 6.7-7.4, 6.8- 7.3, 6.8-8.0, 7.0-8.2, 7.0, 7.5, or 8.0. In some embodiments, a composition may include a metal ion, examples of which include magnesium(II), manganese(II), cobalt(II), or nickel(II).
In some embodiments, compositions may include one or more RNA denaturing agents including, for example, 0.5 M - 4 M urea (e.g., 1 M urea). A composition may comprise, for example, less than 0.5 M urea, less than 0.75 M urea, less than 1.0 M urea, less than 2.0 M urea, less than 3.0 M urea, less than 4.0 M urea, less than 5.0 M urea, less than 6.0 M urea, less than 7.0 M urea, less than 8.0 M urea, 8.0 M urea, more than 7.0 M urea, more than 8.0 M urea. A composition may comprise, for example, less than 10% formamide, less than 15% formamide, less than 20% formamide, less than 25% formamide, less than 30% formamide, less than 35% formamide, less than 40% formamide, less than 45% formamide, less than 50% formamide, less than 55% formamide, more than 50% formamide, or more than 55% formamide. For example, a composition may include an endoribonuclease (e g., an endoribonuclease having specificity for one or more dinucleotide or trinucleotide combinations, for example, cleaving an RNA substrate on average once every 6 to 12 nucleotides) and one or more RNA denaturing agents.
In some embodiments, a composition may comprise an RNA substrate in any amount and/or at any concentration. For example, a composition may comprise less than 1 ng, less than 1 pg, less than 2 pg, less than 3 pg, less than 4 pg, less than 5 pg, less than 6 pg, less than
7 pg, less than 8 pg, less than 9 pg, less than 10 pg, less than 11 pg, less than 12 pg, less than 13 pg, less than 14 pg, less than 15 pg, less than 16 pg, less than 17 pg, less than 18 pg, less than 19 pg, less than 20 pg, 20 pg, more than 19 pg, or more than 20 pg. A fluid composition may comprise, for example, less than 1 ng/pL, less than 1 pg/pL, less than 2 pg/pL, less than 3 pg/pL, less than 4 pg/pL, less than 5 pg/pL, less than 6 pg/pL, less than 7 pg/pL, less than
8 pg/pL, less than 9 pg/pL, less than 10 pg/pL, less than 11 pg/pL, less than 12 pg/pL, less than 13 pg/pL, less than 14 pg/pL, less than 15 pg/pL, less than 16 pg/pL, less than 17 pg/pL, less than 18 pg/pL, less than 19 pg/pL, less than 20 pg/pL, 20 pg/pL, more than 19 pg/pL, or more than 20 pg/pL.
An RNA substrate, in some embodiments, may comprise a subject RNA and one or more additional materials (e.g., impurities and/or supports). For example, an RNA substrate comprising a synthetic RNA may also comprise impurities resulting from the process of in vitro synthesizing the RNA, either via an enzymatic process or a chemical process or a
combination of both processes. An RNA substrate comprising a native RNA may also comprise impurities from or associated with the isolation or enrichment method including, for example, partially degraded or fragmented RNA species, undesired RNA species (e.g., contaminant ribosomal RNA in a mRNA preparation), DNA, and/or proteins. An RNA substrate may comprise RNA and a solid support (e.g., magnetic or non-magnetic polymeric beads), for example, where the RNA is attached to the solid support through its 5’ end, through its 3’ end or through an internal nucleotide, in each case, with or without an optional linker (e.g., a linear or branched linker). An optional linker may serve as steric spacer and does not necessarily have to be of defined length. Examples of suitable linkers may be selected from any of the hetero-bifunctional cross-linking molecules described by Hermanson, Bioconjugate Techniques, 2nd Ed; Academic Press: London, Bioconjugate Reagents, pp 276-335 (2008), incorporated by reference. An optional linker may be a flexible linker connecting the solid support to one or a plurality of same or different RNAs.
An endoribonuclease may be expressed in E. coll, such as the periplasm of E. coll, or Pichia pastoris and purified utilizing an affinity tag. In some embodiments, an endoribonuclease may have a discrete substrate specificity. For example, an endoribonuclease may have the capacity to cleave an RNA 3', 5 ' phosphodiester bond with specific activity towards a nucleotide or a combination of one or more nucleotide sequences comprising 2-7 nucleotides each; or towards a structural element such as a stem, an internal loop, a multibranch loop, or a pseudoknot. In some respects, the substrate specificity of an endoribonuclease may include recognition and cleavage of one or more modified nucleotides (e.g., pseudouridine, 1 -methylpseudouridine, 5-methoxyuridine, 5-methylcytidine, 6- methyladenosine, and inosine). RNA end repair enzymes, in some embodiments, may have both phosphodiesterase and phosphomonoesterase activities.
In some embodiments, a composition comprising an RNA end repair enzyme and an endoribonuclease, optionally in a denaturing buffering solution, may be used to prepare oligoribonucleotide mixtures from an RNA substrate. In some embodiments, a composition of T4 PNK and an endoribonuclease, optionally in a denaturing buffering solution, is used to prepare oligoribonucleotide mixtures from an RNA substrate. Optionally, pre-heating the RNA substrate and/or including an RNA denaturing agent in the reaction mixture may reduce the impact of RNA structure (e.g., Watson-Crick base pairing and/or other intra- and/or intermolecular hydrogen bonding) on the production of endoribonuclease digestion products.
Compositions, according to some embodiments, may include one or more endoribonucleases that are capable of cleaving 3 ',5' phosphodiester bonds with specific activity towards a nucleotide or a sequence of one or more nucleotides, towards one or more nucleotide modifications, or towards a structural element such as a stem, an internal loop, a multibranch loop, a pseudoknot, a duplex segment, a triplex segment, or a quadruplex segment. Examples include endoribonucleases of the RNase A superfamily that cleave a 3 ',5' phosphodiester bond with specificity for pyrimidines at the main anchoring site (often called Bl site) and preference for purines at the secondary site (often called B2 site). Illustrative examples are hRNase5 (cuts both uridine and cytidine at B 1 position and shows only a mild preference for adenosine over guanosine at B2 position), hRNase 4 (shows a significant preference for cutting uridine at Bl position and a minor preference for adenosine over guanosine at B2 position), and hRNases 2, 3, 6, and 7 (cut both uridine and cytidine at B 1 position and do not have any detectable activity for guanosine at B2 position). Other examples include mutant endoribonucleases, such as porcine RNase4 D80A, wherein the substitution of Asp80 by alanine decreased the preference for cutting uridine at Bl position and increased the preference for cutting cytidine at B 1 position. Further examples of endoribonucleases with specificity for pyrimidines at the main anchoring site Bl are some enzymes of the RNase T2 family, such as RNase MCI, which has been isolated from seeds of Momordica charantia (specificity for uridine at Bl position), and RNase Cusativin, which has been isolated from Cucumis sativus (specificity for uridine at Bl position).
Examples of endoribonucleases include endoribonucleases that are capable of cleaving a 3 ',5' phosphodiester bond with specificity for purines at the main anchoring site Bl. Examples are endoribonucleases of the RNase T1 superfamily (specificity for guanosine at Bl position), RNase U2 (purine-specific at Bl position), and Csxl (specificity for adenosine at Bl position).
Examples of endoribonucleases also include endoribonucleases that are part of toxinantitoxin systems in bacteria or archaea. Endoribonucleases that are part of toxin-antitoxin systems may have a wider recognition cleavage site. Examples of endoribonucleases that are part of toxin-antitoxin systems include E. coli MazF (preferentially cuts before ACA trinucleotide motif), ChpB (preferentially cuts after uridine in UAC trinucleotide motiftrinucleotide), MqsR (preferentially cuts after guanosine in GC dinucleotide motif), and YafO (preferentially cuts after uridine). In some embodiments, endoribonucleases include thermostable endoribonucleases, for example, endoribonucleases that are active at
temperatures above 50°C, above 55°C, above 60°C, above 65°C, above 70°C, above 80°C, or above 90°C. Endoribonucleases that are capable of cleaving a subject RNA at such high temperatures may support cleavage of an RNA in absence of a denaturing reagent and/or eliminate the prior step of heating the RNA sample in a low salt solution (e g , up to 50 mM salt) to reduce RNA structure biases during the digestion reaction. A low salt solution may comprise, for example, sodium chloride, magnesium sulfate, potassium nitrate, and /or sodium bicarbonate.
In some embodiments an endoribonuclease utilized for digestion of an mRNA may originate from a vertebrate species (for example, Homo sapiens, Sus scr fa), a bacterial species (for example, Escherichia coli), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus). and an archaea species (for example, Pyrococcus furiosus). In some embodiments a recombinant endoribonuclease is expressed in the periplasm of E. coli or Pichia pastoris and purified utilizing an affinity tag.
An enzyme comprising phosphodiesterase and phosphomonoesterase activities may be included in a composition with oligonucleotides having one or more ends that are 2', 3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated. Contacting such an enzyme with an oligoribonucleotide may dephosphorylate one or more ends.
In some embodiments, a composition may comprise 0.1 to 10 pL (e.g., 1 to 3 pL) of an endoribonuclease, wherein 1 pL of a given endoribonuclease is capable of cleaving RNA with catalytic activity comparable to that of 1 pL of commercially available RNase T1 (1000 U/pL; ThermoFisher Scientific #EN0541). In some embodiments, a composition may comprise 50 to 500 U/pL (e.g., 100 to 200 U/pL) of an RNA end repair enzyme. In some embodiments, the ratio of RNA substrate to endoribonuclease may be from 0.1 to 10 pg of RNA substrate per 1 pL of the endoribonuclease, preferably 1 to 10 pg of RNA substrate per 1 pL of the endoribonuclease. It may be desirable to decrease the ratio of RNA substrate to endoribonuclease where the RNA substrate comprises modified nucleotides. For example, the ratio of RNA substrate to endoribonuclease may be decreased as a function of the proportion of modified nucleotides present. For example, a ratio of RNA substrate to hRNase 4 of 10 pg (of substrate)/l pL (of enzyme), of 5 pg/1 pL, of 2 pg/ 1 pL, 1 pg/1 pL, or of 0.1 pg/1 pL may be used to digest a fully-modified (e.g., all uridines replaced with 1- methylpseudouridine) mRNA comprising about 800 nucleotides (e.g., EPO mRNA). In some embodiments, the ratio of RNA end repair enzymes to endoribonuclease may be from 0.1:1 to 0.2:1 to 0.5: 1 to 1:1 to 1 :2 to 1:5 to 1 :10. . It may be desirable to increase the ratio of RNA
end repair enzymes to endoribonuclease where the RNA substrate comprises longer RNA substrates (for example, greater than 1000 nucleotides, greater than 2000 nucleotides, greater than 3000 nucleotides, greater than 5000 nucleotides). For example, the ratio of RNA end repair enzymes to endoribonuclease may be increased as a function of the length of the RNA substrate present. For example, a ratio of T4 PNK to hRNase 4 of 40 U/l pL, of 80 U/l pL, of 160 U/l pL, 320 U/l pL, or of 500 U/l pL may be used to digest a mRNA comprising about 800 nucleotides (e g , EPO mRNA).
According to some embodiments, an enzyme (e g., an endoribonuclease and/or an end repair enzyme), a ribonuclease inhibitor, and/or a DNA probe may be immobilized. For example, an enzyme may be immobilized to a solid support, including covalent bonding to the support surface and non-covalent interaction (binding by adsorption, e. g. cationic, anionic, lipophilic, or hydrophilic surfaces) of the enzyme with the surface. Covalent immobilization may include reaction of an active functional group on the enzyme with an activated functional group on the solid support. Examples of reactive functional groups include amines, hydroxyl amines, hydrazines, hydrazides, thiols, phosphines, isothiocyanates, isocyanates, N-hydroxysuccinimide (NHS) esters, carbodiimides, thioesters, haloacetyl derivatives, sulfonyl chlorides, nitro- and dinitrophenyl esters, tosylates, mesylates, tritiates, maleimides, disulfides, carboxyl groups, hydroxyl groups, carbonyldiimidazoles, epoxides, aldehydes, acyl-aldehydes, ketones, azides, alkynes, alkenes, nitrones, tetrazines, isonitriles, tetrazoles, and boronates. Examples of such reactions include the reaction between an amine and an activated carboxy group forming an amide, between a thiol and a maleimide forming a thioether bond, between an azide and an alkyne derivative undergoing a 1,3 -dipolar cycloaddition reaction, between an amine and an epoxy group, between an amine and another amine functional group reacting with an added bifunctional linker reagent of the type of activated bis-dicarboxylic acid derivative giving rise to two amide bonds, or other combinations known in the art. Other reactions, such as UV-mediated cross-linking or chemi cal -mediated crosslinking (e.g., using formaldehyde or glutaraldehyde) can be used for covalent attachment of enzymes to solid supports. Disclosed methods may be used/adapted to prepare an immobilized ribonuclease inhibitor and/or an immobilized DNA probe.
A functional group may be inherently present in the material used for the solid support synthesis or a functional group may be provided by treating or coating the support with a suitable material. A functional group may also be introduced by contacting the solid support surface with an appropriate chemical agent. Activation in this context includes a
modification of a functional group on the solid support surface to enable coupling of a binding agent to the surface. Solid support in this context includes any solid (flexible or rigid) material onto which it is desired to capture and immobilize the enzyme. Solid support may be biological, non-biological, organic, inorganic or a combination thereof, and may be in the form of particles, strands, precipitates, gels, sheets, tubings, spheres, containers, capillaries, cartridges, pads, slices, films, plates, slides, and have any convenient shape, including flat, disc, sphere, circle, etc. The surface of the solid support may be composed of a variety of materials, for example, polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, membranes, among others, provided that the surface may support functional groups. Examples of a convenient solid support include glass surfaces such as glass slides, microtiter plates, and suitable sensor elements, for example, functionalized polymers (e.g. in the form of beads), chemically modified oxidic surfaces, (e.g. silicon dioxide, tantalum pentoxide or titanium dioxide), or also chemically modified metal surfaces, e.g. noble metal surfaces such as gold or silver, copper or aluminium surfaces, magnetic surfaces, e.g. Fe, Mn, Ni, Co, and their oxides, quantum dots, e.g., III-V (GaN, GaP, GaAs, InP, or InAs) or II- VI (ZnO, ZnS, CdS, CdSe, or CdTe) semiconductors, or Ln- doped fluoride nanocrystals, rare earth-doped oxidic nanomaterials.
A solid support surface may be provided with a layer of a polymer, for example, a polymer comprising functional groups to be activated. A polymer may be selected from any suitable class of compounds, for example, polyethylene glycols, polyethylene imides, polysaccharides, polypeptides, or polynucleotides, just to name a few. Attachment of the polymers to the support surface may be achieved by a variety of methods which are readily apparent to a person skilled in the art. For example, polymers bearing trichlorosilyl or trisalkoxy groups may be reacted with hydroxyl groups on the substrate surface to form siloxane bonds. Attachment to a gold or silver surface may take place via thiol groups on the polymer. Alternatively, the polymer may be attached via an intermediate species, such as a self-assembled monolayer of alkanethiols. The type of polymers selected, and the method selected for attaching the polymers to the surface, will thus depend on the polymer having suitable reactivity for being attached to the substrate surface, and on the properties of the polymers regarding non-specific adsorption to, especially, DNA and RNA. The functional groups may be present on the polymer or may be added to the polymer by the addition of single or multiple functional groups. Optionally, a spacer arm can be used to provide flexibility to the binding enzyme allowing it to interact with its environment in a way which
minimizes steric hindrance with the solid support. In some instances, the solid support surface may comprise additional coating molecules, for example, polyethylene glycols, polyethylene imides, polysaccharides, polypeptides, or polynucleotides, that do not carry a reactive functional group. Additional coating molecules that do not carry a reactive functional group may increase the specific activity and/or stability of the immobilized enzyme, for example, by providing a local hydrophilic environment that favors the enzyme folding.
To immobilize an endoribonuclease and/or a RNA repair enzyme on a solid support, activated functional groups on a solid support may be present on the predefined regions only, or alternatively on the entire surface, are reacted selectively with the functional groups present in the enzyme molecules. Suitable reaction conditions, including time, temperature, pH, solvent(s), and additives, will depend on inter alia the particular species and may be selected in accordance with conditions for similar reactions. Functional group may be inherent to the enzyme amino acid sequence. Enzymes may be synthesized to incorporate a desired functional group either through a chemical reaction or through genetic engineering. Amino acids can be modified either chemically or enzymatically with any type of functional group in order to provide the desired reactivity.
Endoribonucleases and/or RNA repair enzymes may be included in a fusion protein and immobilized on a solid support by means of such fusion protein. For example, a fusion protein construct of an endoribonuclease and/or a RNA repair enzyme may generated with, for example, a maltose-binding protein (MBP), a chitin or chitin-binding domain (CBD), a poly-histidine 6xHis or poly-His-tag), a HA-tag, a c-myc tag, a FLAG-tag, a SNAP -tag (U.S. Patent Nos. 7,939,284; 8,367,361; 7,799,524; 7,888,090; and 8,163,479), a CLIP-tag (U.S. Patent No. 8,227,602), a Halotag (Los, et al. Methods Mol Biol. 2007, 356:195-208), an ACP-tag (U.S. Patent No. 7,666,612), a S-tag, a glutathione-S-transferase (GST), and others known to those skilled in the art. A solid support surface may be coated with an affinity group that is capable of specifically binding to the corresponding protein fusion partner, for example, a maltose moiety for MBP fusions, a benzylguanine (BG) moiety for SNAP -tag fusions, a benzylcytosine (BC) moiety for CLIP-tag fusions, a chloroalkane moiety for Halotag fusions, a peptide sequence (Lys-Glu-Thr-Ala-Ala-Ala-Lys-Phe-Glu-Arg-Gln-His- Met-Asp-Ser) for S-tag fusions, a nickel-nitrilotriacetic acid (Ni-NTA) chelate for His-tag fusions, and so on. In some embodiments, immobilization is achieved using an affinity binding pair, such as in streptavidin-functionalized on the beads and biotinylated enzymes. In
some other cases, the protein fusion of the endoribonuclease and/or the RNA repair enzyme (e.g., with MBP) may be used to enhance their solubility and facilitate their proper folding.
Endoribonucleases and/or RNA repair enzymes may be immobilized on a solid support by means of physical adsorption, for example, where binding is mainly by hydrogen bonds, multiple salt linkages, and/or Van der Waal's forces. In some embodiments, magnetic or paramagnetic solid supports (e.g., silica beads) are coated with negatively charged molecules (e.g., carboxyl -containing molecules) or positively charged (e g., amino-containing molecules) which reversibly bind the enzymes of interest. In some instances, a crowding agent (e.g., polyethylene glycol, such as 10-50% PEG) and/or salt (e.g., NaCl, such as 0.1-4 M NaCl) may be used. In some embodiments, immobilization may be based on the entrapment of the enzyme within the lattice of a polymer matrix (e.g., synthetic polymers such as polyarylamide and polyvinylalcohol) or of a membrane (e.g., polymeric microcapsule).
One or a plurality of endoribonucleases and/or RNA repair enzymes may be immobilized on the same or different solid supports. They may be immobilized randomly on a given solid surface; or they may be immobilized at a specific arrangement, for example, on specific compartments of a given solid surface, so that the enzymes are arranged in series or in parallel to each other, or a combination of both arrangements. Such arrangement may serve different purposes, such as the sequential treatment of an RNA sample with an endoribonuclease followed by an RNA repair enzyme. Or a parallel treatment of an RNA sample with two or more endoribonucleases, wherein the sample is spatially confined to separate compartments (in the same of different reaction vessels) so that there is no crossreaction of the sample with different enzymes. One or a plurality of endoribonucleases and/or the RNA repair enzymes may be immobilized on cartridges and these cartridges may be integrated in LC-MS/MS systems, wherein the cartridges may be individually selected by column selectors according to the requirements of a given experiment and allowing subsequential incubations with any of these enzymes.
Use of immobilized endoribonucleases and/or RNA repair enzymes may enable automation of one or more RNA sample processing steps (e.g., digestion) prior to downstream analysis (e.g., LC-MS/MS). Use of immobilized endoribonucleases and/or RNA repair enzymes may reduce the amount of sample and/or time required for processing the RNA prior to downstream analysis. Use of immobilized endoribonucleases and/or RNA repair enzymes may enable miniaturization and/or high-throughput analysis of RNA samples.
Use of immobilized enzymes may provide the ability to multiplex reactions, streamline reaction processes and workflows, reduce level of degradation byproducts (e.g., unwanted RNA hydrolysis, oxidation, deamination, etc.), reduce manual steps and the risk of manual (human) errors, and importantly, in some cases increase hydrolytic and/or thermal stability of the enzymes (relative to their non-immobilized forms). The endoribonuclease(s) and/or the RNA repair enzyme(s) may be irreversibly adsorbed or covalently linked to the solid surface using any one of the methods described in this invention. The endoribonuclease(s) and/or the RNA repair enzyme(s) may be stably and efficiently immobilized on a microchip or any column reactor or fluid channel network, in such a way that buffers and reagents are flowed through (e.g., manually or using a peristaltic pump) the reaction vessel.
Methods
The present disclosure provides, in some embodiments, methods for analyzing and characterizing RNA. Methods may include, for example, preparing oligoribonucleotides from RNA substrates (e.g., total RNA, genomic RNA, messenger RNA, transfer RNA, ribosomal RNA, coding RNA, non-coding RNA, micro RNA, small interfering RNA, nuclear RNA, nucleolar RNA). Methods may include, in some embodiments, contacting an RNA substrate with an RNA denaturing agent to form a denatured RNA substrate. For example, a method may include heating an RNA sample (e.g., at 90°C for 10 min) in a low salt solution (e.g., containing 0-50 mMNaCl) or in a denaturing solution (e.g., containing 3 M urea) to form the denatured RNA substrate. In some embodiments, a denaturing agent, if used, may be separated from the denatured RNA substrate (e.g., by dialysis, affinity or size-exclusion chromatography or other methods). Compositions including RNA substrates and RNA denaturing agents, according to some embodiments, may be diluted (e g., more than 10-fold, more than 100-fold, more than 500-fold, more than 1000-fold). For example, compositions including RNA substrates and RNA denaturing agents may be diluted to reduce the impact of included RNA denaturing agent(s) on enzymes in one or more subsequent steps. Dilution may reduce the RNA denaturing agent to a concentration that permits an enzyme in a subsequent (e.g., an endoribonuclease and/or an RNA end repair enzyme) step to have at least 1% of its activity in the absence of such RNA denaturing agent(s) (e.g., under otherwise the same conditions of temperature, pH, enzyme concentration, substrate concentration, kind and concentration of buffer, and/or other components).
Digestion of RNA with some endoribonucleases may produce a mixture of cleavage products comprising 2',3'-cyclic-phosphate (sometimes also referred as to 2’,3’-
phosphodiester) and 3 '-phosphate (sometimes also referred as to 3 '-linear phosphate or 3’- phosphomonoester) termini, whereas some other endoribonucleases may produce a mixture of cleavage products comprising 2',3'-cyclic-phosphate and 2'-phosphate (sometimes also referred as to 2'-linear phosphate or 2’ -phosphomonoester) termini. The extent of formation of 2',3'-cyclic-phosphate and 2’- or 3'-linear phosphate may depend on the enzyme concentration, the digestion buffer and/or incubation time. A mixture of cleavage products may also comprise 2',3'-hydroxylated species Enzyme-independent hydrolytic opening of 2',3'-cyclic-phosphate may generate a mixture comprising 2',3'-cyclic-phosphate, 3'- phosphate, 2'-phosphate, and/or 2',3'-hydroxy termini in any combination. Enzymeindependent hydrolytic cleavage of RNA may further produce a mixture of 5’-phosphate and 5 ’-hydroxy termini. The potential presence of any of these products, in any combination, can convolute analysis by mass spectrometry techniques.
Methods may include contacting an RNA substrate (or a denatured RNA substrate) with a composition comprising an endoribonuclease and/or an optional RNA end repair enzyme under conditions (e.g., temperature, pH, enzyme and substrate concentrations, and buffers or other components) permitting the RNA substrate (or the denatured RNA substrate) to be cleaved and oligoribonucleotides to be formed. In some embodiments, the optional RNA end repair enzyme may be omitted. In such embodiments, it may be desirable to use an endoribonuclease with specificity for cleavage of RNA substrates that results in cleavage on average once every 6 to 12 nucleotides, for example on average once every 8 nucleotides.
Methods, according to some embodiments, may further comprise analyzing oligoribonucleotides (e.g., oligoribonucleotides formed by digestion of an RNA substrate with an endoribonuclease) by LC-MS/MS. For example, oligoribonucleotides may be analyzed by capillary electrophoresis-mass spectrometry (CE-MS). In some embodiments, oligoribonucleotides may be analyzed by gel electrophoresis. In some embodiments, LC- MS/MS and/or CE-MS are used to determine the masses and/or fragmentation profiles of species in compositions of oligoribonucleotides (e g., oligoribonucleotides formed by digestion of an RNA substrate with an endoribonuclease).
Methods, according to some embodiments, may include contacting an RNA substrate with an RNA substrate binding molecule to form a complex, the complex comprising a binding molecule-RNA substrate interface and single-stranded RNA substrate portion. Examples of an RNA substrate binding molecule may include a DNA probe (e.g., at least partially complementary to the RNA substrate), an RNA probe (e.g., at least partially
complementary to the RNA substrate), a synthetic nucleic acid probe (e.g., a locked nucleic acid that is at least partially complementary to the RNA substrate), an RNA binding protein, an antibody, an RNA ligand (e.g., adenosylcobalamin, lysine, glycine, flavin mononucleotide, fluorescent dyes, and drugs including, for example, branaplam and risdiplam), divalent ions (e.g., salts of magnesium, calcium, zinc, manganese, etc.), ribosomes, and lipid-based membranes. A binding molecule-RNA substrate interface may comprise one or more endoribonuclease cut sites for which access by the corresponding endoribonuclease is limited. A single-stranded RNA substrate portion may comprise one or more endoribonuclease cut sites that are accessible to the corresponding endoribonuclease. A method may comprise, in some embodiments, contacting a complex with an endoribonuclease to form cleavage products. Cleavage products may include (two or more) fragments of the single- stranded RNA substrate portion and a cleaved binding molecule-RNA substrate interface, the RNA component of which remains uncut by the endoribonuclease and wherein the site(s) of cleavage of the cleaved binding molecule-RNA substrate interface are adjacent to the interface (e.g., not within the interface). For example, methods may include hybridizing an RNA substrate to at least one DNA probe to form an RNA/DNA duplex comprising a doublestranded portion and at least one single- stranded portion. A double- stranded portion of a duplex may comprise one or more endoribonuclease cut sites. A single-stranded portion of a duplex may comprise one or more endoribonuclease cut sites. A method may include contacting a duplex and an endoribonuclease to form cleavage products, the cleavage products comprising two or more fragments of the single-stranded portion and a cleaved double-stranded portion, the RNA component of which remains uncut by the endoribonuclease.
In some embodiments, a method may include assessing the integrity, identity, presence and/or purity of a target RNA in a sample (e.g., through “fingerprinting”, “signature profiling”, and/or “ID testing”) and/or confirming the identity of an RNA produced by synthesis or isolated from native sources. According to some embodiments, ID testing may be performed by HPLC retention time analysis, intact mass analysis, failure sequence analysis, MS/MS sequencing, MS-fragmentation pattern analysis, NMR, melting temperature analysis, or any combination thereof. Methods may include, according to some embodiments, de novo sequencing a subject RNA (e.g., RNA in an RNA substrate) using mass spectrometry including sequencing oligoribonucleotides and assembling resulting sequences to form an assembled sequence corresponding to the subject RNA. In some
embodiments, these oligoribonucleotide mixtures are used for determining the identity and location of a modified nucleotide in an RNA substrate (“modification mapping”).
In some embodiments, oligoribonucleotides from RNA substrates may be used for characterizing impurities in an RNA sample. Impurities may include, for example, truncated RNA species, protracted RNA species, for example, obtained from read-through synthesis, degraded RNA species, RNA species containing nucleotide misincorporations, deletions, or additions, RNA species containing impurities derived from phosphoramidite-based synthesis, such as RNA containing residual protective groups (e.g., DMT, CEP, TBDMS, Bz, iBu, and others), RNA containing depurinated bases, and RNA containing by-products of RNA synthesis and deprotection, such as cyanoethyl adducts; carried-over reagents (e g., plasmids), exogenous nucleic acid contaminants, and any combination of the foregoing.
Methods, according to some embodiments, may further comprise analyzing activity and/or specificity of an enzyme (e.g., a ligase, a polymerase, a transferase, a methyltransferase, a carbamoyltransferase, a glycosyltransferase, an acyltransferase, an aminotransferase, a peptidyltransferase, a pseudouridine synthase, a transglycosylase, a transaminase, a glycosidase, a capping enzyme, a decapping enzyme, a kinase, a phosphatase, a nuclease (endo or exo), a lyase, an oxidoreductase, and/or a deaminase) by analyzing RNA products of such enzyme.
In some embodiments, methods include digestion of an RNA substrate using a composition of at least one endoribonuclease and at least one an RNA end repair enzyme, wherein the RNA end repair enzyme comprises both phosphodiesterase (PDE) and phosphomonoesterase (PME) activities, in a buffering solution optionally containing an RNA denaturing agent. An example of an RNA end repair enzyme comprising both phosphodiesterase and phosphomonoesterase activities is the bacteriophage T4 polynucleotide kinase (T4 PNK; also referred as to T4 polynucleotide kinase-phosphatase or T4 Pnkp) (Das and Shuman, 2013). T4 PNK heals each of 2',3'-cyclic-phosphorylated, 3'- phosphorylated and 2'-phosphorylated oligoribonucleotide ends resulting in products that comprise a 2’,3’-hydroxylated (2'-OH, 3'-OH) end. Hence, co-incubation of T4 PNK with an endoribonuclease resolves 2',3'-cyclic-phosphorylated, 3'-phosphorylated and/or 2'- phosphorylated oligoribonucleotides that may be produced upon endoribonuclease cleavage. By converting 2',3'-cyclic-phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated oligoribonucleotide ends into 2’, 3 ’-hydroxylated ends, T4 PNK reduces spectral complexity and enhances the mass signal of endoribonuclease digestion products.
Methods may comprise contacting a polyribonucleotide substrate with an endoribonuclease to form a cleaved polyribonucleotide product and contacting the cleaved product with an RNA end repair enzyme to form a polyribonucleotide cleavage product with healed ends (e.g., 5’ ends comprising a 5’-OH and/or 3’ ends comprising a 3’-OH and/or 2’- OH). Contacting, in some embodiments, may be performed in sequential steps (e.g., contact with endoribonuclease followed by repair enzyme, often with an intervening cleanup step) or concurrently as a coupled reaction (e g., in a single compartment, tube, container, vessel or other space). In this context, reactions may be concurrent if they overlap in time with one another, even if their start times and/or completions times are not synchronized. For example, a method may comprise simultaneously adding an endoribonuclease and an RNA end repair enzyme to a composition comprising an RNA substrate (and, optionally, a buffering agent and/or an RNA denaturing agent). Even if the starting composition is free of 2',3'-cyclic-phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated ends until the endoribonuclease begins cleaving the RNA substrate, the cleavage and repair processes would be concurrent as long as repair begins before RNA substrate is exhausted by the cleavage reaction. An RNA product with healed ends may be subjected to characterization by tandem liquid chromatography-mass spectrometry (LC-MS) or by tandem capillary electrophoresis-mass spectrometry (CE-MS).
An enzyme comprising a phosphodiesterase and phosphomonoesterase may be contacted with (e.g., added to) a composition comprising oligoribonucleotides having one or more 2',3'-cyclic-phosphorylated, 3 '-phosphorylated and/or 2'-phosphorylated ends to produce one or more dephosphorylated ends. Oligoribonucleotides having one or more 2', 3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated ends may be provided as such or, in some embodiments, an RNA substrate may be contacted with an endoribonuclease in the same composition or space to form the oligonucleotides.
In some embodiments, a method may comprise contacting an RNA substrate with an endoribonuclease to form oligoribonucleotides having one or more 2',3'-cyclic- phosphorylated, 3'-phosphorylated and/or 2' -phosphorylated ends and, following exhaustion of RNA substrate, contacting the oligoribonucleotides with an enzyme comprising a phosphodiesterase and a phosphomonoesterase to form one or more dephosphorylated ends. In some embodiments, a method may further comprise purifying the oligoribonucleotides prior to contact with the enzyme comprising a phosphodiesterase and a phosphomonoesterase.
A method may include, according to some embodiments, incubating (e.g., heating) an RNA substrate (e.g., prior to or upon contacting with an endoribonuclease) to form a denatured or melted RNA substrate. Incubating an RNA substrate may comprise maintaining the RNA substrate at a temperature of 65°C or higher, for example, 65°C - 75°C, 70°C - 80°C, 75°C - 85°C, 80°C - 90°C, 85°C - 95°C, 90°C - 100°C, more than 95°C, or more than 100°C. Heating may comprise maintaining the RNA substrate at a selected temperature for up to a minute, 1-10 minutes, at least a minute, at least 5 minutes, at least 6 minutes, at least 7 minutes, at least 8 minutes, at least 9 minutes, at least 10 minutes, or up to 20 minutes.
In some embodiments, a method may comprise contacting an RNA substrate (e.g., a melted RNA substrate) with an endoribonuclease at a temperature of less than 30°C, 25°C - 35°C, 30°C - 40°C, 35°C - 45°C, 37°C, 40°C - 50°C, 45°C - 55°C, 50°C - 60°C, more than 50°C, or more than 55°C. A method may include, according to some embodiments, contacting an RNA substrate (e g., a melted RNA substrate) with an endoribonuclease for 30- 120 minutes, up to 30 minutes, at least 30 minutes, up to 45 minutes, up to 60 minutes, up to 75 minutes, up to 90 minutes, up to 105 minutes, up to 120 minutes, or at least 105 minutes.
In some embodiments, a method may comprise fingerprinting 2',3'-hydroxylated oligonucleotides (e.g., arising from a subject RNA following contact with an endoribonuclease and an RNA end repair enzyme) by LC-MS analysis. Fingerprinting by LC-MS analysis may comprise, for example, deconvoluting the charge state distribution of raw mass spectra and comparing the observed masses to masses from a theoretical digestion of a subject RNA. The resulting mass “fingerprint” may be utilized to assess the identity of a subject RNA. In some embodiments, fingerprinting by LC-MS comprises comparing deconvoluted mass spectrum to a database of RNA transcripts using a computer and assessing the “identity” of the characterized transcript by a mathematical metric.
In some embodiments, a method may comprise sequencing 2',3'-hydroxylated oligonucleotides (e.g., arising from a subject RNA following contact with an endoribonuclease and an RNA end repair enzyme) by LC-MS/MS analysis. For example, a sequencing method may comprise acquiring mass spectra (e.g., MS and/or MS/MS spectra) from an oligoribonucleotide comprising healed ends and comparing the acquired mass spectra with theoretical mass spectra from a theoretical digestion of a subject RNA with an endoribonuclease of selected specificity. Sequencing an RNA by LC-MS/MS is utilized to verify the sequence of a subject RNA and identify the position of mass altering RNA modifications.
Methods, according to some embodiments, may comprise preparing oligoribonucleotides from RNA substrates using one or more endoribonucleases in absence of an RNA end repair enzyme. An endoribonuclease (e.g., an endoribonuclease for methods including analysis of RNA by LC-MS/MS) may cleave RNA substrates on average once every 4 to 64 nucleotides, once every 6-12 nucleotides, or once every 8 nucleotides. Endoribonucleases with average cleavage frequencies of every 6-12 nucleotides may provide better mapping coverage relative to endoribonucleases with average cleavage frequencies of every 4 nucleotides or less and/or relative to relative to endoribonucleases with average cleavage frequencies of once every 16 nucleotides or more. Example 3 and FIGURE 5 and FIGURE 6 show the theoretical mapping coverage of 1000 randomly selected human transcripts comparing oligonucleotides generated by endoribonucleases with cleavage frequencies of 1 out 2 nucleotides (RNase A), 1 out 4 nucleotides (RNase Tl, MCI -2015) and Cusativin-2021), 3 out of 16 nucleotides (Cusativin-2017 and MC1-2021), 1 out 8 nucleotides (hRNase 4), and 1 out 16 nucleotides (Colicin E5). Endoribonucleases with specificity that result in RNA cleavage on average once every 8 nucleotides (e.g., cleavage after a specific nucleotide followed by a purine; cleavage after a specific nucleotide followed by a pyrimidine; cleavage after a purine followed by a specific nucleotide; cleavage after a pyrimidine followed by a specific nucleotide; among others) are capable of producing a higher theoretical sequence coverage of the human transcriptome, whether based on the total content of their cleavage products or on the content of their cleavage products comprising unique sequences (see FIGURE 6).
A method, according to some embodiments, may include separation or removal of a ribonuclease from reactants and/or products. For example, a method may comprise contacting an RNA substrate with an endoribonuclease (e.g., RNase 4) to form one or more reaction products comprising at least one RNA substrate cleavage product and the endoribonuclease. A method may further include separating the at least one RNA substrate cleavage product from the endoribonuclease. An endoribonuclease (and, optionally, an end repair enzyme, if included) may be immobilized on a magnetic bead and separating may comprise magnetically gathering the immobilized endoribonuclease (e.g., into a pellet), thereby allowing the at least one RNA substrate cleavage product to be removed. In some embodiments, an endoribonuclease may be susceptible to a ribonuclease inhibitor and separating may comprise contacting the reaction products with an immobilized ribonuclease inhibitor to form immobilized complexes comprising the immobilized ribonuclease inhibitor,
thereby allowing the at least one RNA substrate cleavage product to be removed. Optionally, an endoribonuclease or a ribonuclease inhibitor may be immobilized on a surface (e.g., a column) or in a filter and reaction materials (e.g., reactions and/or reaction products) may be passed over the surface or through the filter. Additional information about separating an immobilized material from reaction products may be found in U.S. Patent Application 18/182,122 filed March 10, 2023, incorporated herein by reference.
According to some embodiments, a ribonuclease inhibitor provided in an immobilized form may be utilized to capture and remove an endoribonuclease (e.g., a soluble endoribonuclease) from a reaction mixture or vessel. Removal of a soluble endoribonuclease from a reaction mixture or vessel may be used to stop the digestion reaction of the RNA substrate at desired time points. In some embodiments, removal of soluble endoribonuclease at designed time points may be used as a strategy to produce incomplete or partial cleavage of the RNA substrate thus resulting in oligonucleotides cleavage products with one or more uncut cleavage sites. Such partially uncut oligonucleotides are longer in size than those that have been cut all at possible cleavage sites, and thus may increase mapping coverage at certain RNA substrate regions. In some embodiments, removal of a soluble endoribonuclease from a reaction mixture or vessel may be used to prevent contamination of downstream analytical instrumentation (e.g., chromatographic columns) with active endoribonucleases. In some embodiments, the removal of a soluble endoribonuclease from a reaction mixture or vessel may be used in automation protocols to facilitate and streamline methods of analysis.
Quantification Methods
In some embodiments, methods may include identifying and/or quantifying one or more components in an RNA sample or RNA substrate. For example, it may be desirable to accurately identify and/or quantify certain features of an RNA substrate, such as the presence of a 5 ’-cap structure, a 3 ’-poly(A) tail, or an RNA modification within an RNA. In the context of an RNA obtained by chemical synthesis or by enzymatic in vitro transcription (IVT) (e.g., intended for use as a vaccine or another therapeutic application), the presence and/or quantity of certain features (e.g., cap structures, polyA tails) may be desirable for the functional stability and/or high translational efficiency of the RNA.
In some embodiments, oligoribonucleotide products generated by cleavage of RNA substrates, using either an endoribonuclease or a composition comprising an endoribonuclease and an RNA end-repair enzyme, may be quantified by mass spectrometric methods. In some embodiments, oligoribonucleotides may be quantified using calibration
curves constructed with authentic standards. Oligoribonucleotides may be labeled, according to some embodiments, at their 5’ or 3’ end with stable isotopes for relative or absolute quantification. Differential incorporation of stable isotopes (also referred as to Tandem Mass Tag) may be used for multiplex quantitative analysis of oligonucleotides in which multiple oligonucleotide features (e.g., presence of a cap structure, one or more RNA modifications, presence of a polyA tail) may be analyzed simultaneously by means of isobaric tags.
In some embodiments, methods may include isotope labeling for quantitative analysis of oligonucleotides. Isotope labeling, according to some embodiments, may be performed in one step, wherein an isotopically labeled nucleotide is incorporated at the 3’ end of an oligonucleotide by the action of an RNA polymerase. In some embodiments, the isotopically labeled nucleotide is blocked at its 3’ position to prevent further extension of oligonucleotides beyond a single labeled nucleotide. Examples of such nucleotides are 3’- deoxynucleotides and 2’,3’-dideoxynucleotides, including but not limited to 3'- deoxyadenosine (Cordycepin), 3'-deoxyinosine, 3 '-deoxy guanosine, 3'-deoxyuridine, 3'- deoxycytidine, 2’,3’-dideoxyadenosine, 2’, 3 ’-dideoxyinosine, 2’,3’-dideoxyguanosine, 2’,3’- dideoxyuridine, 2’, 3 ’-dideoxy cytidine, 3’-azido-3'-deoxyadenosine, 3’-azido-3'- deoxyinosine, 3 ’-azi do-3 '-deoxyguanosine, 3 ’-azido-3 '-deoxyuridine, 3’-azido-3'- deoxycytidine, 3’-azidomethyl-3'-deoxyadenosine, 3’-azidomethyl-3'-deoxyinosine, 3’- azidomethyl-3 '-deoxyguanosine, 3 ’ -azidomethyl-3'-deoxyuridine, 3 ’ -azidomethyl -3'- deoxycytidine, 3’-fluoro-3'-deoxyadenosine, 3 ’-fluoro-3 '-deoxyinosine, 3’-fluoro-3'- deoxyguanosine, 3’-fluoro-3'-deoxyuridine, 3 ’-fluoro-3'-deoxy cytidine, 3’-amino-3'- deoxyadenosine, 3’-amino-3'-deoxyinosine, 3 ’-amino-3 '-deoxy guanosine, 3’-amino-3'- deoxyuridine, 3 ’-amino-3'-deoxy cytidine, 3 ’-O-methyl-3 '-deoxyadenosine, 3’-O-methyl-3'- deoxyinosine, 3 ’-O-methyl-3 '-deoxy uanosine, 3’-O-methyl-3'-deoxyuridine, 3’-O-methyl- 3 '-deoxy cytidine. Examples also include nucleotide analogues, including Carbovir, Ganciclovir, Lamivudine, and Clofarabine, among others. The stable isotope may be selected from one or more of Deuterium (d), Carbon-13 (13C), and Nitrogen- 15 (15N), in any combination (for instance, Cordycepin- 13 C5, Carbovir- 13 C,d2; Ganciclovir-d5; Lamivudine- 15N2,13C; and Clofarabine-13C,15N3). An isotopically labeled nucleotide may be incorporated at the 3’ end of an oligonucleotide by reaction of the corresponding nucleoside triphosphate with a polymerase. Examples of polymerases that may catalyze template independent addition of a desired nucleotide monophosphate (NMP) from the nucleoside triphosphate (NTP) to the 3’ end of RNA are (including recombinant and mutants therof): E.
coli Poly (A) Polymerase, Yeast Poly (A) Polymerase, Poly(U) Polymerase, and DNA Polymerase 0 (Pol0).
In some embodiments, isotope labeling for quantitative analysis of oligonucleotides may be performed by incorporation of an isotopically labeled nucleotide at the 5’ or 3’ end of an oligonucleotide by the action of an RNA ligase. The 3’ end labeling may be performed in one step using a T4 RNA ligase and a pre-adenylated nucleotide. Examples of a preadenylated nucleotide include A(5’)pp(5’)Cp, wherein the cytidine comprises a 3’-phosphate and one or more isotope labels; A(5’)pp(5’)Gp, wherein the guanosine comprises a 3’- phosphate and one or more isotope labels; A(5’)pp(5’)Up, wherein the uridine comprises a 3’-phosphate and one or more isotope labels; A(5’)pp(5’)Ip, wherein the inosine comprises a 3’-phosphate and one or more isotope labels; and A(5’)pp(5’)Ap, wherein the 3’ terminal adenosine comprises a 3 ’-phosphate and one or more isotope labels. The 3’ end labeling may comprise (i) adenylating an isotopically labeled pCp, pGp, pip, pUp, or pAp using a Methanobacterium thermoautotrophicum (Mth) RNA ligase in the presence of ATP, (ii) inactivating the Mth RNA ligase, and (iii) ligating the adenylated isotopically labeled nucleotide using a T4 RNA ligase.
In some embodiments, methods may include converting oligonucleotides to their 5’- phosphorylated form (for instance, by using T4 PNK in the presence of ATP) prior to ligation. 5’ end labeling may be performed by ligation of a 5’ adapter (e.g., 5-50 nucleotides in length) to an oligonucleotide, the adapter comprising one or more isotopically labeled nucleotides (e.g., adenosine, guanosine, uridine or cytidine labeled with one or more of Deuterium, Carbon- 13, and Nitrogen- 15). Ligation of a 5’ adapter to a target oligonucleotide may be performed by an RNA ligase such as T4 RNA ligase 2 and may be carried out in the presence of additives (e.g., PEG) and/or splint adapters (e.g., 5-50 nucleotides in length whose sequence is randomized or partially annealing to the 5’ adapter). 3’ end labeling may be performed by ligation of a 3’ adapter (e.g., 5-50 nucleotides in length) to an oligonucleotide, the adapter comprising one or more isotopically labeled nucleotides (e.g., adenosine, guanosine, uridine or cytidine labeled with one or more of Deuterium, Carbon-13, and Nitrogen- 15) using, for example, a T4 RNA ligase or variant thereof.
In some embodiments, a method for labeling an oligonucleotide for quantitative analysis may comprise incorporating a non-isotopically labeled nucleotide at the 5’ or 3’ end of the oligonucleotide, wherein the non-isotopically labeled nucleotide comprises a chemically reactive group that is capable of reacting with an isotopically labeled molecule
(also referred as to a mass label). A non-isotopically labeled nucleotide may comprise, in some embodiments, a 3 ’-deoxynucleotide or a 2’, 3 ’-dideoxynucleotide, in each case, having a chemically reactive group at the 2’ or 3’ position. In some embodiments, a chemically reactive group may be or comprise any of a carbonyl; a carboxyl; an active ester, e.g., a succinimidyl ester; a maleimide; an amine; a thiol; an alkyne, an azide; an alkyl halide; an isocyanate; an isothiocyanate; an iodoacetamide; a 2-thiopyridine; a 3-arylproprionitrile; a diazonium salt; an alkoxyamine; a hydrazine; a hydrazide; a phosphine, an alkene; a semicarbazone; an epoxy; a phosphonate; and a tetrazine. An isotopically labeled molecule may be selected from an amino acid (e.g., L-alanine-15N; L-alanine-13C3,15N; L-alanine- d4,15N; L-phenylalanine-15N; L-phenylalanine-13C9,15N; L-phenylalanine-d8,15N; L- proline-15N; L-proline-13C5,15N; L-proline-d7, 15N); an a-keto acid (e.g., a-ketobutyric acid-13C4; a-ketoisocaproic acid-13C; a-ketoisovaleric acid-13C5); a nucleotide (e.g., adenosine- 15N5; 2’-deoxyadenosine-15N5; uridine-15N2; thymidine-15N2; thymidine- 13C10,15N2); a bile acid (e.g., chenodeoxycholic acid-13C; cholic acid-13C; deoxycholic acid-d4; etc); a carbohydrate (e.g., N-acetylglucosamine-15N; D-arabinose-13C; D-fructose- 13C; D-galactose-13C; D-galactose-d; D-glucosamine-15N; D-glucose- 13 C); a drug (e.g., Phenacetin ethoxy-13C; Erythromycin N-methyl-13C; 5,5-Diphenylhydantoin diphenyl-dlO; Dopamine HCl-d3); a fatty acid (e.g., arachidic acid-d39; buytric acid-d7, L-carnitine HCl- methyl-d3; decanoic acid-dl9; linoleic acid-13C, palmitic acid-13C), a steroid (cholesterol -3- octanoate-13C; diethylstilbestrol-d8); or any derivative or combination thereof (e.g., putrescine- 13 C4; L-azidohomoalanine-13C4,15N2; omithine-d2; pipecolic acid-13C6,15N; 3-bromo-L-tryrosine-13C6; uric acid-15N2; choline chloride- 13 C2; D-mannitol-13C; glycerol-d5; propionic acid-d2; L-alanine-15N-L-phenylalanine-13C9,15N; L-alanine- 13C3,15N-L-phenylalanine-d8,15N; L-alanine-d4,15N-L-proline-13C5,15N), including the use of differentially isotopically labeled isobaric tags (e.g., Tandem Mass Tags TMT, iodoTMT, and aminoxyTMT). The isotopically labeled molecule may comprise a chemically reactive group that is capable of chemoselectively reacting with the non-isotopically labeled nucleotide, once the latter is incorporated into the target oligonucleotide (for instance, a L- azidohomoalanine-13C4,15N2 reacts with a 3 ’-alkyne-3 '-deoxy adenosine by means of a Cu(I)-catalyzed azide-alkyne cycloaddition). Examples of chemoselective reactions include a reaction between an amine reactive group and an electrophile (e.g., an alkyl halide or an N- hydroxysuccinimide ester (NHS ester)); a reaction between a thiol reactive group and an iodoacetamide or a maleimide; a reaction between an azide and an alkyne (azide-alkyne
cycloaddition or “Click Chemistry”). An azide-alkyne cycloaddition may be catalyzed by Cu(I) or strain-promoted to yield a 1,4-substituted triazole. Another type of useful cycloaddition is the reaction between a trans-cyclooctene (TCO) and a tetrazine (Tz) to form a dihydropyridazine bond. Examples and uses of chemoselective reactions in biological systems are reviewed in a variety of publications, such as in Sletten, E. M. and Bertozzi C. R. “Bioorthogonal Chemistry: Fishing for Selectivity in a Sea of Functionality” Angewandte Chemie International Edition English 2009, 48(38): 6974-98.
In some embodiments, an appropriate chemically reactive group is installed in the isotopically labeled molecule prior to its reaction with an oligonucleotide comprising non- isotopically labeled nucleotide. For example, a chemically reactive group (e.g., dibenzocyclooctyne (DBCO)) may be installed on a L-alanine-d4 or on a dipeptide L-alanine- d4-L-phenylalanine-13C9,15N, and then reacted with an oligonucleotide comprising a 3’- azido-3 '-deoxy adenosine through a strain-promoted 1,3-dipolar cycloaddition to form a 1,4- substituted triazole linkage (FIGURE 24). In some embodiments, non-isotopically labeled nucleotides may be selected from one of 3’-azido-3'-deoxyadenosine, 3’-azido-3'- deoxyinosine, 3 ’-azi do-3 '-deoxyguanosine, 3 ’-azido-3 '-deoxyuridine, 3’-azido-3'- deoxycytidine, 3’-azido-2’,3’-dideoxy-adenosine, 3’-azido-2’,3’-dideoxyinosine, 3’-azido- 2’,3’-dideoxyguanosine, 3’-azido-2’,3’-dideoxyuridine, 3’-azido-2’,3’-dideoxycytidine, 2’- azido-2’, 3 ’-dideoxy-adenosine, 2’ -azido-2’, 3 ’-dideoxyinosine, 2’-azido-2’,3’- dideoxyguanosine, 2’-azido-2’,3’-dideoxyuridine, and 2’ -azido-2’, 3 ’-dideoxy cytidine. Examples of non-isotopically labeled nucleotides further include 3’-alkyne-3'- deoxyadenosine, 3 ’-alkyne-3 '-deoxyinosine, 3’-alkyne-3'-deoxyguanosine, 3’-alkyne-3'- deoxyuridine, 3 ’-alkyne-3 '-deoxy cytidine, 3’-alkyne-2’,3’-dideoxy-adenosine, 3’-alkyne- 2’,3’-dideoxyinosine, 3 ’-alkyne-2’,3’ -dideoxy guanosine, 3 ’-alkyne-2’,3’ -dideoxyuridine, 3’- alkyne-2’,3 ’ -dideoxy cytidine, 2’ -alkyne-2’,3 ’-dideoxy-adenosine, 2’ -alkyne-2’,3 ’ - dideoxyinosine, 2’-alkyne-2’,3’-dideoxyguanosine, 2’ -alkyne-2’,3 ’-dideoxyuridine, 2’- alkyne-2’,3 ’ -dideoxy cytidine, 3’-propargyl-3'-deoxyadenosine, 3 ’ -propargyl-3 '-deoxyinosine, 3 ’-propargyl -3'-deoxyguanosine, 3 ’-propargyl-3 '-deoxyuridine, 3 ’-propargyl -3'- deoxycytidine, 3’-propargyl-2’,3’-dideoxy-adenosine, 3 ’-propargyl-2’, 3 ’-dideoxyinosine, d’propargyl -2’, 3 ’-dideoxy guanosine, 3 ’-propargyl -2’, 3 ’-dideoxyuridine, 3’-propargyl-2’,3’- dideoxycytidine, 2’ -propargyl -2’, 3 ’-dideoxy-adenosine, 2’-propargyl-2’,3’ -dideoxyinosine, 2’-propargyl-2’,3’-dideoxyguanosine, 2 ’-propargyl -2 ’,3 ’-dideoxyuridine, and 2’-propargyl- 2’, 3 ’-di deoxy cytidine.
In some embodiments, a method for labeling an oligonucleotide for quantitative analysis may comprise incorporating a chemically reactive group at the 5’ or 3’ end of the oligonucleotide to form an oligonucleotide having a reactive end and contacting (e.g., reacting) the oligonucleotide having a reactive end with an isotopically labeled molecule (FIGURE 25). For example, a chemically reactive group may be installed at the 5’ end of an oligonucleotide by incubating the oligonucleotide with ATPyS and T4 PNK. Methods may comprise, for example, contacting an RNA substrate with an endoribonuclease and a PNK to produce oligoribonucleotides and incorporating a chemically reactive group in the oligoribonucleotides to form oligoribonucleotides having a 5’ or 3’ chemically reactive group, wherein the contacting and the incorporating may be performed as coupled reactions, for example, coupled reactions further including a phosphorylation reagent (e.g., ATPyS) in the reaction location. In some embodiments, it may be desirable to purify the RNA oligonucleotides prior to incubation with a PNK and the phosphorylation reagent (e.g., ATPyS) so that the chemically reactive group is installed at the 5’ end of the oligonucleotide in a separate step. In some embodiments, phosphothiolated oligonucleotides may react with iodoacetamide- or maleimide-functionalized molecules (e.g., nucleotides) comprising isotope labels.
A chemically reactive group may be installed at the 3’ end of an oligoribonucleotide, for example, by reaction with sodium (or potassium) periodate to generate a dialdehyde reactive group at the 3’ end nucleotide 2’, 3 ’-diol position to produce a dialdehyde oligonucleotide. A dialdehyde oligonucleotide may react with hydrazine-, hydroxylamine-, or amine-functionalized molecules comprising isotope labels, including tandem mass tags. Similarly, a method may comprise installing a chemically reactive group to the 5’ end of a capped oligoribonucleotide wherein the cap structure comprises a 2’,3’-diol group. Converting a 5’ cap comprising a 2’,3’-diol to a dialdehyde may be concurrent (e.g., a coupled reaction) with the converting a 3’ end nucleotide 2’, 3 ’-diol to a dialdehyde within the same oligonucleotide. Alternatively, the 3’ end labeling may be blocked by incubating the oligonucleotide with a polymerase and a blocking 3’-deoxynucleotide (e.g., Cordycepin) or 2’,3’-dideoxynucleotide prior the generation of the reactive dialdehyde to produce selectively labeled oligoribonucleotides having a 5’ end cap comprising a 2’, 3 ’-diol. The methods may further comprise subsequent labeling by reaction with an appropriate molecular scaffold (e.g., amino acids, keto acids, fatty acids, diamines, amino alcohols, carbohydrates) comprising one or more combinations of heavy and light isotope atoms.
According to some embodiments, methods may include contacting an endoribonuclease with an RNase inhibitor in an amount sufficient to at least partially inhibit the activity of the endoribonuclease. RNase inhibitors may be useful to a number of biotechnological applications For example, methods may include an RNase inhibitor to terminate (e.g., precisely terminate) an endoribonuclease reaction at a desired point (e.g., a desired time point, upon consumption of a desired amount of substrate, upon formation of a desired product, upon formation of products having desired size(s)). Methods may include an RNase inhibitor to achieve controlled partial digestion of an RNA substrate (for instance, to generate RNA oligonucleotides that are on average longer in length due to incomplete cleavage of every possible cutting site that is specific for a given endoribonuclease). Methods may include, for example, an RNase inhibitor to avoid or prevent overdigestion of an RNA substrate (i.e., cutting substrate at nonspecific or low-preferred sites) during additional sample processing steps in a multistep preparation workflow (such as, isotope labeling of the digested RNA). In some embodiments, methods may include an RNase inhibitor to avoid or prevent over-digestion of an RNA substrate immediately prior to sample analysis (for instance, during idle instrument times, such as column equilibration or instrument failure). Methods may include an RNase inhibitor to avoid or prevent overdigestion of an RNA substrate upon storage of a digested sample in the presence of the endoribonuclease. Methods may include an RNase inhibitor, for example, to study enzymatic activity (such as in kinetic studies in enzymology). Methods may include an RNase inhibitor, for example, to reduce cytotoxicity of an RNase during protein expression and/or purification.
Targeted site-specific cleavage of RNA substrates for analysis of RNA features.
Targeted cleavage of an RNA substrate may allow, according to some embodiments, one or more RNA oligonucleotide products of interest to be isolated. Isolation of one or more RNA oligonucleotide products may be coupled with analysis of certain RNA features, such as a 5’ cap structure or nucleobase modification (e.g., 6-methyladenosine and 5-methycytidine). Methods for assaying the identity and efficiency of cap incorporation in kilobase-long synthetic mRNA transcripts may be used in connection with quality control and/or characterization of mRNA therapeutics and vaccines. Cleavage of a pre-defined oligonucleotide segment (e.g., 5-30) from the 5’ end of the mRNA substrate using a custom designed DNAzyme or ribozyme, or cleavage of a DNA-RNA hybrid duplex with RNase H
(Beverly et al., Anal. Bioanal. Chem. 2016, 408:5021-30) may include analysis by denaturing gel electrophoresis or LC-MS.
RNase H (RNase Hl) is a particular type of endonuclease that hydrolyzes phosphodiester bonds of RNA, when hybridized to DNA. RNase H is known to remove RNA primers from the Okazaki fragments of the replicating DNA. In vitro, RNase H cleaves one or more nucleotides away from the 5’ and/or 3’ of the target site (DNA-RNA hybrid duplex), giving rise to multiple cleavage products differing from each other by one or more nucleotides in length (with low or no particular nucleotide specificity). Formation of multiple cleavage products of a few nucleotides difference in length complicates the analysis by mobility -based or mass spectrometry -based methods. Application of RHase H methods may be limited by demands on DNA probe design. For example, applications of RNase H methods may be limited by the design of the DNA probe needed to form the duplex DNA-RNA substrate for RNase H binding and activity while also restricting its cutting region and avoiding spurious (or other unwanted) cleavage of the RNA substrate (e.g., through careful design of single-stranded probes comprising DNA-RNA or DNA-2’-O-methyl-RNA chimeras, wherein 4-6 DNA nucleotides are placed at 3’ end of the probe). The present disclosure provides methods and compositions that, according to some embodiments, are free of such limitations. For example, methods and compositions including a nucleotide-specific endoribonuclease (such as a mono-, di-, or trinucleotide-specific endoribonuclease) that selectively hydrolyzes the phosphodiester bonds of single-stranded RNA obviate the need for carefully designed chimeric probes required for RNase H methods.
In some embodiments, a method may comprise contacting a DNA probe (e.g., 5 to 50 nucleotides long) and an RNA comprising sequence (e.g., a sequence having a 5’ cap or a sequence having one or more nucleobase modifications) at least partially complementary to the DNA probe to form a DNA-RNA hybrid and contacting the DNA-RNA hybrid and a nucleotide-specific endoribonuclease. A method may comprise, according to some embodiments,
(a) contacting an RNA substrate and one or more DNA probes, each optionally comprising an affinity domain (e.g., biotin), for example, wherein at least a portion of the RNA substrate and at least a portion of the DNA probe(s) are complementary, to form a DNA-RNA hybrid duplex comprising a double-stranded portion and a single-stranded portion;
(b) contacting the DNA-RNA hybrid duplex with an enzyme composition, the enzyme composition comprising a single-strand-specific nucleotidespecific endoribonuclease and, optionally, an RNA end-repair enzyme, to form a cleaved DNA-RNA hybrid duplex and one or more single-stranded RNA fragments of the RNA substrate by cleavage of the RNA substrate at one or more sites within the single-stranded portion by the single-strand- specific nucleotide-specific endoribonuclease;
(c) optionally, contacting the cleaved DNA-RNA hybrid duplex and a solid support comprising an affinity capture domain capable of binding the affinity domain (e.g., streptavidin) to form an affinity capture complex comprising the affinity domain bound to the affinity capture domain;
(d) optionally, washing the affinity capture complexes to remove unbound materials, if any; and
(e) optionally, dissociating the cleaved DNA-RNA hybrid duplex to release the remaining portion of the RNA substrate from the one or more DNA probes.
A DNA probe may comprise a sequence complementary to a sequence of an RNA substrate. In some embodiments, a DNA probe may be shorter than an RNA substrate such that the duplex formed upon hybridization comprises RNA overhangs at the 5’ and/or 3’ ends. A DNA/RNA duplex, in some embodiments, may comprise a DNA probe and an RNA substrate longer than the DNA probe, wherein the RNA substrate has single-stranded overhangs at both the 5’ and 3’ ends. According to some embodiments, the portion of the RNA substrate hybridized to the DNA probe may be protected from endoribonucleases that cleave only single stranded RNA while the single-stranded overhangs at one or both of the 5’ and 3’ ends would be subject to cleavage.
In some embodiments, hybridization of a DNA probe to a complementary sequence of an RNA substrate may be directed or guided by an accessory protein, for example, a prokaryotic argonaute (e.g., a bacterial argonaute, such as Thermus thermophilus argonaute), whose endonucleolytic activity has been inactivated but retained its ability to search for their guide-defined substrate). Including an accessory protein may hasten hybridization (e.g., more rapid seeking of RNA substrate at a rate near the limit of diffusion). Including an accessory protein may improve (e.g., overcome limitations on) substrate accessibility; and/or facilitate hybridization by reducing the entropic barrier to duplex formation. In some
embodiments, an accessory protein that selectively binds duplex substrates over singlestranded substrates (e.g., Carnation Italian Ringspot Virus pl9 protein) may be used to stabilize or conceal the duplex segment. In some embodiments, one or more chemical additives may be included in methods or compositions of the disclosure to increase the stability and/or specificity of the DNA-RNA duplex, including salts (e.g, NaCl, MgCh), crowding agents (e.g., polyethylene glycol (PEG), Ficoll, Dextran, etc.), duplex strengtheners (e g., betaine, proline, trehalose, proline, tetramethylammonium chloride, etc ), and ionic liquids (e.g., imidazolium, pyridinium, pyrrolidinium, and phosphonium cations; halides, tetrafluoroborate (BF4-), hexafluorophosphate (PF6-), and bis[(trifluoromethyl) sulfonyl]imide (NTf2~) anions).
A high-salt washing buffer may be used to wash away unbound RNA (e.g., the one or more single-stranded RNA fragments of the RNA substrate cleaved from the RNA substrate) while retaining solid support-bound DNA-RNA duplexes. To release (or elute) the RNA oligonucleotide strand (remaining portion of the RNA substrate) from captured DNA-RNA duplexes, a low-salt buffer (or water) or treatment with a DNase (e.g., DNase I) may be used. Alternatively, the RNA oligonucleotide strand (as part of the DNA-RNA duplexes) may be retained on the solid support for downstream applications.
According to some embodiments, a solid support may include any solid (flexible or rigid) material onto which a DNA-RNA hybrid duplex may be captured. For example, a solid support may include a matrix formed from an affinity capture domain or coated with the affinity capture domain. A solid support may be, for example, a bead including a magnetic bead, a column, a porous matrix, or a flat surface formed from for example, plastic or paper. In some embodiments, a solid support may be biological, non-biological, organic, inorganic or a combination thereof. A solid support, according to some embodiments, may have any desired form including, for example, particles, strands, precipitates, gels, sheets, tubings, spheres, containers, capillaries, cartridges, pads, slices, films, plates, slides, and/or have any desired shape, including, for example, a plane, a disc, a sphere, a ring, a torus, a cube, a cylinder, a cone, a vesica, a rod, and an ellipsoid. The surface of a solid support may comprise one or more materials including, for example, polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, and membranes. The surface of a solid support may comprise one or more functional groups. Example solid supports include glass surfaces (e.g., glass slides, microtiter plates) and suitable sensor elements. Sensor elements may include, for example, functionalized
polymers (e.g., in the form of beads). Example solid supports also include chemically modified oxidic surfaces (e.g. silicon dioxide, tantalum pentoxide or titanium dioxide), chemically modified metal surfaces (e.g., noble metal surfaces such as gold or silver, copper or aluminium surfaces), magnetic surfaces (e.g., Fe, Mn, Ni, Co, and their oxides), quantum dots (e.g., III-V (GaN, GaP, GaAs, InP, or InAs) or II- VI (ZnO, ZnS, CdS, CdSe, or CdTe) semiconductors), Ln-doped fluoride nanocrystals, and rare earth-doped oxidic nanomaterials.
In some embodiments, a DNA probe may hybridize with and protect a portion of a single-strand RNA from cleavage by a single-strand specific endoribonuclease. Synthetic nucleic acids may hybridize a single-stranded RNA substrate and/or protect the singlestranded RNA substrate from endoribonuclease cleavage (e.g., like a DNA probe that hybridizes to such single-stranded RNA). Synthetic nucleic acids may include, for example, a peptide nucleic acid (PNA), a lock nucleic acid (LNA), an unlock nucleic acid (UNA), a bridge nucleic acid (BNA), a triazole nucleic acid, a morpholine nucleic acid, an amide- linked nucleic acid, a 1,5 anhydrohexitol nucleic acid (HNA), a cyclohexenyl nucleic acid (CeNA), an arabinose nucleic acid (ANA), a 2'-fluoro-arabinose nucleic acid (FANA), a ot-L- threofuranosyl nucleic acid (TNA), a 4’-thioribose nucleic acid (4’S-RNA), a 2'-fluoro-4’- thioarabinose nucleic acid (4’S-FANA), a 4’-selenoribose nucleic acid (4’Se-RNA), an oxepane nucleic acid (ONA), or a combination thereof. Other synthetic nucleic acids that may be used include RNA probes comprising complete or partial 2'-OH nucleotides substitution with 2'-O-alkyl-nucleotides (e g., 2'-O-methyl-nucleotides), 2'-O-methoxyethyl- nucleotides (MOE), 2' -fluoro-nucleotides, 2'-O-allyl-nucleotides, 2'-0-alkylamine- nucleotides (e.g., 2'-O-ethylamine-nucleotides), 2'-O-cyanoethyl-nucleotides, 2'-O- acetalester-nucleotides, and 2'-azido-nucleotides. Further synthetic nucleic acids that may be used include DNA or RNA probes comprising partial or complete backbone modifications such phosphororothioate (replacement of one non-bridging oxygen atom of the phosphate group with a sulfur atom), phosphorodi thioate (both non-bridging oxygen atoms of the phosphate group are replaced with sulfur), alkyphosphonate (a non-bridging oxygen atom of the phosphate group has been replaced with alkyl group, e g. methyl), arylphosphonate (a non-bridging oxygen atom of the phosphate group has been replaced with aryl group, e.g. phenyl), N-phosphoramidate (an oxygen atom is replaced with an amino group either at the 3’- or 5’-oxygen), boranophosphate (one non-bridging oxygen atom of the phosphate group is replaced with BH3), phosphonoacetate (PACE, one non-bridging oxygen atom of the phosphate group is replaced with an acetate group), and 2 ’,5 ’-phosphodiester linkages.
In some embodiments, oligoribonucleotide products generated by cleavage of DNA- RNA hybrid duplexes may be quantified by mobility -based methods, such as gel- or capillary electrophoresis, or by mass spectrometric methods. In some embodiments, oligoribonucleotide products are quantified using calibration curves constructed employed authentic standards. In other embodiments, such oligoribonucleotides are labeled at their 5’ or 3’ end with stable isotopes for relative or absolute quantification as disclosed herein. Differential incorporation of stable isotopes may be used for multiplex quantitative analysis of oligonucleotide, enabling simultaneous analysis of multiple RNA features by means of isobaric tags.
Kits
The present disclosure further relates to kits including an endoribonuclease and/or an RNA end repair enzyme. For example, a kit may include an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species (e.g., a vertebrate species (for example, Homo sapiens, Sus scrofa), a bacterial species (for example, Escherichia coll), a fungus species (for example, Aspergillus oryzae), a plant species (for example, Momordica charantia, Cucumis sativus), and an archaea species (for example, Pyrococcus furiosus)) or (ii) is a non-naturally occurring sequence. A kit may include, for example, an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species (e.g., a bacterial species or a bacteriophage species) or (ii) is a non-naturally occurring sequence. A kit may include one or more additional enzymes (e.g., an RNA polymerase, an RNA ligase), a denaturing agent (e.g., urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide), a buffering agent, and any combination thereof. An enzyme may be included in a storage buffer (e.g., comprising glycerol and a buffering agent). In some embodiments, a kit may include a reaction buffer which may be in concentrated form, and the buffer may contain additives (e g. glycerol), salt (e.g. KC1), reducing agent, EDTA or detergents, among others. A kit may include an endoribonuclease having specificity for one or more dinucleotide combinations (e.g., cleavage after a specific nucleotide followed by a purine, cleavage after a specific nucleotide followed by a pyrimidine, cleavage after a purine followed by a specific nucleotide, and cleavage after a pyrimidine followed by a specific nucleotide). For example, an endoribonuclease may have an average cleavage rate of once every 6-12 nucleotides. Examples of endoribonucleases for a kit may include hRNase 4, RNase Tl, RNase U2,
RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO. A kit may comprise an RNA end repair enzyme, for example, comprising phosphodiesterase and phosphomonoesterase activities. A kit may include, according to some embodiments, a divalent metal, for example, a divalent metal selected from magnesium(II), manganese(II), cobalt(II), and nickel(II). A kit may comprise one or more rNTPs including, for example, one, two, three of all four of rATP, rUTP, rGTP and rCTP. A kit may further comprise one or more modified nucleotides. In some embodiments, a kit may include an RNase inhibitor. In some embodiments, a kit may include an affinity-labeled DNA probe. One or more components of a kit may be included in one container for a single step or coupled reaction, or one or more components may be contained in one container (e.g., a box, case), but separated (e.g., in one or more tubes) from other components for sequential use or parallel use. The contents of a kit may be formulated for use in a desired method or process.
An enzyme, for example, an enzyme included in a kit, may have any desired form (e.g., fluid, freeze-dried, and lyophilized forms). An enzyme composition and/or kit may comprise non-ionic, ionic e.g. anionic or zwitterionic surfactants and crowding agents.
A kit may include instructions for using the components of the kit to practice a desired method (e.g., methods for analyzing an RNA substrate). Instructions may be recorded on a suitable recording medium. For example, instructions may be printed on a substrate, such as paper or plastic and/or displayed electronically. Instructions may be present in the kits as a package insert, in the labeling of the container of the kit or components thereof (i.e., associated with the packaging or sub-packaging). Instructions may be present as an electronic storage data file residing on a suitable computer readable storage medium (e.g. a CD-ROM, a flash drive). Instructions may be provided remotely using, for example, cloud or internet resources with a link or other access instructions provided in or with a kit.
EXAMPLES
Some specific example embodiments may be illustrated by one or more of the examples provided herein.
EXAMPLE 1: Expression and purification of human RNase4 (hRNase 4)
Recombinant wild-type hRNase 4 enzyme was periplasmically expressed as a MBP fusion protein containing an N-terminal signal peptide (61.7 kDa) (see FIGURE 1) and stored in an ammonium acetate buffer [100 mM NFLOAC, pH 5.5, 0.5 mM DTT, 50% glycerol]. Expression of hRNase 4 was induced with 10 pM IPTG, and the protein was expressed from
a periplasmic hRNase 4-containing plasmid in T7 Express lysY Competent E. coli [MiniF lysY (CamR) / fhuA2 lacZ::T7 genel [Ion] ompT gal sulAl l R(mcr-73::miniTnlO— TetS)2 [dem] R(zgb-210::Tnl0--TetS) endAl A(mcrC-mrr)114::IS10] (NEB C3010) for 16 h at 16°C. The cells were lysed by sonication in lysis buffer (20 mM Tris/Cl pF! 7.5, 200 mM NaCl, 1 mM DTT) and protease inhibitors (1 mM PMSF, 0.5 nM leupeptin, 2.75 mM benzamidine, 2 nM pepstatin), followed by the removal of cell debris by centrifugation at 21,000 xg for 1 hour. The enzyme was purified from the crude extract using 10 mh BioRad Econo-Pac disposable chromatography columns packed with 1.5 mL Amylose Resin (NEB E8021). The flow rate during loading, washing, and elution was regulated to ~ 0.8 ml/min using a Discofix® 1-way stopcock. After elution in elution buffer (EB1; 20 mM Tris/Cl pH 7.5, 250 mM NaCl, 1 mM DTT, 10 mM maltose), the protein was loaded onto a GraviTrap His column, equilibrated with GTH column buffer (20 mM Na2HPO4 pH 7.5, 0.5 M NaCl, 1 mM DTT, 20% glycerol). hRNase 4 was eluted in two 3 ml fractions with GTH elution buffer (20 mM Na2HPO4 pH 7.5, 0.5 M NaCl, 1 mM DTT, 0.5 M imidazole, 20% glycerol). The enzyme-containing fraction was dialyzed into the hRNase 4 storage buffer (200 mM NH4OAC, pH 5.5 + 1 mM DTT), and after dialysis supplemented with an equal volume of 100% glycerol.
EXAMPLE 2: Characterization of human hRNase 4 activity and cleavage specificity The activity and specificity of hRNase 4 cleavage was assessed utilizing a LC- MS/MS-based multiplexed cleavage assay. A defined pool of 13 synthetic oligonucleotides comprising all possible dinucleotide combinations (at least once) flanked by poly-adenosine sequences of varying lengths (Table 1) was prepared. To assess hRNase 4 activity and specificity, 5 pL of this oligonucleotide pool (comprising 25 pmol of each individual oligonucleotide) were digested with 2 pL of a 1:10, 1:20 or 1 :40 dilution of hRNase 4 in 1 x NEBuffer 1 (10 mM Bis-Tris-Propane-HCl, 10 mM MgC12, 1 mM DTT, pH 7). The mixture was incubated at 37°C for 1 h with shaking at 300 rpm. The resultant digestion products were filtered using a Millipore Ultrafree MC-GV spin column (0.22 um) at 13,400 rpm for 5 minutes.
Table 1. Synthetic oligonucleotides utilized to assess hRNase 4 cleavage activity and specificity. All possible dinucleotide motifs are shown.
Each sample was characterized by LC-MS/MS analysis. Liquid chromatographic separation of RNA oligonucleotides was performed on a Thermo Scientific Vanquish Horizon UHPLC equipped with a DNAPac RP Column (2.1 x 50 mm, 4 mm) at 70°C using a 25-minute gradient of solvent A (1% hexafluoroisopropanol (HFIP), 0.1% N,N- diisopropylethylamine (DIEA), 1 pM EDTA) and increasing solvent B (5 - 35%) (80% Methanol, 0.075% HFIP, 0.0375% DIEA, 1 pM EDTA) at a 0.3 mL/min flow rate. MS/MS data were collected on a Thermo Scientific Q Exactive Plus Orbitrap Mass Spectrometer. Intact mass analysis was performed (scan range: 480 - 2500 m/z) at a resolution of 70,000. Raw intact MS data was deconvoluted utilizing ProMass (Novatia LLC) and Avalon peak detection and integration algorithm (Thermo Fisher Scientific). To determine the relative abundance of each input oligonucleotide and cleavage product following incubation with
hRNase 4, deconvoluted mass data was compared with the theoretical masses of each input oligonucleotide and cleavage product using a 10-ppm mass difference cutoff.
A heatmap of the relative abundance of each input oligonucleotide within the oligonucleotide pool after incubation with hRNase 4 is shown in FIGURE 2. Oligonucleotides comprising a uridine followed by a purine (abbreviated as “R”) were cleaved by incubation with hRNase 4. The oligonucleotide comprising the ‘UC’ dinucleotide was not cleaved by hRNase 4 at any of the tested concentrations. Also, none of the oligonucleotides comprising ‘CG’, ‘CC’, ‘CA’ motifs were cleaved by hRNase 4. Cleavage analyses of oligonucleotides comprising the ‘UU’ dinucleotide motif and oligonucleotides comprising the “CU” dinucleotide motif were not conclusive because each of the oligonucleotides also included an ‘UA’ motif (which is cleaved by hRNase 4).
To better define the hRNase 4 cleavage specificity, the identities and quantities of each cleavage product were analyzed. First, the 5' cleavage products were analyzed with respect to the composition of their 3'-terminal nucleotide residue. For the purpose of this experiment, 5’ cleavage products were grouped according to the composition of their 3'- terminal nucleotide residue, regardless of their phosphorylation status. As shown in FIGURE 3, digestion with hRNase 4 resulted in an accumulation of 5' cleavage products comprising a uridine at the 3'-terminus. Digestion with hRNase 4 also produced, albeit to a much lesser extent and dependent upon the enzyme concentration, some very low levels of 5' cleavage products comprising a cytidine at the 3'-terminus. Next, the 3' cleavage products were analyzed with respect to the composition of their 5'-terminal nucleotide residue. For the purpose of this experiment, 3’ cleavage products were grouped according to the composition of their 5'-terminal nucleotide residue, regardless of their phosphorylation status. As shown in FIGURE 4, digestion with hRNase 4 resulted in an accumulation of 3' cleavage products comprising a 5'-adenosine or a 5'-guanosine (note that the higher total intensity of 5’- adenosine products at lower hRNase 4 dilutions is due to the fact that there were fourfold more ‘UA’ sites than ‘UG’ sites in the oligonucleotide pool; however, at higher hRNase 4 dilutions it is possible to observe a slight enrichment of 5’-adenosine products). Taken together, these data show that hRNase 4 cleaves after a uridine site followed by a guanine or adenine nucleotide. These data are consistent with reports suggesting that hRNase 4 may preferentially cleave RNA between ‘UR’ dinucleotides (i.e., on the 3' side of uridine and on the 5' side of adenosine or guanosine) (Shapiro et al., 1986; Zhou and Strydom, 1993; Teryzan et al., 1999).
EXAMPLE 3: Prediction of hRNase 4 cleavage products in mRNA transcripts
The utility of the ‘UR’ cleavage specificity of hRNase 4 for mRNA characterization by LC-MS/MS was assessed by computational comparison with the reported specificities of other endoribonucleases that have been previously used for analysis RNA by LC-MS/MS. A complete theoretical digestion of 1000 randomly selected human mRNA transcripts (less than 5000 bases in length) (RefSeq, https://www.ncbi.nlm.nih.gov/refseq/) (see FIGURE 5A), of E. coli coding sequences (greater than 300 bases in length) (RefSeq) (see FIGURE 5B) and of the BNT162b2, COVID-19 mRNA vaccine sequence (Vogel et al., 2021) (see FIGURE 5C) was performed using the following endoribonuclease cleavage specificities (in the cases where discrepancies in the specificity of a given endoribonuclease have been reported, both reported specificities were used for calculation): Colicin E5 cleaves between ‘GU’; Cusativin-2017 cleaves between ‘CA’, ‘CG’, and ‘CU’, but not between ‘CC’ (Addepalli et al., 2017); Cusativin-2021 cleaves between ‘CG’, ‘CU’, ‘AU’, and ‘UU’ (Griinberg et al., 2021); MC1-2015 cleaves at the 5' end of ‘U’ (Addepalli et al., 2015); MC1-2021 cleaves between ‘AU’, ‘CU’, and ‘UU’, but not between ‘GU’ (Griinberg et al., 2021); RNase A cleaves at the 3' end of ‘C’ and ‘U’; and RNase T1 cleaves at the 3' end of ‘G’.
The calculated sequence coverage for each mRNA transcript based on the predicted cleavage products formed by digestion with a given endoribonuclease is shown on FIGURE 5. Only cleavage products between 4 and 40 nucleotides in length were utilized for the calculation of RNA sequence coverage, as they are the most useful for MS/MS sequencing purposes. Exact duplicate cleavage products were also excluded, as they are not uniquely mappable to a given RNA sequence. As shown in FIGURE 5A-C (left panels), hRNase 4 produced the highest median total predicted sequence coverage among the tested endoribonuclease specificities across transcripts from species as diverse as human and E. coli. hRNase 4 also resulted in the highest median theoretical sequence coverage across all transcripts considering only cleavage products with a unique mass (i.e., excluding cleavage products with isomeric sequences) as shown in FIGURE 5A-C (right panels).
Assuming an approximate equal and random distribution of ‘G’, ‘C’, ‘A’, and ‘U’ nucleotides within each of 1000 randomly selected human mRNA transcripts (RefSeq), one would expect that on average Colicin E5 would cleave once every 16 nucleotide residues, Cusativin (Griinberg et al., 2021) would cleave once every 4 nucleotide residues, MCI (Griinberg et al., 2021) would cleave three times every 16 nucleotide residues, RNase A
would cleave once every 2 nucleotide residues, RNase T1 would cleave once every 4 nucleotide residues, and hRNase 4 would cleave once every 8 nucleotide residues.
Identifying cleavage frequency as a results effective variable and/or optimizing cleavage frequency range for mass spectrometry -based RNA sequencing have been confounded, in part, because distance between consecutive endoribonuclease cleavage sites may vary in different RNA sequences and/or may result in oligonucleotides that are too short for sequencing purposes. Furthermore, the cleavage efficiency at any given endoribonuclease cleavage site may be affected by local RNA secondary structures and presence of RNA modifications. As shown in FIGURE 12 for the experimental cleavage of FLuc IVT mRNA (1766 nt in length), most of the sequenced oligonucleotide products (25th-75th percentile) from hRNase 4 digestion were in the range of 9-18 nt (median length of 12 nt) with the longest products ranging from 41 to 45 nt in length. Comparatively, most of the sequenced oligonucleotide products from RNase T1 digestion were in the range of 7-12 nt (median length of 8 nt) with the longest product being 24 nt in length.
Discrepancies may exist between a predicted cleavage frequency and the corresponding actual or observed mean oligonucleotide product length (e.g., hRNase 4 has a predicted cleavage frequency of 1 out 8 nucleotides, but was observed experimentally to produce oligonucleotide products having median length of 12 nt; RNase T1 has a predicted cleavage frequency of 1 out 4 nucleotides, but was observed experimentally to produce oligonucleotide products having median length of 8 nt). The data shown in FIGURE 5A-C may be used as a guide to select endoribonucleases that may improve sequencing and fingerprinting of mRNAs using LC-MS/MS techniques. For example, one may infer from FIGURE 5A-C that cleavage frequencies within the range of once every 6-12 nucleotide residues provide the highest RNA sequence coverage as follows. Theoretical sequence coverage of endoribonucleases with cutting frequency lower than 6: Cusativin-2017 (3 out 16 nt or every ~5.3 nt), 68% mean coverage; MCI -2021 (3 out 16 nt or every ~5.3 nt), 69% mean coverage; MC1-2015 (1 out 4 nt), 59% mean coverage; Cusativin-2021 (1 out 4 nt), 63% mean coverage; RNase T1 (1 out 4 nt), 57% mean coverage; RNase A (1 out 2 nt), 18% mean coverage. Theoretical sequence coverage of endoribonucleases with cutting frequency higher than 12: Colicin E5 (1 out 16 nt), 56% mean coverage. Theoretical sequence coverage of hRNase 4 (1 out 8 nt cutting frequency), 81% mean coverage.
The impact of theoretical cleavage frequencies on the sequence coverage of human coding sequences (see FIGURE 6A), E. coli coding sequences (see FIGURE 6B), and
BNT162b2 vaccine sequence (see FIGURE 6C) was further assessed for three general classes of cleavage motifs. The classes of cleavage motifs were as follows: cleavage after a given single nucleotide (‘N’); cleavage after a given single nucleotide followed a purine (‘NR’); cleavage after a given single nucleotide followed a pyrimidine (‘NY’); cleavage after a purine followed by a single nucleotide (‘RN’); cleavage after a pyrimidine followed by a single nucleotide (‘YN’); and cleavage between a single dinucleotide sequence (‘NN’). For simplicity, the classes of cleavage motifs were represented as follows: cleavage after a given single nucleotide followed a purine (‘NR)’ and cleavage after a given single nucleotide followed a pyrimidine (‘NY)’ were represented as ‘N(Y/R)’; cleavage after a purine followed by a single nucleotide (‘RN’) and cleavage after a pyrimidine followed by a single nucleotide (‘YN’) were represented as ‘(Y/R)N’.
On average, the expected cleavage frequency for endoribonucleases with ‘N’ specificity is 1 out of 4 nucleotide residues; the expected cleavage frequency for endoribonucleases with ‘N(Y/R)’ specificity is 1 out of 8 nucleotide residues; the expected cleavage frequency for endoribonucleases with ‘NN’ specificity is 1 out of 16 nucleotide residues. Examples of endoribonucleases with the ‘N(Y/R)’ specificity are those whose specificity comprise one of: a uridine followed by a pyrimidine; a cytidine followed by a pyrimidine; an adenosine followed by a pyrimidine; a guanosine followed by a pyrimidine; a uridine followed by a purine; a cytidine followed by a purine; an adenosine followed by a purine; or a guanosine followed by a purine. Similarly, endoribonucleases with ‘(Y/R)N’ specificity also result in cleavage frequencies that are on average 1 out of 8 nucleotide residues. Examples of endoribonucleases with the ‘(Y/R)N’ specificity are those whose specificity comprise one of: a pyrimidine followed by a uridine; a pyrimidine followed by a cytidine; a pyrimidine followed by an adenosine; a pyrimidine followed by a guanosine; a purine followed by a uridine; a purine followed by a cytidine; a purine followed by an adenosine; or a purine followed by a guanosine. Nucleotide combinations that result in cutting frequencies within the range of once every 6-12 nucleotide residues may include cleavage sites comprising two or more nucleotides. Examples of desirable cleavage specificities may include:
• those having cutting frequencies of 6 out of 64 or every -10.7 nucleotides (e g., URH (wherein H = A or C or U), ARH, CRH, GRH, UYH, AYH, CYH, GYH, RUH, RAH, RCH, RGH, YUH, YAH, YCH, YGH, RHU, RHA, RHC, RHG, YHU, YHA, YHC. YHG, HRU, HRA, HRC, HRG, HYU, HYA, HYC,
HYG, URD (wherein D = A or G or U), URB (wherein B = G or C or U), and URV (wherein V = A or C or G));
• those having cutting frequencies of 8 out of 64 or every 8 nucleotides (e.g., RRR, YYY, RYR, YRR, RYY, YYR, RRK (wherein K = G or U), RRM (wherein M = A or C), RRS (wherein S = G or C), RRW (wherein W = A or U));
• those having cutting frequencies of 2 out of 16 or every 8 nucleotides (e.g., UK, KU, UM, MU, US, SU, UW, WU); and
• those having cutting frequencies of 36 out of 256 or every ~7.1 nucleotides (e.g., RRHH, RHRH, HRHR, HHRR, YYHH, WWHH, KKHH, KMBD, RSHD); among other cleavage specificities.
As shown in FIGURE 6A-C, cleavage specificities (£N(Y/R) & (Y/R)N’) that result in similar cleavage frequencies (1 out of 8) as to that of hRNase 4 produced consistently the highest theoretical sequence coverage (>75%) across a plurality of transcripts. In practice, the actual sequence coverage may vary, for example, where cleavage efficiency is a function of reaction conditions (e.g., buffer composition, pH, salt concentration, temperature, incubation time, etc.); enzyme specificity (e.g., some endonucleases show minor cleavage activities to other nucleotide combinations); enzyme quality (e.g., presence of contaminating nucleases or absence of essential/nonessential cofactors); and/or properties of the substrate RNA (e.g., the presence of secondary structure and/or RNA modifications). Endoribonucleases with cleavage specificity similar to the ‘UR’ of hRNase 4, such as ‘N(Y/R)’ or ‘(Y/R)N’, may be suitable for applications such as mass spectrometry -based sequencing and fingerprinting of mRNA and other RNA substrates.
EXAMPLE 4: Digestion of an RNA oligonucleotide with T4 PNK and hRNase 4
Digestion of RNA with certain endonucleases may produce a mixture of cleavage products comprising 2',3'-cyclic-phosphate and 3'-phosphate at the 3’ terminus. In many cases, this process depends on the enzyme concentration, the digestion buffer, and/or incubation time, in any combination. In some cases, the product mixture may also comprise 2',3'-hydroxylated species. In other cases, enzyme-independent hydrolytic opening of 2', 3 cyclic-phosphate may generate a mixture comprising 2',3'-cyclic-phosphate, 3'-phosphate, 2'- phosphate, 2',3'-hydroxy, 5’-phosphate, and/or 5’-hydroxy termini, in any combination. The occurrence of any of these mixtures convolutes analysis by mass spectrometry techniques. Therefore, it is highly desirable to resolve these mixtures prior to mass spectrometry analysis.
This example describes the digestion of a synthetic RNA oligonucleotide substrate by co-incubation with a mixture of T4 PNK (Phage T4 polynucleotide kinase) and hRNase 4. Briefly, 12.5 pmol of an RNA oligonucleotide substrate (Oligonucleotide #1: AAAAAAAAAAAAAUGAAAAAAAAAA)(SEQ ID NO:5) was incubated with a combination of 0.2 pL of T4 PNK and 1 pL of human hRNase 4 in 1 x NEBuffer 1 (10 mM Bis-Tris-Propane-HCl, 10 mM MgCE, 1 mM DTT, pH 7) for 30 minutes at 37°C. A 9-minute gradient of solvent A (1% hexafluoroisopropanol (HFIP), 0.1% N,N-diisopropylethylamine (DIEA), 1 pM EDTA) and increasing solvent B (5 - 35%) (80% Methanol, 0.075% HFIP, 0.0375% DIEA, 1 pM EDTA) at a 0.3 mL/min flow rate was utilized for UHPLC analysis. A corresponding control hRNase 4 digestion in the absence of T4 PNK was utilized for comparison. The identity of all RNA cleavage products was confirmed by MS/MS analysis on a Thermo Scientific Q Exactive Plus Orbitrap Mass Spectrometer as described in Example 2.
FIGURE 7 shows the overlaid UV chromatograms from hRNase 4 treatment of the Oligonucleotide #1 in the presence and absence of T4 PNK. A mixture of 5' cleavage products comprising 3 '-phosphorylated and 2',3'-cyclic-phosphorylated ends was observed upon hRNase 4 digestion in the absence of T4 PNK (see FIGURE 7, cleavage products #3 and #4). Whereas a single 5' cleavage product comprising a 2', 3 '-hydroxylated end was observed upon hRNase 4 digestion and addition of T4 PNK (see FIGURE 7, cleavage product #2). A 3’ cleavage product comprising a 5 ’-hydroxylated end was observed in both conditions in similar quantities (see FIGURE 7, cleavage product #1). Taken together, these data demonstrate that the presence of an end-repair enzyme such as T4 PNK simplifies the analysis of hRNase 4 digests by deconvoluting mixtures of cleavage products comprising different phosphorylation statuses.
EXAMPLE 5: Use of hRNase 4/T4 PNK for sequencing and mass fingerprinting of an mRNA
This example describes sequencing and fingerprinting of Firefly Luciferase messenger RNA (FLuc mRNA) by means of digestion with a combination of hRNase 4 and T4 PNK. For comparison purposes, digestion of FLuc mRNA was also performed with RNaseTl alone.
A FLuc mRNA transcript was produced by in vitro transcription (IVT) utilizing the HiScribe™ T7 High Yield RNA Synthesis Kit (NEB, Catalog # E2040S). A linearized DNA template encoding the FLuc mRNA sequence (1 pg) under the control of T7 promoter was
mixed with 10 mM rATP, 10 mM rGTP, 10 mM rCTP, 10 mM rUTP, and 2 pL of T7 RNA Polymerase in a 20 pL reaction volume. The resultant mixture was incubated at 37°C for 2 h. The reaction mixture was diluted to 100 pL in 1 x DNase 1 buffer (10 mM Tris-HCl, 2.5 mM MgC12, 0.5 mM CaC12, pH 7.6) and incubated with 2 pL of DNase 1 (NEB, Catalog # M0303S) for 15 minutes at 37°C. Subsequently, the in vitro transcribed FLuc mRNA (FLuc IVT mRNA) was purified utilizing an NEB Monarch RNA Cleanup Kit (500 pg) (NEB, Catalog # T2050L). The concentration of purified FLuc IVT mRNA was quantified utilizing a NanoDrop spectrophotometer (Thermo Fisher Scientific).
Digestion using hRNase 4/T4 PNK was performed as illustrated in the example workflow in FIGURE 8 (left panel) and example composition of Table 2. First, 10 pg of purified FLuc IVT mRNA was mixed with 3 M Urea in 1 x NEBuffer 1 (10 mM Bis-Tris- Propane-HCl, 10 mM MgC12, 1 mM DTT, pH 7). The mixture was heated to 90°C for 10 minutes and cooled to room temperature. The mixture was diluted 3 -fold in a 1 x NEBuffer 1. Then 0.4 pL of T4 PNK (160 units) and 2 pL of purified recombinant human RNase4 was added to the reaction mixture (See Table 2). RNA digestion with hRNase 4/T4 PNK was performed for 2 h at 37°C with shaking at 300 rpm. For comparison, a parallel digestion of FLuc IVT mRNA was performed using 1 pL of RNase T1 (FIGURE 8, right panel). The resultant digestion products of either workflow were filtered using a Millipore Ultrafree MC- GV spin column (0.22 um) at 13,400 rpm for 5 minutes.
Table 2. Representative composition of an hRNase 4/T4 PNK digestion mixture
Each sample was characterized by LC-MS/MS analysis as described in Example 2 with slight variations in the UHPLC gradient time and MS/MS parameters. A 25 -minute UHPLC gradient was applied, and MS/MS data was collected with a Thermo Scientific Q Exactive Plus Orbitrap Mass Spectrometer in Top-5 ddMS2 acquisition mode at a resolution of 35,000 with a normalized collision energy of 20% in negative ionization mode. Theoretical prediction of the cleavage products generated by digestion of FLuc IVT mRNA with either hRNase 4 or RNase T1 is shown in FIGURE 9. Complete digestion of FLuc IVT mRNA with hRNase 4 is predicted to produce a substantially higher sequence mapping (higher sequence coverage percentage) in comparison with RNaseTl. Notably, hRNase 4 is predicted to produce a high percentage of cleavage products with unique sequences, while RNase T1 is predicted to generate a high percentage of isomeric cleavage products.
EXAMPLE 5A: hRNase 4/PNK-based mass fingerprinting
The accurate determination of mass-to-charge (m/z) ratio of oligonucleotide cleavage products serves as a unique identifier of a particular RNA and allows identification of the unknown RNAs in a sample by matching the resulting oligonucleotide masses with the theoretical oligonucleotide masses of RNAs in a database (such as NCBI RefSeq).
Oligonucleotide mass fingerprinting was performed by deconvoluting raw intact MS data with ProMass software (Novatia LLC) and Avalon peak detection and integration algorithm (Thermo Fisher Scientific). Deconvoluted oligonucleotide masses detected in either the hRNase 4/T4 PNK condition or the RNaseTl condition were compared to a database of human transcripts (RefSeq) in which the FLuc IVT mRNA sequence was spiked in. The product of the proportion of total spectral intensity explained by theoretical masses and the proportion of theoretical oligonucleotides identified in the spectra from each transcript was calculated, hereafter referred to as the score of each transcript.
As shown in FIGURE 10 (upper panel), the value-based scoring analysis of the cleavage products generated by hRNase 4/T4 PNK permitted unambiguous identification of FLuc mRNA among all transcripts in the human transcriptome. These results indicate that mass fingerprint data produced upon digestion with hRNase 4/T4 PNK is sufficiently unique for identification of a particular transcript in the context of a human transcriptome database and have a lower identification background in comparison to mass fingerprints generated from digestion with RNaseTl (FIGURE 10, lower panel).
EXAMPLE 5B: hRNase 4/T4 PNK-based sequencing
For hRNase 4/T4 PNK-based sequencing, the identities of sequenced cleavage products were inferred utilizing the Nucleic Acid Search Engine (NASE) (Wein et al., 2020) in OpenMS (version: 2.6.0). The search was conducted utilizing a theoretical digestion of the sequence of FLuc IVT mRNA with the cleavage specificity of either hRNase 4 or RNaseTl and one missed cleavage at a 5% False Discovery Rate.
Digestion of FLuc IVT mRNA with either hRNase 4/T4 PNK or RNaseTl resulted in reproducible sequencing profiles (results from 3 independent experiments are shown in FIGURE 11, replicates 1-3). A total of 96 cleavage products were reproducibly observed in all hRNase 4/T4 PNK experiments and a total of 85 cleavage products were reproducibly observed in all RNase Tl experiments.
Specific differences in sequenced cleavage product length and total sequence coverage were observed in hRNase 4/T4 PNK digests in comparison to RNaseTl digests. The median length and upper maximum length of cleavage products from hRNase 4/T4 PNK treatment were longer than those from RNase T1 treatment (see FIGURE 12). In addition, total sequence coverages between 67.5% to 72.1% were obtained in hRNase 4/T4 PNK experiments, whereas coverages between 50.2% to 54.1% were obtained in RNase T1 experiments (see FIGURE 13). FIGURE 14 shows the FLuc mRNA sequence coverage after aggregating replicates of RNase T1 alone, hRNase 4/T4 PNK alone, or RNaseTl and hRNase 4/T4 PNK combined. Aggregation of triplicate FLuc IVT mRNA digests resulted in 75% sequence coverage with hRNase 4/T4 PNK condition and 55.8% with RNaseTl condition. Aggregation of FLuc IVT mRNA digests from combined hRNase 4/T4 PNK and RNaseTl experiments resulted in an improvement in sequence coverage to 89.3%. Parallel digestion using hRNase 4/T4 PNK and RNase T1 may be beneficial due to the complementary cleavage specificities presented by these endoribonucleases.
Taken together, hRNase 4/T4 PNK resulted in a distribution of longer cleavage products with a higher overall coverage of the FLuc mRNA sequence in comparison to RNase Tl. In addition, these data indicate that hRNase 4/T4 PNK offers a complementary alternative to conventional enzymatic tools such as RNase TL
EXAMPLE 6: MC1/T4 PNK-based sequencing an mRNA
This example shows that the composition embodying an RNA end-repair enzymes such as T4 PNK may be effectively extended to other endoribonucleases. The data presented here demonstrates the combination of T4 PNK with MCI, which is a uridine-specific
endoribonuclease that produces a mixture of 2',3'-cyclic-phosphate and 3 '-phosphate termini (Addepalli et al., 2015), for sequencing of FLuc IVT mRNA.
FLuc mRNA was prepared as described in Example 5. Digestion with MC1/T4 PNK was performed as follows: 5 pg of purified FLuc IVT mRNA was mixed with 3 M Urea in 1 x NEBuffer 1 (10 mM Bis-Tris-Propane-HCl, 10 mM MgCh, 1 mM DTT, pH 7). The mixture was heated to 90°C for 10 minutes and cooled to room temperature. The mixture was diluted 3-fold in a 1 x NEBuffer 1. Then 0.2 L of T4 PNK (20000 units) and 1 pL of ribonuclease MCI were added to the reaction mixture. RNA digestion was performed for 1 h at 37°C with shaking at 300 rpm. Digestions were performed in triplicate.
Cleavage products from each digestion replicate were subjected to analysis by LC- MS/MS and the resultant data processed for sequencing analysis as described in Example 5, utilizing a theoretical digestion of FLuc IVT mRNA with the reported uridine specificity of MCI (Gmnberg et al., 2021). In total, 84 unique cleavage products were reproducibly sequenced across replicates (see FIGURE 15) and an overall sequence coverage between 52- 55% was obtained across FLuc IVT mRNA digestions with MC1/T4 PNK (see FIGURE 16).
Taken together, the combination of MCI and T4 PNK yielded a reproducible sequencing profile of FLuc mRNA, demonstrating that other endoribonucleases (beyond hRNase 4) may be combined with T4 PNK.
EXAMPLE 7: hRNase 4/T4-PNK-based sequencing and fingerprinting of a human erythropoietin (Epo) mRNA
This example describes sequencing and fingerprinting of in vitro synthesized human erythropoietin (Epo) mRNA. Epo mRNA was in vitro transcribed either using canonical UTP, ATP, GTP, and CTP (herein referred as to U Epo mRNA or EpoU), or using mo5UTP replacing UTP to result in a Epo mRNA with full substitution of uridine with 5- methoxyuridine (herein referred as to mo5U Epo mRNA or EpomoU), or using ml YTP replacing UTP to result in a Epo mRNA with full substitution of uridine with 1-methyl- pseudouridine (herein referred as to ml Y Epo mRNA or EpomlY). The incorporation of specific modified uridine nucleotides, including mlY and mo5U has been shown to reduce immunogenicity and enhance translation/ stability of exogenously delivered IVT mRNAs (Kariko et al., 2008; Anderson et al., 2010; Parr et al., 2020; Li et al., 2011; Svitkin et al., 2017). Furthermore, Epo mRNA has been utilized to demonstrate the therapeutic potential of IVT mRNAs in the treatment of anemia (Kariko et al., 2012, Thess et al., 2015). Hence, Epo
mRNA fully modified with mo5U or mlY was utilized as a model system to assess the use of hRNase 4 in combination with T4 PNK to characterize putative therapeutic mRNAs.
5 pg of purified mo5U/ mlY/U Epo mRNAs were prepared and digested with hRNase 4/T4 PNK or RNaseTl essentially as described in Example 5 LC-MS/MS data was processed for mass fingerprinting and sequencing analysis as described in Example 5.
First, the specificity of each mass fingerprint in the context of a human transcriptome database supplemented with the synthetic Epo mRNA sequence was assessed. FIGURE 17 shows the score (as defined in Example 5) of each transcript relative to RNA length. The synthetic Epo mRNA sequence could be uniquely identified relative to all other human transcripts in each of hRNase 4/T4 PNK or RNase T1 conditions. However, RNase T1 data exhibited a substantially higher identification background relative to that of hRNase 4/T4 PNK.
Second, the MS/MS-based sequencing data was examined from digestion of each of mo5U/ mlY/U Epo mRNAs with either hRNase 4/T4 PNK or RNaseTl. FIGURE 18 shows the overall sequence coverage obtained in each digestion experiment. Digestion with hRNase 4/T4 PNK resulted in consistently higher sequence coverage (73-87%) relative to digestion with RNase T1 (50-61%) across all Epo mRNA substrates tested.
Taken together, these data support using a composition comprising hRNase 4 and T4 PNK for sequencing and fingerprinting therapeutic RNAs, including mRNAs comprising nucleotide modifications such as mo5U or mlY. With its ability to cleave uridine-based RNA modifications (e.g., mo5U, Y, methylpseudouridine (mlY) and 5 -methoxyuridine (mo5U)), hRNase 4 may be useful for applications in the analysis of mRNA-based medicines (e.g., mRNA vaccines and therapeutics).
EXAMPLE 8: hRNase 4/T4-PNK-based characterization of an Epo mRNA comprising a 5- prime m7GpppAm cap and 3-prime poly-adenosine (poly-A) tail]
This example describes the use of hRNase 4/T4 PNK for characterizing Epo mRNAs comprising a 5' terminal m7GpppAm cap and a 3' terminal 120-nt poly-adenosine (Poly-A). The presence of 5' cap and 3' poly-A tail structures may confer or imprive the stability and/or translation of an IVT mRNA upon introduction into mammalian cells and organisms.
The synthesis of a 5' capped m7GpppAm Epo mRNA was performed utilizing Clean Cap AG® technology (Henderson et al., 2021). A 3' 120-nt poly-A tail was introduced by encoding the tail sequence in the DNA template that was utilized for in vitro transcription. 10 pg of purified U Epo mRNA comprising a 5' cap and 3' poly-A tail were prepared and
digested with human hRNase 4/T4 PNK as described in Example 5. The purified U Epo mRNA from Example 7 (without both 5' cap and 3' poly-A tail) was used as a control.
First, the presumed oligonucleotides originating from the 5' end of both capped and uncapped Epo mRNAs were investigated. To this end, deconvoluted intact mass data were searched for the masses of all possible 5' cleavage products with three missed cleavages and variable addition of a monophosphate, diphosphate, triphosphate, “monomethyl” guanosine triphosphate or “dimethyl” guanosine triphosphate within a mass difference cutoff of 10 ppm. FIGURE 19 shows a summary of the intensities of 5' cleavage products detected with and without a 5' m7GpppAm cap. Consistent with the presence of 5' cap m7GpppAm structure, oligonucleotides comprising deconvoluted masses equivalent to a triphosphorylated guanosine and two methyl groups were detected only in the capped Epo mRNA digests.
Next, the presumed presence of a poly-A tails was examined in each Epo mRNA digest. To this end, cleavage products from each experiment were characterized by UHPLC analysis similar to the approach described in Example 2, which was modified with an increased 18-45% gradient of non-aqueous buffer. Notably, only the 3' poly-adenylated Epo mRNA samples exhibited a distinct chromatographic peak with higher retention, which was associated to the cleaved 3' poly-A tail sequence (see FIGURE 20).
Taken together, these data indicate that a composition of hRNase 4/T4 PNK is useful for characterization of the 5' cap and 3' poly-A tail structures in mRNAs.
EXAMPLE 9: hRNase 4/T4-PNK-based characterization of uridine-depleted variants of CLuc mRNA
This example describes the LC-MS/MS analysis of three highly similar uridine- depleted variants of CLuc (Cypridina luciferase) mRNA using a composition of hRNase 4/T4 PNK. Depletion of the number of uridines in an RNA template has been utilized as a strategy to reduce the immunogenicity of IVT mRNAs without the need for introduction of chemical modifications (Vaidyanathan, et al., 2018).
5 pg of each purified uridine-depleted variant of CLuc mRNA was prepared and digested with hRNase 4/T4 PNK and characterized by fingerprinting and sequencing as described in Example 5. A schematic representation of the depletion region in each of three uridine-depleted CLuc mRNAs (CLuc Ul, CLuc U2, and CLuc U3) is shown FIGURE 21.
First, the specificity of each mass fingerprint in the context of a human transcriptome database supplemented with each uridine-depleted CLuc mRNA sequence was assessed (see
FIGURE 21). The correct uridine-depleted CLuc mRNA substrate was uniquely identified in each digestion experiment by LC-MS fingerprinting analysis.
Next, the sequenced cleavage products detected in each hRNase 4/T4 PNK digest were assessed. FIGURE 22 shows the sequence coverage of each uridine-depleted mRNA substrate in each hRNase 4/T4 PNK digest. In each experiment, the correct uridine-depleted CLuc mRNA sequence exhibited a substantially higher sequence coverage relative to the others. In addition, the vast majority of sequenced cleavage products detected could be confidently attributed to the correct uridine-depleted CLuc mRNA in each experiment (see FIGURE 23).
Taken together, hRNase 4/T4 PNK may be utilized to discriminate between highly similar nucleotide-depleted substrates by both sequencing and fingerprinting techniques.
EXAMPLE 10: Isotopically labeling RNA oligonucleotides for quantification analysis
An example method to add one or more stable isotope mass labels to the 3'-end of dephosphorylated RNA oligonucleotides is described. RNA oligonucleotides are prepared by digestion of an RNA-of-interest using hRNase 4/T4 PNK or a related composition as described above. This labeling method may be useful for and/or combined with multiplexing and relative quantification of one or more RNA oligonucleotides.
First, a non -template directed RNA polymerase is utilized to add a single 3'-azido-3'- deoxy-nucleotidetriphosphate (NTP) to the 3'-end of one or more RNA oligonucleotides (see FIGURE 24A). Examples of a non-template directed RNA polymerase include, E. coli poly(A) polymerase, yeast poly(A) polymerase, DNA polymerase 0 or any non-template directed RNA polymerase which accepts 3'-azido-3'-deoxy-NTPs as a substrate. Next, a Dibenzocyclooctyne (DBCO)-derived amino acid or peptide conjugate mass label is added to the 3'-terminal 3'- azido-3 '-deoxy -nucleotide residue utilizing copper-free (also referred as to strain-promoted) click chemistry (Baskin et al., 2007) as shown in FIGURE 24B and FIGURE 24C. Examples of DBCO conjugate mass labels involving an amino acid and a dipeptide are shown in FIGURE 24B.
Mass label conjugates may be utilized to produce “heavy” and “light” variants of differential isotopic composition for the comparison between two experimental conditions, for example for the analysis of 5’-capped and uncapped mRNAs. One sample (e.g., RNA oligonucleotides generated by digestion of an uncapped mRNA) is labeled with a “light” version of a mass tag as described above. The other sample (e.g., RNA oligonucleotides generated by digestion of a mRNA whose capping percentage is unknown) is labeled with a
version of the same tag that comprises a “heavy” isotope. After labeling, the samples are combined and analyzed within the same experiment. Identical oligonucleotides from each sample co-elute as pairs of peaks and may be distinguished by the mass difference between the “heavy” and “light” isotope content. By establishing the ratio of signal intensities of oligonucleotides from the uncapped mRNA sample relative to the corresponding oligonucleotides from the sample whose capping percentage is unknown, it is possible to accurately determine the levels of capping in the unknown sample. Identification and quantitation of the relevant 5’ end oligonucleotides are performed by a combination of intact MS and MS/MS fragment analyses. This approach may be extended to quantify other features of RNA substrates, such as RNA modification analysis. The mass tag concept is not restricted to isotopically labeled amino acids and could be extended as to other molecule classes (i.e., the group R in FIGURE 24B could comprise a keto acid, a lipid, a carbohydrate, etc.).
DBCO dipeptide conjugates may be utilized to produce isobaric dipeptide (or polypeptide) mass tags. In this approach, each tag has the same molecular mass, but the positions of the “heavy” and “light” isotopes are distributed within the peptide. This is achieved, for instance, by placing combinations of 13C and 15N heavy isotopes at different positions for each tag, so that the total number of isotopes is constant for all tags, thus creating distinct reporter and balancing regions. Upon fragmentation, such as with a HCD (Higher- energy C-trap Dissociation; a collision-induced dissociation technique specific to the orbitrap mass spectrometer), reporter and balancing peptide fragments may be distinguished, and the identity and quantity of each mass tag determined. Samples labeled with distinct peptide conjugate mass labels may be multiplexed and compared by LC-MS/MS analysis as described in Example 2. This approach permits multiplexed analysis of several RNA features in the same experiment (for instance, the simultaneous analysis of multiple RNA modifications in a given mRNA).
In some embodiments, methods and workflows may include fragmentation of peptide- RNA nucleoside conjugates utilizing high energy collision dissociation (HCD). Labeling RNA oligonucleotides with isobaric tags with distinct stable isotopic distributions is an example of an approach that may be utilized to enable multiplexing and relative quantification of RNA oligonucleotides between different experimental samples. Various molecular scaffolds may be utilized as isobaric tags, such as amino acids, keto acids, fatty acids, diamines, amino alcohols, carbohydrates, and dipeptides among others. Dipeptides may be cleavable by fragmentation modalities such as HCD, at the amide bond between the N-terminal and C-terminal amino acids
to produce an amino acid reporter anion. The amino acid reporter anion fragment may comprise a defined set of heavy and light isotopes in its composition so that differential isotopic composition between the isobaric tag and the reporter fragment distinguish and relatively quantitate labeled oligonucleotide from distinct samples and/or experimental conditions (e g., capped and uncapped mRNAs),
A 3'-azido-3'-deoxyadenosine nucleoside was independently conjugated to three amino acid/dipeptide labels by copper-free click chemistry reaction with DBCO-alanine, DBCO- alanine-phenylalanine or DBCO-alanine-Proline. FIGURES 24D-24F show the fragmentation pattern by HCD (with a normalized HCD of 30%) of the three 3 '-azido-3 '-deoxyadenosine derived peptide-nucleoside conjugates — including 3 '-azido-3 '-deoxyadenosine conjugated to DBCO-alanine (FIGURE 24D), to DBCO-alanine-phenylalanine (FIGURE 24E) and to DBCO-alanine-Proline (FIGUE 24F). The single amino acid conjugate formed by the reaction of 3 '-azido-3 '-deoxy adenosine with DBCO-alanine was used as a model for the fragmentation studies. The main anions detected upon fragmentation of the nucleosi de-alanine conjugate were derived from the adenosine nucleobase (denoted as A-base for simplicity), triazole-DBCO (denoted as DBCO for simplicity), and triazole-DBCO-alanine (denoted as DBCO-Ala for simplicity) fragments. The fragmentation of the dipeptide conjugates formed by the reaction of 3 '-azido-3 '-deoxyadenosine with DBCO-alanine-phenylalanine or with DBCO-alanine-Proline dipeptides is shown in FIGURE 24E and FIGUE 24F. Notably, fragmentation between the N- terminal and C-terminal amino acids for each of those dipeptide conjugates produced the intended amino acid reporter anion: phenylalanine (Phe) and proline (Pro) derived anions, respectively. These data suggest that this is viable strategy to generate discrete reporter anions from nucleoside-peptides conjugates. By incorporating a defined set of heavy stable isotopes in a dipeptide such as those presented in this example, isobaric tags can be thus generated, and the corresponding reporter anions applied to oligonucleotide quantification.
EXAMPLE 11 : Inhibition of hRNase 4 activity with human placental RNase Inhibitor
Shapiro et al., 1986 described the inhibition of human tumor cell secreted RNases by human placental RNase inhibitor. Herein the ability of human placental RNase inhibitor to inhibit the endoribonuclease activity of hRNase 4 was investigated. Briefly, 1 pL of a 1:10 dilution of hRNase 4 preincubated for 15 minutes at room temperature with 1 pL human placental RNase inhibitor (NEB, Cat # M0307S) in 1 x NEBuffer 1. Next, a target RNA oligonucleotide (SEQ ID NO: 23; AAGAGAGAUAGAGAA) containing a single hRNase 4 cleavage site was added to the mixture and the reaction was incubated for 15 minutes at 37°C.
Robust cleavage of the target oligonucleotide by hRNase 4 was observed in the absence of human placental RNase inhibitor (FIGURE 26). However, cleavage of the target oligonucleotide was inhibited following preincubation of hRNase 4 with human placental RNase inhibitor. hRNase 4 activity was inhibited by human placental RNase inhibitor in the presence of 1 M urea, which helps unfold substrate RNA secondary structures.
EXAMPLE 12: Targeted substrate protection and hRNase 4 cleavage for mRNA capping analysis
This example describes an example of methods for analysis of mRNA capping. FIGURE 27 shows an example workflow illustrating this method. Briefly, a capped RNA substrate and a 5 ’-biotinylated DNA probe which is complementary to at least a portion of the capped RNA substrate (e.g., a segment of interest) are annealed to form an RNAZDNA duplex. The duplex and an enzyme composition (e.g., comprising hRNase 4 and optionally an RNA end repair enzyme) are combined to form a cleaved DNA-RNA hybrid duplex and one or more single-stranded RNA fragments of the RNA substrate. The cleaved DNA-RNA hybrid duplex may then be affinity purified (e.g., using streptavidin magnetic beads). The remaining portion of the RNA substrate included in the purified DNA-RNA hybrid duplex may be eluted, for example, by contacting the purified DNA-RNA hybrid duplex with a DNase I.
In this example, a 30-nt DNA probe sequence (SEQ ID NO: 24; /Biotin/GAGCTTCTGCAAAAAGAACAAGCAAGCCCT) was hybridized to the 5 ’-terminal sequence of a 5' m7GpppAm capped EPO mRNA (as illustrated in FIGURE 28A) utilizing a touchdown hybridization approach (heating to 95 °C for 2 minutes, followed by slowly cooling to 22°C at 0.1°C/s) in lx NEBuffer 1 supplemented with 3 M urea. The hybridized mRNA solution was diluted to 1 M urea in NEBuffer 1 and a composition of hRNase 4/T4 PNK was added. The mixture was incubated at 37°C for 1.5 hours. Digestion was stopped by addition of human placental RNase inhibitor. Next, the resulting duplex comprising the 5'-biotinylated DNA probe and the corresponding hybridized RNA oligonucleotide was purified utilizing streptavidin magnetic beads. The hybridized RNA was eluted by incubation with DNase I at 37°C. The isolated RNA oligonucleotide was characterized by LC-MS/MS. Comparative experiments were performed in the absence of either the DNA probe or hRNase 4/T4 PNK.
Analysis of the isolated RNA oligonucleotide by LC-MS indicated a single prominent chromatographic peak (FIGURE 28B, top panel), whose identity was confirmed by mass spectrometry analysis (FIGURE 28C). Notably, no corresponding chromatographic peak was detected in purifications performed in the absence of the DNA probe or in the absence of
hRNase 4/T4 PNK (FIGURE 28B, middle and lower panels). FIGURE 28C shows the deconvoluted mass of the RNA 35mer oligonucleotide corresponding to the 5 ’-terminal segment of EPO mRNA (SEQ ID NO: 25;
AGGGCUUGCUUGUUCUUUUUGCAGAAGCUCAGAAU) comprising 2 methyl groups and a guanine-triphosphate moiety, consistently with presence of a 5' m7GpppAm cap structure. This data suggests that hRNase 4 digestion is prevented in the region of a DNA-RNA hybrid duplex. Thus, the isolated RNA oligonucleotide product comprises the sequence that is “protected” by the DNA probe plus any subsequent ribonucleotides at the 3’ end preceding an hRNase 4 ‘UR’ cutting site (indicated by the arrow in FIGURE 28A). A DNA/RNA duplex, in some embodiments, may comprise a DNA probe and an RNA substrate longer than the DNA probe, wherein the RNA substrate has single-stranded overhangs at both the 5’ and 3’ ends. Protecting an internal oligoribonucleotide segment of a given RNA substrate by hybridization with a DNA probe that leaves 5’ and 3’ overhangs may limit cleavage by hRNase 4 to the 5’ and 3’ UR sites that are nearest to the DNA probe-RNA substrate duplex.
Results shown in FIGURE 28B and FIGURE 28C demonstrate that protection of a portion of an RNA substrate from the action of a single-stranded nucleotide-specific endoribonuclease (e g., hRNase 4) by hybridization with a complementary affinity tagged DNA probe (e.g., shorter than the RNA substrate) can be used to selectively isolate and analyze features of the protected portion, such as a cap structure and/or any modifications that are present. In some embodiments, the methods illustrated in this example may include contacting an RNA substrate with multiple biotinylated-DNA probes targeting different portions of the RNA substrate, permitting simultaneous analysis of such portions. Disclosed methods may be applied to RNA modification analysis, such as RNA identification, locating an RNA within a sequence, assessing RNA stoichiometry, detecting RNA presence, permanence, and/or dynamics (i.e., installation and removal), and detecting co-existence of RNA modifications.
Table 3. Sequences of IVT mRNAs used in this study. mRNA Input Sequence
FLuc 10 pg GGGUCUAGAAAUAAUUUUGUUUAACUUUAAGAAGGAGAUAUAAC
(SEQ ID CAUGAAAAUCGAAGAAGGUAAAGGUCACCAUCACCAUCACCACG
NO: 26) GAUCCAUGGAAGACGCCAAAAACAUAAAGAAAGGCCCGGCGCCA
UUCUAUCCUCUAGAGGAUGGAACCGCUGGAGAGCAACUGCAUAA GGCUAUGAAGAGAUACGCCCUGGUUCCUGGAACAAUUGCUUUUA CAGAUGCACAUAUCGAGGUGAACAUCACGUACGCGGAAUACUUC GAAAUGUCCGUUCGGUUGGCAGAAGCUAUGAAACGAUAUGGGCU GAAUACAAAUCACAGAAUCGUCGUAUGCAGUGAAAACUCUCUUC
AAUUCUUUAUGCCGGUGUUGGGCGCGUUAUUUAUCGGAGUUGCA
GUUGCGCCCGCGAACGACAUUUAUAAUGAACGUGAAUUGCUCAA
CAGUAUGAACAUUUCGCAGCCUACCGUAGUGUUUGUUUCCAAAA
AGGGGUUGCAAAAAAUUUUGAACGUGCAAAAAAAAUUACCAAUA
AUCCAGAAAAUUAUUAUCAUGGAUUCUAAAACGGAUUACCAGGG
AUUUCAGUCGAUGUACACGUUCGUCACAUCUCAUCUACCUCCCGG
UUUUAAUGAAUACGAUUUUGUACCAGAGUCCUUUGAUCGUGACA
AAACAAUUGCACUGAUAAUGAAUUCCUCUGGAUCUACUGGGUUA
CCUAAGGGUGUGGCCCUUCCGCAUAGAACUGCCUGCGUCAGAUUC
UCGCAUGCCAGAGAUCCUAUUUUUGGCAAUCAAAUCAUUCCGGA
UACUGCGAUUUUAAGUGUUGUUCCAUUCCAUCACGGUUUUGGAA
UGUUUACUACACUCGGAUAUUUGAUAUGUGGAUUUCGAGUCGUC
UUAAUGUAUAGAUUUGAAGAAGAGCUGUUUUUACGAUCCCUUCA
GGAUUACAAAAUUCAAAGUGCGUUGCUAGUACCAACCCUAUUUU
CAUUCUUCGCCAAAAGCACUCUGAUUGACAAAUACGAUUUAUCU
AAUUUACACGAAAUUGCUUCUGGGGGCGCACCUCUUUCGAAAGA
AGUCGGGGAAGCGGUUGCAAAACGCUUCCAUCUUCCAGGGAUAC
GACAAGGAUAUGGGCUCACUGAGACUACAUCAGCUAUUCUGAUU
ACACCCGAGGGGGAUGAUAAACCGGGCGCGGUCGGUAAAGUUGU
UCCAUUUUUUGAAGCGAAGGUUGUGGAUCUGGAUACCGGGAAAA
CGCUGGGCGUUAAUCAGAGAGGCGAAUUAUGUGUCAGAGGACCU
AUGAUUAUGUCCGGUUAUGUAAACAAUCCGGAAGCGACCAACGC
CUUGAUUGACAAGGAUGGAUGGCUACAUUCUGGAGACAUAGCUU
ACUGGGACGAAGACGAACACUUCUUCAUAGUUGACCGCUUGAAG
UCUUUAAUUAAAUACAAAGGAUAUCAGGUGGCCCCCGCUGAAUU
GGAAUCGAUAUUGUUACAACACCCCAACAUCUUCGACGCGGGCG
UGGCAGGUCUUCCCGACGAUGACGCCGGUGAACUUCCCGCCGCCG
UUGUUGUUUUGGAGCACGGAAAGACGAUGACGGAAAAAGAGAUC
GUGGAUUACGUCGCCAGUCAAGUAACAACCGCGAAAAAGUUGCG
CGGAGGAGUUGUGUUUGUGGACGAAGUACCGAAAGGUCUUACCG
GAAAACUCGACGCAAGAAAAAUCAGAGAGAUCCUCAUAAAGGCC
AAGAAGGGCGGAAAGUCCAAACUCGAGUAAGGUUAACCUGCAGG
AGG
EPO 3 ng GGGGCUUGCUUGUUCUUUUUGCAGAAGCUCAGAAUAAACGCUCA (SEQ ID ACUUUGGCACCAUGGGAGUGCACGAGUGUCCCGCGUGGUUGUGG NO: 27) UUGCUGCUGUCGCUCUUGAGCCUCCCACUGGGACUGCCUGUGCUG
GGGGCACCACCCAGAUUGAUCUGCGACUCACGGGUACUUGAGAG
GUACCUUCUUGAAGCCAAAGAAGCCGAAAACAUCACAACCGGAU
GCGCCGAGCACUGCUCCCUCAAUGAGAACAUUACUGUACCGGAUA
CAAAGGUCAAUUUCUAUGCAUGGAAGAGAAUGGAAGUAGGACAG
CAGGCCGUCGAAGUGUGGCAGGGGCUCGCGCUUUUGUCGGAGGC
GGUGUUGCGGGGUCAGGCCCUCCUCGUCAACUCAUCACAGCCGUG
GGAGCCCCUCCAACUUCAUGUCGAUAAAGCGGUGUCGGGGCUCCG
CAGCUUGACGACGUUGCUUCGGGCUCUGGGCGCACAAAAGGAGG
CUAUUUCGCCGCCUGACGCGGCCUCCGCGGCACCCCUCCGAACGA
UCACCGCGGACACGUUUAGGAAGCUUUUUAGAGUGUACAGCAAU
UUCCUCCGCGGAAAGCUGAAAUUGUAUACUGGUGAAGCGUGUAG
GACAGGGGAUCGCUAGGACUGACUAGGAUCUGGUUACCACUAAA
CCAGCCUCAAGAACACCCGAAUGGAGUCUCUAAGCUACAUAAUAC
CAACUUACACUUUACAAAAUGUUGUCCCCCAAAAUGUAGCCAUU
CGUAUCUGCUCCUAAUAAAAAGAAAGUUUCUUCACAUUCUAGCU
AGC
ClucUl GGGAGACCCAAGCUUGGUACCGAGCUCGGAUCCGCCACCAUGAAG (SEQ ID ACCCUGAUCCUGGCCGUGGCCCUGGUGUACUGCGCCACCGUGCAC NO: 28) UGCCAGGACUGCCCAUACGAACCAGACCCCCCGAACACCGUGCCA
ACCAGCUGCGAGGCCAAGGAAGGCGAGUGCAUCGACAGCAGCUG
CGGCACCUGCACCAGAGACAUCCUGAGCGACGGCCUGUGCGAGAA
CAAGCCGGGAAAGACAUGCUGCCGGAUGUGCCAGUACGUGAUCG
AGUGCAGAGUGGAGGCCGCAGGAUGGUUCCGGACCUUCUACGGC
AAGAGAUUCCAGUUCCAAGAGCCCGGCACAUACGUGCUGGGCCA
GGGAACCAAGGGCGGCGACUGGAAAGUGAGCAUCACCCUGGAGA
ACCUCGACGGCACCAAAGGCGCCGUGCUGACAAAGACAAGACUGG
AAGUCGCCGGCGACAUCAUCGACAUCGCGCAGGCCACCGAGAACC
CCAUCACCGUGAACGGAGGCGCCGACCCCAUAAUCGCCAACCCCU
ACACAAUCGGCGAAGUGACAAUCGCCGUCGUGGAAAUGCCAGGC
UUCAACAUCACCGUCAUUGAGUUCUUCAAACUGAUCGUGAUCGA
CAUCCUCGGAGGAAGAUCUGUAAGAAUCGCCCCAGACACAGCAA
ACAAAGGAAUGAUCUCUGGCCUCUGUGGAGAUCUUAAAAUGAUG
GAAGAUACAGACUUCACUUCAGAUCCAGAACAACUCGCUAUUCA
GCCUAAGAUCAACCAGGAGUUUGACGGUUGUCCACUCUAUGGAA
AUCCUGAUGACGUUGCAUACUGCAAAGGUCUUCUGGAGCCGUAC
AAGGACAGCUGCCGCAACCCCAUCAACUUCUACUACUACACCAUC
UCCUGCGCCUUCGCCCGCUGUAUGGGUGGAGACGAGCGAGCCUCA
CACGUGCUGCUUGACUACAGGGAGACGUGCGCUGCUCCCGAAACU
AGAGGAACCUGCGUUUUGUCUGGACAUACUUUCUACGAUACAUU
UGACAAAGCAAGAUACCAAUUCCAGGGUCCCUGCAAGGAGAUUC
UUAUGGCCGCCGACUGUUUCUGGAACACUUGGGAUGUGAAGGUU
UCACACAGGAAUGUUGACUCUUACACUGAAGUAGAGAAAGUACG
AAUCAGGAAACAAUCGACUGUAGUAGAACUCAUUGUUGAUGGAA
AACAGAUUCUGGUUGGAGGAGAAGCCGUGUCCGUCCCGUACAGC
UCUCAGAACACUUCCAUCUACUGGCAAGAUGGUGACAUACUGAC
UACAGCCAUCCUACCUGAAGCUCUGGUGGUCAAGUUCAACUUCA
AGCAACUGCUCGUCGUACAUAUUAGAGAUCCAUUCGAUGGUAAG
ACUUGCGGUAUUUGCGGUAACUACAACCAGGAUUUCAGUGAUGA
UUCUUUUGAUGCUGAAGGAGCCUGUGAUCUGACCCCCAACCCACC
GGGAUGCACCGAAGAACAGAAACCUGAAGCUGAACGACUCUGCA
AUAGUCUCUUCGCCGGUCAAAGUGAUCUUGAUCAGAAAUGUAAC
GUGUGCCACAAGCCUGACCGUGUCGAACGAUGCAUGUACGAGUA
UUGCCUGAGGGGACAACAGGGUUUCUGUGACCACGCAUGGGAGU
UCAAGAAAGAAUGCUACAUAAAGCAUGGAGACACCCUAGAAGUA
CCAGAUGAAUGCAAAUAGGC
ClucU2 5 pig GGGAGACCCAAGCUUGGUACCGAGCUCGGAUCCGCCACCAUGAAG (SEQ ID ACCUUAAUUCUUGCCGUUGCAUUAGUCUACUGCGCCACUGUUCA NO: 29) UUGCCAGGACUGUCCUUACGAACCUGAUCCACCAAACACAGUUCC
AACUUCCUGUGAAGCUAAAGAAGGAGAAUGUAUUGAUAGCAGCU
GUGGCACCUGCACGAGAGACAUACUAUCAGAUGGACUGUGUGAA
AAUAAACCAGGAAAAACAUGUUGCCGAAUGUGUCAGUAUGUAAU
UGAAUGCAGAGUAGAGGCCGCAGGAUGGUUUAGAACAUUCUAUG
GAAAGAGAUUCCAGUUCCAGGAACCUGGUACAUACGUGUUGGGU
CAAGGAACCAAGGGCGGCGACUGGAAGGUGUCCAUCACCCUGGA
GAACCUGGAUGGAACCAAGGGGGCUGUGCUGACCAAGACAAGAC
UGGAAGUGGCUGGAGACAUCAUUGACAUCGCUCAAGCUACUGAG
AAUCCCAUCACUGUAAACGGUGGAGCUGACCCUAUCAUCGCCAAC
CCGUACACCAUCGGCGAGGUCACCAUCGCUGUUGUUGAGAUGCCA
GGCUUCAACAUCACAGUGAUCGAAUUCUUCAAGCUGAUCGUGAU
CGACAUACUGGGCGGACGGAGCGUGCGCAUCGCCCCAGACACCGC
GAACAAGGGCAUGAUCAGCGGCCUGUGCGGAGACCUGAAGAUGA
UGGAGGACACCGACUUCACCAGCGACCCCGAGCAGCUGGCCAUCC
AGCCAAAAAUCAACCAGGAAUUCGACGGCUGCCCCCUGUACGGAA
ACCCCGACGACGUGGCCUACUGCAAAGGCCUGCUCGAGCCGUACA
AGGACAGCUGCAGAAACCCCAUCAACUUCUACUACUACACCAUCA
GCUGCGCCUUCGCCAGGUGCAUGGGCGGCGACGAAAGAGCCAGCC
ACGUCCUGCUGGACUACAGAGAAACCUGCGCCGCCCCGGAGACAC
GGGGCACCUGCGUGCUGAGCGGCCACACCUUCUACGACACAUUCG
ACAAGGCACGGUACCAGUUCCAGGGCCCAUGCAAGGAGAUCCUG
AUGGCCGCCGACUGCUUCUGGAACACCUGGGACGUGAAGGUGAG
CCACAGAAACGUCGACAGCUACACAGAGGUGGAGAAGGUGAGAA
UCAGAAAACAGAGCACAGUGGUGGAACUGAUCGUGGACGGCAAG
CAAAUUCUGGUUGGAGGAGAAGCCGUGUCCGUCCCGUACAGCUC
UCAGAACACUUCCAUCUACUGGCAAGAUGGUGACAUACUGACUA
CAGCCAUCCUACCUGAAGCUCUGGUGGUCAAGUUCAACUUCAAGC
AACUGCUCGUCGUACAUAUUAGAGAUCCAUUCGAUGGUAAGACU
UGCGGUAUUUGCGGUAACUACAACCAGGAUUUCAGUGAUGAUUC
UUUUGAUGCUGAAGGAGCCUGUGAUCUGACCCCCAACCCACCGGG
AUGCACCGAAGAACAGAAACCUGAAGCUGAACGACUCUGCAAUA
GUCUCUUCGCCGGUCAAAGUGAUCUUGAUCAGAAAUGUAACGUG
UGCCACAAGCCUGACCGUGUCGAACGAUGCAUGUACGAGUAUUG
CCUGAGGGGACAACAGGGUUUCUGUGACCACGCAUGGGAGUUCA
AGAAAGAAUGCUACAUAAAGCAUGGAGACACCCUAGAAGUACCA
GAUGAAUGCAAAUAGGC
ClucU3 5 pig GGGAGACCCAAGCUUGGUACCGAGCUCGGAUCCGCCACCAUGAAG (SEQ ID ACCUUAAUUCUUGCCGUUGCAUUAGUCUACUGCGCCACUGUUCA NO: 30) UUGCCAGGACUGUCCUUACGAACCUGAUCCACCAAACACAGUUCC
AACUUCCUGUGAAGCUAAAGAAGGAGAAUGUAUUGAUAGCAGCU
GUGGCACCUGCACGAGAGACAUACUAUCAGAUGGACUGUGUGAA
AAUAAACCAGGAAAAACAUGUUGCCGAAUGUGUCAGUAUGUAAU
UGAAUGCAGAGUAGAGGCCGCAGGAUGGUUUAGAACAUUCUAUG
GAAAGAGAUUCCAGUUCCAGGAACCUGGUACAUACGUGUUGGGU
CAAGGAACCAAGGGCGGCGACUGGAAGGUGUCCAUCACCCUGGA
GAACCUGGAUGGAACCAAGGGGGCUGUGCUGACCAAGACAAGAC
UGGAAGUGGCUGGAGACAUCAUUGACAUCGCUCAAGCUACUGAG
AAUCCCAUCACUGUAAACGGUGGAGCUGACCCUAUCAUCGCCAAC
CCGUACACCAUCGGCGAGGUCACCAUCGCUGUUGUUGAGAUGCCA
GGCUUCAACAUCACCGUCAUUGAGUUCUUCAAACUGAUCGUGAU
CGACAUCCUCGGAGGAAGAUCUGUAAGAAUCGCCCCAGACACAGC
AAACAAAGGAAUGAUCUCUGGCCUCUGUGGAGAUCUUAAAAUGA
UGGAAGAUACAGACUUCACUUCAGAUCCAGAACAACUCGCUAUU
CAGCCUAAGAUCAACCAGGAGUUUGACGGUUGUCCACUCUAUGG
AAAUCCUGAUGACGUUGCAUACUGCAAAGGUCUUCUGGAGCCGU
ACAAGGACAGCUGCCGCAACCCCAUCAACUUCUACUACUACACCA
UCUCCUGCGCCUUCGCCCGCUGUAUGGGUGGAGACGAGCGAGCCU
CACACGUGCUGCUUGACUACAGGGAGACGUGCGCUGCUCCCGAAA
CUAGAGGAACCUGCGUUUUGUCUGGACAUACUUUCUACGAUACA
UUUGACAAAGCAAGAUACCAAUUCCAGGGUCCCUGCAAGGAGAU
UCUUAUGGCCGCCGACUGUUUCUGGAACACUUGGGAUGUGAAGG
UUUCACACAGGAAUGUUGACUCUUACACUGAAGUAGAGAAAGUA
CGAAUCAGGAAACAAUCGACUGUAGUAGAACUCAUUGUUGAUGG
AAAACAGAUCCUGGUGGGCGGCGAAGCCGUGAGCGUGCCAUACA
GCAGCCAAAACACCAGCAUCUACUGGCAGGACGGCGACAUCCUGA
CAACCGCCAUCCUGCCCGAGGCACUGGUGGUGAAGUUCAACUUCA
AACAGCUGCUGGUGGUCCACAUCAGAGACCCCUUCGACGGCAAGA
CAUGCGGAAUCUGCGGCAACUACAACCAGGACUUCAGCGACGACA
GCUUCGACGCCGAGGGCGCCUGCGACCUGACCCCCAACCCGCCCG
GCUGCACCGAGGAACAGAAGCCAGAGGCCGAAAGACUGUGCAAC
AGCCUCUUCGCCGGACAGAGCGACCUGGACCAGAAGUGCAACGUG
UGCCACAAACCGGACAGAGUGGAACGGUGCAUGUACGAAUACUG
CCUGCGGGGCCAGCAGGGAUUCUGCGACCACGCCUGGGAGUUCAA
GAAGGAGUGCUACAUCAAGCACGGCGACACCCUGGAGGUGCCAG
ACGAGUGCAAGUAGGC
EXAMPLE 13: DNA probe-directed RNA cleavage with site-specific ribonucleases
Targeted cleavage of an RNA substrate with site-specific ribonucleases may be directed by hybridization of the RNA substrate with complementary DNA probe(s). For example, an RNA substrate may be annealed to one or more DNA probes, each complementary to one or more sequences of interest within the RNA substrate sequence, forming one or more DNA/RNA duplex segments. Segments comprising DNA/RNA duplexes may be cleaved upstream and/or downstream of the double-stranded region, for example, using a ribonuclease capable of cleaving single-stranded RNA, optionally in the presence of a repair enzyme. In some instances, the RNA cleavage may occur at nucleotide positions within the internal edges
of the double-stranded region (presumably due to local conformation fluctuations, also referred as to breathing or fraying, that may form transient single-stranded regions; or by the action the ribonuclease itself). Resultant products of ribonuclease digestion, whether cleaved DNA/RNA duplex segments or cleaved single-stranded segments, or both, may be assessed by LC-MS/MS analysis. Optional steps of isolation of the cleaved DNA/RNA duplex segments may be employed prior to LC-MS/MS analysis, such as capturing the cleaved DNA/RNA duplex segments by means of affinity enrichment followed by selective elution of the corresponding RNA strand.
FIGURE 29 shows example results of an assay to assess the cleavage of an RNA substrate (SEQ ID NO: 31;
GGGACUCUAACUAUGUCAAUCGCCGUGAUGUAAUUAUCGC) hybridized to a DNA probe (SEQ ID NO: 32; ATTGACATAGTTAGAGTCCC). In this example, the first 20 nucleotides of a 40mer RNA sequence were hybridized with a complementary 20mer DNA probe to form at least partially duplex DNA/RNA polynucleotides, which were then cleaved using one of several ribonucleases. Hybridization of the RNA substrate and DNA probe was conducted by heating to 80°C for 2 minutes, followed by slowly cooling at 0.1°C/s to 22°C to form a DNA/RNA duplex solution. Next, the hybridized DNA/RNA duplex solution was diluted in a reaction buffer appropriate for each ribonuclease. TABLE 4 shows the ribonucleases and reaction buffers used in this example. The ribonucleases included three sitespecific RNases (hRNase 4, MCI and RNase Tl) and two ribonucleases with poor specificity (RNase A and RNase Ir). A composition of 1 pL of a 5-fold dilution series of each RNase was added to the reaction mixture. T4 PNK (1 :75 dilution) (400,000U/mL) was added to the hRNase 4, MCI and RNase Tl reaction mixtures. Each mixture was heated for 30 minutes at 37°C. The resultant mixture was characterized by LC-MS/MS. Comparative experiments were performed in the absence of either the DNA probe or ribonuclease.
RNA cleavage products were classified as “protected products” if they were associated with limited cleavage of the hybridized DNA/RNA duplex, including products with 3'- overhangs, 3 '-overhangs in combination with 5'-reccessed ends (less than 4nt internal to the hybrid duplex), blunt ends and 3 '-reccessed ends (less than 4nt internal to the hybrid duplex) with respect to the hybridized DNA/RNA duplex. Products classified as “internal products” refer to those RNA cleavage products resulting from one or more cleavage events within the DNA/R A duplex (greater than 4nt internally from either end of the hybrid duplex). Products classified as “external products” refer to those cleavage products resulting from cleavage events only in the unhybridized (single stranded) regions of the RNA substrate.
The tested RNases produced different levels of protected products following DNA/RNA hybridization and cleavage. The plurality of protected products in digests with sitespecific RNases (hRNase 4, MCI and RNase Tl) exhibited well-defined 3 '-overhangs terminating at the respective recognition site immediately following the hybridized DNA/RNA duplex (Figure 29). In contrast, the less-specific RNases (RNase A and RNase If) yielded a mixture of products with variable 3 '-recessed ends and 3 '-overhangs relative to the DNA hybridized region. The sequences of the most abundant protected products and their positions are shown in the right panel (light gray).
As such, RNA cleavage with site-specific RNases can be directed to predictable and well-defined sites by DNA probe hybridization. Data shown demonstrate that the 5' and 3' heterogenicity in the resulting cleavage products is a function of ribonuclease utilized in the protection assay. At higher ribonuclease concentrations, the product heterogenicity may increase at different levels for different ribonucleases.
EXAMPLE 14: Varying DNA probe lengths in DNA probe-directed RNA cleavage with site specific RNases
FIGURE 30A shows an example experiment to examine how cleavage of a DNA probe- hybridized RNA substrate changes with varying DNA probes. In this example, the 40mer RNA of Example 13 was hybridized with one of a sequential series of DNA probes ranging from 22 to 30 nucleotides in length. The sequences of the DNA probes were designed to be complementary to the 5' end of the RNA substrate sequence (TABLE 5.). Hybridization and
RNase digestion with a composition of 1 pL of hRNase 4 (1:75 dilution), RNase T1 (1 :100 dilution) or MCI (1:25 dilution), each in combination with T4 PNK (1 :75 dilution) (400,000U/mL), were performed as described in Example 13.
TABLE 5. DNA probe sequences used to assess DNA probe-directed RNA cleavage with site specific RNases
FIGURE 30B shows the cleavage pattern of each site-specific ribonuclease at and around the varying DNA/RNA duplex regions. All site-specific RNases produced cleavage products primarily with 3 '-overhangs or blunt ends terminating at the first respective recognition site immediately downstream of the DNA probe hybridized region (FIGURE 3 OB, bottom panel). With the increase in probe length, a noticeably transition to the formation of longer protected products was observed, which were a result of cleavage at subsequent recognition sites downstream of the DNA probe hybridized region. Upon DNA probe hybridization overlapping a given recognition site a mixture of cleavage products from cleavage at successive downstream and upstream recognition sites was observed. Taken together, these data indicate that site-specific RNases can be utilized to generate predictable and well-defined cleavage products by DNA hybridization. Well-defined cleavage products may be correlated with the protection of regions of interest within the RNA substrate
through hybridization with a DNA probe. Notably, hRNase 4 appears to show less cleavage product heterogenicity for a wider range of DNA probes.
EXAMPLE 15: Comparing the use of hRNase 4 and RNase H in ribonuclease protection assays
RNase H may be used for analysis of synthetic mRNA 5' cap incorporation and cleaves RNA substrate at adjacent phosphodiester bonds 5' and 3' to the RNA hybridized to the 5' deoxynucleotide of an DNA-RNA chimera probe. However, RNase H may also cleave one or more nucleotides away from 5' and 3 ' of the target site, giving rise to multiple cleavage products that differ by one or more nucleotides, thereby complicating product analysis by electrophoresis or LC-MS/MS. As such, extensive optimization of the DNA-RNA chimera probe is usually required to achieve uniform cleavage of an RNA substrate at predetermined sites. This example shows a comparison between the specificities of hRNase 4 and RNase H to cleave an RNA substrate in vitro at a pre-defined site at or near a double-stranded segment generated by hybridizing a complementary probe to the 5’ end of the target RNA substrate.
In this example, 10 pg of a synthetic FLuc mRNA transcript (Seq ID NO:26) were first capped with a Faustovirus Capping Enzyme (FCE; NEB Cat # M2081S) and then methylated at the 2'-0 position of the first nucleotide adjacent to the cap structure with a mRNA Cap 2'- O-Methyltransferase (NEB Cat # M0366S), according to manufacturer’s instructions, to produce a capped FLuc mRNA that comprise a 5'-terminal m7GpppGm (Cap 1) structure and a series of intermediate products, including a 5'-terminal diphosphate (pp or 2p), a 5'-terminal triphosphate (ppp or 3p), a 5 '-terminal guanosine triphosphate (Gppp), and 5 '-terminal m7GpppG (Cap 0).
This 5 '-end m7GpppGm modified FLuc mRNA was hybridized either with a 25-nt biotinylated DNA probe (SEQ ID NO: 51) for cleavage conditions with hRNase 4, or with a 25-nt desthiobiotinylated DNA-RNA chimeric probe (DNA/RNA probe; SEQ ID NO: 55), wherein the first 6 positions at the 5’ end are deoxyribonucleotides and the remaining 19 positions are ribonucleotides; deoxyribonucleotides are denoted by a preceding ‘d’) for cleavage conditions with RNase H. Hybridization was performed utilizing a touchdown approach (by heating to 80°C followed by a ramp-down at 0.1°C/s to 22°C) in absence of a denaturant (e.g., such as urea). The DNA/FLuc mRNA hybrid was cleaved utilizing a composition of 1 pL of hRNase 4 (10-fold dilution) and 0.4 pL of T4 PNK (400,000U/mL) in NEB buffer rl .1 at 37°C for 1 hour. Each digestion was stopped by addition of 1 pL of human placental RNase inhibitor. The cleaved DNA/RNA duplex segment was affinity purified
utilizing streptavidin magnetic beads and eluted by heating to 80°C in water. For comparison, the DNA-RNA chimera/FLuc mRNA hybrid was cleaved utilizing 1 pL of Thermostable RNase H (NEB Cat # M0523S), then affinity purified utilizing streptavidin magnetic beads and eluted by heating to 80°C in water as above.
Digestion of the FLuc mRNA substrate hybridized to the DNA probe is expected to produce a 28mer RNA cleavage product terminating at the first hRNase 4 recognition site following (i.e., at a more 3’ position on the mRNA substrate than) the DNAZRNA hybridized region of the FLuc mRNA. Digestion of the same FLuc mRNA substrate/DNA probe duplex with RNase H is expected to yield a 24mer RNA cleavage product (FIGURE 31 A). Two main chromatographic peaks were observed in the hRNase 4/T4 PNK reaction after cleavage and DNA/RNA duplex enrichment (FIGURE 3 IB, top panel). These peaks correspond to the 28mer RNA cleavage product and the biotinylated DNA probe. In contrast, the RNase H reaction yielded three main chromatographic peaks, corresponding to the 23mer (‘Product -Inf) and 24mer RNA products and the biotinylated DNA probe (FIGURE 3 IB, bottom panel). These data accord with observations that probe-directed RNase H cleavage of an RNA substrate may not be uniform and may result in multiple cleavage products around a pre-defined site. On the other hand, probe-directed hRNase 4 cleavage resulted in a substantially more specific formation of the expected cleavage product, indicating that protecting a sequence segment of interest with a complementary probe and contacting the RNA substrate with a site-specific ribonuclease that preferentially cuts single-stranded RNA is a superior strategy to generate predetermined oligonucleotides for LC-MS/MS analysis.
FIGURE 31C shows the deconvoluted mass spectrum of the 28mer cleavage product peak of Figure 3 IB (hRNase 4 condition). FIGURE 3 ID shows the deconvoluted mass spectrum of the 24mer cleavage product peak of Figure 3 IB (RNase H condition). Both mass spectra are consistent with the formation of a series of intermediary modifications of the RNA 5' end resulting from incomplete enzymatic capping and/or methylation of the FLuc mRNA (described above). These include 5'-pp (diphosphate or 2p or pp), 5 '-ppp (triphosphate or 3p or ppp), 5'-Gppp, and Cap 1 (5'-m7GpppGm).
A search of all annotated 5 '-end products with a Cap 1 structure in the hRNase 4 condition revealed that the vast majority came from the 28mer product (87.5 ± 1 .4%), whereas in the RNase H condition, the 5 '-end products with a Cap 1 structure are distributed between of the 23 m er (45.7 ± 2.3%) and 24mer (51 5 ± 2.5%) product sequences (FIGURE. 3 IE). Hence, a framework for increasing the precision of RNA cleavage (e.g., by optimizing the identity and
sequence of the protection probe for each RNA substrate of interest) while may be required in applications involving RNase H, may not be at all necessary in applications involving hRNase 4.
A relative quantitation of the products with different 5'-end modifications, and by extension, a relative measure of FLuc mRNA capping efficiency is shown in FIGURE 3 IF. The aggregate results of hRNase 4 and RNaseH largely agree with each other. In both the hRNase 4 and RNaseH reactions, the Capl product represented the majority of species identified (66.7 ± 1.2% and 64.2 ± 1.4%, respectively). The relative abundance of intermediary products also exhibited good concordance between the hRNase 4 and RNaseH conditions, with the di phosphorylated product (24.0 i- 2.2% and 21.9 ± 0.4%, respectively) exhibiting the highest relative abundance among all intermediary products.
Taken together, these data demonstrate that ribonucleases, such as hRNase 4, that feature a high degree of specificity for cleaving a single-stranded RNA substrate at defined sites (e.g., specificity for cleaving an RNA at one or more dinucleotide, trinucleotide or tetranucleotide combinations) may be useful to characterize the extent of mRNA 5' end capping. RNase 4 results are comparable to RNase H results but with differences. For example, RNase H requires a double-stranded RNA target such that the DNA probe must be chimeric requiring both a DNA and an RNA portion and even with such a probe, RNaseH cleavage products vary in size by -2 to +2 nucleotides around the recognition site, complicating fragment analysis.
Disclosed methods, in some embodiments, may be used to analyze aspects of the protected RNA segment, including modifications present in the segment, such as a cap structure. For example, disclosed methods may include contacting an RNA substrate with (a) a probe targeting an internal segment of the RNA substrate of interest, (b) a probe targeting a 3' end segment of the RNA substrate of interest or (c) multiple probes (e.g., multiple biotinylated-DNA probes) targeting different portions of the RNA substrate, permitting simultaneous analysis of such portions. Disclosed methods may be applied to RNA modification analysis, such as RNA identification, locating an RNA within a sequence, assessing RNA stoichiometry, detecting RNA presence, permanence, and/or dynamics (i.e., installation and removal), and detecting co-existence of RNA modifications.
In some embodiments, RNA 5' end cap analysis methods, including methods of analyzing cap structures present in mRNAs (e.g., 7-methyl guanosine triphosphate cap), small nuclear RNAs (e g., 2,2,7-trimethylguanosine triphosphate cap or y-monomethyl phosphate
cap) and mitochondrial RNA (e.g., NAD cap), may include contacting a sample and a 5'— >3' exoribonuclease that is capable of hydrolyzing 5 '-monophosphate RNA in the 5' to 3' direction and that does not hydrolyze 5 '-capped RNA. Examples of such 5'-phosphate-dependent exonucleases include XRN-1 (NEB, Cat # M0338S) and Terminator (LGC, Biosearch Technologies, Cat # TER51020). Treatment of an RNA sample with XRN-1 or Terminator, prior or after contacting the RNA substrate (or fragments thereof) with a site-specific endoribonuclease such as hRNase 4, may reduce the complexity of the sample and facilitate data analysis of RNA 5 '-capped ends.
By selecting appropriate probe sequences, disclosed methods may be applied for detecting RNA segments originated from abortive transcription initiation events; or RNA segments originated from premature transcription termination events (e.g., resulting in truncated RNAs); or RNA segments originated from cis-primed transcription extension and/or self-primed transcription extension that result in a transcript pool comprising longer than encoded RNA products, often forming regions of double-stranded RNA that may trigger innate immune response and affect the action of RNA vaccines and therapeutics.
Disclosed methods may be used in absence of a protection probe so that it permits that RNA segments comprising double-stranded regions or other structural regions (e.g., hairpins, stem loops, pseudoknots, etc.) that may form within an RNA substrate of interest (intramolecular structures) or may form among multiple RNAs or DNA/RNA hybrids (intermolecular structures), including triple or quadruple helices, to be either directly analyzed by LC-MS/MS or undergo a process of purification to isolate the double-stranded and/or other structural regions prior to LC-MS/MS analysis. In these embodiments, the structured region(s) will (in analogy to a region protected by an exogenous probe) direct the ribonuclease to cleave the RNA only at accessible sites (e g., ribonuclease specific sites located at unstructured or poorly structured regions), thus enabling analyses of such structured region(s). Disclosed methods may be used to determine RNA structured regions implicated in certain biological functions, such as translation modulators, splicing regulatory elements, microRNA processing sites, riboswitches, IRES, and others.
In some embodiments, a cap analysis method may be performed in the absence of a protection probe, for example, where site-specific ribonuclease access to the subject RNA segment is limited by a protein (e.g., an RNA binding protein or an antibody), by an RNA ligand (e.g., cellular metabolites such as adenosylcobalamin, lysine, glycine, flavin mononucleotide, etc. as well as synthetic small molecule binders such as fluorescent dyes and
drugs like branaplam and risdiplam), by a divalent ion (e.g., a salt of magnesium, calcium, zinc, manganese, etc.) or by a multicomponent biological structure (e.g., a ribosome, a lipid-based membrane, etc.). For example, RNA cleavage by a site-specific ribonuclease in the surrounding region(s) of the bound element (for instance an RNA binding protein, an RNA ligand, or a ribosome) may be used to determine the identity of the sequence to which this element is bound, the relative occupancy, and/or binding dissociation properties.
In some embodiments, chemical crosslinking may be performed prior to contacting the RNA with a site-specific ribonuclease (e.g., hRNase 4). The RNA may be crosslinked intramolecularly and/or intramolecularly (e.g., to complementary probe, to an RNA binding protein, to a ribosome, to an aptamer, to another DNA or RNA strand). The RNA may be cleaved by a site-specific ribonuclease in the surrounding region(s) of the crosslinked region for isolation and analyses.
EXAMPLE 16: The effect of varying the RNA 5 '-end sequence in probe-directed RNA cleavage with hRNase 4 or RNase H
In some embodiments, disclosed methods may be applied to analysis of mRNA capping in transcripts with distinct 5'-UTR sequences and with sequences comprising full replacement of uridine sites (U) with 1 -methyl pseudouridine sites (ml P or m'Y) as illustrated in this example. Synthetic mRNA transcripts were constructed by replacing the FLuc mRNA 5'-UTR coding sequence with the coding sequence of the 5'-UTR of interest. TABLE 6 lists the mRNA 5'-UTR sequences used in this example. Transcripts were produced by in vitro transcription (IVT) utilizing the HiScribe™ T7 High Yield RNA Synthesis Kit (NEB, Catalog # E2040S) utilizing either canonical UTP or m1 TTP replacing UTP to result in full substitution of uridine with 1 -methyl -pseudouridine as described in Examples 5 and 7. Each mRNA was capped with FCE and methylated with a 2 ’-O-m ethyltransferase to produce a 5' terminal m7GpppGm (Cap 1) containing product and a series of intermediary 5' end capped and uncapped products as described in Example 15.
Each mRNA was hybridized to a corresponding biotinylated DNA probe (TABLE 7) utilizing the touchdown hybridization approach as described in Example 15. Each hybridized DNA/RNA duplex was digested with either hRNase 4/T4 PNK or RNase H, affinity purified and characterized by LC-MS/MS as described in Example 15. TABLE 7. Probe sequences used to assess probe-directed RNA cleavage of mRNAs comprising distinct 5'-UTRs
Probes used for hRNase 4 protection cleavage are denoted by ‘R4’. Probes used for RNase H protection cleavage are marked with ‘RH’. Deoxyribonucleotides in the DNA-RNA chimera probes are preceded by ‘d’. FIGURE 32A shows the extent of capping observed for each mRNA substrate. A relative quantitation of the cleavage products with different 5' end modifications was determined for each of the hRNase 4 and RNaseH conditions as described in Example 15. A range of capping efficiencies (30-90%) was detected across U or m' -modified mRNAs with distinct 5'-UTRs. However, the mean relative abundance of each mRNA 5' end product
(comprising a 5'-pp, or a 5'-ppp, or a 5'-Gppp, or a Cap 0, or a Cap 1) was consistent in both the hRNase 4 and RNaseH conditions.
FIGURE 32B shows the length distribution of m7GpppGm (Cap 1) capped cleavage products observed in each of the hRNase 4 and RNaseH conditions. Cap 1 products of identical length were detected for both U and rn'T'-modified variants of a particular mRNA 5'-UTR. Notably, the hRNase 4 condition yielded predominantly Cap 1 products of a discrete length, resulting from cleavage of each mRNA substrate at an hRNase 4 recognition site downstream of the DNA probe hybridized region in each target mRNA. In contrast, a higher heterogeneity regarding the length distribution of the Cap 1 products for each individual mRNA sub state was observed in the RNase H condition. While for certain mRNA substrates (for instance, mRNA comprising HBB and pRNA21 5'-UTRs) cleavage with RNase H resulted in primarily one product, for other mRNAs (for instance, mRNA comprising Comirnaty and FLuc 5'-UTRs) cleavage with RNase H resulted in mixtures of products varying by one or more nucleotides in length.
Collectively, these data demonstrate that ribonucleases such as hRNase 4 are useful to assess mRNA 5' end capping across mRNAs with a diversity of 5' end sequences. The ability of hRNase 4 to produce specific cleavage at defined sites surrounding a protected portion of an RNA substrate may yield a more defined set of RNA cleavage products, which may advantageously simplify data analysis and facilitate the assessment of aspects of interest, such as the presence of a cap, a tail and/or modifications (e.g., endogenous modifications, synthetically incorporated modifications, or RNA modifications resulting from damage caused by irradiation, exposure to hazardous chemicals, temperature or pH fluctuation, among others).
EXAMPLE 17: DNA probe-directed selective purification of RNA poly(A) tails with hRNase 4
According to some embodiments, disclosed workflows may be applied to selectively cleaving and purifying an mRNA 3' end poly(A) tail utilizing a site-specific ribonuclease. For such embodiments, care may be taken to select a site-specific ribonuclease (e.g., hRNase 4) that is not adenosine specific (i.e., does not cleave a 3 ',5 '-phosphodiester bond with specificity for adenosine at the main anchoring site B 1). A workflow may include, for example, contacting an mRNA 3' end poly(A) tail with a DNA probe to form a duplex product in which the DNA probe is annealed to at least a portion of the poly(A) tail and one or more additional nucleotides immediately upstream of poly(A) tail sequence.
FIGURE 33 shows a representative example of a deconvoluted LC-MS/MS spectrum of oligonucleotide cleavage products comprising regions of the mRNA poly(A) tail that were isolated from the capped and polyadenylated synthetic EPO mRNA of EXAMPLE 8. The in vitro synthesize EPO mRNA, whose coding sequence encoded a 120-nt poly(A) tail, was annealed to a biotinylated DNA probe an (SEQ ID NO: 59 /5BiosG/TTTTT/iBiodT/TTTTT/iBiodT/TTTTTTTTTTTVN), contacted with hRNase 4 and T4 PNK, and then purified with the use of magnetic streptavidin beads. After elution from the beads, a 126-nt product cleavage product comprising a distribution of poly(A) tail-related oligonucleotides sequences differing in mass from each other by a single adenosine residue (FIGURE 33) was identified.
Collectively, these data demonstrate the use of a ribonuclease such as hRNase 4 for isolation of mRNA poly(A) tail sequence regions for analysis by LC-MS/MS.
EXAMPLE 18: Integrated analysis of an mRNA sequence, 5 ’-cap and Poly(A) tailing using hRNase 4/T4 PNK
Examples workflows are provided to characterize in a single experimental preparation the three primary parts of a eukaryotic mRNA: a 5' end cap, an internal (or body) sequence, and a 3' end poly(A) tail. Available methods for analysis of mRNA by LC-MS/MS frequently require independent analytical workflows to characterize each of these modules. FIGURE 34 shows an example of an analytical workflow for integrated mRNA analysis enabled by the use of a site-specific ribonuclease such as hRNase 4, optionally in combination with a repair enzyme such as T4 PNK.
In FIGURE 34 workflows, an mRNA comprising a 5' end cap, an internal RNA sequence, and a 3' end poly(A) tail, contacts a 5’ targeting DNA probe complementary to the 5’ end of the mRNA and a 3’ DNA targeting probe complementary to the 5’ end of the mRNA to form annealed polynucleotide products comprising in a 5’ to 3’ direction relative to the mRNA, a double- stranded first DNA probe/5’ mRNA segment, an internal single-stranded sequence, and a double-stranded second DNA probe/3’ mRNA segment. Annealed polynucleotides may be contacted with hRNase 4, optionally in combination with T4 PNK, to selectively generate cleavage products (oligonucleotides). Cleavage products may be isolated and analyzed by LC-MS/MS. For example, a capped and poly-adenylated mRNA may be annealed to two complementary DNA probes, each optionally comprising one or more affinity tags. One DNA probe may be complementary to the 5' end sequence of the mRNA and the other DNA probe may be an oligo-dT probe complementary to the poly(A) tail, forming
DNA/mRNA hybrids at the mRNA 5' end and 3' end poly(A) tail, respectively, so that these regions are protected from cleavage by the ribonuclease. A digestion reaction may comprise contacting annealed polynucleotides with a composition including a site-specific ribonuclease (e.g., hRNase 4) and a repair enzyme (e g , T4 PNK) to form products site-specifically cleaved at accessible regions of the mRNA substrate (e.g., the internal single-stranded sequence) to form cleavage products comprising single-stranded fragments of the mRNA, the doublestranded first DNA probe/5’ mRNA segment, and the double-stranded second DNA probe/3’ mRNA segment. Next, single-stranded fragments and cleaved DNA/mRNA duplex segments may be separated from each other (e.g., by affinity purification) wherein one fraction comprises single-stranded fragments and another comprises duplex segments. One or more fractions may be subject to LC-MS/MS analysis. For example, single-stranded fragments (e.g., in a supernatant fraction of an affinity purification) may be subjected to LC-MS/MS and to characterize the internal mRNA sequence and/or cleaved DNA/mRNA hybrid duplexes (e g., in an eluted fraction of an affinity purification) may be subjected to LC-MS/MS to characterize the 5' cap and 3' poly(A) tail. The combined analysis of cleavage products, resulting from the internal mRNA sequence fraction and from the cap and poly(A) tail fraction, can be used for an integrated characterization of an mRNA substrate of interest (e.g., characterization of the RNA sequence and any modifications) from a single experimental preparation within the same workflow.
In some embodiments, a mRNA comprising a 5' end cap, an internal RNA sequence, and a 3' end poly(A) tail may be annealed to one or more DNA probes, each complementary to at least a portion of the mRNA (e.g., each independently designed to be complementary to 5' and/or 3' end regions of the mRNA substrate). In some embodiments, a workflow may additionally comprise one or more DNA probes targeting selected regions of the internal mRNA sequence. In some embodiments, a DNA probe targeting the mRNA 3' end may comprise an oligo-dT DNA probe. In some embodiments, a DNA probe targeting an mRNA 3' end may comprise one or more additional nucleotides complementary to the mRNA sequence immediately upstream of the oligo-dT DNA probe binding site. Each of the DNA probes targeting the 5’ or 3' end regions of the mRNA may independently of each other comprise an affinity group (e.g., a biotin) so that the DNA-RNA duplex can be isolated by affinity purification at any stage of the workflow. The affinity group may be attached to the 5' end of the DNA probe, to the 3' end of the DNA probe, or internally to the 5' end of the DNA probe (e.g., the affinity group may be covalently linked to the base an internal nucleotide, for instance
to the 5-position of thymine). In some embodiments, multiple affinity groups (e.g., multiples of the same affinity group or affinity groups of different chemical composition) may be used to increase purification efficiency and/or allow purification using multiple affinity matrices. In other embodiments, the DNA does not comprise an affinity group and the isolation of the DNA- RNA duplex may be performed by size exclusion chromatography (SEC), gel filtration chromatography, anion-exchange chromatography (AEX), hydrophilic interaction liquid chromatography (HILIC), reversed-phase liquid chromatography (RP-LC), ion-paring reversed-phase liquid chromatography (IP -RP-LC), solid-phase reversible immobilization (e.g., SPRI paramagnetic beads), or any combination thereof (also referred as multimodal or mixed-mode chromatography). In some embodiments, a DNA-RNA duplex may be analyzed directly without isolation or purification.
In some embodiments, an mRNA comprising a 5' end cap, an internal RNA sequence, and a 3' end poly(A) tail, and annealed to one or more targeting DNA probes, may be contacted with hRNase 4, optionally in combination with T4 PNK, to selectively generate cleavage products (oligonucleotides) by cutting in regions of the mRNA sequence that comprise accessible ribonuclease cleavage sites (e.g., sites not protected by the targeting DNA probe). Oligonucleotide cleavage products comprising a mixture of DNA-RNA duplex (DNA-RNA hybrids) region(s) and single-stranded RNA regions may be either directly analyzed by LC- MS/MS or undergo a process of purification to isolate the cleaved DNA-RNA duplex region(s) prior to LC-MS/MS analysis. Cleaved DNA-RNA duplex region(s) may be isolated by affinity purification; by purification using of one or more of the chromatographic or immobilization modes described above; or both. After isolation of the cleaved DNA-RNA duplex regions(s), the remaining supernatant enriched in single-stranded internal mRNA regions may be either directly analyzed by LC-MS/MS or undergo purification using of one or more of the chromatographic or immobilization modes described above, and then be analyzed by LC- MS/MS. The isolated DNA-RNA duplex region(s) may be eluted as appropriated according to the method of purification chosen and then analyzed by LC-MS/MS.
Claims
1. A composition compri sing :
(a) an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species or (ii) is a non-naturally occurring sequence; and
(b) an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species or (ii) is a non- naturally occurring sequence.
2. A composition according to Claim 1, wherein the first species is selected from Homo sapiens, Escherichia coli, Aspergillus oryzae, Momordica charantia, Pyrococcus furiosus, Cucumis sativus, and Sus scrofa and the species other than the first species is a bacterial species or a bacteriophage species.
3. A composition according to Claim 1, wherein the first species is a vertebrate species.
4. A composition according to Claim 1, wherein the endoribonuclease specificity is selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide.
5. A composition according to Claim 1, wherein the endoribonuclease has an average cleavage rate of once every 6-12 nucleotides.
6. A composition according to Claim 1, wherein the endoribonuclease is hRNase 4.
7. A composition according to Claim 1, wherein the endoribonuclease is selected from RNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
8. A composition according to Claim 1, wherein the RNA end repair enzyme comprises phosphodiesterase and phosphomonoesterase activities.
9. A composition according to Claim 1, wherein the RNA end repair enzyme is a polynucleotide kinase-phosphatase.
10. A composition according to Claim 1, wherein the RNA end repair enzyme is a T4 polynucleotide kinase-phosphatase or a Cth polynucleotide kinase-phosphatase.
11. A composition according to Claim 1 further comprising one or more of a denaturing agent, a buffering agent, and an RNA substrate.
12. A composition according to Claim 1 further comprising one or more oligoribonucleotides.
13. A method compri sing :
(a) contacting an RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2', 3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated; and
(b) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2’, 3 ’-hydroxylated.
14. A method according to Claim 13, wherein the endoribonuclease specificity is selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide.
15. A method according to Claim 13, wherein the endoribonuclease has an average cleavage rate of the RNA substrate of once every 6-12 nucleotides.
16. A method according to Claim 13, wherein the endoribonuclease has an average cleavage rate of the RNA substrate of once every 8 nucleotides.
17. A method according to Claim 13, wherein the endoribonuclease is hRNase 4.
18. A method according to Claim 13, wherein the endoribonuclease is selected from RNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
19. A method according to Claim 13, wherein the RNA end repair enzyme comprises phosphodiesterase and phosphomonoesterase activities.
20. A method according to Claim 13, wherein the RNA end repair enzyme is a polynucleotide kinase-phosphatase.
21. A method according to Claim 13, wherein the RNA end repair enzyme is a T4 polynucleotide kinase-phosphatase or a Cth polynucleotide kinase-phosphatase.
22. A method according to Claim 13, wherein the method is a coupled reaction method.
23. A method according to Claim 13, wherein the (a) contacting and the (b) contacting occur in a single location or occur in separate locations that are in fluid communication with one another.
24. A method according to Claim 13, wherein the RNA substrate is a denatured RNA substrate.
25. A method according to Claim 25, wherein the (a) contacting further comprises denaturing the RNA substrate to form a denatured RNA substrate and contacting the denatured RNA substrate and the endoribonuclease.
26. A method according to Claim 13, wherein denaturing the RNA substrate further comprises contacting the RNA substrate with a denaturing agent at a salt concentration of up to 50 mM or incubating the RNA substrate at a temperature of 65°C or higher at a salt concentration of up to 50 mM.
27. A method according to Claim 13, wherein the denaturing agent is selected from urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide.
28. A method according to Claim 13, wherein the (a) contacting further comprises denaturing the RNA substrate to form a denatured RNA substrate, diluting the denatured RNA substrate for form a diluted denatured RNA substrate, and contacting the diluted denatured RNA substrate and the endoribonuclease.
29. A method according to Claim 13, wherein the (a) contacting further comprises contacting the RNA substrate and the endoribonuclease and a buffering agent.
30. A method according to Claim 13, wherein the (b) contacting further comprises contacting the RNA end repair enzyme and oligonucleotides and a buffering agent.
31. A method according to Claim 13, wherein the (b) contacting further comprises separating the oligoribonucleotides comprising one or more unrepaired ends from the endoribonuclease to form separated oligoribonucleotides comprising one or more unrepaired ends.
32. A method according to Claim 13 further comprising (c) characterizing the oligoribonucleotides comprising one or more repaired ends that are 2’, 3 ’-hydroxylated.
33. A method according to Claim 32, wherein the (c) characterizing comprises characterizing the oligoribonucleotides comprising one or more repaired ends by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry.
34. A method according to Claim 32, wherein the (c) characterizing comprises separating the oligoribonucleotides from one or more of the RNA substrate, the endoribonuclease, the RNA end repair enzyme to form separated oligoribonucleotides and characterizing the separated oligoribonucleotides.
35. A method according to Claim 32, wherein the (c) characterizing comprises fractionating the oligoribonucleotides comprising one or more repaired ends that are 2’,3’- hydroxylated by liquid chromatography to form fractionated oligoribonucleotides and ionizing the fractionated oligoribonucleotides for mass spectrometry.
36. A method according to Claim 13, wherein the RNA substrate comprises in vitro transcribed RNA, chemically synthesized RNA, viral RNA, prokaryotic RNA, eukaryotic RNA, archaeal RNA, or any combination thereof.
37. A method according to Claim 13, wherein the RNA substrate comprises tissue culture RNA, biopsy RNA, feces RNA, urine RNA, lymph RNA, blood RNA, mucous RNA, sputum RNA, skin RNA, saliva RNA, wound RNA, sweat RNA, semen RNA, shoot RNA, root RNA, seed RNA, sewage RNA, sludge RNA, soil RNA, or any combination thereof.
38. A method according to Claim 13, wherein the RNA substrate comprises messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (sRNA), microRNA (miRNA), long non-coding RNA (IncRNA), circular RNA (circRNA), aptamer
RNA, antisense RNA, silencing RNA (siRNA), guide RNA (gRNA), or any combination thereof.
39. A kit comprising:
(a) an endoribonuclease having an amino acid sequence that (i) corresponds to an amino acid sequence of a first species or (ii) is a non-naturally occurring sequence;
(b) an RNA end repair enzyme having an amino acid sequence that (i) corresponds to an amino acid sequence of a species other than the first species or (ii) is a non- naturally occurring sequence;
(c) optionally, a denaturing agent;
(d) optionally, a buffering agent; and.
(e) optionally, an affinity -labeled DNA probe.
40. A kit according to Claim 39, wherein the first species is selected from Homo sapiens, Escherichia coH, Aspergillus oryzae, Momordica charantia, Pyrococcus furiosus, Cucumis sativus, and Sus scrofa and the species other than the first species is a bacterial species or a bacteriophage species.
41. A kit according to Claim 39, wherein the first species is a vertebrate species.
42. A kit according to Claim 39, wherein the endoribonuclease specificity is selected from (1) cleavage after a specific nucleotide followed by a purine, (2) cleavage after a specific nucleotide followed by a pyrimidine, (3) cleavage after a purine followed by a specific nucleotide, and (4) cleavage after a pyrimidine followed by a specific nucleotide.
43. A kit according to Claim 39, wherein the endoribonuclease has an average cleavage rate of once every 6-12 nucleotides.
44. A kit according to Claim 39, wherein the endoribonuclease is hRNase 4.
45. A kit according to Claim 39, wherein the endoribonuclease is selected from RNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
46. A kit according to Claim 39, wherein the RNA end repair enzyme comprises phosphodiesterase and phosphomonoesterase activities.
47. A kit according to Claim 39 further comprising a divalent metal, wherein the divalent metal is optionally selected from magnesium(II), manganese(II), cobalt(II), and nickel(II).
48. A kit according to Claim 39, wherein the kit comprises the denaturing agent and the denaturing agent is selected from urea, formamide, dimethylformamide, guanidinium thiocyanate, sodium salicylate, dimethyl sulfoxide, propylene glycol, poly(ethylene glycol), and cetyltrimethylammonium bromide.
49. A kit according to Claim 39 further comprising one or more additional enzymes, wherein the one or more additional enzymes are optionally selected from RNA polymerases and RNA ligases.
50. A method compri sing :
(a) contacting an RNA substrate and one or more DNA probes, each DNA probe shorter than the RNA substrate and each comprising an affinity domain, wherein at least a portion of the RNA substrate and at least a portion of the DNA probe(s) are complementary, to form a DNA-RNA hybrid duplex comprising a double- stranded portion and at least one single-stranded overhang;
(b) contacting the DNA-RNA hybrid duplex with an enzyme composition, the enzyme composition comprising a single-strand-specific nucleotide-specific endoribonuclease and, optionally, an RNA end-repair enzyme, to form a cleaved DNA-RNA hybrid duplex and one or more single-stranded RNA fragments of the RNA substrate by cleavage of the RNA substrate at one or more sites within the single-stranded overhang by the single-strand-specific nucl eoti de- specifi c endorib onucl ease ;
(c) contacting the cleaved DNA-RNA hybrid duplex and a solid support comprising an affinity capture domain to form an affinity capture complex comprising the affinity domain bound to the affinity capture domain;
(d) optionally, washing the affinity capture complex to remove unbound materials, if any; and
(e) optionally, dissociating the cleaved DNA-RNA hybrid duplex to release the remaining portion of the RNA substrate from the one or more DNA probes.
51. A method according to Claim 50, wherein the DNA-RNA hybrid duplex comprises the double-stranded portion and two single-stranded overhangs.
52. A method according to Claim 50, wherein the DNA-RNA hybrid duplex comprises the double-stranded portion and a 5’ single-stranded overhang and a 3’ single-stranded overhang.
53. A method according to Claim 50, wherein the endoribonuclease is hRNase 4.
54. A method according to Claim 50, wherein the endoribonuclease is selected from RNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
55. A method according to Claim 50, wherein the RNA end repair enzyme comprises phosphodiesterase and phosphomonoesterase activities.
56. A method according to Claim 50, wherein the RNA end repair enzyme is a polynucleotide kinase-phosphatase.
57. A method according to Claim 50, wherein the RNA end repair enzyme is a T4 polynucleotide kinase-phosphatase or a Cth polynucleotide kinase-phosphatase.
58. A method comprising:
(a) contacting an RNA substrate, an enzyme, and an isotopically labeled nucleoside triphosphate to form a labeled RNA substrate, wherein the enzyme is optionally selected from an RNA polymerase and an RNA ligase;
(b) contacting the labeled RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2', 3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2' -phosphorylated; and
(c) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2 ’,3 ’-hydroxylated.
59. A method comprising:
(a) contacting an RNA substrate, an enzyme, and a nucleoside triphosphate comprising a chemically reactive group to form a chemically reactive RNA
- 102 -
RECTIFIED SHEET (RULE 91) ISA/EP
substrate, wherein the enzyme is optionally selected from an RNA polymerase and an RNA ligase;
(b) contacting the chemically reactive RNA substrate and a molecule reactive with the chemically reactive RNA substrate to form a labeled RNA substrate, wherein the molecule comprises one or more stable isotopics;
(c) contacting the labeled RNA substrate and an endoribonuclease to produce oligoribonucleotides comprising one or more unrepaired ends that are 2',3'- cyclic-phosphorylated, 3'-phosphorylated and/or 2'-phosphorylated; and
(d) contacting an RNA end repair enzyme and the oligoribonucleotides comprising the unrepaired ends to produce oligoribonucleotides comprising one or more repaired ends that are 2 ’,3 ’-hydroxylated.
60. A method comprising:
(a) contacting an RNA substrate and one or more RNA substrate binding molecules to form RNA substrate-RNA binding molecule complexes, each complex comprising a bound portion and at least one single-stranded portion, wherein each bound portion comprises at least a portion of the RNA substrate and an RNA binding molecule;
(b) contacting the RNA substrate-RNA binding molecule complexes with an enzyme composition, the enzyme composition comprising: a single-strand-specific nucleotide-specific endoribonuclease and, optionally, an RNA end-repair enzyme, to form by cleavage of the RNA substrate at one or more sites within the single-stranded portion by the single-strand-specific nucleotide-specific endoribonuclease: cleaved bound portions and one or more fragments of the single-stranded portion;
(c) optionally separating the cleaved bound portions from the one or more fragments of the at least one single-stranded portion;
(d) optionally analyzing one or more properties of the cleaved bound portions; and
(e) optionally analyzing one or more properties of the fragments.
61. A method according to Claim 60, wherein the single-strand-specific nucleotidespecific endoribonuclease is hRNase 4.
- 103 -
RECTIFIED SHEET (RULE 91) ISA/EP
62. A method according to Claim 60, wherein the endoribonuclease is selected from RNase 4, RNase Tl, RNase U2, RNase A, Colicin E5, MCI, Cusativin, Csxl, MazF, ChpB, MqsR, and YafO.
63. A method according to Claim 60, wherein the RNA end repair enzyme comprises phosphodiesterase and phosphomonoesterase activities.
64. A method according to Claim 60, wherein the RNA end repair enzyme is a polynucleotide kinase-phosphatase.
65. A method according to Claim 60, wherein the RNA end repair enzyme is a T4 polynucleotide kinase-phosphatase or a Cth polynucleotide kinase-phosphatase.
66. A method according to Claim 60, comprising the (d) optional analysis of the one or more properties of the cleaved bound portions.
67. A method according to Claim 67, wherein the (d) analysis comprises characterizing at least the RNA substrate of the cleaved bound portions by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry.
68. A method according to Claim 61 comprising the (e) optional analysis of the one or more properties of the fragments.
69. A method according to Claim 66, wherein the (e) analysis comprises characterizing the fragments by one or more of gel electrophoresis, capillary electrophoresis, liquid chromatography, and mass spectrometry.
70. A method according to Claim 60, wherein the RNA substrate comprises in vitro transcribed RNA, chemically synthesized RNA, viral RNA, prokaryotic RNA, eukary otic RNA, archaeal RNA, or any combination thereof.
71. A method according to Claim 60, wherein the RNA substrate comprises tissue culture RNA, biopsy RNA, feces RNA, urine RNA, lymph RNA, blood RNA, mucous RNA, sputum RNA, skin RNA, saliva RNA, wound RNA, sweat RNA, semen RNA, shoot RNA, root RNA, seed RNA, sewage RNA, sludge RNA, soil RNA, or any combination thereof.
72. A method according to Claim 60, wherein the RNA substrate comprises messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), small RNA (sRNA),
- 104 -
RECTIFIED SHEET (RULE 91) ISA/EP
microRNA (miRNA), long non-coding RNA (IncRNA), circular RNA (circRNA), aptamer RNA, antisense RNA, silencing RNA (siRNA), guide RNA (gRNA), or any combination thereof.
- 105 -
RECTIFIED SHEET (RULE 91) ISA/EP
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202263329262P | 2022-04-08 | 2022-04-08 | |
US63/329,262 | 2022-04-08 | ||
US18/182,122 | 2023-03-10 | ||
US18/182,122 US20230287376A1 (en) | 2022-03-11 | 2023-03-10 | Immobilized enzyme compositions and methods |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023197015A1 true WO2023197015A1 (en) | 2023-10-12 |
Family
ID=86286343
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2023/065602 WO2023197015A1 (en) | 2022-04-08 | 2023-04-10 | Compositions and analysis of dephosphorylated oligoribonucleotides |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2023197015A1 (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7666612B2 (en) | 2003-05-23 | 2010-02-23 | Epfl-Ecole Polytechnique Federale De Lausanne | Methods for protein labeling based on acyl carrier protein |
US7799524B2 (en) | 2002-10-03 | 2010-09-21 | Ecole Polytechnique Ferdeale de Lausanne | Substrates for O6-alkylguanina-DNA alkyltransferase |
US7888090B2 (en) | 2004-03-02 | 2011-02-15 | Ecole Polytechnique Federale De Lausanne | Mutants of O6-alkylguanine-DNA alkyltransferase |
US7939284B2 (en) | 2001-04-10 | 2011-05-10 | Ecole Polytechnique Federale De Lausanne | Methods using O6-alkylguanine-DNA alkyltransferases |
US8163479B2 (en) | 2004-03-02 | 2012-04-24 | Ecole Polytechnique Federale De Lausanne | Specific substrates for O6-alkylguanine-DNA alkyltransferase |
US8227602B2 (en) | 2006-07-25 | 2012-07-24 | Ecole Polytechnique Federale De Lausanne | Labelling of fusion proteins with synthetic probes |
WO2018089860A1 (en) * | 2016-11-11 | 2018-05-17 | 2D Genomics Inc. | Methods for processing nucleic acid samples |
US20210207109A1 (en) * | 2018-05-16 | 2021-07-08 | Bio-Rad Laboratories, Inc. | Methods for processing nucleic acid samples |
-
2023
- 2023-04-10 WO PCT/US2023/065602 patent/WO2023197015A1/en unknown
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7939284B2 (en) | 2001-04-10 | 2011-05-10 | Ecole Polytechnique Federale De Lausanne | Methods using O6-alkylguanine-DNA alkyltransferases |
US8367361B2 (en) | 2001-04-10 | 2013-02-05 | Ecole Polytechnique Federale De Lausanne | Methods using O6-alkylguanine-DNA alkytransferase |
US7799524B2 (en) | 2002-10-03 | 2010-09-21 | Ecole Polytechnique Ferdeale de Lausanne | Substrates for O6-alkylguanina-DNA alkyltransferase |
US7666612B2 (en) | 2003-05-23 | 2010-02-23 | Epfl-Ecole Polytechnique Federale De Lausanne | Methods for protein labeling based on acyl carrier protein |
US7888090B2 (en) | 2004-03-02 | 2011-02-15 | Ecole Polytechnique Federale De Lausanne | Mutants of O6-alkylguanine-DNA alkyltransferase |
US8163479B2 (en) | 2004-03-02 | 2012-04-24 | Ecole Polytechnique Federale De Lausanne | Specific substrates for O6-alkylguanine-DNA alkyltransferase |
US8227602B2 (en) | 2006-07-25 | 2012-07-24 | Ecole Polytechnique Federale De Lausanne | Labelling of fusion proteins with synthetic probes |
WO2018089860A1 (en) * | 2016-11-11 | 2018-05-17 | 2D Genomics Inc. | Methods for processing nucleic acid samples |
US20210207109A1 (en) * | 2018-05-16 | 2021-07-08 | Bio-Rad Laboratories, Inc. | Methods for processing nucleic acid samples |
Non-Patent Citations (17)
Title |
---|
"Oligonucleotide Synthesis: A Practical Approach", 1984, IRI, PRESS |
BEVERLY ET AL., ANAL. BIOANAL. CHEM, vol. 408, 2016, pages 5021 - 30 |
BROWN TREVOR S ET AL: "Method for assigning double-stranded RNA structures", BIOTECHNIQUES, vol. 38, 1 March 2005 (2005-03-01), pages 368 - 372, XP093057699 * |
GRAMELSPACHER MAX J ET AL: "Biochemical characterization of RNA-guided ribonuclease activities for CRISPR-Cas9 systems", METHODS, ACADEMIC PRESS, NL, vol. 172, 20 June 2019 (2019-06-20), pages 32 - 41, XP086083209, ISSN: 1046-2023, [retrieved on 20190620], DOI: 10.1016/J.YMETH.2019.06.018 * |
HAFNER MARKUS ET AL: "Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP", CELL, vol. 141, no. 1, 1 April 2010 (2010-04-01), Amsterdam NL, pages 129 - 141, XP093057722, ISSN: 0092-8674, DOI: 10.1016/j.cell.2010.03.009 * |
HALEMARKHAM: "Oligonucleotides and Analogs: A Practical Approach", 1991, OXFORD UNIVERSITY PRESS |
HERMANSON: "Bioconjugate Techniques", 2008, ACADEMIC PRESS, pages: 276 - 335 |
KEPPETIPOLA N. ET AL: "Reprogramming the tRNA-splicing activity of a bacterial RNA repair enzyme", NUCLEIC ACIDS RESEARCH, vol. 35, no. 11, 7 May 2007 (2007-05-07), GB, pages 3624 - 3630, XP093057739, ISSN: 0305-1048, DOI: 10.1093/nar/gkm110 * |
KORNBERGBAKER: "DNA Replication", 1992, W.H. FREEMAN |
LEHNINGER: "Biochemistry", 1975, WORTH PUBLISHERS |
LOS ET AL., METHODS MOL BIOL, vol. 356, 2007, pages 195 - 208 |
LU ET AL.: "Immune Modulation by Human Secreted RNases at the Extracellular Space", FRONT IMMUNOL, vol. 9, 2018, pages 1012 |
SHIGEMATSU MEGUMI ET AL: "Generation of 2?,3?-Cyclic Phosphate-Containing RNAs as a Hidden Layer of the Transcriptome", FRONTIERS IN GENETICS, vol. 9, 27 November 2018 (2018-11-27), pages 1 - 13, XP055848805, DOI: 10.3389/fgene.2018.00562 * |
SINGLETON ET AL.: "Dictionary of Microbiology and Molecular biology", 1994, JOHN WILEY AND SONS |
SLETTEN, E. MBERTOZZI C. R: "Angewandte Chemie International Edition English", vol. 48, 2009, article "Bioorthogonal Chemistry: Fishing for Selectivity in a Sea of Functionality", pages: 6974 - 98 |
STRACHANREAD: "Human Molecular Genetics", 1999, WILEY-LISS |
WANG XI-WEN ET AL: "RNA structure probing uncovers RNA structure-dependent biological functions", NATURE CHEMICAL BIOLOGY, NATURE PUBLISHING GROUP US, NEW YORK, vol. 17, no. 7, 25 June 2021 (2021-06-25), pages 755 - 766, XP037491521, ISSN: 1552-4450, [retrieved on 20210625], DOI: 10.1038/S41589-021-00805-7 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210230578A1 (en) | Removal of dna fragments in mrna production process | |
US9102936B2 (en) | Method of adaptor-dimer subtraction using a CRISPR CAS6 protein | |
EP2464755B1 (en) | Methods and kits for 3'-end-tagging of rna | |
JP5766610B2 (en) | Sequencing of nucleic acid molecules by mass spectrometry | |
EP3728585B1 (en) | Novel processes for the production of oligonucleotides | |
Muthmann et al. | Chemo‐enzymatic treatment of RNA to facilitate analyses | |
US20230022745A1 (en) | Methods for Labeling a Population of RNA Molecules | |
WO2015085142A1 (en) | Compositions and methods for capping rna | |
US20130143276A1 (en) | Compositions and Methods for Adenylating Oligonucleotides | |
CN103261416B (en) | Utilize the method for attachment of eucaryon tRNA ligase | |
JP2008253176A (en) | Linker for obtaining highly affinitive molecule | |
US20230287489A1 (en) | Compositions and Analysis of Dephosphorylated Oligoribonucleotides | |
WO2023197015A1 (en) | Compositions and analysis of dephosphorylated oligoribonucleotides | |
WO2023097295A1 (en) | Rna and dna analysis using engineered surfaces | |
CN115552029A (en) | Compositions and methods for rapid RNA-adenylation and RNA sequencing | |
JP5858415B2 (en) | Linker for preparing mRNA / cDNA-protein conjugate and purification method of nucleotide-protein conjugate using the same | |
CN112105748B (en) | Methods for sequencing and producing nucleic acid sequences | |
Strzelecka et al. | Functional and LC-MS/MS analysis of in vitro transcribed mRNAs carrying phosphorothioate or boranophosphate moieties reveal polyA tail modifications that prevent deadenylation without compromising protein expression | |
EP4294936A1 (en) | Compositions and methods for labeling modified nucleotides in nucleic acids | |
Ge | Development of Site-Specific and Quantitative N 6-Methyl Adenosine (m 6 A) Profiling Methods | |
Yeh | In vitro selection of deoxyribozymes for O-glycoside cleavage and for 3-nitrotyrosine modification | |
WO2024006978A2 (en) | Improved methods for in vitro transcription | |
CN117730149A (en) | Single-stranded RNA purification method | |
Milton et al. | Stability of a 2′‐O‐(Carbamoylmethyl) adenosine‐Containing Dinucleotide | |
WO2015041909A1 (en) | Methods of selecting binding-elements and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23721254 Country of ref document: EP Kind code of ref document: A1 |