CN115125292A - Single-base resolution positioning analysis method for inosine modification in RNA assisted by endonuclease - Google Patents
Single-base resolution positioning analysis method for inosine modification in RNA assisted by endonuclease Download PDFInfo
- Publication number
- CN115125292A CN115125292A CN202210623985.8A CN202210623985A CN115125292A CN 115125292 A CN115125292 A CN 115125292A CN 202210623985 A CN202210623985 A CN 202210623985A CN 115125292 A CN115125292 A CN 115125292A
- Authority
- CN
- China
- Prior art keywords
- rna
- inosine
- assisted
- reaction
- ino
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 title claims abstract description 26
- 229930010555 Inosine Natural products 0.000 title claims abstract description 25
- 229960003786 inosine Drugs 0.000 title claims abstract description 25
- 230000004048 modification Effects 0.000 title claims abstract description 24
- 238000012986 modification Methods 0.000 title claims abstract description 24
- 108010042407 Endonucleases Proteins 0.000 title claims abstract description 16
- 102000004533 Endonucleases Human genes 0.000 title claims abstract 7
- 238000004458 analytical method Methods 0.000 title claims description 21
- 238000006243 chemical reaction Methods 0.000 claims abstract description 29
- 238000000034 method Methods 0.000 claims abstract description 25
- 101000880860 Homo sapiens Endonuclease V Proteins 0.000 claims abstract description 15
- 238000012163 sequencing technique Methods 0.000 claims abstract description 15
- 102100037696 Endonuclease V Human genes 0.000 claims abstract description 14
- 238000010839 reverse transcription Methods 0.000 claims description 16
- 239000002299 complementary DNA Substances 0.000 claims description 15
- 108090000623 proteins and genes Proteins 0.000 claims description 15
- 102000004169 proteins and genes Human genes 0.000 claims description 14
- 238000013507 mapping Methods 0.000 claims description 9
- 239000011535 reaction buffer Substances 0.000 claims description 7
- 230000003321 amplification Effects 0.000 claims description 6
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 6
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 4
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 claims description 4
- 238000012408 PCR amplification Methods 0.000 claims description 3
- 239000007853 buffer solution Substances 0.000 claims description 2
- 230000004807 localization Effects 0.000 claims description 2
- 125000000896 monocarboxylic acid group Chemical group 0.000 claims description 2
- 230000007022 RNA scission Effects 0.000 claims 2
- 238000010276 construction Methods 0.000 abstract description 6
- 230000008569 process Effects 0.000 abstract description 4
- 230000015556 catabolic process Effects 0.000 abstract description 3
- 238000006731 degradation reaction Methods 0.000 abstract description 3
- 230000000903 blocking effect Effects 0.000 abstract description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 63
- 238000003776 cleavage reaction Methods 0.000 description 15
- 102100031780 Endonuclease Human genes 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 9
- 239000000872 buffer Substances 0.000 description 9
- 239000003161 ribonuclease inhibitor Substances 0.000 description 9
- 230000007017 scission Effects 0.000 description 7
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 108020004999 messenger RNA Proteins 0.000 description 6
- 108010007577 Exodeoxyribonuclease I Proteins 0.000 description 5
- 102100029075 Exonuclease 1 Human genes 0.000 description 5
- 101710163270 Nuclease Proteins 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 108020004566 Transfer RNA Proteins 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- LEQAOMBKQFMDFZ-UHFFFAOYSA-N glyoxal Chemical compound O=CC=O LEQAOMBKQFMDFZ-UHFFFAOYSA-N 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 102100029791 Double-stranded RNA-specific adenosine deaminase Human genes 0.000 description 3
- 101000865408 Homo sapiens Double-stranded RNA-specific adenosine deaminase Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 238000003559 RNA-seq method Methods 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 108091070501 miRNA Proteins 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 230000007026 protein scission Effects 0.000 description 3
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 2
- 230000026279 RNA modification Effects 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 230000001594 aberrant effect Effects 0.000 description 2
- 238000007259 addition reaction Methods 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 229940015043 glyoxal Drugs 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- 102000009394 5-Hydroxytryptamine 2C receptors Human genes 0.000 description 1
- 108050000259 5-Hydroxytryptamine 2C receptors Proteins 0.000 description 1
- 206010069754 Acquired gene mutation Diseases 0.000 description 1
- NLHHRLWOUZZQLW-UHFFFAOYSA-N Acrylonitrile Chemical compound C=CC#N NLHHRLWOUZZQLW-UHFFFAOYSA-N 0.000 description 1
- 102000055025 Adenosine deaminases Human genes 0.000 description 1
- 108020005098 Anticodon Proteins 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 102100038191 Double-stranded RNA-specific editase 1 Human genes 0.000 description 1
- 102100024692 Double-stranded RNA-specific editase B2 Human genes 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- 101000742223 Homo sapiens Double-stranded RNA-specific editase 1 Proteins 0.000 description 1
- 101000686486 Homo sapiens Double-stranded RNA-specific editase B2 Proteins 0.000 description 1
- 108020005198 Long Noncoding RNA Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 238000010357 RNA editing Methods 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 230000001365 aminolytic effect Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000009615 deamination Effects 0.000 description 1
- 238000006481 deamination reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 230000004049 epigenetic modification Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000005755 formation reaction Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 208000024714 major depressive disease Diseases 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 230000000926 neurological effect Effects 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 208000020016 psychiatric disease Diseases 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 238000007480 sanger sequencing Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000037439 somatic mutation Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention discloses an endonuclease-assisted single-base resolution positioning method for inosine modification in RNA, belonging to the technical field of biology. According to the invention, 3 '-deoxy-A is used for blocking the end of RNA 3' OH, then hEndoV protein is cut at inosine to generate new RNA 3 'OH, the new 3' OH is subjected to polyuridylation and library construction, and sequencing is carried out to obtain inosine modified site information. The method has the advantages of direct principle, simple and convenient operation, no chemical reaction in the whole process, no degradation of RNA, no need of complex bioinformatics algorithm, and capability of directly obtaining single base resolution positioning information modified by inosine in RNA.
Description
Technical Field
The invention belongs to the technical field of biology, and particularly relates to an endonuclease-assisted single-base resolution positioning method for inosine modification in RNA.
Background
RNA is composed of a combination of four typical bases including A, U, G, C, and more than 150 epigenetic modifications with important functions are found on RNA except for four standard base nucleotides. Inosine (Ino) is one of the most important post-transcriptional modifications on mammalian RNA, widely found on mRNA, tRNA, lncRNA, miRNA, and siRNA. Ino is produced by the catalytic aminolytic deamination at adenosine C6 by Adenosine Deaminase (ADAR) family proteins acting on RNA, a transition also known as a-to-Ino RNA editing. 3 ADAR proteins have been identified in mammals, ADAR1, ADAR2 with enzymatic activity and ADAR3 without enzymatic activity.
A-to-Ino editing has important biological functions. Since Ino is recognized as G by cellular machinery as a result of pairing with C during reverse transcription, A-to-Ino editing can alter genetic information post-transcriptionally. A-to-Ino editing on mRNA can result in amino acid changes, splice sites and the creation or deletion of stop codons, resulting in post-translational protein changes that favor protein diversity. Ino on the tRNA is located at position 34 of the anticodon loop, where Ino allows recognition of multiple codons by wobble pairing, enhancing the decoding capacity of the tRNA. mirnas and sirnas are able to regulate target genes in a highly specific manner, and a-to-Ino editing regulates the maturation, structure and function of mirnas and sirnas.
Aberrant levels of a-to-Ino editing have been shown to be associated with a variety of human diseases, including a variety of cancers, neurological and neurodegenerative diseases, psychiatric disorders, and autoimmune diseases, among others. Aberrant editing levels at the A-to-Ino editing sites on some transcripts have been shown to be associated with the mechanisms underlying disease. For example, A-to-Ino editing of the pre-mRNA of the 5 hydroxytryptamine 2C receptor occurs at 5 conserved sites in exon 5 (designated A to E), with increased editing levels at the A site in the brain of a suicidal victim of depression and significantly increased editing levels at the E site in patients with major depression. However, the significance of most A-to-Ino editing sites is unknown, and the accurate identification of the position of A-to-Ino editing in the whole transcriptome is the basis for the deep understanding of the biological function and significance of each site.
At present, the single base positioning method edited by the transcriptome range A-to-Ino is mainly RNA-seq. Ino pairs with C during reverse transcription so that it is read as G during sequencing, and thus can recognize the A-to-Ino editing site based on the A-to-G conversion. Although the principle is simple, in the actual analysis process, due to the interference factors such as alignment error, single nucleotide polymorphism, somatic mutation and the like, the RNA-seq can generate a large number of false positive sites. Although bioinformatic approaches such as statistical modeling, filtering and/or alignment, and integration of other genomic information were developed to equal the reduction of false positives, the accuracy of RNA-seq identification of a-to-Ino editing sites remains poor. Identification of A-to-Ino edits within the transcriptome RangeOne of the chemical methods for the site is "ICE-seq", N produced by the reaction of Ino with acrylonitrile 1 Cyanoethylarinine hinders cDNA extension during reverse transcription, and "erasure" signals generated during sequencing are screened out during library construction due to too short a fragment that produces a truncated cDNA, and can be used to identify a-to-Ino editing sites, but this approach fails to identify 100% edited sites. Another method involving chemical reactions is Ino-specific sequencing, where nuclease T1 is characterized by cleavage of RNA at G or Ino, where G forms a stable adduct with glyoxal in the presence of borate to resist cleavage by nuclease T1, and Ino is cleaved by nuclease T1 because it cannot form a stable adduct. The method comprises the steps of firstly oxidizing and biotinylating the 3' end of RNA through a chemical reaction, then treating and protecting G on the RNA through glyoxal and borate, then fixing the RNA on a streptomycin functional magnetic bead through biotin-streptomycin affinity, cutting the RNA at Ino by nuclease T1, and then performing library construction and sequencing on the cut RNA to analyze the A-to-Ino editing site. The disadvantage of this method is that the procedure is complicated and if 100% protection of G is not achieved, G can also be cleaved by nuclease T1 resulting in false positive sites. Chemical methods have the common disadvantage that severe chemical reactions cause severe degradation of RNA, so mild enzymatic reaction methods should be developed for single base resolution mapping analysis a-to-Ino editing.
hEndoV (human endonuclease V), an endonuclease for Ino on RNA, specifically recognizes and cleaves Ino on RNA, and generates a 3 ' OH and 5 ' phosphate nick at its downstream 3 ' end second phosphodiester bond. Based on the characteristics, the invention provides a single base resolution positioning method of Ino in RNA assisted by endonuclease.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide an endonuclease-assisted single-base resolution positioning analysis method for inosine modification in RNA, namely an Ino single-base resolution positioning analysis method which can be applied to the field of RNA modification sequencing, does not need chemical reagent treatment, does not need complex bioinformatics algorithm, and has high sensitivity, high selectivity and simple and convenient operation.
The purpose of the invention is realized by the following technical scheme:
an endonuclease-assisted single-base resolution localization analysis method for inosine modification in RNA comprises the following steps:
(1) fragmenting RNA of a cell or tissue sample;
(2) 3' OH end-blocking of fragmented RNA: adding 3 'deoxy-A to the 3' OH end of the fragmented RNA using polyA polymerase catalyzed 3 'deoxy-ATP to block the 3' OH end of the RNA, followed by purification of the RNA;
(3) cleaving the 3 'OH-end-blocked RNA using the hEndoV protein to generate a new RNA 3' OH at inosine;
(4) 3' polyuridylation of cleaved RNA: catalyzing UTP by using polyU polymerase to add U tail to the 3 'OH end of RNA, and carrying out 3' polyuridylation;
(5) taking 3 'polyuridylated RNA as a template, and carrying out reverse transcription reaction by adopting a reverse transcription primer with polyadenylic acid and a 3' joint to obtain cDNA;
(6) adding a 5 'linker to the cDNA by a DNA polymerase extension reaction using a primer having a 5' linker to obtain a DNA for amplification;
(7) performing PCR amplification by using the DNA as a template, and sequencing an amplified product;
(8) single base recognition with inosine modification: and the second base G from 3 'to 5' after the double-ended joint is removed in the sequencing result is an inosine modified base.
According to the method, Ino is specifically cut by the hEndoV protein, and single-base resolution positioning information of Ino is obtained through sequencing.
The aforementioned cleavage reaction of the hEndoV protein: NEBuffer 4(NEB, cat # B7004) was used as a reaction buffer, and the buffer system of the buffer reaction solution was Tris-CH 3 COOH,Mg 2+ Is 10mM, and a certain amount of the hEndoV protein (determined according to the activity of the actual protein) is used, and the reaction condition is preferably 37 ℃ for 30 min.
The aforementioned cleavage reaction of the hENDOV protein refers to the specific cleavage reaction of the hENDOV protein to Ino on RNA.
The invention has the following advantages and beneficial effects:
1) the invention has direct principle, simple method and no need of complex pretreatment.
2) The invention does not need chemical reaction to process RNA and can not cause violent degradation of RNA.
3) The hENDOV protein cleavage reaction can be completed in only 30 minutes, and the whole experiment time is greatly shortened.
4) The cleavage reaction involved in the invention has high specificity (only Ino is cleaved, and A is not cleaved) and high efficiency (complete cleavage), and is beneficial to positioning analysis.
5) The RNA library construction method does not need single-strand ligation reaction, has high library construction efficiency, and is beneficial to sequencing and site analysis.
6) The invention does not relate to a complex bioinformatics algorithm and is beneficial to the identification of the A-to-Ino editing sites.
Drawings
FIG. 1 is a schematic diagram of a single base resolution mapping analysis method for inosine modification in RNA assisted by endonuclease according to the present invention. The 3 'OH end of RNA was first blocked by 3' -deoxy-a, then a new 3 'OH was generated at the 3' second phosphodiester bond of Ino by hEndoV protein cleavage, the newly generated 3 'OH was subsequently 3' polyuridylated, and single base resolution mapping analysis of Ino was achieved via library construction and sequencing.
FIG. 2 is a PAGE gel of the hEndoV protein of the present invention specifically cleaving an artificially synthesized RNA containing an Ino modification. (A) Schematic representation of the Ino modification on the hEndoV cleavage RNA; (B) the cleavage effect of the hAndoV protein was evaluated using synthetic RNA strands containing Ino modification (Ino-RNA-1 and Ino-RNA-2, see Table 2 for sequence) as substrates, and (C) the cleavage specificity of the hAndoV protein was evaluated using RNA strands not containing Ino modification (A-RNA-1, see Table 2 for sequence) as substrates.
FIG. 3 is a diagram showing the sequencing effect of single-base resolution mapping analysis of inosine modification on a synthetic 45nt Ino-RNA-1 oligonucleotide chain in the present invention.
Detailed Description
In some embodiments, the method for single base resolution mapping analysis of inosine modification in endonuclease-assisted RNA of the present invention specifically comprises the following steps:
(1) fragmenting; taking 5 μ g of mRNA extracted from tissue or cells of a biological sample, using NEBNext RNA Mg ++ The fragmentation module (NEB, cat # E6150) fragments the mRNA to around 1000nt in length.
(2) 3' OH end blocking of the above RNA: to the above mRNA were added 5U of polyA polymerase (NEB, cat # M0276), 1 XPolyA reaction buffer (dispensed with the enzyme), 1mM 3' deoxy-ATP (Sigma-Aldrich, cat # C9137) and 20U of RNase inhibitor, reacted at 37 ℃ for 30min, and then RNA was purified using RNA Clean & concentrate Kits (ZYMO RESEARCH, cat # R1013) (10. mu.L eluted).
(3) The above RNA was subjected to hEndoV cleavage: to the above 10. mu.L of RNA were added a certain amount of hEndoV protein, 1 XNEBuffer 4 buffer (NEB, cat. No. B7004) and 20U of RNase inhibitor to a final volume of 20. mu.L, and cleavage reaction was carried out at 37 ℃ for 30 min.
The expression process of the hENDOV protein is as follows, wherein the full-length coding sequence (shown in table 1) of the hENDOV is inserted into a pET-28a (+) plasmid to obtain a hENDOV-pET-28a (+) recombinant plasmid, and the hENDOV-pET-28a (+) recombinant plasmid is transformed into escherichia coli BL21(DE 3); the conditions for inducing protein expression are that the IPTG concentration is 1mM, the induction temperature is 25 ℃, and the induction time is 10 h; the protein was purified using a Beyogold His-tag Purification Resin (reduction-resistant chelating type) (Biyuntian Biotechnology, cat # P2210). The procedure for assessing the activity of the hAndoV protein was as follows, and a total of 10. mu.L of a reaction system containing 1. mu.M of RNA (Ino-RNA-1, Ino-RNA-2 and A-RNA-1, see Table 2 for sequence), a certain amount of hAndoV, 20U of RNase inhibitor and 1 XNEBuffer 4 buffer (NEB, cat # B7004) was incubated at 37 ℃ for 30 minutes, and 1. mu.L of 10 Xloading buffer (Takara, cat # 9157) was added to the system to terminate the reaction, and the reaction results were confirmed by 20% PAGE (FIG. 2).
(4) 3' polyuridylation of the above RNA: to the above 20. mu.L reaction system were added 1U of polyU polymerase (NEB, cat. No. M0337), 1 XNEBuffer 2 buffer (dispensed with the enzyme), 100. mu.M UTP and 20U of RNase inhibitor to a final volume of 40. mu.L. This system was run at 37 ℃ for 8min and RNA was purified using RNA Clean & concentration Kits (ZYMO RESEARCH, cat. No. R1013) (20. mu.L elution).
(5) The RNA is subjected to reverse transcription to synthesize cDNA: a30. mu.L reverse transcription system consisted of 20. mu.L of the above RNA, 200UM-MuLV reverse transcriptase (Bio/industry, cat # B500517), 1 XT reaction buffer (distributed with the enzyme), 0.33. mu.M 3' adaptor-A 20 NC primers (sequence shown in Table 2), 0.5mM dNTP and 20U RNase inhibitor, and this system was reacted at 42 ℃ for 90 min.
(6) RNA removal and cDNA purification: to the reverse transcription reaction (30. mu.L) was added 1. mu.L exonuclease I (NEB, cat # M0293), 5. mu.L exonuclease I reaction buffer (dispensed with the enzyme), and 14. mu.L L H 2 O, reacting at 37 ℃ for 30min to remove excess reverse transcription primer; then, 10. mu.L NaOH (1M) and 10. mu.L EDTA (0.5M) were added to the exonuclease I reaction system (50. mu.L), and reacted at 65 ℃ for 15min to remove RNA in the system, using Oligo Clean&The cDNA was recovered from the Concentrator Kits (ZYMO RESEARCH, cat # D4060) (10. mu.L of elution).
(7) Extension of the cDNA above 5' linker was added: the reaction system (30. mu.L) consisted of 10. mu.L of the above cDNA, 5U of Bsu DNA polymerase (NEB, cat # M0330), 1 XNEBuffer 2 buffer (dispensed with the enzyme), 0.33. mu.M of 5' adaptor-N 6 Primers (sequences are shown in Table 2), 1mM dNTP, and the reaction is carried out at 37 ℃ for 1 h.
(8) DNA purification: mu.L of exonuclease I (NEB, cat # M0293), 5. mu.L of exonuclease I reaction buffer (dispensed with enzyme), and 14. mu. L H were added to the precursor system (30. mu.L) 2 O, reacting at 37 ℃ for 30min to remove excessive 5' adaptor-N 6 Primer, Oligo Clean is adopted&The DNA was recovered from the Concentrator Kits (ZYMO RESEARCH, cat. No. D4060) (15. mu.L of the eluate).
(9) The above DNA was subjected to PCR amplification: the PCR system (50. mu.L) consisted of 15. mu.L of the above DNA, 2 Xamplification buffer (holy next, cat. 10136ES01), 0.5. mu. M P5 and P7 primer (sequence see Table 2). Amplification cycle time program: firstly, denaturation is carried out for 1min at 98 ℃; ② denaturation at 98 ℃ for 10 sec; ③ annealing at 60 ℃ for 25 sec; extension at 72 ℃ for 50 sec; the fourth step is repeated for 30 times; extension at 72 ℃ for 10min and storage at 4 ℃.
(10) The resulting amplification product was sequenced.
TABLE 1 full length coding sequence of hEndoV
TABLE 2 oligonucleotide chains referred to in the embodiments
The following examples are intended to further illustrate the present invention and should not be construed as limiting the present invention, and any other changes, modifications, substitutions, combinations and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and shall be included within the scope of the present invention.
Unless otherwise specified, the technical means used in the examples include nucleic acid extraction, kit purification, reverse transcription, polymerase chain reaction, and the like, which are conventional means well known to those skilled in the art.
Example 1: standard synthetic RNA analysis
(1) 100ng of artificially synthesized Ino-RNA-1 strand (see the sequence in Table 3), 5U of polyA polymerase, 1mM of 3' deoxy-ATP, 1 XpolyA reaction buffer and 20U of RNase inhibitor were added to a 10. mu.L reaction system, and the RNA was purified by using a kit after reaction at 37 ℃ for 30 min.
(2) The RNA was subjected to the hEndoV protein cleavage reaction. To the above RNA were added an amount of the hENDOV protein, 1 XNEBuffer 4 and 20U of RNase inhibitor to make the final volume 20. mu.L, and the cleavage reaction was performed at 37 ℃ for 30 min.
(3) The RNA was subjected to polyuridine formation reaction. To the above 20. mu.L reaction system were added 1U of polyU polymerase, 100. mu.M UTP, 1 XNEBuffer 2 and 20U of RNase inhibitor to a final volume of 40. mu.L. The system was purified for RNA using a kit after 8min at 37 ℃.
(4) The RNA is subjected to reverse transcription. The reverse transcription system (30. mu.L) included 10. mu.L of the above RNA, 0.33. mu.M of 3' adaptor-A 20 GC primers (sequence shown in Table 3), 0.5mM dNTP, 200U reverse transcriptase, 1 × reverse transcription buffer, 20U RNase inhibitor, the system is reacted at 42 ℃ for 90 min. Since the standard synthetic RNA analysis system using Ino-RNA-1 is simple, the step of recovering cDNA here is simplified to add 1. mu.L of ribonuclease A to the reaction-terminated reverse transcription system, react at 37 ℃ for 30min to remove RNA, and recover cDNA by means of a kit.
(5) The cDNA was subjected to 5' linker addition reaction. The 5 'linker addition reaction system (30. mu.L) comprising 10. mu.L of the above cDNA, 5U Bsu DNA polymerase, 0.33. mu.M 5' adaptor primer (see sequence in Table 3), 1mM dNTP, 1 XNEBuffer 2 was performed at 37 ℃ for 1 hour.
(6) And (3) taking the reaction DNA to carry out polymerase chain amplification reaction. A25. mu.L PCR reaction was carried out using 2. mu.L of the above DNA, 12.5. mu.L of 2 Xamplification buffer, 10. mu. M P5, and 1.25. mu.L and 8. mu. L H of each of the P7 primers (see Table 3 for sequences) 2 And (C) O. Amplification cycle time program: firstly, denaturation is carried out for 1min at 98 ℃; ② denaturation at 98 ℃ for 10 sec; ③ annealing at 60 ℃ for 25 sec; extension at 72 ℃ for 50 sec; the fourth step is repeated for 30 times; extension at 72 ℃ for 10min and storage at 4 ℃. Samples were directly subjected to Sanger sequencing, the sequencing results are shown in FIG. 3, where the second base 3 'to 5' after removal of the linker is G (boxed label), which is the inosine modification in the original Ino-RNA-1.
TABLE 3 oligonucleotide chains referred to in example 1
Sequence listing
<110> Wuhan university
<120> endonuclease-assisted single-base resolution positioning analysis method for inosine modification in RNA
<160> 1
<170> SIPOSequenceListing 1.0
<210> 1
<211> 846
<212> DNA
<213> Homo sapiens
<400> 1
atggccctgg aggcggcggg agggccgccg gaggaaacgc tgtcactgtg gaaacgggag 60
caagctcggc tgaaggccca cgtcgtagac cgggacaccg aggcgtggca gcgagacccc 120
gccttctcgg gtctgcagag ggtcgggggc gttgacgtgt ccttcgtgaa aggggacagt 180
gtccgcgctt gtgcttccct ggtggtgctc agcttccctg agctcgaggt ggtgtatgag 240
gagagccgca tggtcagcct cacagccccc tacgtgtcgg gcttcctggc cttccgagag 300
gtgcccttct tgctggagct ggtgcagcag ctgcgggaga aggagccggg cctcatgccc 360
caggtccttc ttgtggatgg aaacggggta ctccaccacc gaggctttgg ggtggcctgc 420
caccttggcg tccttacaga cctgccgtgt gttggggtgg ccaagaaact tctgcaggtg 480
gatgggctgg agaacaacgc cctgcacaag gagaagatcc gactcctgca gactcgagga 540
gactcattcc ctctgctggg agactctggg actgtcctgg gaatggccct gaggagccac 600
gaccgcagca ccaggcccct ctacatctcc gtgggccaca ggatgagcct ggaggccgct 660
gtgcgcctga cttgctgctg ctgcaggttc cggatcccag agcccgtgcg ccaggctgac 720
atctgctccc gagagcacat ccgcaagtcg ctgggactcc ccgggccacc cacaccgagg 780
agcccgaagg cgcagaggcc agtggcatgc cccaaaggag actccggaga gtcctcagca 840
ctttgt 846
Claims (5)
1. An endonuclease-assisted single-base resolution localization analysis method for inosine modification in RNA is characterized in that: the method comprises the following steps:
(1) fragmenting RNA of a cell or tissue sample;
(2) 3' OH end-capping the fragmented RNA;
(3) cleaving the 3' OH-terminated blocked RNA using the hEndoV protein;
(4) 3' polyuridylation is carried out on the cut RNA;
(5) taking 3 'polyuridylated RNA as a template, and carrying out reverse transcription reaction by adopting a reverse transcription primer with polyadenyle and a 3' joint to obtain cDNA;
(6) adding a 5 'linker to the cDNA by a DNA polymerase extension reaction using a primer having a 5' linker to obtain a DNA for amplification;
(7) performing PCR amplification by using the DNA as a template, and sequencing an amplified product;
(8) single base recognition with inosine modification: and the second base G from 3 'to 5' after the double-ended joint is removed in the sequencing result is an inosine modified base.
2. The method for single base resolution mapping analysis of inosine modification in RNA assisted by endonuclease according to claim 1, wherein: the step (2) is as follows: polyA polymerase is used to catalyze 3 ' deoxy-ATP addition of 3 ' deoxy-a to the 3 ' OH end of fragmented RNA.
3. The method for single base resolution mapping analysis of inosine modification in RNA assisted by endonuclease according to claim 1, wherein: in the step (3), the reaction buffer for RNA cleavage using the hENDOV protein was Tris-CH 3 COOH is a buffer system in which Mg is present 2+ Is 10 mM.
4. The method for single base resolution mapping analysis of inosine modification in RNA supported by endonuclease according to claim 3, wherein: the reaction conditions for RNA cleavage were 37 ℃ for 30 min.
5. The method for single base resolution mapping analysis of inosine modification in RNA assisted by endonuclease according to claim 1, wherein: the step (4) is as follows: UTP was catalyzed by polyU polymerase to add a U-tail to the 3 'OH terminus of RNA for 3' polyuridylation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210623985.8A CN115125292A (en) | 2022-06-02 | 2022-06-02 | Single-base resolution positioning analysis method for inosine modification in RNA assisted by endonuclease |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210623985.8A CN115125292A (en) | 2022-06-02 | 2022-06-02 | Single-base resolution positioning analysis method for inosine modification in RNA assisted by endonuclease |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115125292A true CN115125292A (en) | 2022-09-30 |
Family
ID=83377322
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210623985.8A Pending CN115125292A (en) | 2022-06-02 | 2022-06-02 | Single-base resolution positioning analysis method for inosine modification in RNA assisted by endonuclease |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115125292A (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070020639A1 (en) * | 2005-07-20 | 2007-01-25 | Affymetrix, Inc. | Isothermal locus specific amplification |
CN102732629A (en) * | 2012-08-01 | 2012-10-17 | 复旦大学 | Method for concurrently determining gene expression level and polyadenylic acid tailing by using high-throughput sequencing |
CN105492629A (en) * | 2013-07-18 | 2016-04-13 | 哈佛学院院长及董事 | Specific nucleic acid amplification with compounded selectivity |
WO2018090373A1 (en) * | 2016-11-21 | 2018-05-24 | 深圳华大智造科技有限公司 | Method for repairing dna terminal end and adding a |
CN108841934A (en) * | 2018-08-06 | 2018-11-20 | 中国科学院动物研究所 | A kind of method and kit of the N7- methylguanosine modification of the full inside transcript profile range single base resolution ratio detection RNA |
US20190284624A1 (en) * | 2018-03-15 | 2019-09-19 | Massachusetts Institute Of Technology | Methods of quantifying rna and dna variants through sequencing employing phosphorothioates |
CN110358817A (en) * | 2019-06-25 | 2019-10-22 | 武汉大学 | 5- carboxyl cytimidine modifies single base resolution ratio method for positioning analyzing in the DNA of deaminase auxiliary |
CN111154837A (en) * | 2019-09-02 | 2020-05-15 | 浙江大学 | Method and kit for detecting RNA N6-methyladenine modification in single base resolution in full transcriptome range |
CN114134205A (en) * | 2021-11-25 | 2022-03-04 | 武汉大学 | Deaminase-mediated N in DNA4Single base resolution positioning analysis method of-methylcytosine |
-
2022
- 2022-06-02 CN CN202210623985.8A patent/CN115125292A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070020639A1 (en) * | 2005-07-20 | 2007-01-25 | Affymetrix, Inc. | Isothermal locus specific amplification |
CN102732629A (en) * | 2012-08-01 | 2012-10-17 | 复旦大学 | Method for concurrently determining gene expression level and polyadenylic acid tailing by using high-throughput sequencing |
CN105492629A (en) * | 2013-07-18 | 2016-04-13 | 哈佛学院院长及董事 | Specific nucleic acid amplification with compounded selectivity |
WO2018090373A1 (en) * | 2016-11-21 | 2018-05-24 | 深圳华大智造科技有限公司 | Method for repairing dna terminal end and adding a |
US20190284624A1 (en) * | 2018-03-15 | 2019-09-19 | Massachusetts Institute Of Technology | Methods of quantifying rna and dna variants through sequencing employing phosphorothioates |
CN108841934A (en) * | 2018-08-06 | 2018-11-20 | 中国科学院动物研究所 | A kind of method and kit of the N7- methylguanosine modification of the full inside transcript profile range single base resolution ratio detection RNA |
CN110358817A (en) * | 2019-06-25 | 2019-10-22 | 武汉大学 | 5- carboxyl cytimidine modifies single base resolution ratio method for positioning analyzing in the DNA of deaminase auxiliary |
CN111154837A (en) * | 2019-09-02 | 2020-05-15 | 浙江大学 | Method and kit for detecting RNA N6-methyladenine modification in single base resolution in full transcriptome range |
CN114134205A (en) * | 2021-11-25 | 2022-03-04 | 武汉大学 | Deaminase-mediated N in DNA4Single base resolution positioning analysis method of-methylcytosine |
Non-Patent Citations (4)
Title |
---|
CHEN LQ等: "Mapping and editing of nucleic acid modifications",Chen LQ等,Comput Struct Biotechnol J,第18卷,第661-667页,公开日期:20200319", COMPUT STRUCT BIOTECHNOL J, vol. 18, 19 March 2020 (2020-03-19), pages 661 - 667 * |
KNUTSON SD等: "EndoVIPER-seq for Improved Detection of A-to-I Editing Sites in Cellular RNA", CURR PROTOC CHEM BIOL, vol. 12, no. 2, 30 June 2020 (2020-06-30), pages 1 - 30 * |
朱文煜等: "基于高通量测序的RNA修饰检测技术", 生命科学, vol. 33, no. 3, 31 March 2021 (2021-03-31), pages 292 - 302 * |
游雪娇;袁必锋;冯钰;: "RNA修饰分析方法研究进展", 分析测试学报, no. 10, 22 October 2018 (2018-10-22), pages 15 - 29 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2245187T3 (en) | Methods for accurate sequence data and modified due to localization | |
CN115181783A (en) | Bisulfite-free base resolution identification of cytosine modifications | |
JP2005535283A (en) | DNA amplification and sequencing using DNA molecules generated by random fragmentation | |
CN112352057A (en) | Compositions and methods for cancer or neoplasia assessment | |
CN114174530A (en) | Methods and compositions for analyzing nucleic acids | |
CN114901818A (en) | Methods of targeted nucleic acid library formation | |
EP3674419A1 (en) | Probe and method applying the same for enriching target region in high-throughput sequencing | |
EP4083231A1 (en) | Compositions and methods for nucleic acid analysis | |
WO2015145133A1 (en) | Nucleic acid preparation method | |
CN109825552B (en) | Primer and method for enriching target region | |
WO2016063034A1 (en) | Improved nucleic acid sample preparation using concatenation | |
JP2002542838A5 (en) | ||
CN114134205B (en) | Deaminase mediated N in DNA 4 Single base resolution localization analysis method of methyl cytosine | |
US20030108874A1 (en) | Compositions and systems for identifying and comparing expressed genes (mRNAs) in eukaryotic organisms | |
CN115125292A (en) | Single-base resolution positioning analysis method for inosine modification in RNA assisted by endonuclease | |
EP4172357B1 (en) | Methods and compositions for analyzing nucleic acid | |
CN113817723B (en) | Polynucleotide and standard substance, kit and application thereof | |
JP3845416B2 (en) | Gene tag acquisition method | |
CN110724729A (en) | Fluorescent probe | |
CN113981043B (en) | Method for preparing second generation sequencing joint | |
CN113774121B (en) | Low sample size m based on RNA (ribonucleic acid) connection tag 6 A high throughput sequencing method | |
CN116287166A (en) | Methylation sequencing joint and application thereof | |
Shagin et al. | MOLECULAR BIOLOGY APPLICATIONS OF THE RED KING CRAB DUPLEX-SPECIFIC NUCLEASE | |
Shagin et al. | REVIEW MOLECULAR BIOLOGY | |
WO2023158739A2 (en) | Methods and compositions for analyzing nucleic acid |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |