WO2022257354A1 - 检测核酸分子中n6-甲基腺嘌呤的方法及试剂盒 - Google Patents

检测核酸分子中n6-甲基腺嘌呤的方法及试剂盒 Download PDF

Info

Publication number
WO2022257354A1
WO2022257354A1 PCT/CN2021/130282 CN2021130282W WO2022257354A1 WO 2022257354 A1 WO2022257354 A1 WO 2022257354A1 CN 2021130282 W CN2021130282 W CN 2021130282W WO 2022257354 A1 WO2022257354 A1 WO 2022257354A1
Authority
WO
WIPO (PCT)
Prior art keywords
carbonyl compound
nucleic acid
buffer
product obtained
heat treatment
Prior art date
Application number
PCT/CN2021/130282
Other languages
English (en)
French (fr)
Inventor
伊成器
王晶
Original Assignee
北京大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京大学 filed Critical 北京大学
Priority to EP21944432.0A priority Critical patent/EP4353832A1/en
Publication of WO2022257354A1 publication Critical patent/WO2022257354A1/zh

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P20/00Technologies relating to chemical industry
    • Y02P20/50Improvements relating to the production of bulk chemicals
    • Y02P20/55Design of synthesis routes, e.g. reducing the use of auxiliary or protecting groups

Definitions

  • the present application relates to the field of molecular biology, especially the field of nucleic acid detection and analysis.
  • the present application provides a method for detecting N6-methyladenine in nucleic acid molecules.
  • the present application also provides a kit, which can be used to implement the method of the present application.
  • m 6 ACE-seq can also achieve single-base sequencing by using antibody cross-linking and exonuclease digestion.
  • the above methods all rely on m 6 A antibodies, have poor reproducibility, require a large amount of samples, and are difficult to quantitatively detect m 6 A modification.
  • Chem.Sci., 2021, 12, 606 disclosed a method of using sodium nitrite in acetic acid aqueous solution to realize the conversion of adenine (A) to hypoxanthine (I) in nucleic acid (DNA/RNA). Or it is recognized as guanine (G) during reverse transcription, but methyladenine is still read as A. By comparing the transformation of AG, it can be judged whether the nucleic acid contains m 6 A modification.
  • NOseq (Nucleic Acids Res.2021 Feb 26; 49(4):e23) also uses nitrous acid compounds to treat A to deaminate it into I and read it as G, while m 6 A reacts to NO-m 6 A and cannot Deamination is still recognized as A, thereby distinguishing m6A sites.
  • the conversion efficiency of A to I is between 10% and 50%, and there is still a large amount of A remaining at the site of conventional A, so there is a lot of background in the analysis of m 6 A, and it is impossible to analyze low-abundance methyl groups chemical site.
  • this method can only identify certain methylation sites in specific sequences, but cannot detect m 6 A modifications in the whole transcriptome.
  • G will be converted into xanthine (X) by deamination, and X will cause transcription termination, and at the same time, mismatches will occur and read as A to produce false positives. It will be deaminated and converted to U, which will bring difficulties to subsequent analysis.
  • the inventors of the present application have developed a new method for detecting N6-methyladenine in nucleic acid molecules based on intensive research.
  • the method of the present application utilizes nitrite and carbonyl compounds to catalyze the deamination of adenine into hypoxanthine, while N6-methyladenine cannot be deaminated due to its stable chemical properties. Therefore, the method of the present application realizes the detection of m 6 A modification level or m 6 A site by analyzing the signals of adenine or hypoxanthine before and after conversion.
  • the method has high sensitivity and accurate quantification, and can realize the detection of N6-methyladenine on the whole genome or whole transcriptome at the single base level.
  • the application provides a kind of method for detecting N6-methyladenine in nucleic acid molecule, it comprises the following steps:
  • the protected nucleic acid molecule is reacted with nitrite to convert the adenine in the nucleic acid molecule to be tested into hypoxanthine;
  • any base in the nucleic acid molecule except adenine can be protected by any method, wherein the protection can prevent or hinder the deamination reaction of the protected base.
  • the protection is reversible or irreversible.
  • the protection is irreversible, and the protected bases are capable of complementary base pairing.
  • the method of the present application utilizes nitrite to deaminate the adenine in the nucleic acid molecule to be tested and convert it into hypoxanthine, while N6-methyladenine cannot be deaminated because of its stable chemical properties.
  • the signal of xanthine realizes the detection of m 6 A modification level or m 6 A site.
  • the method can detect the m 6 A modification level by directly detecting the signal of adenine or hypoxanthine in the product of the nucleic acid to be tested after undergoing a catalytic deamination reaction.
  • the m 6 A modification level can be achieved by detecting the signal of adenine or hypoxanthine in the product of the deamination catalytic reaction in the test nucleic acid by methods such as mass spectrometry, enzyme fragmentation and/or chromatography detection.
  • the method can realize m 6 A modification level and m 6 A site detection.
  • the method can detect the sequence information of the amplified product or the reverse transcription product of the nucleic acid to be tested by sequencing or hybridization to achieve the m 6 A modification level and the m 6 A site detection.
  • the inventor speculates that in the method of the present invention, nitrite is used to convert adenine deamination into hypoxanthine in the nucleic acid molecule to be tested, and in the process of converting adenine (A) to hypoxanthine (I) , the first carbonyl compound (such as glyoxal) plays a catalytic role and can effectively improve the conversion efficiency of A to I in nucleic acid molecules.
  • the exemplary catalytic reaction mechanism is as shown in Figure 1.
  • the inventor has detected the imine intermediate formed by the combination of glyoxal and the amino group of adenine, and its secondary mass spectrum peak pattern is as follows: Figure 27 shows.
  • xanthine Since the deamination of guanine in the nitrosation reaction will be converted into xanthine (X), and xanthine will lead to the termination of nucleic acid amplification or reverse transcription process with a certain probability, or, xanthine will be combined with thymus in the process of nucleic acid amplification or reverse transcription
  • Complementary pairing of pyrimidines, which are read as adenines, interferes with the detection of m 6 A modification levels and m 6 A sites. Therefore, in certain embodiments, it is particularly advantageous to protect the amino group of guanine in the nucleic acid molecule to be tested.
  • step (1) of the method the amino group of guanine in the nucleic acid molecule to be tested is protected with a second carbonyl compound.
  • the second carbonyl compound (taking glyoxal as an example) to protect the amino group of guanine in the nucleic acid molecule to be tested, its reaction mechanism is as shown in Figure 2, the second carbonyl compound such as glyoxal can be with guanine N 1 , N The 2 -position reacts to form N 1 , N 2 -dihydroxyguanosine, thereby reducing the deamination efficiency of guanine.
  • the carbonyl compound is derived from:
  • the second carbonyl compound is the same or different than the first carbonyl compound. In certain embodiments, the second carbonyl compound is the same as the first carbonyl compound.
  • the step (2) does not include the step of adding a first carbonyl compound, and the carbonyl compound is derived from: the second carbonyl compound added in step (1); or, step (2) includes A step of adding a first carbonyl compound, and the first carbonyl compound is the same or different from the second carbonyl compound.
  • step (3) the guanine in the product is optionally deprotected.
  • the first carbonyl compound and the second carbonyl compound are each independently selected from compounds represented by formula I and any combination thereof,
  • R 1 is aldehyde group, C 1-3 alkyl (such as methyl, ethyl or n-propyl) or halogen substituted C 1-3 alkyl (such as trichloromethyl, trifluoromethyl),
  • R1 and R2 form a ring structure with the carbonyl to which they are attached, such as ninhydrin.
  • the first carbonyl compound and the second carbonyl compound are each independently selected from: glyoxal, 2,3-butanedione, ninhydrin, 2-bromomalonaldehyde, methylglyoxal, Chloral, phenylglyoxal, and any combination thereof.
  • the first carbonyl compound is selected from the group consisting of: glyoxal, 2,3-butanedione, ninhydrin, 2-bromomalondialdehyde, acetoaldehyde, chloral, phenylacetaldehyde Dialdehydes and any combination thereof.
  • the first carbonyl compound is selected from the group consisting of glyoxal, 2-bromomalondialdehyde, 2,3-butanedione, phenylglyoxal, chloral, ninhydrin and any combination thereof.
  • the second carbonyl compound is selected from the group consisting of glyoxal, ninhydrin, and any combination thereof.
  • step (1) of the method the nucleic acid molecule to be tested is contacted with the second carbonyl compound in the first solvent.
  • the first solvent is water.
  • the nucleic acid molecule to be tested and the second carbonyl compound are in the first solvent, and in dimethylsulfoxide (DMSO) or N,N-dimethylformamide ( DMF) in the presence of contact.
  • DMSO dimethylsulfoxide
  • DMF N,N-dimethylformamide
  • the nucleic acid molecule to be detected and the second carbonyl compound are in a first solvent, and in a mixture of dimethyl sulfoxide (DMSO) and boric acid or a salt thereof (such as potassium borate) Contact exists.
  • DMSO dimethyl sulfoxide
  • boric acid or a salt thereof such as potassium borate
  • the nucleic acid molecule to be detected and the second carbonyl compound are in the first solvent, and in N,N-dimethylformamide (DMF) and boric acid or its salt (such as in the presence of potassium borate).
  • DMF N,N-dimethylformamide
  • boric acid or its salt such as in the presence of potassium borate
  • the final concentration of dimethylsulfoxide (DMSO) is 20v/v%-90v/v%, for example about 30v/v%, about 40v/v%, About 50v/v%, about 60v/v%, about 70v/v%, about 80v/v%.
  • the final concentration of the boric acid or its salt is 5mM-150mM (such as 5mM-10mM, 10mM-50mM, 50mM-80mM, 80mM-100mM).
  • the final concentration of the second carbonyl compound is greater than or equal to 20mM, preferably 20mM-3M (such as 20mM-100mM, 100mM-500mM, 500mM-1000mM, 1M-1.5 M, 1.5 ⁇ 2M, 2M ⁇ 2.6M, such as about 0.8M, 1.3M, 1.8M).
  • step (1) of the method the nucleic acid molecule to be tested and the second carbonyl compound are heated at 16-60°C (for example, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C) temperature conditions.
  • step (1) of the method the nucleic acid molecule to be tested and the second carbonyl compound are heated at 16-60°C (for example, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C) for 15-60 minutes (for example, about 20 minutes, about 30 minutes, about 40 minutes, about 50 minutes).
  • 16-60°C for example, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C
  • the method further includes: before step (1), a step of pretreating the nucleic acid molecules to be tested.
  • the pretreatment comprises purification, fragmentation, denaturation, or any combination thereof of the nucleic acid molecule to be tested.
  • the pretreatment includes denaturing the nucleic acid molecules to be tested with aqueous formamide or alkaline solution.
  • step (2) of the method the protected nucleic acid molecule is reacted with nitrite in a second solvent in the presence of a carbonyl compound.
  • the second solvent is water, 2-(N-morpholine)ethanesulfonic acid (MES) buffer, sodium acetate buffer, 3-(N-morpholine)propanesulfonic acid (MOPS ) buffer, piperazine-1,4-diethanesulfonic acid (PIPES) buffer, 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS) buffer or trishydroxymethylaminomethane (TRIS) buffer or random combination.
  • the second solvent can also be an aqueous solution of p-toluenesulfonic acid or aqueous phosphoric acid.
  • the second solvent is water, aqueous p-toluenesulfonic acid, aqueous phosphoric acid, 2-(N-morpholine)ethanesulfonic acid (MES) buffer, sodium acetate buffer, 3-(N -morpholine) propanesulfonic acid (MOPS) buffer, piperazine-1,4-diethanesulfonic acid (PIPES) buffer, 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS) buffer or trimethylol Aminomethane (TRIS) buffer or any combination thereof.
  • MES 2-(N-morpholine)ethanesulfonic acid
  • MOPS 3-(N -morpholine) propanesulfonic acid
  • PPES piperazine-1,4-diethanesulfonic acid
  • HPPS 4-hydroxyethylpiperazineethanesulfonic acid
  • TMS trimethylol Aminomethane
  • the second solvent is 2-(N-morpholine)ethanesulfonic acid (MES) buffer or sodium acetate buffer.
  • MES 2-(N-morpholine)ethanesulfonic acid
  • the second solvent is an aqueous solution of p-toluenesulfonic acid, 2-(N-morpholine)ethanesulfonic acid (MES) buffer, or sodium acetate buffer.
  • MES 2-(N-morpholine)ethanesulfonic acid
  • the second solvent is aqueous p-toluenesulfonic acid, aqueous phosphoric acid, 2-(N-morpholine)ethanesulfonic acid (MES) buffer, or sodium acetate buffer.
  • MES 2-(N-morpholine)ethanesulfonic acid
  • the second solvent is an aqueous p-toluenesulfonic acid solution with a pH of 4.5-6.5 (eg, 5-6).
  • the second solvent is an aqueous phosphoric acid solution with a pH of 4.5-6.5 (eg, 5-6).
  • the second solvent is 2-(N-morpholine)ethanesulfonic acid (MES) buffer, wherein the final concentration of 2-(N-morpholine)ethanesulfonic acid (MES) is 20mM ⁇ 750 mM (eg, about 40 mM, about 80 mM, about 100 mM, about 150 mM, about 200 mM, about 250 mM, about 500 mM).
  • MES 2-(N-morpholine)ethanesulfonic acid
  • the second solvent is a sodium acetate buffer, wherein the final concentration of the sodium acetate is 300mM-400mM.
  • step (2) in the presence of a carbonyl compound, the protected nucleic acid molecule is mixed with nitrite in a second solvent, and in the presence of boric acid or a salt thereof (such as potassium borate) reaction.
  • boric acid or a salt thereof such as potassium borate
  • the total molar concentration (final concentration) of carbonyl compounds in the second solvent is greater than or equal to 50 mM, such as greater than or equal to 250 mM, (such as 50-2000 mM, 50-200 mM, 200-500 mM, 500- 800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, eg 105mM).
  • the second solvent contains a final concentration of 50-2000mM (such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105mM) and methylglyoxal at a final concentration of 50-1500mM (eg, 50-60mM, 60-100mM, 100-300mM, 300-600mM, 600-800mM, 800-1200mM, 1200-1500mM).
  • 50-2000mM such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105mM
  • 50-1500mM eg, 50-60mM, 60-100mM, 100-300mM, 300-600mM, 600-800mM, 800-1200mM, 1200-1500mM.
  • the second solvent contains a final concentration of 50-2000mM (such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105mM) and a final concentration of 50-2500mM (e.g. 50-100mM, 100-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500mM, 1500-1800mM, 1800-2200mM, 2200-2500mM ) of 2,3-butanedione.
  • 50-2000mM such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500mM, 1500-1800mM, 1800-2200mM, 2200-2500mM
  • 50-2500mM e.g. 50-100mM, 100-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200
  • the second solvent contains a final concentration of 50-2000mM (such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105mM) and ninhydrin at a final concentration of 20-1500mM (eg 20-100mM, 100-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500mM, about 60mM).
  • 50-2000mM such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500mM, about 60mM.
  • the second solvent contains a final concentration of 50-2000mM (such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105mM) of glyoxal and 2-bromomalonaldehyde at a final concentration of 50-1500mM (e.g. 50-100mM, 100-200mM, 200-400mM, 400-500mM, 500-800mM, 800-1200mM, 1200-1500mM) .
  • 50-2000mM such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500mM
  • the second solvent contains a final concentration of 50-2000mM (such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105mM) and chloral at a final concentration of 50-1500mM (eg, 50-100mM, 100-400mM, 400-500mM, 500-800mM, 800-1200mM, 1200-1500mM).
  • 50-2000mM such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500mM.
  • the second solvent contains a final concentration of 50-2000mM (such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1400mM, 1400-1800mM, 1800-2000mM, such as 105 mM) and phenylglyoxal at a final concentration of 50-1500 mM (eg, 50-100 mM, 100-400 mM, 400-500 mM, 500-800 mM, 800-1200 mM, 1200-1500 mM).
  • 50-2000mM such as 50-200mM, 200-500mM, 500-800mM, 800-1200mM, 1200-1500 mM.
  • the second solvent contains a final concentration greater than or equal to 50 mM, such as greater than or equal to 250 mM (eg, 50-2000 mM, 50-200 mM, 200-500 mM, 500-800 mM, 800-1200 mM, 1200-1400 mM , 1400-1800mM, 1800-2000mM, eg 105mM) glyoxal.
  • 50 mM such as greater than or equal to 250 mM (eg, 50-2000 mM, 50-200 mM, 200-500 mM, 500-800 mM, 800-1200 mM, 1200-1400 mM , 1400-1800mM, 1800-2000mM, eg 105mM) glyoxal.
  • the final concentration of the boric acid or its salt is 5mM-150mM (such as 5mM-10mM, 10mM-50mM, 50mM-80mM, 80mM-100mM).
  • the nitrite is selected from sodium nitrite, potassium nitrite, and combinations of both.
  • the final concentration of the nitrite is 0.5M-1.5M, such as 0.5M-0.8M, 0.8-1.25M.
  • step (2) of the method in the presence of the first carbonyl compound, the protected nucleic acid molecule is mixed with nitrite in a second solvent, and in boric acid or a salt thereof (such as boric acid react in the presence of potassium).
  • N 1 , N 2 -dihydroxyguanosine formed after the amino group of guanine in the nucleic acid molecule is protected by a second carbonyl compound such as glyoxal may be deprotected in step (2).
  • adding boric acid or its salt (such as H 3 BO 3 /K 3 BO 3 ) in step (2) can further protect N 1 , N 2 -dihydroxyguanosine, making its structure more stable, making Protecting groups are not easily removed.
  • the boric acid or salt thereof (eg, potassium borate) is derived from:
  • step (1) (i) boric acid or a salt thereof (such as potassium borate) added in step (1);
  • the step (2) does not include the step of adding boric acid or a salt thereof (such as potassium borate); alternatively, step (2) includes the step of adding boric acid or a salt thereof (such as potassium borate), and,
  • the boric acid or its salt (such as potassium borate) added in step (2) is the same or different from the boric acid or its salt (such as potassium borate) added in step (1).
  • boric acid or its salt in the second solvent is not limited by the reaction steps, for example, boric acid or its salt can be added before step (1) (for example, after boric acid or its salt is incubated with the second carbonyl compound , carry out the nucleic acid protection process of step (1) again);
  • boric acid or its salt can be directly added in step (1);
  • boric acid or its salt can be directly added in step (2);
  • boric acid or a salt thereof is added to the protected nucleic acid.
  • step (2) of the method the protected nucleic acid molecule and nitrite are heated at 12-60°C (for example, 16-60°C, 12-24°C, 24-40°C, 40- 60°C, about 16°C, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C).
  • 12-60°C for example, 16-60°C, 12-24°C, 24-40°C, 40- 60°C, about 16°C, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C.
  • step (2) the protected nucleic acid molecule and nitrite are heated at 12-60°C (such as 16-60°C, 12-24°C, 24-40°C, 40-60°C, About 16°C, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C) for 10min to 24h (for example, 30min to 24h, 10min ⁇ 20min, 20min ⁇ 1h, 1h ⁇ 5h, 5h ⁇ 10h, 10h ⁇ 24h, about 20min, about 30min, about 40min, about 50min, about 8h).
  • 12-60°C such as 16-60°C, 12-24°C, 24-40°C, 40-60°C, About 16°C, about 20°C, about 25°C, about 30°C, about 35°C, about 40°C, about 45°C, about 50°C, about 55°C
  • 10min to 24h for example, 30min to 24h, 10min ⁇ 20min, 20min ⁇ 1
  • step (2) the protected nucleic acid molecule is reacted with nitrite at a temperature of 12-24°C (eg, about 16°C) for 5h-10h (eg, about 8h).
  • step (2) the protected nucleic acid molecule is reacted with nitrite at a temperature of 40-60°C (eg, about 50°C) for 20min-1h (eg, about 30min).
  • the method further includes: before step (4), the steps of purifying, reverse transcribing and/or amplifying the product obtained in the previous step.
  • the method further comprises: an additional step of detecting the nucleic acid molecule to be tested.
  • the detection includes nucleotide composition analysis by sequencing or hybridization, mass spectrometry (eg triple quadrupole mass spectrometry), enzyme fragmentation and/or chromatography.
  • the detecting comprises nucleotide composition analysis by sequencing.
  • the method further includes comparing the detection result of step (4) with the detection result of the additional step (for example, comparing the product obtained in step (4) with the detection result of the additional step. Detect the sequence or nucleotide composition of the nucleic acid molecule, and determine the content and/or position information of N6-methyladenine in the nucleic acid molecule to be tested).
  • the nucleic acid molecule to be tested can be derived from any organism (eg eukaryotic cells, prokaryotic cells, viruses and viroids) or non-biological organisms (eg nucleic acid molecule library).
  • the nucleic acid molecule to be tested can exist in a single-stranded or double-stranded form.
  • the nucleic acid molecule to be detected is RNA, DNA or a DNA/RNA hybrid.
  • step (3) the product obtained in step (2) is deprotected under alkaline conditions; alternatively, the product obtained in step (2) is deprotected in a buffer containing phosphate .
  • step (3) the product obtained in step (2) is deprotected by heat treatment under alkaline conditions.
  • step (2) in a triethylamine acetate (TEAA) buffer containing formamide at a pH of 8-9 or in a phosphate buffer containing dimethyl sulfoxide (DMSO) at a pH of 7.1-8
  • TEAA triethylamine acetate
  • DMSO dimethyl sulfoxide
  • triethylamine acetate (TEAA) buffer containing 45-50 v/v% formamide at pH 8-9 or triethylamine acetate (TEAA) buffer containing 40-60 v/v% formamide at pH 7.1-8 The product obtained in step (2) was deprotected by heat treatment in a phosphate buffered saline solution of methyl sulfoxide (DMSO).
  • the heat treatment is:
  • step (2) a) Treating the product obtained in step (2) at a temperature of 80-95° C. for 5-10 minutes, or,
  • step (2) Treating the product obtained in step (2) at a temperature of 60-70° C. for 1.5-2.5 hours (for example, about 2 hours).
  • the product obtained in step (2) is subjected to the heat treatment described in a) in a triethylamine acetate (TEAA) buffer containing about 47.5 v/v % formamide at a pH of about 8.6 deprotection, or,
  • step (2) The product obtained in step (2) is deprotected by the heat treatment described in b) in a phosphate buffer solution containing about 50 v/v% dimethyl sulfoxide (DMSO) at a pH of about 7.4.
  • DMSO dimethyl sulfoxide
  • the nucleic acid molecule to be detected is RNA or a DNA/RNA hybrid.
  • step (3) the product obtained in step (2) is deprotected under alkaline conditions.
  • step (3) the product obtained in step (2) is deprotected once under alkaline conditions, and the product obtained from the first deprotection is deprotected twice in phosphate buffer.
  • step (3) the product obtained in step (2) is deprotected by heat treatment under alkaline conditions.
  • step (3) the product obtained in step (2) is deprotected once by heat treatment in triethylamine acetate (TEAA) buffer solution containing formamide at pH 8-9.
  • TEAA triethylamine acetate
  • step (3) the product obtained in step (2) is deprotected once by heat treatment in a triethylamine acetate (TEAA) buffer solution containing formamide at a pH of 8-9, In a phosphate buffer solution containing dimethyl sulfoxide (DMSO) at a pH of 7.1-8, the product obtained from the first deprotection is subjected to a second deprotection by heat treatment.
  • TEAA triethylamine acetate
  • DMSO dimethyl sulfoxide
  • step (3) in the triethylamine acetate (TEAA) buffer solution containing 45-50v/v% formamide at pH 8-9, the obtained in step (2) is treated by heat treatment The product was deprotected once.
  • TEAA triethylamine acetate
  • step (3) in the triethylamine acetate (TEAA) buffer solution containing 45-50v/v% formamide at pH 8-9, the obtained in step (2) is treated by heat treatment
  • the product is subjected to primary deprotection, and the product obtained from the primary deprotection is subjected to secondary deprotection by heat treatment in a phosphate buffer solution containing 40-60v/v% dimethyl sulfoxide (DMSO) at a pH of 7.1-8.
  • DMSO dimethyl sulfoxide
  • the heat treatment is:
  • step (2) a) Treating the product obtained in step (2) at a temperature of 80-95° C. for 5-10 minutes, or,
  • step (2) Treating the product obtained in step (2) at a temperature of 60-70° C. for 1.5-2.5 hours (for example, about 2 hours).
  • step (3) step (The product obtained in 2) was deprotected once.
  • step (3) step ( The product obtained in 2) is deprotected once, and the product obtained by the deprotection is subjected to the heat treatment described in b) in a phosphate buffer solution containing about 50v/v% dimethyl sulfoxide (DMSO) at a pH of about 7.4 Perform secondary deprotection.
  • DMSO dimethyl sulfoxide
  • the nucleic acid molecule to be detected is DNA or a DNA/RNA hybrid.
  • step (3) the product obtained in step (2) is deprotected under alkaline conditions.
  • step (3) the product obtained in step (2) is deprotected by heat treatment under alkaline conditions.
  • the product obtained in step (2) is deprotected by heat treatment in a triethylamine acetate (TEAA) buffer solution containing formamide at a pH of 8-9.
  • TEAA triethylamine acetate
  • the product obtained in step (2) is deprotected by heat treatment in a triethylamine acetate (TEAA) buffer solution containing 45-50 v/v% formamide at a pH of 8-9.
  • TEAA triethylamine acetate
  • the heat treatment is: treating the product obtained in step (2) at a temperature of 80-95° C. for 5-10 minutes.
  • the product obtained in step (2) is subjected to the heat treatment described in a) in a triethylamine acetate (TEAA) buffer containing about 47.5 v/v % formamide at a pH of about 8.6 off protection.
  • TEAA triethylamine acetate
  • the present application also provides a kit comprising a first carbonyl compound and a nitrite.
  • the first carbonyl compound is selected from compounds shown in formula I and any combination thereof,
  • R 1 is aldehyde group, C 1-3 alkyl (such as methyl, ethyl or n-propyl) or halogen substituted C 1-3 alkyl (such as trichloromethyl, trifluoromethyl),
  • R1 and R2 form a ring structure with the carbonyl to which they are attached, such as ninhydrin.
  • the first carbonyl compound is selected from the group consisting of: glyoxal, 2,3-butanedione, ninhydrin, 2-bromomalondialdehyde, acetoaldehyde, chloral, phenylacetaldehyde Dialdehydes and any combination thereof.
  • the first carbonyl compound is selected from the group consisting of: glyoxal, 2,3-butanedione, ninhydrin, 2-bromomalondialdehyde, acetoaldehyde, chloral, phenylacetaldehyde Dialdehydes and any combination thereof.
  • the first carbonyl compound is selected from the group consisting of glyoxal, 2-bromomalondialdehyde, 2,3-butanedione, phenylglyoxal, chloral, ninhydrin and any combination thereof.
  • the kit further comprises a second carbonyl compound.
  • the second carbonyl compound is selected from compounds shown in formula I and any combination thereof,
  • R 1 is aldehyde group, C 1-3 alkyl (such as methyl, ethyl or n-propyl) or halogen substituted C 1-3 alkyl (such as trichloromethyl, trifluoromethyl),
  • R1 and R2 form a ring structure with the carbonyl to which they are attached, such as ninhydrin.
  • the second carbonyl compound is selected from the group consisting of glyoxal, ninhydrin, and any combination thereof.
  • the nitrite is selected from sodium nitrite, potassium nitrite, and combinations of both.
  • the kit further comprises boric acid or a salt thereof (eg, potassium borate).
  • boric acid or a salt thereof eg, potassium borate
  • the kit further comprises dimethylsulfoxide (DMSO), N,N-dimethylformamide (DMF), or a combination thereof.
  • DMSO dimethylsulfoxide
  • DMF N,N-dimethylformamide
  • the kit further comprises a group selected from the group consisting of: 2-(N-morpholine)ethanesulfonic acid (MES), sodium acetate, 3-(N-morpholine)propanesulfonic acid (MOPS), piperazine At least one of oxazine-1,4-diethanesulfonic acid (PIPES), 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS), and trishydroxymethylaminomethane (TRIS).
  • the kit further comprises p-toluenesulfonic acid, phosphoric acid, or a combination thereof.
  • the kit further comprises: p-toluenesulfonic acid, phosphoric acid, 2-(N-morpholine)ethanesulfonic acid (MES), sodium acetate, 3-(N-morpholine)propane At least one of sulfonic acid (MOPS), piperazine-1,4-diethanesulfonic acid (PIPES), 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS), trishydroxymethylaminomethane (TRIS) .
  • the kit further comprises a material selected from the group consisting of a material for preparing a 2-(N-morpholine)ethanesulfonic acid (MES) buffer, a material for preparing a sodium acetate buffer, a material for preparing a 3- (N-Morpholine) propanesulfonic acid (MOPS) buffer substances, substances for the preparation of piperazine-1,4-diethanesulfonic acid (PIPES) buffer, substances for the preparation of 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS ) buffer, substances for preparing tris (TRIS) buffer, and any combination thereof.
  • MES 2-(N-morpholine)ethanesulfonic acid
  • MOPS 3- propanesulfonic acid
  • PPES piperazine-1,4-diethanesulfonic acid
  • HPPS 4-hydroxyethylpiperazineethanesulfonic acid
  • TMS tris
  • the kit further comprises 2-(N-morpholine)ethanesulfonic acid (MES) buffer, sodium acetate buffer, 3-(N-morpholine)propanesulfonic acid (MOPS) buffer solution, piperazine-1,4-diethanesulfonic acid (PIPES) buffer, 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS) buffer or trishydroxymethylaminomethane (TRIS) buffer or any combination thereof .
  • MES 2-(N-morpholine)ethanesulfonic acid
  • MOPS 3-(N-morpholine)propanesulfonic acid
  • PPES piperazine-1,4-diethanesulfonic acid
  • HPPS 4-hydroxyethylpiperazineethanesulfonic acid
  • TMS trishydroxymethylaminomethane
  • the kit further comprises a material selected from the group consisting of a material for formulating a phosphate buffer, a material for formulating a triethylamine acetate (TEAA) buffer, and combinations of both.
  • a material for formulating a phosphate buffer a material for formulating a triethylamine acetate (TEAA) buffer, and combinations of both.
  • TEAA triethylamine acetate
  • the kit further comprises phosphate buffer, triethylamine acetate (TEAA) buffer, or a combination thereof.
  • TEAA triethylamine acetate
  • the substances for preparing the buffer described in this application may exist in the form of dry powder or in the form of solution.
  • the materials for preparing the buffer are well known in the art, and can be selected according to experimental requirements according to conventional methods in the art.
  • the materials for formulating a 2-(N-morpholine)ethanesulfonic acid (MES) buffer include 2-morpholineethanesulfonic acid, and optionally sodium hydroxide.
  • the materials for preparing the sodium acetate buffer include sodium acetate, and optionally acetic acid.
  • the materials for preparing the 3-(N-morpholine)propanesulfonic acid (MOPS) buffer include 3-(N-morpholine)propanesulfonic acid, and optionally sodium hydroxide.
  • the substances for preparing the piperazine-1,4-diethanesulfonic acid (PIPES) buffer include piperazine-1,4-diethanesulfonic acid, and optionally sodium hydroxide.
  • the materials for preparing the 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS) buffer include 4-hydroxyethylpiperazineethanesulfonic acid, and optionally sodium hydroxide.
  • the materials for preparing the Tris buffer (TRIS) include Tris, and optionally hydrochloric acid.
  • the triethylamine acetate (TEAA) buffer contains 45-50 v/v% formamide, and the pH is 8-9.
  • the phosphate buffer contains 40-60 v/v% dimethyl sulfoxide (DMSO), and the pH is 7.1-8.
  • DMSO dimethyl sulfoxide
  • the kit further comprises reagents for mass spectrometry detection and/or DNA sequencing.
  • the kit further comprises instructions.
  • the instructions recite the methods as described above.
  • the present application also provides the use of the kit as described above in the detection of N6-methyladenine in nucleic acid molecules.
  • adenine or “A”, as used herein, encompasses adenine itself, as well as nucleotides containing adenine (such as ribonucleotides, deoxyribose nucleotides, adenine ribonucleotide residues, adenine deoxyribonucleotide residues); the term “N6-methyladenine”, “6mA”, “m6A” or “m 6 A” covers N6- Methyladenine itself, and nucleotides containing N6-methyladenine (e.g., ribonucleotides, deoxyribonucleotides, ribonucleotide residues, deoxyribonucleotide residues); the term “secondary Xanthine” or “I”, covering hypoxanthine itself, as well as nucleotides containing hypoxanthine (such as ribonucleo
  • cytosine or “C” covers cytosine itself, as well as nucleotides containing cytosine (such as ribonucleotides, deoxyribonucleotides, ribonucleotide residues group, deoxyribonucleotide residue);
  • uracil or “U” covers uracil itself, as well as nucleotides containing uracil (such as ribonucleotides, deoxyribonucleotides, ribonucleosides acid residues, deoxyribonucleotide residues).
  • m6A As used herein, unless the context clearly dictates otherwise, the terms "m6A”, “6mA” and “m6A” as used herein all encompass N6 -methyl in DNA, RNA or DNA/RNA hybrids Adenine itself, and nucleotides containing N6-methyladenine (such as ribonucleotides, deoxyribonucleotides, ribonucleotide residues, deoxyribonucleotide residues), are interchangeable use.
  • the compounds used herein have meanings commonly understood by those skilled in the art, for example, the compounds used herein and their corresponding CAS numbers are as follows: 2-(N-morpholine) Ethylsulfonic acid (MES, CAS: 4432-31-9), sodium acetate (CAS: 127-09-3), 3-(N-morpholine) propanesulfonic acid (MOPS, CAS: 1132-61-2), Piperazine-1,4-diethanesulfonic acid (PIPES, CAS: 5625-37-6), 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS, CAS: 7365-45-9) or trimethylolamino Methane (TRIS, CAS: 77-86-1), Glyoxal (CAS: 107-22-2), Methylglyoxal (CAS: 78-98-8), 2,3-Butanedione (CAS: 431- 03-8), ninhydrin (MES, CAS: 4432-31-9), sodium
  • MES 2-(N-morpholine)ethanesulfonic acid
  • MES 2-(N-morpholine)ethanesulfonic acid
  • sodium hydroxide solution as needed Adjust pH.
  • sodium acetate buffer used herein can be formulated from sodium acetate, optionally with acetic acid to adjust the pH as needed.
  • the 3-(N-morpholine)propanesulfonic acid (MOPS) buffer used herein can be formulated from 3-(N-morpholine)propanesulfonic acid, optionally with sodium hydroxide to adjust the pH as needed.
  • the piperazine-1,4-diethanesulfonic acid (PIPES) buffer used herein can be formulated from piperazine-1,4-diethanesulfonic acid, optionally with sodium hydroxide to adjust the pH as needed.
  • the 4-hydroxyethylpiperazineethanesulfonic acid (HEPPS) buffer used herein can be formulated from 4-hydroxyethylpiperazineethanesulfonic acid, optionally with sodium hydroxide to adjust the pH as needed.
  • tris (TRIS) buffers used herein can be formulated from tris, optionally with hydrochloric acid to adjust the pH as needed.
  • the conversion rate of A to I is high, so that the quantitative and/or qualitative detection of low-abundance m 6 A sites can be realized.
  • the conversion rate of false positive or non-specific conversion (such as conversion of C to U, conversion of G to X) is low, and the background signal is low when m 6 A is analyzed.
  • the method of the present application uses nitrite to catalyze the deamination of adenine into hypoxanthine, while N6-methyladenine cannot be deaminated due to its stable chemical properties. Therefore, the method of the present application realizes single-base resolution detection of the m 6 A site by analyzing the signals of adenine or hypoxanthine before and after conversion.
  • this application provides a highly active and specific m 6 A site detection technology, which can quantitatively analyze m 6 A sites in the whole genome or whole transcriptome at the single base level, solving the problem of traditional detection problems with the method.
  • Figure 1 schematically shows the reaction process of converting adenine into hypoxanthine under the catalysis of nitrite and carbonyl compounds.
  • Fig. 2 schematically shows the reaction process of guanine base and dicarbonyl compound to form guanosine adduct when guanine is protected by eg dicarbonyl compound.
  • Fig. 3 schematically shows the reaction process of guanosine adducts and boronic acid compounds to form new guanosine adducts.
  • Fig. 4 is the Sanger sequencing peak diagram of the amplified product of the DNA model sequence after being processed by reaction conditions A, B and C.
  • Fig. 5 is the Sanger sequencing peak diagram of the amplification products before and after the treatment of each reaction condition in Table 3.
  • FIG. 6 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 4.
  • FIG. 6 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 4.
  • Fig. 7 is the Sanger sequencing peak diagram of the amplification products before and after the treatment of each reaction condition in Table 5.
  • FIG. 8 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 6.
  • FIG. 9 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 7.
  • FIG. 9 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 7.
  • Fig. 10 is a sanger sequencing peak diagram of the amplification products before and after treatment under each reaction condition in Table 8.
  • Fig. 11 is a sanger sequencing peak diagram of the amplification products before and after treatment under each reaction condition in Table 9.
  • Fig. 12 is a sanger sequencing peak diagram of the amplification products before and after treatment under each reaction condition in Table 10.
  • Fig. 13 is a sanger sequencing peak diagram of the amplification products before and after treatment under each reaction condition in Table 11.
  • FIG. 14 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 12.
  • FIG. 15 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 13.
  • FIG. 16 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 14.
  • FIG. 16 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 14.
  • FIG. 17 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 15.
  • FIG. 17 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 15.
  • FIG. 18 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 16.
  • FIG. 19 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 17.
  • FIG. 19 is a sanger sequencing peak diagram of the amplification products before and after treatment under various reaction conditions in Table 17.
  • Fig. 20 is the signal conversion efficiency of A to G in the RNA sample realized by the method of the present application.
  • Figure 21 is the signal conversion efficiency of C to T in the RNA sample realized by the method of the present application.
  • Fig. 22 is the high-throughput sequencing signal on rRNA by the method of the present application.
  • (a) The signal of the classic m 6 A_1832 site on 18srRNA during sequencing.
  • the red rectangle is the m 6 A modified site, and the black framed rectangle lists the percentages of the reads covered by the site as the four bases of A, T, C, and G;
  • (b ) The signal of the classic m 6 A_4220 site on 28srRNA in sequencing. Convert the A in the reference transcriptome (version number GRCh38.p13; GCF_000001405.39) to G and use it as the comparison template.
  • the green color block is the G-to-A mutation signal generated by the sequencing signal relative to the comparison template, representing Most of this position is base A.
  • Figure 23 is the high-throughput sequencing signal of the present application on mRNA.
  • the m 6 A modification site is inside the red rectangle, and the percentages of the four bases of A, T, C, and G of the reads covered by the site are listed in the rectangle with the black border.
  • the T in the negative strand of the reference transcriptome (version number GRCh38.p13; GCF_000001405.39) was converted to C and used as an alignment template, and the m 6 A modification of the negative strand transcript mRNA appeared to be relative to the alignment template during sequencing. C to T mutation signal.
  • Figure 24 shows the distribution of m 6 A in mRNA detected by the method of the present application.
  • Fig. 25 is the m 6 A modification level detected in the model sequence and mRNA by the method of the present application.
  • Figure 26 shows the signal conversion efficiency of A to G in mRNA samples under different reaction conditions in the method of the present application.
  • "NS-16" corresponds to the exemplary conversion reaction condition C of Example 3
  • "NS-50” corresponds to the exemplary conversion reaction condition D of Example 3
  • “NS-TsOH” corresponds to the conversion reaction example of Example 3 Sexual condition E.
  • Fig. 27 shows the peak spectrum of the secondary mass spectrum of the imine intermediate produced during the conversion of adenine ribonucleotide to inosine ribonucleotide using glyoxal catalyzed by the method of the present application.
  • Table 1 Information on the sequences involved in this application is described in the table below.
  • N A, T, C or G
  • X A, T, C or G
  • Deprotection B buffer 50% dimethylsulfoxide (DMSO) in 137mM NaCl, 2.7mM KCl, 8mM Na 2 HPO 4 , 2mM KH 2 PO 4 , pH 7.4;
  • DMSO dimethylsulfoxide
  • MES buffer pH 5: Dissolve 106.6mg 2-(N-morpholine)ethanesulfonic acid (MES) in 900 ⁇ L DEPC water, adjust the pH to 5 with 1M NaOH, and add water to 1mL;
  • MES 2-(N-morpholine)ethanesulfonic acid
  • 500mM MES buffer pH 5.5: Dissolve 106.6mg MES in 900 ⁇ L DEPC water, adjust the pH to 5.5 with 1M NaOH, and add water to 1mL;
  • H 3 BO 3 buffer take 100mg H 3 BO 3 and add 1mL DEPC water, mix well and take the supernatant for later use (H 3 BO 3 saturated solution);
  • 2-bromomalondialdehyde solution 2.0M 2-bromomalonaldehyde DMSO/H 2 O (1:1) solution, take 301.9mg 2-bromomalonaldehyde, dissolve in 0.5mL dimethyl sulfoxide (DMSO) , mix well, then add 0.5mL DEPC water, mix well;
  • DMSO dimethyl sulfoxide
  • 1M NH 4 OAc (pH 5.3): Dissolve 77.0 mg of ammonium acetate in 800 ⁇ L of DEPC water, adjust the pH to 5.3 with acetic acid, and add water to 1 mL.
  • Chemical reagents such as NaNO 2 , 40% glyoxal aqueous solution (molar concentration 8.8M), phenylglyoxal, 2-bromomalondialdehyde, trichloroacetaldehyde, and anhydrous sodium acetate were purchased from Anaiji;
  • P-toluenesulfonic acid was purchased from Anaiji;
  • H 3 BO 3 was purchased from Aladdin;
  • DEPC water is purchased from living organisms
  • Solvents such as DMSO and absolute ethanol were purchased from Beijing Tongguang Fine Chemical Company;
  • the amplification enzyme used in PCR is KOD One TM PCR Master Mix, purchased from TOYOBO;
  • Nucleic acid precipitation aid (Glycogen) was purchased from ZOMANBIO;
  • Nuclease P1 (Nuclease P1 from Penicillium citrinum lyophilized powder) was purchased from SIGMA;
  • Alkaline phosphatase (Shrimp Alkaline Phosphatase) rSAP was purchased from SIGMA;
  • the DNA model sequence shown in SEQ ID NOs: 1 was synthesized using a DNA synthesizer (ABI 3400) with a synthesis specification of 1 ⁇ mol.
  • Nucleic acid monomers (A, T, C, G and 6mA) were purchased from Anhui Wuhu Huaren Technology Co., Ltd.;
  • Bacterial Genomic DNA Extraction Kit was purchased from Tiangen, Cat. No. DP302;
  • DNA uracil glycosidase was purchased from New England Biolabs (NEB) Cat. No. M0280.
  • Embodiment 1 the detection of 6mA in DNA model
  • the conversion reaction of A to I can be carried out with reference to exemplary reaction conditions A, B or C.
  • reaction solution 3 ⁇ L of 5M NaNO 2 , 1.6 ⁇ L of 500 mM MES (pH 6), 2 ⁇ L of H 3 BO 3 , 5.4 ⁇ L of DEPC water. 8 ⁇ L of the glyoxal-protected nucleic acid prepared in step 1.1 above was added thereto, with a total volume of 20 ⁇ L, and reacted at 16° C. for 24 h.
  • reaction solution 5M NaNO 2 3 ⁇ L, 500 mM MES (pH 6) 1.6 ⁇ L, H 3 BO 3 2 ⁇ L, DEPC water 3.4 ⁇ L, 2 ⁇ L 40% glyoxal aqueous solution.
  • reaction solution 3 ⁇ L of 5M NaNO 2 , 1.6 ⁇ L of 500 mM MES (pH 6), 3.4 ⁇ L of DEPC water, and 2 ⁇ L of 40% aqueous solution of glyoxal. 10 ⁇ L of the glyoxal-protected nucleic acid prepared in step 1.1 above was added thereto, and the total volume was 20 ⁇ L, and reacted at 16° C. for 8 h.
  • step 2.3 Take 50 ⁇ L of DEPC water to dissolve the DNA precipitate prepared in step 2.2 above, and place it on ice for later use.
  • the sequence of the amplified product obtained in step 3.1 is detected by Sanger sequencing, and the sequence of the amplified product and the original DNA model before transformation is compared. information, the conversion efficiency of A to I in the original model sequence and the position information of 6mA can be determined.
  • the primer sequence for sequencing is shown in SEQ ID NO: 4.
  • the above reaction solution was centrifuged at 4°C, 12000rr for 10min, and 25 ⁇ L was used for QQQ detection to measure the content of I in the sample DNA after the nitrosation reaction, so as to further determine the conversion efficiency of A to I in the DNA model sequence.
  • FIG. 4 After being treated with reaction conditions A, B or C, an exemplary peak diagram of the Sanger sequencing of the amplified product of the DNA model sequence is shown in FIG. 4 .
  • A is converted to I after deamination, and I is amplified to generate a signal of G in sequencing, and it is still recognized as A after the 6mA reaction.
  • the Sanger sequencing signal calculate the signal conversion probability of A to G in the amplified product relative to the original sequence (ie, the sequence before processing), and obtain the conversion rate of A to I.
  • the calculation method of the signal conversion probability of A to G is (taking the last A at the 3' end of the sequence as an example):
  • the information of the 6mA site in the original sequence can be determined by detecting the residue of the A signal in the original sequence.
  • G also produces non-specific deamination (to X (xanthine)), this efficiency is lower than the sequencing baseline, so non-specific conversion of G cannot be assessed in sanger sequencing.
  • reaction condition A, B or C can realize the conversion of high specificity and high activity of A to I in the DNA sequence, thereby realizing the detection of the single-base resolution level of the 6mA site in the DNA.
  • the nitrosation reaction catalytic reaction conditions corresponding to reaction conditions A, B or C and the estimated results of sequencing signal conversion of amplified products are shown in Table 2, where all concentrations are final concentrations.
  • the final concentration of glyoxal in the nitrosation reaction is obtained from the volume conversion of the glyoxal added in the nucleic acid protection process and the sum of the glyoxal added in the nitrosation reaction.
  • mode 1 refers to adding boric acid to the nitrosation reaction system
  • mode 2 refers to adding boric acid to the nucleic acid protection system after protecting the nucleic acid with glyoxal but before the nitrosation reaction.
  • mode 1 of boric acid incubation refers to adding boric acid to the nitrosation reaction system
  • mode 2 refers to adding boric acid to the nucleic acid protection system after glyoxal protects the nucleic acid and before the nitrosation reaction
  • mode 3 refers to adding boric acid to the nucleic acid protection system.
  • the Sanger sequencing method is to detect the conversion rate of A by detecting the signal conversion of A to G in the DNA amplification product after the nitrosation reaction, wherein the signal conversion of A to G is composed of the conversion of A to I and the conversion of A to I
  • the pairing of I and C occurs during DNA amplification. Therefore, the accuracy of complementary base pairing in the amplification process may have an impact on the determination of the conversion rate. Therefore, we further use mass spectrometry to directly measure the content of I in the nitrosation reaction product, by calculating the I/(A+ The conversion rate of A is calculated according to the ratio of I), and the conversion rate calculated by Sanger sequencing results is further verified.
  • mode 2 refers to adding boric acid to the nucleic acid protection system after the glyoxal protects the nucleic acid and before the nitrosation reaction.
  • This example further verifies the effect of the method of the present application on detecting 6mA at the genome level.
  • Genomic DNA of E. coli k12mg1655 strain (purchased from ATCC (American Type Culture Collection), catalog number: 700926D-5) was extracted using a bacterial genomic DNA extraction kit (purchased from Tiangen, catalog number: DP302), and non-contact ultrasonic A fragmentation instrument (Covaris M220) fragmented the extracted genomic DNA into fragments of about 550 bp, spin-dried the DNA samples, and placed them on ice for later use.
  • Step 2 Dissolve 1 ug of the fragmented DNA obtained in Step 1 in 2.0 ⁇ L of 0.1M NaOH aqueous solution at 95°C for 10 minutes, immediately place it on ice for 2 minutes, then add 0.8 ⁇ L of 500 mM MES solution to adjust the pH to 6, place it on ice, and use immediately In the next step of nucleic acid protection.
  • nucleic acid denaturation conditions choose either 2.1 formamide denaturation or 2.2 NaOH denaturation.
  • the above reaction solution was centrifuged at 4°C, 12000rr for 10 min, and 25 ⁇ L was used for QQQ detection to measure the content of I in the sample DNA after the nitrosation reaction, so as to further determine the conversion efficiency of A to I in the DNA sequence.
  • condition C can efficiently realize the conversion of A to I.
  • the A in the double-stranded DNA without denaturation pretreatment is also converted completely, showing that the method of the present application can realize the conversion of A to I in the DNA sample, and the conversion efficiency is very high.
  • RNA extraction kit (Trizol method) (purchased from Invitrogen, catalog number: 15596026) to extract total RNA from HEK293T cells (purchased from ATCC (American Culture Collection), catalog number CRL-3216).
  • RNA Fragmentation Mod ⁇ Le 94°C for 4 minutes
  • the conversion reaction of A to I can be carried out with reference to exemplary reaction conditions A, B, C, D or E.
  • RNA prepared in step 1 in 2.8 ⁇ L DEPC water, add 4 ⁇ L DMSO and 1.2 ⁇ L 40% glyoxal, mix well, 50 ° C, 1 h, to obtain the nucleic acid protected by glyoxal, and cool it on ice for later use.
  • RNA reaction solution 3 ⁇ L of 5M NaNO 2 , 1.6 ⁇ L of 500 mM MES (pH 6), 2 ⁇ L of H 3 BO 3 , and 5.4 ⁇ L of DEPC water.
  • RNA prepared in step 1 in 2.8 ⁇ L DEPC water, add 4 ⁇ L DMSO and 1.2 ⁇ L 40% glyoxal, mix well, 50 ° C, 1 h, to obtain the nucleic acid protected by glyoxal, and cool it on ice for later use.
  • RNA reaction solution 5M NaNO 2 3 ⁇ L, 500 mM MES (pH 6) 1.6 ⁇ L, H 3 BO 3 2 ⁇ L, DEPC water 3.4 ⁇ L, 2 ⁇ L 40% glyoxal aqueous solution.
  • RNA prepared in step 1 Dissolve the RNA prepared in step 1 in 2.8 ⁇ L DEPC water, add 4 ⁇ L DMSO and 1.2 ⁇ L 40% glyoxal, mix well, 50°C, 0.5h; then add H 3 BO 3 2 ⁇ L to the mixture, 50°C, and then After protection for 0.5 h, the glyoxal-protected nucleic acid was obtained and cooled on ice for later use.
  • RNA reaction solution 3 ⁇ L of 5M NaNO 2 , 1.6 ⁇ L of 500 mM MES (pH 6), 3.4 ⁇ L of DEPC water, and 2 ⁇ L of 40% glyoxal aqueous solution. 10 ⁇ L of the above-mentioned glyoxal-protected nucleic acid was added thereto, and the total volume was 20 ⁇ L, and reacted at 16° C. for 8 h.
  • RNA prepared in step 1 Dissolve the RNA prepared in step 1 in 2.8 ⁇ L DEPC water, add 4 ⁇ L DMSO and 1.2 ⁇ L 40% glyoxal, mix well, 50°C, 0.5h; then add H 3 BO 3 2 ⁇ L to the mixture, 50°C, and then After protection for 0.5 h, the glyoxal-protected nucleic acid was obtained and cooled on ice for later use.
  • RNA reaction solution 4 ⁇ L of 5M NaNO 2 , 1.6 ⁇ L of 500 mM MES (pH 6), 3.4 ⁇ L of DEPC water, and 2 ⁇ L of 40% glyoxal aqueous solution. 10 ⁇ L of the above-mentioned glyoxal-protected nucleic acid was added thereto, and the total volume was 20 ⁇ L, and reacted at 50° C. for 0.5 h.
  • RNA prepared in step 1 Dissolve the RNA prepared in step 1 in 2.8 ⁇ L DEPC water, add 4 ⁇ L DMSO and 1.2 ⁇ L 40% glyoxal, mix well, 50°C, 0.5h; then add H 3 BO 3 2 ⁇ L to the mixture, 50°C, and then After protection for 0.5 h, the glyoxal-protected nucleic acid was obtained and cooled on ice for later use.
  • RNA reaction solution 3 ⁇ L of 5M NaNO 2 p-toluenesulfonic acid aqueous solution (pH 6), 5 ⁇ L of DEPC water, and 2 ⁇ L of 40% glyoxal aqueous solution. 10 ⁇ L of the above-mentioned glyoxal-protected nucleic acid was added thereto, and the total volume was 20 ⁇ L, and reacted at 50° C. for 0.5 h.
  • Step 3.3 above is an optional step.
  • RNA Clean & Concentrator Kits to purify RNA, and Nanodrop to measure the concentration.
  • Untreated RNA and nitrosated RNA were used for library construction.
  • the RNA sample was treated with PNK (polynucleotide kinase, purchased from NEB, Cat. No.: M0201S) to dephosphorylate the 3' end of the RNA fragment, and the dephosphorylated RNA was combined with the 3' RNA linker (SEQ ID NO:5) Connect with T4RNA ligase2, truncated KQ (purchased from NEB, catalog number: M0373S), and digest the excess 3'RNA linker with 5'Deadenylase (purchased from NEB, catalog number: M0331S) and RecJf enzyme (purchased from NEB, catalog number: M0264S) Lose.
  • PNK polynucleotide kinase, purchased from NEB, Cat. No.: M0201S
  • Use SuperScript III (purchased from Invitrogen, product number: 18080093) to reverse-transcribe the RNA, use silane beads to purify the reverse-transcribed cDNA, and combine the 5' phosphorylated adapter (SEQ ID NO: 6) with the purified cDNA 3
  • the 'ends were ligated with T4RNA ligase 1, high concentration (purchased from NEB, catalog number: M0437M).
  • the cDNA linked to the 5'adaptor was further purified with silane beads.
  • the cDNA was amplified by PCR using cDNA amplification primers 1 and 2 (SEQ ID NOs: 7-8, wherein "XXXXXX" is the index sequence).
  • the PCR product was purified with 8% TBE gel, and the gel with a fragment size between 160-250 bp was excised and quenched for recovery to obtain a cDNA sequencing library. Finally, the resulting library was sequenced using Illumina Hiseq X10 PE150.
  • RNA prepared in step 3 Take 200ng of RNA prepared in step 3, add 1U of nuclease P1, 2 ⁇ L of NH 4 OAc, add water to 20 ⁇ L, and react at 42°C for 2 hours. Then add 1 ⁇ L FastAP Thermosensitive Alkaline Phosphatase, 2.5 ⁇ L 10x FastAP Buffer, rehydrate to 25 ⁇ L, and react at 37°C for 2 hours.
  • the above reaction solution was centrifuged at 12000r for 10 min at 4°C, and 25 ⁇ L was taken for QQQ detection.
  • trim_galore version 0.6.6 software to remove the sequencing adapters from the sequencing data (FASTQ file).
  • the specific command parameters are: trim_galore-q 20--stringency 1-length 35.
  • FASTX-Toolkit version 0.0.13 to remove the 10bp barcode in the obtained deduplication results. The filtered data will be used for subsequent analysis.
  • the read coverage of all sites in the reference genome is counted.
  • the median represents the signal conversion efficiency of A to G on the chromosome.
  • the signal conversion efficiency of A to G on the whole genome is the median of the conversion efficiency of all chromosomes.
  • the preliminary identification of m6A candidate sites was carried out.
  • the read coverage of all sites in the reference genome is counted. This command first converts the file from BAM format to mpileup format, and then counts the number of unconverted A in the reads covered by each site in the genome according to the mpileup format. Then, format the file obtained in the previous step into a specific fmat format file.
  • the genome coordinates, gene information, transcript information, covered read information, and untransformed information contained in the covered reads are marked for each candidate A site on the genome. The number information of A.
  • preliminary m6A candidate sites were obtained.
  • RNA with a complex secondary structure will cause the C on it to not be converted to T when treated with sodium bisulfite, resulting in false positive results.
  • we first filter the reads containing more than three unconverted A the number of such reads accounts for about 2% of the overall data, and the possible source For RNAs sensitive to NaNO 2 treatment.
  • candidate sites that occurred in NaNO2 treatment - sensitive regions were also filtered.
  • an m6A candidate site contains more than three unconverted A in more than 20% of all covered reads, this candidate site is considered to occur in a NaNO2 treatment - sensitive region, that is, the RNA transcribed in this region It is not easy to convert A to G when NaNO 2 is treated.
  • the above filtering can further ensure that the obtained m6A candidate sites are credible.
  • RNA transcribed in different regions will have different conversion efficiencies due to differences in sequence and RNA secondary structure.
  • the specific method is as follows: for each candidate m6A site, we use the transformation efficiency of different genes as the background, perform statistical tests (based on binomial distribution or Poisson distribution), and calculate the significance P value of the candidate site. After the candidate sites with a P value of less than 0.005 are selected, FDR correction is further performed. After correction, the candidate sites with a P value of less than 0.005 will be used as the m6A candidate sites for the final detection of this application.
  • the preparation method of the Spike in 2 pattern sequence containing 20% m 6 A modification ratio is as follows: the Spike in 2 pattern sequence (SEQ ID NO: 10) containing m 6 A modification and the Spike in 2' without m 6 A modification
  • the pattern sequence (SEQ ID NO:11) is mixed according to the molar ratio of 2:8;
  • the preparation method of the Spike in 3 pattern sequence containing 50% m 6 A modification ratio is as follows: the Spike in 3 pattern sequence containing m 6 A modification ( SEQ ID NO:12) mixed with Spike in 3' pattern sequence (SEQ ID NO:13) without m 6 A modification according to the molar ratio of 5:5; Spike in 4 pattern sequence containing 80% m 6 A modification ratio
  • the preparation method is: mix the Spike in 4 pattern sequence (SEQ ID NO:14) containing m 6 A modification and the Spike in 4' pattern sequence (SEQ ID NO:15) without m 6 A modification according to the ratio of 8:2 Molar ratio mixed.
  • the analysis of the sequencing data based on the application shows that the conversion efficiency of C to U in different types of RNA, such as 18s, 28s rRNA, mRNA, and the incorporated pattern sequence (sequencing results show a signal of C to T, and the analysis is also Statistically, the conversion ratio of C to T) is maintained at about 4%, and the results are shown in Figure 21.
  • This result confirms that with the detection method of the present application, the non-specific conversion rate of C is low and stable, so that the interference to m 6 A detection is low.
  • the method of the present application can specifically detect the m 6 A site on the RNA
  • the method of the present application can specifically detect the m 6 A site on the mRNA
  • the detection results are shown in FIG. 23 , and the results show that obvious m 6 A signals can be detected at the m 6 A sites of the above-mentioned known mRNAs.
  • the method of the present application can detect the distribution of m 6 A in mRNA
  • the method of the present application can accurately detect the modification level of m 6 A
  • the calculated m 6 A modification level of the pattern sequence Spike in 3 is slightly lower than the expected level, which may be due to the fact that Spike in 3 has another similar sequence on the converted triple base genome, which affects the sequencing reads Reposting process, which in turn leads to a low modification ratio of the m 6 A modification site in Spike in 3 in the later calculation.
  • Figure 25b shows that the present application can detect more than 100,000 m 6 A sites in the two repetitions of HEK293T cell mRNA, and most of the sites can be detected in the two repetitions (82%), indicating that The method of the present application has high sensitivity, stability and good reliability.
  • Figure 25c shows the comparison of the modification levels in the common m 6 A modification sites detected by the method of the present application in two repetitions of HEK293T cell mRNA, as shown in Figure 25c, in two technical repetitions, m 6 A
  • the m 6 A modification level corresponding to the site basically surrounds the diagonal line, with a high consistency and a correlation coefficient of 0.96, which further demonstrates that the method of the present application can detect the m 6 A modification level stably and accurately.
  • pattern sequences (Spike in 2, Spike in 4, Spike in 5, Spike in 6) containing 5%, 20%, 50%, 80% m 6 A modification and Spike in 1 without m 6 A modification
  • the pattern sequence (SEQ ID NO:9) and the Spike in 7 pattern sequence (SEQ ID NO:20) containing m 6 A modification were incorporated into the fragmented whole-cell RNA.
  • the preparation method of the Spike in 2 pattern sequence containing 5% m 6 A modification ratio is as follows: the Spike in 2 pattern sequence (SEQ ID NO: 10) containing m 6 A modification and the Spike in 2' without m 6 A modification
  • the pattern sequence (SEQ ID NO: 11) is mixed according to the molar ratio of 0.5:9.5
  • the preparation method of the Spike in 4 pattern sequence containing 20% m 6 A modification ratio is as follows: the Spike in 4 pattern sequence containing m 6 A modification ( SEQ ID NO:14) mixed with Spike in 4' pattern sequence (SEQ ID NO:15) without m 6 A modification at a molar ratio of 2:8; Spike in 5 pattern sequence containing 50% m 6 A modification ratio
  • the preparation method is: mix the Spike in 5 pattern sequence (SEQ ID NO:16) containing m 6 A modification and the Spike in 5' pattern sequence (SEQ ID NO:17) without m 6 A modification according to the ratio of 5:5 Molar ratio mixing; the preparation method
  • FIG. 26 a The processed results are shown in FIG. 26 a , and it can be seen from the figure that the signal conversion efficiency of A to G is about 98% under different conditions.
  • Figure 26b shows that the m 6 A modification level of the pattern sequence under different conditions is basically consistent with the expected level, indicating that the method of the present application can accurately detect the m 6 A modification level in RNA under different conditions.
  • reaction conditions corresponding to conditions A, B, C, D, E and 1-1 and the conversion rate (I/(I+A)) of A to I are shown in Table 21.
  • condition 1-1 and condition C lies in the concentration and pH value of the MES buffer in the nitrosation system (step 2.2) (as shown in Table 21), and the rest are the same.
  • condition D and condition C is that the concentration of sodium nitrite used, reaction temperature and time are different.
  • Condition D is 1M NaNO 2 , react at 50°C for 0.5h, and the rest are the same.
  • condition E and condition C lies in the type of buffer used in the reaction, the reaction temperature and the time.
  • the p-toluenesulfonic acid aqueous solution is used as the buffer and reacted at 50°C for 0.5h, and the rest are the same.
  • mode 1 refers to adding boric acid to the nitrosation reaction system
  • mode 2 refers to adding boric acid to the nucleic acid protection system after protecting the nucleic acid with glyoxal but before the nitrosation reaction.
  • boric acid incubation “Mode 1” refers to adding boric acid to the nitrosation reaction system; “Mode 2” refers to adding boric acid to the nucleic acid protection system after glyoxal protects the nucleic acid and before the nitrosation reaction; “Mode 3” refers to adding After the incubation of glyoxal and boronic acid, the nucleic acid is protected with the product of the incubation of glyoxal and boronic acid.
  • the above-mentioned acidic solutions all can promote the conversion of A to I (for example, the conversion rate is all above 94%).

Landscapes

  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Physics & Mathematics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

本申请提供了一种检测核酸分子中N6-甲基腺嘌呤的方法和试剂盒。

Description

检测核酸分子中N6-甲基腺嘌呤的方法及试剂盒
本申请是以CN申请号为202110655085.7,申请日为2021年6月11日的申请为基础,并主张其优先权,该CN申请的公开内容在此作为整体引入本申请中。
技术领域
本申请涉及分子生物学领域,特别是核酸检测和分析领域。特别地,本申请提供了一种检测核酸分子中N6-甲基腺嘌呤的方法。此外,本申请还提供了一种试剂盒,所述试剂盒可用于实施本申请的方法。
背景技术
高通量测序技术的发展,对研究表观遗传学修饰的生物学功能具有重要意义。目前,对于N6-甲基腺嘌呤(N6-methyladenosine,m 6A)修饰,已经开发了多种测序方法。其中,MeRIP-seq(即m 6A-seq)是目前应用最广泛的m 6A全转录组水平的测序方法,该方法利用m 6A特异性的抗体富集含有m 6A的RNA片段,继而进行测序分析,分辨率在200nt左右。miCLIP结合抗体交联和免疫沉淀,通过突变或截断的信息,来精确定位m 6A残基,达到单碱基水平测序。后续发展的m 6ACE-seq,利用抗体交联和外切酶消化,也可以达到单碱基水平测序。以上方法均依赖m 6A的抗体,重复性差,需要的样品量较大,并且很难做到定量检测m 6A修饰。m 6A-REF-seq和MAZTER-seq两种方法,利用RNA内切酶,不依赖抗体,实现了可定量的m 6A测序,但这两种方法仅能检测部分m 6A修饰,无法实现全部m 6A修饰的分析。
近期,Chem.Sci.,2021,12,606公开了一种使用亚硝酸钠的醋酸水溶液实现核酸(DNA/RNA)中腺嘌呤(A)到次黄嘌呤(I)的转变的方法,I在扩增或者反转录过程中被识别为鸟嘌呤(G),而甲基腺嘌呤依然被读作A,通过比较A-G的转化,来判断核酸中是否含有m 6A修饰。但是,该方法总体反应效率不超过20%,无法对低丰度位点实现定量分析,且存在大量胞嘧啶(C)向尿嘧啶(U)的转化,并且,因为使用醋酸水溶液,还存在核酸降解的问题。
NOseq(Nucleic Acids Res.2021 Feb 26;49(4):e23)同样使用亚硝酸化合物处理A 使其脱氨变成I而被读作G,而m 6A反应为NO-m 6A而无法脱氨仍然被识别为A,从而区分m 6A位点。该方法中A to I的转化效率在10%-50%之间,对于常规A的位点仍有大量A的残留,因此在对m 6A分析中存在大量背景,无法分析低丰度甲基化位点。同时该方法只能对具体序列某些确定的甲基化位点进行鉴定,无法检测全转录组中的m 6A修饰。并且,该方法仍存在大量假阳性或非特异性转化,G会被脱氨转化为黄嘌呤(X),而X会导致转录终止,同时也会发生错配读作A产生假阳性,同时C也会脱氨转化为U,为后续分析带来困难。
综上所述,目前已有的m 6A测序方法,均无法实现全转录组水平上的m 6A定量分析,因此,亟需开发一种全基因组或全转录组水平上N6-甲基腺嘌呤的可定量、高灵敏度的检测方法。
发明内容
本申请的发明人基于深入的研究,开发了一种新的检测核酸分子中N6-甲基腺嘌呤的方法。本申请的方法利用亚硝酸盐和羰基化合物催化腺嘌呤脱氨转化为次黄嘌呤,而N6-甲基腺嘌呤因其化学性质稳定无法脱氨。因此,本申请的方法通过分析转化前后腺嘌呤或次黄嘌呤的信号,实现m 6A修饰水平或m 6A位点的检测。该方法灵敏度高、定量准确,且能在单碱基水平实现全基因组或全转录组上N6-甲基腺嘌呤的检测。
因此,在一方面,本申请提供了一种检测核酸分子中N6-甲基腺嘌呤的方法,其包括以下步骤:
(1)对待测核酸分子中鸟嘌呤的氨基进行保护,得到被保护的核酸分子;
(2)在羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐反应,将待测核酸分子中的腺嘌呤转化为次黄嘌呤;
(3)任选地,对步骤(2)中所得产物进行脱保护;
(4)对前一步骤所得产物进行检测。
易于理解,所述方法步骤(1)中,可用任意方法对所述核酸分子中除腺嘌呤以外的任意碱基进行保护,其中,所述保护可阻止或阻碍被保护碱基的脱氨反应。在某些实施方案中,所述保护是可逆的或不可逆的。
在某些实施方案中,所述保护是不可逆的,并且,所述被保护的碱基能够进行碱 基互补配对。
本申请的方法利用亚硝酸盐使待测核酸分子中腺嘌呤脱氨转化为次黄嘌呤,而N6-甲基腺嘌呤因其化学性质稳定无法脱氨,从而可通过分析转化前后腺嘌呤或次黄嘌呤的信号,实现m 6A修饰水平或m 6A位点的检测。
因此,可以理解,在某些实施方案中,所述方法可通过直接检测待测核酸经脱氨催化反应后产物中腺嘌呤或次黄嘌呤的信号,来实现m 6A修饰水平的检测。在某些实施方案中,可通过例如质谱、酶片段化和/或色谱的方法检测待测核酸中经脱氨催化反应后产物中腺嘌呤或次黄嘌呤的信号,来实现m 6A修饰水平的检测。
由于,次黄嘌呤在逆转录或DNA复制过程中与胞嘧啶互补配对,从而被读作鸟嘌呤;N6-甲基腺嘌呤仍保持与胸腺嘧啶或尿嘧啶互补配对,仍被读作腺嘌呤。因此,在某些实施方案中,所述方法可通过检测待测核酸经脱氨催化反应后产物的扩增产物或逆转录产物的序列信息,来实现m 6A修饰水平和m 6A位点的检测。在某些实施方案中,所述方法可通过例如测序或杂交的方法检测待测核酸经脱氨催化反应后产物的扩增产物或逆转录产物的序列信息,来实现m 6A修饰水平和m 6A位点的检测。
不受理论限制,发明人推测本发明的方法中利用亚硝酸盐使待测核酸分子中腺嘌呤脱氨转化为次黄嘌呤,在腺嘌呤(A)向次黄嘌呤(I)转化的过程中,第一羰基化合物(例如乙二醛)起到催化作用,能够有效提高核酸分子中A to I的转化效率。以乙二醛为例,示例性的催化反应机理如图1所示,在反应过程中发明人检测到了乙二醛与腺嘌呤的氨基结合形成的亚胺中间体,其二级质谱峰图如图27所示。
由于鸟嘌呤在亚硝化反应中脱氨会转化为黄嘌呤(X),而黄嘌呤在一定概率会导致核酸扩增或逆转录过程终止,或者,黄嘌呤在核酸扩增或逆转录过程与胸腺嘧啶互补配对,从而被读作腺嘌呤,干扰m 6A修饰水平和m 6A位点的检测。因此,在某些实施方案中,对所述待测核酸分子中鸟嘌呤的氨基进行保护是特别有利的。
在某些实施方案中,所述方法步骤(1)中,使用第二羰基化合物对待测核酸分子中鸟嘌呤的氨基进行保护。
使用第二羰基化合物(以乙二醛为例)对待测核酸分子中鸟嘌呤的氨基进行保护,其反应机理如图2所示,第二羰基化合物例如乙二醛能与鸟嘌呤N 1,N 2位反应形成N 1,N 2-二羟基鸟苷,从而降低鸟嘌呤的脱氨效率。
在某些实施方案中,所述羰基化合物源自:
(i)步骤(1)中添加的第二羰基化合物;
(ii)步骤(2)中添加的第一羰基化合物;或者,
(iii)(i)和(ii)的组合。
在某些实施方案中,所述第二羰基化合物与第一羰基化合物相同或不相同。在某些实施方案中,所述第二羰基化合物与第一羰基化合物相同。
在某些实施方案中,所述步骤(2)不包含添加第一羰基化合物的步骤,且所述羰基化合物源自:步骤(1)中添加的第二羰基化合物;或者,步骤(2)包含添加第一羰基化合物的步骤,且,所述第一羰基化合物与所述第二羰基化合物相同或者不同。
在某些实施方案中,步骤(3)中,任选地对所述产物中的鸟嘌呤进行脱保护。
在某些实施方案中,所述第一羰基化合物与第二羰基化合物各自独立地选自式I所示化合物及其任意组合,
Figure PCTCN2021130282-appb-000001
其中R 1为醛基、C 1-3烷基(例如甲基、乙基或正丙基)或卤素取代的C 1-3烷基(例如三氯甲基、三氟甲基),R 2为H、C 1-3烷基(例如甲基、乙基或正丙基)、苯基或HC(=O)-CH(Br)-;或者,
R 1和R 2与它们所连接的羰基形成环状结构,例如茚三酮。
在某些实施方案中,所述第一羰基化合物与第二羰基化合物各自独立地选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合。
在某些实施方案中,所述第一羰基化合物选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合。
在某些实施方案中,所述第一羰基化合物为选自乙二醛、2-溴丙二醛、2,3-丁二酮、苯基乙二醛、三氯乙醛、茚三酮及其任意组合。
在某些实施方案中,所述第二羰基化合物选自:乙二醛、茚三酮及其任意组合。
在某些实施方案中,所述方法步骤(1)中,使待测核酸分子与第二羰基化合物在 第一溶剂中接触。
在某些实施方案中,所述第一溶剂为水。
在某些实施方案中,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中,并且在二甲基亚砜(DMSO)或N,N-二甲基甲酰胺(DMF)的存在下接触。
在某些实施方案中,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中,并且在二甲基亚砜(DMSO)和硼酸或其盐(例如硼酸钾)的存在下接触。
在某些实施方案中,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中,并且在N,N-二甲基甲酰胺(DMF)和硼酸或其盐(例如硼酸钾)的存在下接触。
在某些实施方案中,步骤(1)中,所述二甲基亚砜(DMSO)的终浓度为20v/v%~90v/v%,例如约30v/v%、约40v/v%、约50v/v%、约60v/v%、约70v/v%、约80v/v%。
在某些实施方案中,步骤(1)中,所述硼酸或其盐(例如硼酸钾)的终浓度为5mM~150mM(例如5mM~10mM,10mM~50mM,50mM~80mM,80mM~100mM)。
在某些实施方案中,步骤(1)中,所述第二羰基化合物的终浓度为大于或等于20mM,优选为20mM~3M(例如20mM~100mM,100mM~500mM,500mM~1000mM,1M~1.5M,1.5~2M,2M~2.6M,例如约0.8M、1.3M、1.8M)。
在某些实施方案中,所述方法步骤(1)中,使待测核酸分子与第二羰基化合物在16-60℃(例如约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下接触。
在某些实施方案中,所述方法步骤(1)中,使待测核酸分子与第二羰基化合物在16-60℃(例如约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下接触15-60min(例如约20min、约30min、约40min、约50min)。
在某些实施方案中,所述方法还包括:在步骤(1)之前,对待测核酸分子进行预处理的步骤。
在某些实施方案中,所述预处理包括对待测核酸分子进行纯化、片段化、变性、或其任意组合。
在某些实施方案中,所述预处理包括使用甲酰胺水溶液或碱溶液对待测核酸分子进行变性。
在某些实施方案中,所述方法步骤(2)中,在羰基化合物的存在下,使被保护的 核酸分子与亚硝酸盐在第二溶剂中反应。
在某些实施方案中,所述第二溶剂为水、2-(N-吗啉)乙磺酸(MES)缓冲液、醋酸钠缓冲液、3-(N-吗啡啉)丙磺酸(MOPS)缓冲液、哌嗪-1,4-二乙磺酸(PIPES)缓冲液、4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液或三羟甲基氨基甲烷(TRIS)缓冲液或其任意组合。所述第二溶剂还可以为对甲苯磺酸水溶液或磷酸水溶液。
在某些实施方案中,所述第二溶剂为水、对甲苯磺酸水溶液、磷酸水溶液、2-(N-吗啉)乙磺酸(MES)缓冲液、醋酸钠缓冲液、3-(N-吗啡啉)丙磺酸(MOPS)缓冲液、哌嗪-1,4-二乙磺酸(PIPES)缓冲液、4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液或三羟甲基氨基甲烷(TRIS)缓冲液或其任意组合。
在某些实施方案中,所述第二溶剂为2-(N-吗啉)乙磺酸(MES)缓冲液或醋酸钠缓冲液。
在某些实施方案中,所述第二溶剂为对甲苯磺酸水溶液、2-(N-吗啉)乙磺酸(MES)缓冲液或醋酸钠缓冲液。
在某些实施方案中,所述第二溶剂为对甲苯磺酸水溶液、磷酸水溶液、2-(N-吗啉)乙磺酸(MES)缓冲液或醋酸钠缓冲液。
在某些实施方案中,所述第二溶剂为pH为4.5~6.5(例如5~6)的对甲苯磺酸水溶液。
在某些实施方案中,所述第二溶剂为pH为4.5~6.5(例如5~6)的磷酸水溶液。
在某些实施方案中,所述第二溶剂为2-(N-吗啉)乙磺酸(MES)缓冲液,其中2-(N-吗啉)乙磺酸(MES)的终浓度为20mM~750mM(例如约40mM、约80mM、约100mM、约150mM、约200mM、约250mM、约500mM)。
在某些实施方案中,所述第二溶剂为醋酸钠缓冲液,其中所述醋酸钠的终浓度为300mM~400mM。
在某些实施方案中,步骤(2)中,在羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐在第二溶剂中,并且在硼酸或其盐(例如硼酸钾)的存在下反应。
在某些实施方案中,所述第二溶剂中羰基化合物总摩尔浓度(终浓度)为大于或等于50mM,例如大于或等于250mM,(例如50-2000mM,50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105 mM)。
在某些实施方案中,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-60mM,60-100mM,100-300mM,300-600mM,600-800mM,800-1200mM,1200-1500mM)的丙酮醛。
在某些实施方案中,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-2500mM(例如50-100mM,100-200mM,200-500mM,500-800mM,800-1200mM,1200-1500mM,1500-1800mM,1800-2200mM,2200-2500mM)的2,3-丁二酮。
在某些实施方案中,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为20-1500mM(例如20-100mM,100-200mM,200-500mM,500-800mM,800-1200mM,1200-1500mM,约60mM)的茚三酮。
在某些实施方案中,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-100mM,100-200mM,200-400mM,400-500mM,500-800mM,800-1200mM,1200-1500mM)的2-溴丙二醛。
在某些实施方案中,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-100mM,100-400mM,400-500mM,500-800mM,800-1200mM,1200-1500mM)的三氯乙醛。
在某些实施方案中,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-100mM,100-400mM,400-500mM,500-800mM,800-1200mM,1200-1500mM)的苯基乙二醛。
在某些实施方案中,所述第二溶剂含有终浓度大于或等于50mM,例如大于或等 于250mM(例如50-2000mM,50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛。
在某些实施方案中,步骤(2)中,所述硼酸或其盐(例如硼酸钾)的终浓度为5mM~150mM(例如5mM~10mM,10mM~50mM,50mM~80mM,80mM~100mM)。
在某些实施方案中,所述亚硝酸盐选自亚硝酸钠、亚硝酸钾以及二者的组合。
在某些实施方案中,所述亚硝酸盐的终浓度为0.5M~1.5M,例如0.5M~0.8M,0.8~1.25M。
在某些实施方案中,所述方法步骤(2)中,在第一羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐在第二溶剂中,并且在硼酸或其盐(例如硼酸钾)的存在下反应。
核酸分子中鸟嘌呤的氨基被第二羰基化合物例如乙二醛保护后形成的N 1,N 2-二羟基鸟苷在步骤(2)中可能会脱除保护基团。如图3所示,在步骤(2)中加入硼酸或其盐(例如H 3BO 3/K 3BO 3)可以进一步保护N 1,N 2-二羟基鸟苷,使其结构更加稳定,使保护基团不易被脱除。
在某些实施方案中,所述硼酸或其盐(例如硼酸钾)源自:
(i)步骤(1)中添加的硼酸或其盐(例如硼酸钾);
(ii)步骤(2)中添加的硼酸或其盐(例如硼酸钾);
(iii)步骤(1)之后,步骤(2)之间添加的硼酸或其盐(例如硼酸钾);或者,
(iv)(i)至(iii)的任意组合。
在某些实施方案中,所述步骤(2)不包含添加硼酸或其盐(例如硼酸钾)的步骤;或者,步骤(2)包含添加硼酸或其盐(例如硼酸钾)的步骤,且,步骤(2)添加的硼酸或其盐(例如硼酸钾)与步骤(1)添加的硼酸或其盐(例如硼酸钾)相同或者不同。
易于理解,第二溶剂中硼酸或其盐的添加方式不受反应步骤的限制,例如,可在步骤(1)之前添加硼酸或其盐(例如,将硼酸或其盐与第二羰基化合物孵育后,再进行步骤(1)的核酸保护过程);例如,可在步骤(1)中直接添加硼酸或其盐;例如,可在步骤(2)中直接添加硼酸或其盐;例如,可在步骤(1)之后,步骤(2)之前,在被保护的核酸中加入硼酸或其盐。
在某些实施方案中,所述方法步骤(2)中,使被保护的核酸分子与亚硝酸盐在12-60℃(例如16-60℃、12-24℃、24-40℃、40-60℃、约16℃、约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下反应。
在某些实施方案中,步骤(2)中,使被保护的核酸分子与亚硝酸盐在12-60℃(例如16-60℃、12-24℃、24-40℃、40-60℃、约16℃、约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下反应10min~24h(例如30min~24h、10min~20min、20min~1h、1h~5h、5h~10h、10h~24h、约20min、约30min、约40min、约50min、约8h)。
在某些实施方案中,步骤(2)中,使被保护的核酸分子与亚硝酸盐在12-24℃(例如约16℃)的温度条件下反应5h~10h(例如约8h)。
在某些实施方案中,步骤(2)中,使被保护的核酸分子与亚硝酸盐在40-60℃(例如约50℃)的温度条件下反应20min~1h(例如约30min)。
在某些实施方案中,所述方法还包括:在步骤(4)之前,对前一步骤中所得产物进行纯化、逆转录和/或扩增的步骤。
在某些实施方案中,所述方法还包括:对待测核酸分子进行检测的额外步骤。
在某些实施方案中,在步骤(4)中,所述检测包括通过测序或杂交、质谱(例如三重串联四极杆质谱)、酶片段化和/或色谱进行核苷酸组成分析。
在某些实施方案中,所述检测包括通过测序进行核苷酸组成分析。
在某些实施方案中,所述方法还包括对比步骤(4)的检测结果与所述额外步骤的检测结果(例如,对比步骤(4)中所述所得产物和所述额外步骤中所述待测核酸分子的序列或核苷酸组成,确定待测核酸分子中N6-甲基腺嘌呤的含量和/或位置信息)。
在某些实施方案中,所述待测核酸分子可来源于任何生物体(例如真核细胞,原核细胞,病毒和类病毒)或非生物体(例如核酸分子文库)。所述待测核酸分子可以以单链或双链形式存在。
在某些实施方案中,所述方法中,所述待测核酸分子为RNA、DNA或DNA/RNA杂合体。
在某些实施方案中,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行脱保护;或者,在含有磷酸盐缓冲液中对步骤(2)中所得产物进行脱保护。
在某些实施方案中,步骤(3)中,在碱性条件下,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中或者在pH为7.1~8的含有二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中或者在pH为7.1~8的含有40~60v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,所述热处理是:
a)将步骤(2)中所得产物在80~95℃的温度条件下处理5~10min,或者,
b)将步骤(2)中所得产物在60~70℃的温度条件下处理1.5~2.5h(例如约2h)。
在某些实施方案中,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行脱保护,或者,
在pH为约7.4的含有约50v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过b)中所述热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,所述方法中,所述待测核酸分子为RNA或DNA/RNA杂合体。
在某些实施方案中,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行脱保护。
在某些实施方案中,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行一次脱保护,在磷酸盐缓冲液中对一次脱保护所得产物进行二次脱保护。
在某些实施方案中,步骤(3)中,在碱性条件下,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,步骤(3)中,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护。
在某些实施方案中,步骤(3)中,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护,在pH为7.1~8的含有二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对一次脱保护所得产物进行 二次脱保护。
在某些实施方案中,步骤(3)中,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护。
在某些实施方案中,步骤(3)中,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护,在pH为7.1~8的含有40~60v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对一次脱保护所得产物进行二次脱保护。
在某些实施方案中,所述热处理是:
a)将步骤(2)中所得产物在80~95℃的温度条件下处理5~10min,或者,
b)将步骤(2)中所得产物在60~70℃的温度条件下处理1.5~2.5h(例如约2h)。
在某些实施方案中,步骤(3)中,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行一次脱保护。
在某些实施方案中,步骤(3)中,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行一次脱保护,在pH为约7.4的含有约50v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过b)中所述热处理对一次脱保护所得产物进行二次脱保护。
在某些实施方案中,所述方法中,所述待测核酸分子为DNA或DNA/RNA杂合体。
在某些实施方案中,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行脱保护。
在某些实施方案中,步骤(3)中,在碱性条件下,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护。
在某些实施方案中,所述热处理是:将步骤(2)中所得产物在80~95℃的温度条件下处理5~10min。
在某些实施方案中,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行脱保护。
在一方面,本申请还提供了试剂盒,其包含第一羰基化合物和亚硝酸盐。
在某些实施方案中,所述第一羰基化合物选自式I所示化合物及其任意组合,
Figure PCTCN2021130282-appb-000002
其中R 1为醛基、C 1-3烷基(例如甲基、乙基或正丙基)或卤素取代的C 1-3烷基(例如三氯甲基、三氟甲基),R 2为H、C 1-3烷基(例如甲基、乙基或正丙基)、苯基或HC(=O)-CH(Br)-;或者,
R 1和R 2与它们所连接的羰基形成环状结构,例如茚三酮。
在某些实施方案中,所述第一羰基化合物选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合。
在某些实施方案中,所述第一羰基化合物选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合。
在某些实施方案中,所述第一羰基化合物为选自乙二醛、2-溴丙二醛、2,3-丁二酮、苯基乙二醛、三氯乙醛、茚三酮及其任意组合。
在某些实施方案中,所述试剂盒还包含第二羰基化合物。
在某些实施方案中,所述第二羰基化合物选自式I所示化合物及其任意组合,
Figure PCTCN2021130282-appb-000003
其中R 1为醛基、C 1-3烷基(例如甲基、乙基或正丙基)或卤素取代的C 1-3烷基(例如三氯甲基、三氟甲基),R 2为H、C 1-3烷基(例如甲基、乙基或正丙基)、苯基或HC(=O)-CH(Br)-;或者,
R 1和R 2与它们所连接的羰基形成环状结构,例如茚三酮。
在某些实施方案中,所述第二羰基化合物选自:乙二醛、茚三酮及其任意组合。
在某些实施方案中,所述亚硝酸盐选自亚硝酸钠、亚硝酸钾以及二者的组合。
在某些实施方案中,所述试剂盒还包含硼酸或其盐(例如硼酸钾)。
在某些实施方案中,所述试剂盒还包含二甲基亚砜(DMSO)、N,N-二甲基甲酰胺(DMF)或其组合。
在某些实施方案中,所述试剂盒还包含选自:2-(N-吗啉)乙磺酸(MES)、醋酸钠、3-(N-吗啡啉)丙磺酸(MOPS)、哌嗪-1,4-二乙磺酸(PIPES)、4-羟乙基哌嗪乙磺酸(HEPPS)、三羟甲基氨基甲烷(TRIS)中的至少一种物质。所述试剂盒还包含对甲苯磺酸、磷酸或其组合。
在某些实施方案中,所述试剂盒还包含选自:对甲苯磺酸、磷酸、2-(N-吗啉)乙磺酸(MES)、醋酸钠、3-(N-吗啡啉)丙磺酸(MOPS)、哌嗪-1,4-二乙磺酸(PIPES)、4-羟乙基哌嗪乙磺酸(HEPPS)、三羟甲基氨基甲烷(TRIS)中的至少一种物质。
在某些实施方案中,所述试剂盒还包含选自下列的物质:配制2-(N-吗啉)乙磺酸(MES)缓冲液的物质、配制醋酸钠缓冲液的物质、配制3-(N-吗啡啉)丙磺酸(MOPS)缓冲液的物质、配制哌嗪-1,4-二乙磺酸(PIPES)缓冲液的物质、配制4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液的物质、配制三羟甲基氨基甲烷(TRIS)缓冲液的物质及其任意组合。
在某些实施方案中,所述试剂盒还包含2-(N-吗啉)乙磺酸(MES)缓冲液、醋酸钠缓冲液、3-(N-吗啡啉)丙磺酸(MOPS)缓冲液、哌嗪-1,4-二乙磺酸(PIPES)缓冲液、4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液或三羟甲基氨基甲烷(TRIS)缓冲液或其任意组合。
在某些实施方案中,所述试剂盒还包含选自下列的物质:配制磷酸盐缓冲液的物质、配制醋酸三乙胺(TEAA)缓冲液的物质及二者的组合。
在某些实施方案中,所述试剂盒还包含磷酸盐缓冲液、醋酸三乙胺(TEAA)缓冲液或其组合。
本申请中所述的配制缓冲液的物质可以以干粉形式存在也可以以溶液形式存在。所述配制缓冲液的物质是本领域所公知的,可以根据实验需要按照本领域常规方法进行选择。
例如,所述配制2-(N-吗啉)乙磺酸(MES)缓冲液的物质包括2-吗啉乙磺酸,以 及任选的氢氧化钠。所述配制醋酸钠缓冲液的物质包括醋酸钠,以及任选的醋酸。所述配制3-(N-吗啡啉)丙磺酸(MOPS)缓冲液的物质包括3-(N-吗啡啉)丙磺酸,以及任选的氢氧化钠。所述配制哌嗪-1,4-二乙磺酸(PIPES)缓冲液的物质包括哌嗪-1,4-二乙磺酸,以及任选的氢氧化钠。所述配制4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液的物质包括4-羟乙基哌嗪乙磺酸,以及任选的氢氧化钠。所述配制三羟甲基氨基甲烷(TRIS)缓冲液的物质包括三羟甲基氨基甲烷,以及任选的盐酸。
在某些实施方案中,所述醋酸三乙胺(TEAA)缓冲液中含有45~50v/v%甲酰胺,pH为8~9。
在某些实施方案中,所述磷酸盐缓冲液中含有40~60v/v%二甲基亚砜(DMSO),pH为7.1~8。
在某些实施方案中,所述试剂盒还包含质谱检测和/或DNA测序所用试剂。
在某些实施方案中,所述试剂盒还包含说明书。
在某些实施方案中,所述说明书记载了如上文所述的方法。
在一方面,本申请还提供了如上文所述的试剂盒在检测核酸分子中N6-甲基腺嘌呤中的用途。
术语定义
在本申请中,除非另有说明,否则本文中使用的科学和技术名词具有本领域技术人员所通常理解的含义。同时,为了更好地理解本申请,下面提供相关术语的定义和解释。
当本文使用术语“例如”、“如”、“诸如”、“包括”、“包含”或其变体时,这些术语将不被认为是限制性术语,而将被解释为表示“但不限于”或“不限于”。
除非本文另外指明或根据上下文明显矛盾,否则术语“一个”和“一种”以及“该”和类似指称物在描述本申请的上下文中(尤其在以下权利要求的上下文中)应被解释成覆盖单数和复数。
如本文所使用的,除非上下文明确指出,否则,本文所使用的术语“腺嘌呤”或“A”,涵盖腺嘌呤本身,以及含有腺嘌呤的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、腺 嘌呤核糖核苷酸残基、腺嘌呤脱氧核糖核苷酸残基);术语“N6-甲基腺嘌呤”、“6mA”、“m6A”或“m 6A”,涵盖N6-甲基腺嘌呤本身,以及含有N6-甲基腺嘌呤的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基);术语“次黄嘌呤”或“I”,涵盖次黄腺嘌呤本身,以及含有次黄嘌呤的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基);术语“黄嘌呤”或“X”,涵盖黄嘌呤本身,以及含有黄嘌呤的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基);术语“鸟嘌呤”或“G”,涵盖鸟嘌呤本身,以及含有鸟嘌呤的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基);术语“胸腺嘧啶”或“T”,涵盖胸腺嘧啶本身,以及含有胸腺嘧啶的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基);术语“胞嘧啶”或“C”,涵盖胞嘧啶本身,以及含有胞嘧啶的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基);术语“尿嘧啶”或“U”,涵盖尿嘧啶本身,以及含有尿嘧啶的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基)。
如本文所使用的,除非上下文明确指出,否则,本文所使用的术语“m6A”、“6mA”和“m 6A”三者均涵盖DNA、RNA或DNA/RNA杂合体中的N6-甲基腺嘌呤本身,以及含有N6-甲基腺嘌呤的核苷酸(例如核糖核苷酸、脱氧核糖核苷酸、核糖核苷酸残基、脱氧核糖核苷酸残基),三者可互换使用。
如本文所使用的,除非上下文明确指出,否则,本文所使用的化合物具有本领域技术人员通常理解的含义,例如,如本文所用的化合物及其相应CAS号如下:2-(N-吗啉)乙磺酸(MES,CAS:4432-31-9)、醋酸钠(CAS:127-09-3)、3-(N-吗啡啉)丙磺酸(MOPS,CAS:1132-61-2)、哌嗪-1,4-二乙磺酸(PIPES,CAS:5625-37-6)、4-羟乙基哌嗪乙磺酸(HEPPS,CAS:7365-45-9)或三羟甲基氨基甲烷(TRIS,CAS:77-86-1)、乙二醛(CAS:107-22-2)、丙酮醛(CAS:78-98-8)、2,3-丁二酮(CAS:431-03-8)、茚三酮(CAS:485-47-2)、2-溴丙二醛(CAS:2065-75-0)、三氯乙醛(CAS:75-87-6)、苯基乙二醛(CAS:1075-06-5)、甲酰胺(CAS:75-12-7)、对甲苯磺酸(CAS:104-15-4)、磷酸(CAS:7664-38-2)、氢氧化钠(CAS:1310-73-2)、盐酸(CAS:7647-01-0)、醋酸(CAS:64-19-7)。
如本文所使用的,本文中使用的各种缓冲液具有本领域技术人员所通常理解的含义,可以根据实验需要按照本领域常规方法进行配制。例如,本文所述2-(N-吗啉)乙磺酸(MES)缓冲液可由2-(N-吗啉)乙磺酸(MES)配制而成,任选地根据需要用氢氧化钠溶液调节pH值。例如,本文所用醋酸钠缓冲液可由醋酸钠配制而成,任选地根据需要用醋酸调节pH值。例如,本文所用3-(N-吗啡啉)丙磺酸(MOPS)缓冲液可由3-(N-吗啡啉)丙磺酸配制而成,任选地根据需要用氢氧化钠调节pH值。例如,本文所用哌嗪-1,4-二乙磺酸(PIPES)缓冲液可由哌嗪-1,4-二乙磺酸配制而成,任选地根据需要用氢氧化钠调节pH值。例如,本文所用4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液可由4-羟乙基哌嗪乙磺酸配制而成,任选地根据需要用氢氧化钠调节pH值。例如,本文所用三羟甲基氨基甲烷(TRIS)缓冲液可由三羟甲基氨基甲烷配制而成,任选地根据需要用盐酸调节pH值。
有益效果
与现有技术相比,本申请的技术方案具有以下有益效果:
(1)采用本申请的方法,A to I的转化率高,从而可以实现对低丰度的m 6A位点进行定量和/或定性检测。
(2)采用本申请的方法,假阳性或非特异性转化的转化率(例如C toU的转化,G to X的转化)低,对m 6A进行分析时背景信号低。
(3)本申请的方法利用亚硝酸盐催化腺嘌呤脱氨转化为次黄嘌呤,而N6-甲基腺嘌呤因其化学性质稳定无法脱氨。因此,本申请的方法通过分析转化前后腺嘌呤或次黄嘌呤的信号,实现m 6A位点单碱基分辨率的检测。
因此,本申请提供了一种高活性高特异性的m 6A位点检测技术,可在单碱基水平对全基因组或者全转录组中的m 6A位点进行定量分析,解决了传统检测方法存在的问题。
下面将结合附图和实施例对本申请的实施方案进行详细描述,但是本领域技术人员将理解,下列附图和实施例仅用于说明本申请,而不是对本申请的范围的限定。根据附图和优选实施方案的下列详细描述,本申请的各种目的和有利方面对于本领域技 术人员来说将变得显然。
附图说明
图1示例性显示了腺嘌呤在亚硝酸盐和羰基化合物的催化下转化为次黄嘌呤的反应过程。
图2示例性显示了使用例如二羰基化合物对鸟嘌呤进行保护时,鸟嘌呤碱基与二羰基化合物形成鸟苷加合物的反应过程。
图3示例性显示了鸟苷加合物与硼酸类化合物形成新的鸟苷加合物的反应过程。
图4为经反应条件A,B和C处理后,DNA模型序列的扩增产物的Sanger测序峰图。
图5为表3中各反应条件处理前后的扩增产物的sanger测序峰图。
图6为表4中各反应条件处理前后的扩增产物的sanger测序峰图。
图7为表5中各反应条件处理前后的扩增产物的sanger测序峰图。
图8为表6中各反应条件处理前后的扩增产物的sanger测序峰图。
图9为表7中各反应条件处理前后的扩增产物的sanger测序峰图。
图10为表8中各反应条件处理前后的扩增产物的sanger测序峰图。
图11为表9中各反应条件处理前后的扩增产物的sanger测序峰图。
图12为表10中各反应条件处理前后的扩增产物的sanger测序峰图。
图13为表11中各反应条件处理前后的扩增产物的sanger测序峰图。
图14为表12中各反应条件处理前后的扩增产物的sanger测序峰图。
图15为表13中各反应条件处理前后的扩增产物的sanger测序峰图。
图16为表14中各反应条件处理前后的扩增产物的sanger测序峰图。
图17为表15中各反应条件处理前后的扩增产物的sanger测序峰图。
图18为表16中各反应条件处理前后的扩增产物的sanger测序峰图。
图19为表17中各反应条件处理前后的扩增产物的sanger测序峰图。
图20为本申请方法实现的RNA样品中A to G的信号转化效率。(a)测序数据中,rRNA和模式序列中的A to G的信号转化率;其中,m6A-1和m6A-2代表两个生物学重复。(b)测序数据中,mRNA中A to G的信号转化率。
图21为本申请方法实现的RNA样品中C to T的信号转化效率。(a)测序数据中,rRNA和模式序列中的C to T的信号转化率;其中,m6A-1和m6A-2代表两个生物学重复。(b)测序数据中,mRNA中C to T的信号转化率。
图22为本申请方法在rRNA上的高通量测序信号。(a)18srRNA上经典的m 6A_1832位点在测序中的信号。红色矩形内为该m 6A修饰位点,黑色边框的矩形内列出来该位点上覆盖的读段在该位点上分别为A,T,C,G四种碱基的百分比;(b)28srRNA上经典的m 6A_4220位点在测序中的信号。将参考转录组(版本号GRCh38.p13;GCF_000001405.39)中的A转化为G后作为比对模板,绿色色块即为测序信号相对于比对模板产生的G-to-A突变信号,代表该位置绝大部分为碱基A。
图23为本申请在mRNA上的高通量测序信号。(a)基因ACTB的mRNA上1216号m 6A位点(chr7:5527743,版本hg38)在测序中的信号。红色矩形内为该m 6A修饰位点,黑色边框的矩形内列出来该位点上覆盖的读段在该位点上分别为A,T,C,G四种碱基的百分比。将参考转录组(版本号GRCh38.p13;GCF_000001405.39)负链中的T转化为C后作为比对模板,负链转录mRNA的m 6A修饰在测序中表现为相对于比对模板产生的C to T的突变信号。(a)基因INCENP的mRNA上967号和1060号m 6A位点(chr11:62130330;chr11:62130423)在测序中的信号。INCENP为正链基因。将参考转录组(版本号GRCh38.p13;GCF_000001405.39)中的A转化为G后作为比对模板,示意图中的绿色色块即为测序信号相对于比对模板产生的G到A的突变信号,代表该位置绝大部分为碱基A。
图24为本申请方法检测到m 6A在mRNA中的分布。
图25为本申请方法在模式序列和mRNA中检测的m 6A修饰水平。(a)本申请在模式序列中,检测到的m 6A修饰水平与加入的模式序列的修饰水平的比较,其中,Rep-1和Rep-2代表两个生物学重复。(b)本申请在HEK293T细胞mRNA的两个重复中所检测到m 6A数目以及其重复度。(c)本申请在HEK293T细胞mRNA的两个重复中,检测的共有的m 6A修饰位点的修饰水平的比较。
图26为本申请方法中不同反应条件下mRNA样品中A to G的信号转化效率。(a)测序数据中,不同反应条件下mRNA的A to G的信号转化率;(b)本申请在不同反应条件下检测到的模式序列中m 6A修饰水平与模式序列修饰水平的比较。图中,“NS-16” 对应实施例3的转化反应示例性条件C,“NS-50”对应实施例3的转化反应示例性条件D,“NS-TsOH”对应实施例3的转化反应示例性条件E。
图27示出了本申请方法中使用乙二醛催化腺嘌呤核糖核苷酸转化为次黄嘌呤核糖核苷酸反应过程中产生的亚胺中间体的二级质谱峰图。
序列信息
表1:本申请涉及的序列的信息描述于下面的表中。
Figure PCTCN2021130282-appb-000004
Figure PCTCN2021130282-appb-000005
注:“N”=A,T,C or G;“X”=A,T,C or G;
具体实施方式
现参照下列意在举例说明本申请(而非限定本申请)的实施例来描述本申请。
本领域技术人员知晓,实施例以举例方式描述本申请,且不意欲限制本申请所要求保护的范围。实施例中的实验方法,如无特殊说明,均为常规方法。实施例中未注明具体条件者,按照常规条件或制造商建议的条件进行。所用试剂或仪器未注明生产厂商者,均为可以通过市购获得的常规产品。
溶液配制:
Deprotection A buffer:0.5M醋酸三乙胺(TEAA),pH 8.6,47.5%甲酰胺;
Deprotection B buffer:50%二甲基亚砜(DMSO)in 137mM NaCl,2.7mM KCl,8mM Na 2HPO 4,2mM KH 2PO 4,pH 7.4;
5M NaNO 2:345mg NaNO 2溶解于DEPC水中,并补水到1ml;
500mM MES buffer(pH 5):106.6mg 2-(N-吗啉)乙磺酸(MES)溶解于900μL  DEPC水,1M NaOH调pH至5,补水至1mL;
500mM MES buffer(pH 5.5):106.6mg MES溶解于900μL DEPC水,1M NaOH调pH至5.5,补水至1mL;
500mM MES buffer(pH 6):106.6mg MES溶解于800μL DEPC水,1M NaOH调pH至6,补水至1mL;
H 3BO 3buffer:取100mg H 3BO 3加入1mL DEPC水,混匀取上清液备用(H 3BO 3饱和溶液);
5M NaNO 2对甲基苯磺酸水溶液(pH 6):345mg NaNO 2溶解于DEPC水,使用对甲基苯磺酸调节pH值至6,并补水到1ml;
2-溴丙二醛溶液:2.0M 2-溴丙二醛DMSO/H 2O(1:1)溶液,取301.9mg 2-溴丙二醛,溶解于0.5mL二甲基亚砜(DMSO),混匀后再加入0.5mL DEPC水,混匀;
茚三酮溶液:0.15M茚三酮溶液,取26.7mg茚三酮,溶解于1mL DMSO,混匀;
75%乙醇:30mL无水乙醇加入10mL DEPC水,混匀;
1M NH 4OAc(pH 5.3):取77.0mg醋酸铵,溶解于800μL DEPC水中,醋酸调pH至5.3,补水至1mL。
试剂来源:
NaNO 2、40%乙二醛水溶液(摩尔浓度为8.8M)、苯基乙二醛、2-溴丙二醛、三氯乙醛、无水醋酸钠等化学试剂购买自安耐吉;
MES购买自Amresco;
对甲基苯磺酸购买自安耐吉;
H 3BO 3购买自Aladdin;
DEPC水购买自生工生物;
DMSO、无水乙醇等溶剂购买自北京通广精细化工公司;
PCR所用的扩增酶为KOD One TMPCR Master Mix,购买自TOYOBO;
核酸助沉剂(Glycogen)购买自ZOMANBIO;
Total RNA使用RNA提取试剂盒(Trizol法)从HEK293T细胞中提取,所述试剂盒购自Invitrogen,货号:15596026;
核酸酶P1(Nuclease P1 from Penicillium citrinum lyophilized powder)购买自SIGMA;
碱性磷酸酶(Shrimp Alkaline Phosphatase)rSAP购买自SIGMA;
SEQ ID NOs:1所示DNA model sequence使用DNA合成仪(ABI 3400)合成,合成规格为1umol。核酸单体(A、T、C、G及6mA)均购买自安徽芜湖华仁科技有限公司;
细菌基因组DNA提取试剂盒购买自天根,货号DP302;
DNA尿嘧啶糖苷酶购买自New England Biolabs(NEB)货号M0280。
经过测试,各种来源的试剂对实验结果没有显著影响,条件稳定性好。
实施例1:DNA模型中6mA的检测
一、实验方法
1.A(腺嘌呤)to I(次黄嘌呤)的转化反应
A to I的转化反应可参照示例性反应条件A、B或C进行。
转化反应示例性反应条件A
1.1核酸保护
取1ug 6mA DNA模型(序列如SEQ ID NO:1所示)溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40wt/wt%乙二醛水溶液(乙二醛的终浓度为约1.3M),混匀。加热至50℃,孵育1h,得到乙二醛保护的核酸,置于冰上冷却备用。
1.2亚硝化反应
配置反应液:5M NaNO 2 3μL,500mM MES(pH 6)1.6μL,H 3BO 3 2μL,DEPC水5.4μL。向其中加入上述步骤1.1制备的乙二醛保护的核酸8μL,总体积共20μL,16℃反应24h。
转化反应示例性反应条件B
1.1核酸保护
取1ug 6mA DNA模型(序列如SEQ ID NO:1)溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀。加热至50℃,1h,得到乙二醛保护的核酸,置于冰上冷却备用。
1.2亚硝化反应
配置反应液:5M NaNO 2 3μL,500mM MES(pH 6)1.6μL,H 3BO 3 2μL,DEPC水3.4μL,2μL 40%乙二醛水溶液。向其中加入上述步骤1.1制备的乙二醛保护的核酸8μL,总体积共20μL,16℃反应8h。
转化反应示例性反应条件C
1.1核酸保护
取1ug 6mA DNA模型(序列如SEQ ID NO:1)溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀。加热至50℃,0.5h;随后向混合液中加入H 3BO 32μL,50℃,再保护0.5h,得到乙二醛保护的核酸,置于冰上冷却备用。
1.2亚硝化反应
配置反应液:5M NaNO 2 3μL,500mM MES(pH 6)1.6μL,DEPC水3.4μL,2μL 40%乙二醛水溶液。向其中加入上述步骤1.1制备的乙二醛保护的核酸10μL,总体积共20μL,16℃反应8h。
2.转化产物纯化
2.1向上述步骤1转化反应所得样品中加入2μL 3M NaOAc(pH 5.2),2μL Glycogen混匀,55μL 75%乙醇(乙醇的终浓度为约70%),-80℃,2h。离心12000g 10min。加入1ml 70%乙醇再次离心,除去液体。
2.2在上述步骤2.1所得沉淀中加入Deprotection A buffer 50μL,95℃反应10min,置于冰上,加入5μL 3M NaOAc(pH 5.2),2μL Glycogen混匀,142μL无水乙醇,-80℃,2h。离心12000g 10min。加入1mL 70%乙醇再次离心,除去液体。
2.3取50μL DEPC水溶解上述步骤2.2制备的DNA沉淀,置于冰上备用。
3.Sanger测序
3.1向0.2ml PCR管中加入15μL KOD One TMPCR Master Mix,11μL ddH 2O,1μL上述步骤2制备的DNA样品作为模板,1.5μL扩增引物1(SEQ ID NO:2),1.5μL扩增引物2(SEQ ID NO:3)。扩增35个循环后备用。
3.2由于DNA序列中由A(腺嘌呤)转化而来的I(次黄嘌呤)在DNA扩增过程中与C(胞嘧啶)互补配对,从而被读作G(鸟嘌呤),而序列中的6mA不转化为I,仍保持与T(胸腺嘧啶)互补配对,被读作A,因此,用sanger法测序检测步骤3.1 所得扩增产物序列,通过比对扩增产物和转化前原DNA模型的序列信息,即可确定原模型序列中A to I的转化效率以及6mA的位置信息。测序用引物序列如SEQ ID NO:4所示。
4.三重串联四极杆质谱(QQQ)检测
取200ng上述步骤2制备的DNA样品,加入1U核酸酶P1,NH 4OAc(pH=5.3)2μL,补水至20μL,42℃反应2h。随后加入1μL FastAP Thermosensitive Alkaline Phosphatase,2.5μL 10x FastAP Buffer,补水至25μL,37℃反应2h。
将上述反应后溶液于4℃,12000r,离心10min,取25μL用于QQQ检测,测定经亚硝化反应后,样品DNA中I的含量,从而进一步确定DNA模型序列中A to I的转化效率。
二、实验结果
1.Sanger测序结果
1.1反应条件A、B或C下,A to I的转换效率检测结果
经反应条件A、B或C处理后,DNA模型序列的扩增产物的Sanger测序的示例性峰图如图4所示。如前所述,A脱氨后转化为I,I经扩增过程,在测序中产生G的信号,6mA反应后仍然被识别为A。通过Sanger测序信号,计算扩增产物中相对于原序列(即,处理前序列)A to G的信号转换概率,得到A to I的转化率。A to G的信号转换概率的计算方式为(以序列3’端最后一个A为例):
(i)使用软件4Peaks分别计算残存A的信号A1的峰高(绿色信号)和转化后G的信号G1(灰色信号)的峰高,然后计算未转化的dA的比例a=A1/(A1+G1);
(ii)计算A to I的效率b=(1-a)*100%;
(iii)统计模式序列中除6mA外所有A的转化效率,求平均值。
同时,可通过检测原序列中A信号的残留,确定原序列中6mA的位点信息。虽然G也会生产非特异性脱氨(变成X(黄嘌呤)),但是该效率低于测序基线,因此在sanger测序中无法评估G的非特异性转化。由图4的结果可知,经反应条件A、B或C处理后,扩增产物中,原序列中的A所在位置都产生G的信号,并且,原序列中6mA和C所在位置的信号保持不变。该结果表明,反应条件A、B或C可以实现DNA序列中A to I的高特异性、高活性的转化,从而实现DNA中6mA位点单碱基分 辨率水平的检测。
反应条件A、B或C对应的亚硝化反应催化反应条件和扩增产物的测序信号转换估算结果如表2所示,其中所有浓度均为终浓度。亚硝化反应中乙二醛终浓度由核酸保护过程添加的乙二醛经体积换算和亚硝化反应中外源添加的乙二醛加和得到。
表2
Figure PCTCN2021130282-appb-000006
注:硼酸孵育,“方式1”指在亚硝化反应体系中加入硼酸,“方式2”指在乙二醛保护核酸后,亚硝化反应之前,在核酸保护体系中加入硼酸。
1.2不同种类亚硝化试剂对A to I转化率的影响
评估不同种类亚硝化试剂对A的转化率的影响。简言之,将上述实验方法中反应条件A中步骤1.2亚硝化反应体系中的NaNO 2替换为KNO 2,具体催化反应条件和扩增产物的测序信号转换估算结果如表3所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表3
Figure PCTCN2021130282-appb-000007
各反应条件下扩增产物的Sanger测序结果如图5所示,由表3和图5可知,NaNO 2和KNO 2均能有效催化A to I的转化。
1.3 NaNO 2浓度对A to I转化率的影响
本实验探究亚硝化反应中不同NaNO 2浓度对A to I转化率的影响。简言之,调整 上述实验方法步骤1.2亚硝化反应体系中的NaNO 2浓度,考察NaNO 2浓度对A to I转化率的影响。具体催化反应条件和扩增产物的测序信号转换估算结果如表4所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表4
Figure PCTCN2021130282-appb-000008
各反应条件下扩增产物的Sanger测序结果如图6所示,由表4和图6可知,在表4所示的NaNO 2浓度范围内,A的转化率均接近完全(例如不低于93%)。
1.4 KNO 2浓度对A to I转化率的影响
本实验探究亚硝化反应中不同KNO 2浓度对A to I转化率的影响。简言之,调整上述实验方法步骤1.2亚硝化反应体系中的KNO 2浓度,考察KNO 2浓度对A to I转化率的影响。具体催化反应条件和扩增产物的测序信号转换估算结果如表5所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表5
Figure PCTCN2021130282-appb-000009
各反应条件下扩增产物的Sanger测序结果如图7所示,由表5和图7可知,在表5所示的KNO 2浓度浓度范围内,A的转化率均接近完全(例如不低于93%)。
1.5亚硝化反应中乙二醛终浓度对A to I转化率的影响
本实验探究亚硝化反应中乙二醛终浓度对转化率的影响。简言之,调整上述实验方法步骤1.2亚硝化反应体系中乙二醛的终浓度,估算A to I的转化率。具体催化反应条件和扩增产物的测序信号转换估算结果如表6所示,其余没有提及的步骤或反应 条件保持与条件A中一致。
表6
Figure PCTCN2021130282-appb-000010
各反应条件下扩增产物的Sanger测序结果如图8所示,由表6和图8可知,提高亚硝化反应过程中乙二醛的外源添加量,能提高A to I的转化率,即,亚硝化反应中乙二醛终浓度越高,A to I的转化率也越高,并且反应所需时间更短。
1.6核酸保护时乙二醛浓度对A to I转化率的影响
本实验探究核酸保护时乙二醛浓度对A to I转化率的影响。简言之,调整上述实验方法1.1核酸保护步骤中乙二醛的浓度,考察乙二醛浓度对A to I转化率的影响。具体催化反应条件和扩增产物的测序信号转换估算结果如表7所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表7
Figure PCTCN2021130282-appb-000011
各反应条件下扩增产物的Sanger测序结果如图9所示,由表7和图9可知,在反应条件1-4下,A的转化率均在87%及以上,并且,核酸保护过程乙二醛浓度越高,A的转化率也相应升高。
1.7亚硝化反应中乙二醛外源添加量对A to I转化率的影响
本实验探究亚硝化反应中乙二醛外源添加量对A to I转化率的影响。简言之,调整上述实验方法1.2亚硝化反应步骤中外源添加乙二醛的量,考察外源添加乙二醛的量对A to I转化率的影响。具体催化反应条件和扩增产物的测序信号转换估算结果如 表8所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表8
Figure PCTCN2021130282-appb-000012
各反应条件下扩增产物的Sanger测序结果如图10所示,由表8和图10可知,在反应条件1-3下,A的转化率均在93%及以上,并且,亚硝化反应中乙二醛外源添加量越多,A的转化率也相应升高。此外,根据表7-8的转化率结果,进一步证实,亚硝化反应体系中乙二醛的终浓度越高,A to I的转化率越高。
1.8核酸保护过程中孵育条件对A to I转化率的影响
本实验探究核酸保护孵育条件对A to I转化率的影响。简言之,调整上述实验方法1.1核酸保护步骤中乙二醛与核酸孵育的温度和时间,考察温度和时间对A to I转化率的影响。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表9所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表9
Figure PCTCN2021130282-appb-000013
各反应条件下扩增产物的Sanger测序结果如图11所示,由表9和图11可知,核酸保护过程中乙二醛与核酸的孵育温度在16-50℃范围内,孵育时间在15-60min内均不会影响A的转化率。
1.9缓冲液和硼酸浓度对A to I转化率的影响
本实验探究缓冲液和硼酸浓度对A to I转化率的影响。简言之,调整上述实验方法1.2亚硝化反应步骤中缓冲液的种类和硼酸浓度,考察缓冲液的种类和硼酸浓度对A to I转化率的影响。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表10(亚硝化反应时不外源添加乙二醛)和表11(亚硝化反应时外源添加乙二醛)所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表10
Figure PCTCN2021130282-appb-000014
表11
Figure PCTCN2021130282-appb-000015
表10和表11各反应条件下扩增产物的Sanger测序结果分别如图12和图13所示。结果显示,亚硝化反应体系中硼酸浓度在5-100mM均不影响A to I的转化;此外,表10和表11的结果还表明,将MES替换为NaOAc,或者,直接使用水作为溶剂,A to I的转化效率几乎不受影响。
1.10 MES缓冲液浓度和pH值对A to I转化率的影响
本实验探究MES缓冲液浓度和pH值对亚硝化反应A to I转化率的影响。简言之, 调整上述实验方法1.2亚硝化反应步骤中MES缓冲液的浓度和pH值,考察MES缓冲液的浓度和pH值对A to I转化率的影响。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表12(不同MES浓度)和表13(不同pH值)所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表12
Figure PCTCN2021130282-appb-000016
表13
Figure PCTCN2021130282-appb-000017
表12和表13各反应条件下扩增产物的Sanger测序结果分别如图14和图15所示。结果显示,不同浓度(例如20-750mM)和不同pH值(例如4.0-5.5)的MES缓冲液均能有效促进A to I的转化。
1.11硼酸或其盐对A to I转化率的影响
本实验探究亚硝化反应中硼酸或其盐对A to I转化率的影响。简言之,将上述实验方法1.2亚硝化反应步骤中的硼酸替换为不同pH值的硼酸钾,估算A to I的转化率。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表14所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表14
Figure PCTCN2021130282-appb-000018
各反应条件下扩增产物的Sanger测序结果如图16所示,由表14和图16可知,将亚硝化反应中的硼酸替换为硼酸盐后,A to I的转化率不受影响(例如均在92%及以上)。
1.12硼酸孵育方式对A to I转化率的影响
本实验探究硼酸孵育方式对A to I转化率的影响。简言之,调整上述实验方法中硼酸孵育的方式,估算A to I的转化率。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表15所示,其余没有提及的步骤或反应条件保持与条件B中一致。其中,硼酸孵育“方式1”指在亚硝化反应体系中加入硼酸;“方式2”指在乙二醛保护核酸后,亚硝化反应之前,在核酸保护体系中加入硼酸;“方式3”指将乙二醛与硼酸孵育后,再使用乙二醛和硼酸孵育产物对核酸进行保护。
表15
Figure PCTCN2021130282-appb-000019
各反应条件下扩增产物的Sanger测序结果如图17所示,由表15和图17可知,上述3种硼酸孵育方式下,A的转化率均在96%及以上,并且,通过方式2进行H 3BO 3孵育,A的转化率相对更高一点,但是没有显著差别。
1.13亚硝化反应温度对A to I转化率的影响
本实验探究亚硝化反应温度对A to I转化率的影响。简言之,调整上述实验方法1.2亚硝化反应步骤中的反应温度和时间,考察亚硝化反应的温度对A to I转化率的影响。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表16所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表16
Figure PCTCN2021130282-appb-000020
各反应条件下扩增产物的Sanger测序结果如图18所示,由表16和图18可知,亚硝化反应在16-50℃的温度范围内均可以发生,温度越高所需要的反应时间越短。
1.14羰基化合物的种类对A to I转化率的影响
本实验探究羰基化合物的种类对A to I转化率的影响。简言之,将上述实验方法1.1核酸保护和1.2亚硝化反应步骤中的使用的乙二醛替换为不同的种类和浓度的羰基化合物,考察羰基化合物种类和浓度对A to I转化率的影响。具体核酸保护和催化反应条件,以及扩增产物的测序信号转换估算结果如表17所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表17
Figure PCTCN2021130282-appb-000021
Figure PCTCN2021130282-appb-000022
各反应条件下扩增产物的Sanger测序结果如图19所示,由表17和图19可知,使用高浓度乙二醛(例如1.3M乙二醛)进行核酸保护后,在亚硝化反应体系中添加不同羰基化合物(例如乙二醛、丙酮醛、2,3-丁二酮、苯基乙二醛、2-溴丙二醛、三氯乙醛、茚三酮等)均能促进A to I的转化,转化率均达88%及以上;此外,茚三酮作为羰基化合物不仅能催化硝化反应的发生,并且可以保护核酸,其效果与乙二醛类似。
2.三重串联四极杆质谱(QQQ)检测结果
如前所述,Sanger测序法是通过检测亚硝化反应后,DNA的扩增产物中A to G的信号转化来检测A的转化率,其中,A to G的信号转化由A to I的转化和DNA扩增过程中I与C的配对产生。因此,扩增过程中碱基互补配对的准确性可能会对转化率的测定结果产生影响,因此,我们进一步使用质谱直接测定亚硝化反应产物中I的含量,通过计算产物中I/(A+I)的比率计算A的转化率,对Sanger测序结果计算的转化率进行进一步验证。
不同NaNO 2浓度、不同buffer种类和浓度,以及不同H 3BO 3浓度和孵育方式处理后DNA样品的QQQ检测结果如表18和表19所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表18
Figure PCTCN2021130282-appb-000023
表19
Figure PCTCN2021130282-appb-000024
注:硼酸孵育,“方式2”指在乙二醛保护核酸后,亚硝化反应之前,在核酸保护体系中加入硼酸。
表18和19的结果表明,上述条件下DNA样品中A to I的转化均趋于完全。因此,QQQ检测结果进一步验证了本申请的方法可实现A to I的高效转化,从而可以实现核酸样品中6mA修饰水平和6mA修饰位点的准确检测。
实施例2:基因组DNA中6mA的检测
本实施例进一步验证了本申请的方法在基因组水平检测6mA的效果。
一、实验方法
1.基因组DNA提取以及片段化
使用细菌基因组DNA提取试剂盒(购自天根,货号:DP302)提取E.coli k12mg1655菌株(购自ATCC(美国菌种保藏中心),货号:700926D-5)的基因组DNA,使用非接触式超声波破碎仪(Covaris M220)将提取的基因组DNA片段化为550bp左右的片段,旋干DNA样品,置于冰上备用。
1.核酸变性
2.1甲酰胺变性
取1ug步骤1获得的片段化DNA溶于2.8μL 85%去离子甲酰胺水溶液,95℃,10min后,立即置于冰上2min,立即用于下一步核酸保护。
2.2 0.1M NaOH变性
取1ug步骤1获得的片段化DNA溶于2.0μL 0.1M NaOH水溶液,95℃,10min后,立即置于冰上2min后加入0.8μL 500mM MES溶液,调节pH至6,置于冰上,立即用于下一步核酸保护。
注:核酸变性条件选择2.1甲酰胺变性或者2.2 NaOH变性其中一种方法即可。
2.A(腺嘌呤)to I(次黄嘌呤)转化反应
同实施例1反应条件C。
3.转化产物纯化
同实施例1步骤2。
5.三重串联四极杆质谱(QQQ)检测
5.1取500ng制备的DNA,加入1U核酸酶P1,NH 4OAc(pH=5.3)2μL,补水至20μL,42℃反应2h。随后加入1μL FastAP Thermosensitive Alkaline Phosphatase,2.5μL 10x FastAP Buffer,补水至25μL,37℃反应2h。
将上述反应后溶液于4℃,12000r,离心10min,取25μL用于QQQ检测,测定经亚硝化反应后,样品DNA中I的含量,从而进一步确定DNA序列中A to I的转化效率。
二、实验结果
经甲酰胺变性、碱变性或者不经变性预处理后,各基因组DNA反应样品的QQQ检测结果如表20所示。
表20
Figure PCTCN2021130282-appb-000025
由表20可知,条件C可以高效实现A to I的转化。并且,对于不经变性预处理的双链DNA中的A也转化完全,表明本申请的方法能够实现DNA样品中A to I的转换,并且转化效率很高。
实施例3:RNA中m 6A的检测
一、实验方法
1.RNA提取及纯化
1.1使用RNA提取试剂盒(Trizol法)(购自Invitrogen,货号:15596026)从 HEK293T细胞(购自ATCC(美国菌种保藏中心),货号CRL-3216)中提取总RNA。
1.2取1ug步骤1.1制备的总RNA,使用无核酸酶水或DEPC水稀释至18μL,加入NEB Next Magnesium RNA Fragmentation ModμLe 2μL,94℃4min,随即冷却至4℃。加入终止液2μL混匀,Glycogen 2μL混匀,乙醇50μL(终浓度70%EtOH),-20℃,1h。离心12000g 20min。加入1ml 70%乙醇再次离心,除去液体。
2.A(腺嘌呤)to I(次黄嘌呤)转化反应
A to I的转化反应可参照示例性反应条件A、B、C、D或E进行。
转化反应示例性条件A
2.1核酸保护
将步骤1制备的RNA溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀,50℃,1h,得到乙二醛保护的核酸,置于冰上冷却备用。
2.2亚硝化反应
配置RNA反应液:5M NaNO 2 3μL,500mM MES(pH 6)1.6μL,H 3BO 3 2μL,DEPC水5.4μL。向其中加入上述乙二醛保护的核酸8μL,总体积共20μL,16℃反应8h。
转化反应示例性条件B
2.1核酸保护
将步骤1制备的RNA溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀,50℃,1h,得到乙二醛保护的核酸,置于冰上冷却备用。
2.2亚硝化反应
配置RNA反应液:5M NaNO 2 3μL,500mM MES(pH 6)1.6μL,H 3BO 3 2μL,DEPC水3.4μL,2μL 40%乙二醛水溶液。向其中加入上述乙二醛保护的核酸8μL,总体积共20μL,16℃反应8h。
转化反应示例性条件C
2.1核酸保护
将步骤1制备的RNA溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀,50℃,0.5h;随后向混合液中加入H 3BO 3 2μL,50℃,再保护0.5h,得到乙二醛保护的核酸,置于冰上冷却备用。
2.2亚硝化反应
配置RNA反应液:5M NaNO 2 3μL,500mM MES(pH 6)1.6μL,DEPC水3.4μL,2μL 40%乙二醛水溶液。向其中加入上述乙二醛保护的核酸10μL,总体积共20μL,16℃反应8h。
转化反应示例性条件D
2.1核酸保护
将步骤1制备的RNA溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀,50℃,0.5h;随后向混合液中加入H 3BO 3 2μL,50℃,再保护0.5h,得到乙二醛保护的核酸,置于冰上冷却备用。
2.2亚硝化反应
配置RNA反应液:5M NaNO 2 4μL,500mM MES(pH 6)1.6μL,DEPC水3.4μL,2μL 40%乙二醛水溶液。向其中加入上述乙二醛保护的核酸10μL,总体积共20μL,50℃反应0.5h。
转化反应示例性条件E
2.1核酸保护
将步骤1制备的RNA溶于2.8μL DEPC水中,加入4μL DMSO以及1.2μL 40%乙二醛,混匀,50℃,0.5h;随后向混合液中加入H 3BO 3 2μL,50℃,再保护0.5h,得到乙二醛保护的核酸,置于冰上冷却备用。
2.2亚硝化反应
配置RNA反应液:5M NaNO 2对甲基苯磺酸水溶液(pH 6)3μL,DEPC水5μL,2μL 40%乙二醛水溶液。向其中加入上述乙二醛保护的核酸10μL,总体积共20μL,50℃反应0.5h。
3.转化产物纯化
3.1将步骤2制备的样品加入2μL 3M NaOAc(pH 5.2),2μL Glycogen混匀,55μL乙醇(终浓度70%EtOH),-20℃,1h。离心12000g 20min。加入1ml 70%乙醇再次离心,除去液体。
3.2在上述管中加入Deprotection A buffer 50μL,95℃反应10min,置于冰上, 加入5μL 3M NaOAc(pH 5.2),2μL Glycogen混匀,142μL乙醇,-20℃,1h。离心12000g 20min。加入1ml 70%乙醇再次离心,除去液体。
3.3在上述管中加入Deprotection B buffer 50μL,65℃反应120min,置于冰上,加入5μL 3M NaOAc(pH 5.2),2μL Glycogen混匀,142μL乙醇,-20℃,2h。离心12000g 10min。加入1ml 70%乙醇再次离心,除去液体。
注:上述步骤3.3是可选步骤。
3.4使用RNA Clean&Concentrator Kits纯化RNA,Nanodrop测定浓度。
4.cDNA建库
未处理的RNA和亚硝化反应后的RNA被用来进行文库构建。RNA样品用PNK(多聚核苷酸激酶,购自NEB,货号:M0201S)处理,使RNA片段3’端去磷酸化,将去磷酸化的RNA与3’RNA linker(SEQ ID NO:5)用T4RNA ligase2,truncated KQ(购自NEB,货号:M0373S)连接,并用5’Deadenylase(购自NEB,货号:M0331S)和RecJf酶(购自NEB,货号:M0264S)将多余的3’RNA linker消化掉。
使用SuperScript III(购自Invitrogen,货号:18080093)对RNA进行逆转录,用silane beads纯化逆转录后的cDNA,将带有5’磷酸化的adaptor(SEQ ID NO:6)与纯化的cDNA的3’端用T4RNA ligase 1,high concentration(购自NEB,货号:M0437M)连接。连接了5’adaptor的cDNA继续用silane beads纯化。使用cDNA扩增引物1和2(SEQ ID NOs:7-8,其中“XXXXXX”为index序列)对cDNA进行PCR扩增。PCR产物用8%的TBE胶纯化,将片段大小为160-250bp之间的胶切下来并淬胶回收,得到cDNA测序文库。最后使用Illumina Hiseq X10 PE150对所得文库进行测序。
5.三重串联四极杆质谱(QQQ)检测
取200ng步骤3制备的RNA,加入1U核酸酶P1,NH 4OAc 2μL,补水至20μL,42℃反应2h。随后加入1μL FastAP Thermosensitive Alkaline Phosphatase,2.5μL 10x FastAP Buffer,补水至25μL,37℃反应2h。
将上述反应后溶液于4℃,12000r,离心10min,取25μL用于QQQ检测。
二、实验结果
1.高通量测序及数据分析
1.1高通量数据分析
1)本申请原始数据的过滤
数据下机后,首先使用trim_galore(版本0.6.6)软件对测序数据(FASTQ文件)进行测序接头的去除,具体命令参数为:trim_galore-q 20--stringency 1–length 35。去除接头以后,考虑到该技术用的是eclip建库方法,每条reads的5’端有10bp random N barcode,可用于去除由建库过程中PCR引起的重复。我们利用该barcode对测序数据进行去重。最后利用FASTX-Toolkit(版本0.0.13)去除得到的去重结果中10bp的barcode。过滤完成后的数据将用于后续的分析。
2)本申请数据的回贴
我们对测序数据进行后续的数据分析。我们首先将测序数据回贴到参考基因组上。具体方法如下:在本申请测序结果中,绝大部分的碱基A会被转化为G,接近于三元碱基数据。因此,数据回贴的首要任务是构建基于三元碱基的数据回贴环境。首先将后期需要用到的参考基因组(版本号hg38)和参考转录组(版本号GRCh38.p13;GCF_000001405.39)中的A转化为G,并分别构建数据回贴时需要的参考基因组和转录组的三元碱基索引文件。其次,利用将测序数据中,未转化的A,转化为G,得到完全的三元碱基测序数据,并标记未转化的A在读段中的坐标。
构建完基于三元碱基的数据回贴环境后,利用数据比对工具STAR(版本号2.7.5.c),将读段回贴到参考基因组上。没有比对成功的读段将会被重新提取出来,然后使用bowtie(version 1.3.0)进行重新比对,比对到转录组。再将比对到转录组上的读段的相关转录组坐标转化为基因组的坐标。随后,根据标记的未转化的A的坐标信息,将比对得到的两个BAM格式文件中的读段序列相应的坐标信息中的G转换回A。最后将两次比对的结果进行合并,得到BAM格式的文件,用于下游分析。
3)A to G信号转化效率分析
在获得回贴好的BAM文件后,首先根据BAM文件中读段的比对信息,统计参考基因组中所有位点的读段覆盖情况。该过程中,我们首先将文件从BAM格式转换成mpileup格式,随后根据mpileup格式统计基因组中每条染色体上,所有腺嘌呤A的 位置,覆盖的读段中A to G的信号转化效率。其中位数代表该染色体上A to G的信号转化效率。最后整个基因组上的A to G的信号转化效率为所有染色体转化效率中位数。
4)m6A候选位点的初步鉴定
在获得回贴好的BAM文件后,进行m6A候选位点的初步鉴定。首先根据BAM文件中读段的比对信息,统计参考基因组中所有位点的读段覆盖情况。该命令首先将文件从BAM格式转换成mpileup格式,随后根据mpileup格式统计基因组中每个位点覆盖的读段中未转化A的个数信息。随后,将上步得到的文件格式化为特定的fmat格式文件。转化格式的过程中,同时结合基因的注释信息,标记基因组上每个候选的A位点所在的基因组坐标、基因信息、转录本信息、覆盖的读段信息以及覆盖读段中含有的未转化的A的个数信息。最后,综合以上统计的信息,获得初步的m6A候选位点。
5)对m6A候选位点进行假阳性过滤
在基于亚硫酸氢钠检测m5C的方法的开发中,人们发现亚硫酸氢钠处理对RNA的二级结构敏感。具有复杂二级结构的RNA会导致其上的C在亚硫酸氢钠处理的时候,不会转化为T,进而导致假阳性结果。为了避免同样的假阳性结果,我们在初步鉴定m6A候选位点的时候,首先过滤包含三个以上的未转化A的读段,该类读段数目在整体数据中占比2%左右,可能来源于对NaNO 2处理敏感的RNAs。此外,发生在NaNO 2处理敏感区域的候选位点也会被过滤。当一个m6A候选位点,所有覆盖的读段中,有超过20%的读段中包含三个以上未转化A,这个候选位点被认为发生在NaNO 2处理敏感区域,即该区域转录的RNA不容易在NaNO 2处理的时候,发生A to G的转化。以上过滤能够进一步确保得到的m6A候选位点是可信的。
6)对m6A候选位点进行统计检验
由于本申请中A to G的突变率并没有达到100%,且不同区域转录的RNA,因为序列、RNA的二级结构等不同,会有不同的转化效率。我们进一步基于“信噪比”对候选位点进行过滤。我们根据m6A候选位点所在的基因组背景进行统计检验。进一步选取相对于所在背景,未转化的A的比例显著性高于背景的m6A候选位点。具体方法为:对于每一个候选m6A位点,我们以不同基因的转化效率作为背景,进行统计检验(基于二项分布或泊松分布),并计算候选位点的显著性P值。将P值小于0.005的 候选位点选出后,进一步进行FDR矫正,矫正后,P值小于0.005的候选位点将作为本申请最终检测的m6A候选位点。
1.2测序结果
以下示出经条件C处理后各RNA样品的cDNA文库测序结果
1)本申请的方法可以实现很高的A to I的转化效率
为了证明本申请的转化效率,我们将含有20%、50%和80%m 6A修饰比例的模式序列(Spike in 2,Spike in 3和Spike in 4)以及不含m 6A修饰的Spike in 1模式序列(SEQ ID NO:9)掺入至打断后的全细胞RNA中。
含有20%m 6A修饰比例的Spike in 2模式序列的配制方法为:将含有m 6A修饰的Spike in 2模式序列(SEQ ID NO:10)与不含m 6A修饰的Spike in 2’模式序列(SEQ ID NO:11)按照2:8的摩尔比混合;含有50%m 6A修饰比例的Spike in 3模式序列的配制方法为:将含有m 6A修饰的Spike in 3模式序列(SEQ ID NO:12)与不含m 6A修饰的Spike in 3’模式序列(SEQ ID NO:13)按照5:5的摩尔比混合;含有80%m 6A修饰比例的Spike in 4模式序列的配制方法为:将含有m 6A修饰的Spike in 4模式序列(SEQ ID NO:14)与不含m 6A修饰的Spike in 4’模式序列(SEQ ID NO:15)按照8:2的摩尔比混合。
按实施例3的实验方法进行亚硝化反应和测序建库。最后分析A to G的信号转化效率。如前所述,A脱氨后转化为I,I经扩增过程,在测序中产生G的信号,因此A to G的信号转化即反映了A to I的转化效率。
本申请中,我们利用HEK293T细胞系,分别分析了不同类型的RNA,如18s、28s rRNA、mRNA以及掺入的模式序列中,A to G的信号转化效率。结果如图20所示,由图20可知,在四种不同的RNA上,本申请的方法都能实现97%左右的A to G的信号转化效率(在图中显示为突变率),即,A to I的高转化率。
2)本申请的方法可以实现低的C to U的转化效率
同时,基于本申请的测序数据分析表明,不同类型的RNA,如18s、28s rRNA、mRNA以及掺入的模式序列中C to U的转化效率(测序结果显示为C to T的信号,分析时也是统计C to T的转化比例)保持在4%左右,结果如图21所示。该结果证实利用本申请的检测方法,C的非特异性转化率较低,且保持稳定,从而对m 6A检测的干 扰较低。
3)本申请的方法能够特异性地检测RNA上的m 6A位点
在人类HEK293T细胞系中,我们检测了rRNA上的两个已知m 6A位点,分别为18s rRNA上的m 6A_1832位点和28s rRNA上的m 6A_4220位点。这两个位点,一直被认为存在高的m 6A修饰比例。检测结果如图22所示,从IGV展示的18s rRNA上高通量测序信号图谱(图22a)中可以看到,在m 6A_1832位点区域,绝大部分覆盖的读段都是A,而周围非m 6A修饰的碱基A上,覆盖的读段都是G。这说明本申请的方法具有非常好的检测特异性,能够特异性的检测m6A位点,且能够很好的对位点的甲基化修饰水平进行定量。同样,在28s上的m 6A_4220位点也能看到清晰的m 6A修饰信号(图22b),且在该位点覆盖的A的含量接近于其真实的m6A修饰水平。
4)本申请的方法可以特异性地检测mRNA上的m 6A位点
为了验证本申请的方法能否检测到mRNA上的m 6A位点,我们在HEK293T细胞系中分别检测了mRNA上经典的m 6A修饰位点(基因ACTB的mRNA上1216号m 6A位点,基因INCENP的mRNA上967号和1060号m 6A位点),基于已知的mRNA上经典的m 6A位点,分析本申请的方法的可行性。
检测结果如图23所示,结果表明,上述已知mRNA的m 6A位点处都能检测到明显的m 6A信号。
5)本申请的方法可以检测mRNA中m 6A的分布
对所检测的m 6A位点进行分析发现,m 6A大量分布于mRNA 3’UTR区域,这种在mRNA中的分布模式符合已有报道中对m6A分布模式的研究结果。且检测到的位点序列满足已知的m6A基序DRACH(如图24所示)。说明本申请的方法可以准确检测mRNA中m 6A。
6)本申请的方法可以准确检测m 6A的修饰水平
为了进一步证明,本申请的方法能够准确检测在全转录组中m 6A的修饰水平,我们在本申请的RNA样品中掺入具有不同的m 6A修饰比例的m 6A模式序列,通过本申请的方法检测m 6A的修饰水平,并将m 6A模式序列的检测结果与其真实的m 6A修饰比例进行比较。比较结果如图25a所示,由图可知,在本申请中观察到的模式序列Spike in 2和Spike in 4的m 6A修饰水平与期望的水平基本一致,都位于对角线附近,表明 本申请的方法可以准确检测RNA中m 6A的修饰水平。其中,模式序列Spike in 3的m 6A修饰水平计算值比期望水平稍微偏低,这可能由于Spike in 3在转化后的三元碱基基因组上,有另外的相似序列,影响了测序读段回帖过程,进而导致后期计算Spike in 3中m 6A修饰位点的修饰比例偏低。
图25b显示了本申请在HEK293T细胞mRNA的两个重复中一共可以检测到超过十万个m 6A位点,且绝大多数位点在两次重复中均可以检测到(82%),说明本申请的方法灵敏度和稳定性高,可靠性好。
图25c显示了本申请的方法在HEK293T细胞mRNA的两个重复中,检测的共有的m 6A修饰位点中修饰水平的比较,如图25c中所示,两个技术重复中,m 6A位点对应的m 6A修饰水平基本围绕对角线,一致性很高,相关系数0.96,这进一步说明本申请的方法能够稳定且准确的检测m 6A的修饰水平。
7)本申请的方法中不同反应条件均可以实现m 6A的修饰水平的准确检测
为了进一步证明,本申请中的方法的不同反应条件能够准确检测全转录组中m 6A的修饰水平,我们测试了不同反应条件,提高了反应温度或者改变缓冲液的成分,通过高通量测序比较其各自转化效率的差异。
同时将含有5%、20%、50%、80%m 6A修饰比例的模式序列(Spike in 2,Spike in 4,Spike in 5,Spike in 6)以及不含m 6A修饰的Spike in 1模式序列(SEQ ID NO:9)和含m 6A修饰的Spike in 7模式序列(SEQ ID NO:20)掺入至打断后的全细胞RNA中。
含有5%m 6A修饰比例的Spike in 2模式序列的配制方法为:将含有m 6A修饰的Spike in 2模式序列(SEQ ID NO:10)与不含m 6A修饰的Spike in 2’模式序列(SEQ ID NO:11)按照0.5:9.5的摩尔比混合;含有20%m 6A修饰比例的Spike in 4模式序列的配制方法为:将含有m 6A修饰的Spike in 4模式序列(SEQ ID NO:14)与不含m 6A修饰的Spike in 4’模式序列(SEQ ID NO:15)按照2:8的摩尔比混合;含有50%m 6A修饰比例的Spike in 5模式序列的配制方法为:将含有m 6A修饰的Spike in 5模式序列(SEQ ID NO:16)与不含m 6A修饰的Spike in 5’模式序列(SEQ ID NO:17)按照5:5的摩尔比混合;含有80%m 6A修饰比例的Spike in 6模式序列的配制方法为:将含有m 6A修饰的Spike in 6模式序列(SEQ ID NO:18)与不含m 6A修饰的Spike in 6’ 模式序列(SEQ ID NO:19)按照8:2的摩尔比混合。
按上述实施例3的实验方法进行亚硝化反应和测序建库。
处理后结果如图26a所示,由图可知,不同的条件都能实现98%左右的A to G的信号转化效率。同时图26b显示不同条件下模式序列m 6A修饰水平与期望的水平基本一致,表明本申请的方法在不同条件下均可以准确检测RNA中m 6A的修饰水平。
2.三重串联四极杆质谱QQQ检测结果
2.1条件A、B、C和1-1的检测结果
条件A、B、C、D、E和1-1对应的反应条件和A to I的转化率(I/(I+A))如表21所示。条件1-1与条件C的区别在于亚硝化体系(步骤2.2)中MES缓冲液的浓度及pH值不同(如表21所示),其余均相同。条件D与条件C的区别在于所使用亚硝酸钠浓度,反应温度与时间不同。条件D为1M NaNO 2,50℃反应0.5h,其余均相同。条件E与条件C的区别在于反应所用的缓冲液的种类、反应温度与时间不同,对甲基苯磺酸水溶液作为缓冲液,50℃反应0.5h,其余均相同。
表21
Figure PCTCN2021130282-appb-000026
注:硼酸孵育,“方式1”指在亚硝化反应体系中加入硼酸,“方式2”指在乙二醛保护核酸后,亚硝化反应之前,在核酸保护体系中加入硼酸。
由表21可知,条件B、H 4、1-1处理后,RNA样品中A to I的转化率都接近完全,条件A处理后,RNA样品中A to I的转化率达95%以上。此外,从表21还可以看出, 亚硝化反应体系中,乙二醛终浓度的提高会促进A的转化。
2.2不同种类亚硝化试剂对A to I转化率的影响
将上述实验方法条件A步骤2.2亚硝化反应体系中的NaNO 2替换为不同浓度的KNO 2,具体催化反应条件和各碱基转化率如表22所示,其余没有提及的步骤或反应条件保持与条件A中一致。G在亚硝化反应中发生非特异性转化,转化为X(黄嘌呤),表中X/G表示G to X的转化率。
表22
Figure PCTCN2021130282-appb-000027
由表22可知,NaNO 2和KNO 2一样,均能有效催化A to I的转化,并且G的转化率保持在低水平(例如2.1%及以下)。
2.3 NaNO 2浓度和缓冲液对A to I转化率的影响
调整上述实验方法步骤2.2亚硝化反应体系中的NaNO 2浓度、缓冲液浓度和pH值,考察NaNO 2浓度和缓冲液对A to I转化率的影响。具体催化反应条件和各碱基转化率如表23所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表23
Figure PCTCN2021130282-appb-000028
由表23可知,在上述的条件下进行亚硝化反应,A的转化率均接近完全,且G的转化率均保持在较低水平。
2.4硼酸孵育方式对A to I转化率的影响
本实验探究硼酸孵育方式对A to I转化率的影响。简言之,调整上述实验方法中硼酸孵育的方式,考察硼酸孵育方式对A to I转化率的影响。具体核酸保护和催化反应条件,以及转化率结果如表24所示,其余没有提及的步骤或反应条件保持与条件B中一致。其中,硼酸孵育“方式1”指在亚硝化反应体系中加入硼酸;“方式2”指在乙二醛保护核酸后,亚硝化反应之前,在核酸保护体系中加入硼酸;“方式3”指将乙二醛与硼酸孵育后,再使用乙二醛和硼酸孵育产物对核酸进行保护。
表24
Figure PCTCN2021130282-appb-000029
由表24可知,上述3种硼酸孵育方式均可有效促进A to I的转化,并且,G的非特异性转化率均在5%以下。
2.5乙二醛对A to I转化率的影响
本实验探究核酸保护和亚硝化反应中乙二醛浓度对A to I转化率的影响。简言之,调整上述实验方法步骤2.1核酸保护反应体系中的乙二醛浓度和步骤2.2亚硝化反应体系中外源添加乙二醛的量,考察乙二醛对A to I转化率的影响。具体催化反应条件和转化率结果如表25所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表25
Figure PCTCN2021130282-appb-000030
由表25可知,亚硝化反应中乙二醛浓度越高,A的转化率也越高;并且,相对于未经核酸保护的样品,经乙二醛保护的样品A的转化率明显升高(对比样品1和2),可能是由于核酸保护过程中,乙二醛和鸟嘌呤的结合促使核酸变性,从而使A的转化 更完全。
2.6亚硝化反应时间对A to I转化率的影响
本实验探究亚硝化反应时间对A to I转化率的影响。简言之,调整上述实验方法2.2亚硝化反应步骤中的反应时间,考察亚硝化反应时间对A to I转化率的影响。具体催化反应条件和转化率结果如表26所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表26
Figure PCTCN2021130282-appb-000031
由表26可知,亚硝化反应体系中乙二醛浓度越高,则亚硝化反应越高所需要的反应时间越短,例如,在用1.3M乙二醛进行核酸保护,并且,亚硝化反应时外源添加0.88M乙二醛的条件下,16℃反应4h,A的转化率即接近完全。
2.7缓冲液对A to I转化率的影响
本实验探究亚硝化反应体系中缓冲液对A to I转化率的影响。简言之,使用不同的酸(例如H 2SO 4,H 3PO 3、TsOH)将上述实验方法2.2亚硝化反应步骤中的反应液(不含MES)调节到相应pH值,考察缓冲液对A to I转化率的影响。具体催化反应条件和转化率结果如表27所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表27
Figure PCTCN2021130282-appb-000032
由表27可知,上述酸性溶液均能促进A to I的转化(例如,转化率均在94%以上)。
2.8 MES浓度和pH值对A to I转化率的影响
本实验探究亚硝化反应体系中MES浓度和pH值对A to I转化率的影响。简言之,调整上述实验方法2.2亚硝化反应步骤中的MES缓冲液的浓度和pH值,考察MES浓度和pH值对A to I转化率的影响。各样品组具体催化反应条件和转化率结果如表28所示,其余没有提及的步骤或反应条件保持与条件A中一致。
表28
Figure PCTCN2021130282-appb-000033
由表28可知,上述浓度和pH值的MES缓冲液体系中,A的转化率均达92%以上。
2.9亚硝化反应乙二醛终浓度对A to I转化率的影响
本实验探究亚硝化反应乙二醛终浓度对A to I转化率的影响。简言之,调整上述实验方法2.1核酸保护和2.2亚硝化反应步骤中外源添加乙二醛的量,从而改变亚硝化反应中乙二醛的终浓度,考察乙二醛终浓度对A to I转化率的影响。具体催化反应条件和转化率结果如表29所示,其余没有提及的步骤或反应条件保持与条件B中一致。
表29
Figure PCTCN2021130282-appb-000034
由表29可知,亚硝化反应体系中乙二醛浓度越高,A的转化率也相应升高。
2.10亚硝化反应温度对A to I转化率的影响
本实验探究亚硝化反应温度对A to I转化率的影响。简言之,调整上述实验方法2.2亚硝化反应步骤中的反应温度和时间,考察亚硝化反应温度对A to I转化率的影响。具体核酸保护和催化反应条件和转化率结果如表30所示,其余没有提及的步骤或反应条件保持与条件C中一致。
表30
Figure PCTCN2021130282-appb-000035
由表30可知,亚硝化反应在16-50℃的温度范围内均可以发生,温度越高所需要的反应时间越短,并且,反应温度的降低和反应时间的缩短有利于G to X转化率的降低。
2.11羰基化合物的种类对A to I转化率的影响
本实验探究羰基化合物的种类对A to I转化率的影响。简言之,将上述实验方法2.2亚硝化反应步骤中的使用的乙二醛替换为不同的种类和浓度的羰基化合物,考察羰基化合物的种类对A to I转化率的影响。具体核酸保护和催化反应条件和转化率结果如表31所示,其余没有提及的步骤或反应条件保持与条件C中一致。
表31
Figure PCTCN2021130282-appb-000036
由表31可知,使用高浓度乙二醛(例如1.3M乙二醛)进行核酸保护后,在亚硝化反应体系中添加不同羰基化合物(例如2-溴丙二醛、三氯乙醛等)均能促进A to I的转化。此外,在反应条件6中,检测到乙二醛催化亚硝化反应过程产生的亚胺中间体,其质谱峰图如图27所示。
2.12硼酸或其盐对A to I转化率的影响
本实验探究亚硝化反应中硼酸或其盐对A to I转化率的影响。简言之,将上述实验方法2.2亚硝化反应步骤中的硼酸替换为不同pH值的硼酸钾,考察硼酸或其盐对A to I转化率的影响。具体催化反应条件转化率结果如表32所示,其余没有提及的步骤或反应条件保持与条件C中一致。
表32
Figure PCTCN2021130282-appb-000037
由表32可知,将亚硝化反应中的硼酸替换为硼酸盐后,A to I的转化率不受影响(均接近完全)。
尽管本申请的具体实施方式已经得到详细的描述,但本领域技术人员将理解:根据 已经公布的所有教导,可以对细节进行各种修改和变动,并且这些改变均在本申请的保护范围之内。本申请的全部分为由所附权利要求及其任何等同物给出。

Claims (13)

  1. 一种检测核酸分子中N6-甲基腺嘌呤的方法,其包括以下步骤:
    (1)对待测核酸分子中鸟嘌呤的氨基进行保护,得到被保护的核酸分子;
    (2)在羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐反应,将待测核酸分子中的腺嘌呤转化为次黄嘌呤;
    (3)任选地,对步骤(2)中所得产物进行脱保护;
    (4)对前一步骤所得产物进行检测。
  2. 权利要求1所述的方法,其中,步骤(1)中,使用第二羰基化合物对待测核酸分子中鸟嘌呤的氨基进行保护;
    优选地,所述羰基化合物源自:
    (i)步骤(1)中添加的第二羰基化合物;
    (ii)步骤(2)中添加的第一羰基化合物;或者,
    (iii)(i)和(ii)的组合;
    优选地,所述第二羰基化合物与第一羰基化合物相同或不相同;更优选地,所述第二羰基化合物与第一羰基化合物相同;
    优选地,所述步骤(2)不包含添加第一羰基化合物的步骤,且所述羰基化合物源自:步骤(1)中添加的第二羰基化合物;或者,步骤(2)包含添加第一羰基化合物的步骤,且,所述第一羰基化合物与所述第二羰基化合物相同或者不同;
    优选地,步骤(3)中,任选地对所述产物中的鸟嘌呤进行脱保护;
    优选地,所述第一羰基化合物与第二羰基化合物各自独立地选自式I所示化合物及其任意组合,
    Figure PCTCN2021130282-appb-100001
    其中R 1为醛基、C 1-3烷基(例如甲基、乙基或正丙基)或卤素取代的C 1-3烷基(例如三氯甲基、三氟甲基),R 2为H、C 1-3烷基(例如甲基、乙基或正丙基)、苯基或 HC(=O)-CH(Br)-;或者
    R 1和R 2与它们所连接的羰基形成环状结构,例如茚三酮;
    优选地,所述第一羰基化合物与第二羰基化合物各自独立地选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合;
    优选地,所述第一羰基化合物选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合;
    优选地,所述第一羰基化合物为选自乙二醛、2-溴丙二醛、2,3-丁二酮、苯基乙二醛、三氯乙醛、茚三酮及其任意组合;
    优选地,所述第二羰基化合物选自:乙二醛、茚三酮及其任意组合。
  3. 权利要求2所述的方法,其中,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中接触;
    优选地,所述第一溶剂为水;
    优选地,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中,并且在二甲基亚砜(DMSO)或N,N-二甲基甲酰胺(DMF)的存在下接触;
    优选地,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中,并且在二甲基亚砜(DMSO)和硼酸或其盐(例如硼酸钾)的存在下接触;
    优选地,步骤(1)中,使待测核酸分子与第二羰基化合物在第一溶剂中,并且在N,N-二甲基甲酰胺(DMF)和硼酸或其盐(例如硼酸钾)的存在下接触;
    优选地,所述二甲基亚砜(DMSO)的终浓度为20v/v%~90v/v%,例如约30v/v%、约40v/v%、约50v/v%、约60v/v%、约70v/v%、约80v/v%;
    优选地,所述硼酸或其盐(例如硼酸钾)的终浓度为5mM~150mM(例如5mM~10mM,10mM~50mM,50mM~80mM,80mM~100mM);
    优选地,所述第二羰基化合物的终浓度为大于或等于20mM,优选为20mM~3M(例如20mM~100mM,100mM~500mM,500mM~1000mM,1M~1.5M,1.5~2M,2M~2.6M,例如约0.8M、1.3M、1.8M);
    优选地,步骤(1)中,使待测核酸分子与第二羰基化合物在16-60℃(例如约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下接 触;
    优选地,步骤(1)中,使待测核酸分子与第二羰基化合物在16-60℃(例如约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下接触15-60min(例如约20min、约30min、约40min、约50min)。
  4. 权利要求1-3任一项所述的方法,其中该方法还包括:在步骤(1)之前,对待测核酸分子进行预处理的步骤;
    优选地,所述预处理包括对待测核酸分子进行纯化、片段化、变性、或其任意组合;
    优选地,所述预处理包括使用甲酰胺水溶液或碱溶液对待测核酸分子进行变性。
  5. 权利要求1-4任一项所述的方法,其中,在步骤(2)中,在羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐在第二溶剂中反应;
    优选地,所述第二溶剂为水、对甲苯磺酸水溶液、磷酸水溶液、2-(N-吗啉)乙磺酸(MES)缓冲液、醋酸钠缓冲液、3-(N-吗啡啉)丙磺酸(MOPS)缓冲液、哌嗪-1,4-二乙磺酸(PIPES)缓冲液、4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液或三羟甲基氨基甲烷(TRIS)缓冲液或其任意组合;
    优选地,所述第二溶剂为对甲苯磺酸水溶液、磷酸水溶液、2-(N-吗啉)乙磺酸(MES)缓冲液或醋酸钠缓冲液;
    优选地,所述第二溶剂为pH为4.5~6.5(例如5~6)的对甲苯磺酸水溶液;
    优选地,所述第二溶剂为pH为4.5~6.5(例如5~6)的磷酸水溶液;
    优选地,所述第二溶剂为2-(N-吗啉)乙磺酸(MES)缓冲液,其中2-(N-吗啉)乙磺酸(MES)的终浓度为20mM~750mM(例如约40mM、约80mM、约100mM、约150mM、约200mM、约250mM、约500mM);
    优选地,所述第二溶剂为醋酸钠缓冲液,其中醋酸钠的终浓度为300mM~400mM;
    优选地,步骤(2)中,在羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐在第二溶剂中,并且在硼酸或其盐(例如硼酸钾)的存在下反应;
    优选地,所述第二溶剂中羰基化合物总摩尔浓度(终浓度)为大于或等于50mM, 例如大于或等于250mM(例如50-2000mM,50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM);
    优选地,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-60mM,60-100mM,100-300mM,300-600mM,600-800mM,800-1200mM,1200-1500mM)的丙酮醛;
    优选地,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-2500mM(例如50-100mM,100-200mM,200-500mM,500-800mM,800-1200mM,1200-1500mM,1500-1800mM,1800-2200mM,2200-2500mM)的2,3-丁二酮;
    优选地,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为20-1500mM(例如20-100mM,100-200mM,200-500mM,500-800mM,800-1200mM,1200-1500mM)的茚三酮;
    优选地,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-100mM,100-200mM,200-400mM,400-500mM,500-800mM,800-1200mM,1200-1500mM)的2-溴丙二醛;
    优选地,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-100mM,100-400mM,400-500mM,500-800mM,800-1200mM,1200-1500mM)的三氯乙醛;
    优选地,所述第二溶剂含有终浓度为50-2000mM(例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛和终浓度为50-1500mM(例如50-100mM,100-400mM,400-500mM,500-800mM,800-1200mM,1200-1500mM)的苯基乙二醛;
    优选地,所述第二溶剂含有终浓度大于或等于50mM,例如大于或等于250mM (例如50-200mM,200-500mM,500-800mM,800-1200mM,1200-1400mM,1400-1800mM,1800-2000mM,例如105mM)的乙二醛;
    优选地,所述硼酸或其盐(例如硼酸钾)的终浓度为5mM~150mM(例如5mM~10mM,10mM~50mM,50mM~80mM,80mM~100mM);
    优选地,所述亚硝酸盐选自亚硝酸钠、亚硝酸钾以及二者的组合;
    优选地,所述亚硝酸盐的终浓度为0.5M~1.5M,例如0.5M~0.8M,0.8~1.25M。
  6. 权利要求5所述的方法,其中,步骤(2)中,在羰基化合物的存在下,使被保护的核酸分子与亚硝酸盐在第二溶剂中,并且在硼酸或其盐(例如硼酸钾)的存在下反应;
    优选地,所述硼酸或其盐(例如硼酸钾)源自:
    (i)步骤(1)中添加的硼酸或其盐(例如硼酸钾);
    (ii)步骤(2)中添加的硼酸或其盐(例如硼酸钾);
    (iii)步骤(1)之后,步骤(2)之前添加的硼酸或其盐(例如硼酸钾);或者,
    (iv)(i)至(iii)的任意组合;
    优选地,所述步骤(2)不包含添加硼酸或其盐(例如硼酸钾)的步骤;或者,步骤(2)包含添加硼酸或其盐(例如硼酸钾)的步骤,且,步骤(2)添加的硼酸或其盐(例如硼酸钾)与步骤(1)添加的硼酸或其盐(例如硼酸钾)相同或者不同。
  7. 权利要求1-6任一项所述的方法,步骤(2)中,使被保护的核酸分子与亚硝酸盐在12-60℃(例如16-60℃、12-24℃、24-40℃、40-60℃、约16℃、约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下反应;
    优选地,步骤(2)中,使被保护的核酸分子与亚硝酸盐在12-60℃(例如16-60℃、12-24℃、24-40℃、40-60℃、约16℃、约20℃、约25℃、约30℃、约35℃,约40℃、约45℃、约50℃、约55℃)的温度条件下反应10min~24h(例如30min~24h、10min~20min、20min~1h、1h~5h、5h~10h、10h~24h、约20min、约30min、约40min、约50min、约8h);
    优选地,步骤(2)中,使被保护的核酸分子与亚硝酸盐在12-24℃(例如约16℃) 的温度条件下反应5h~10h(例如约8h);
    优选地,步骤(2)中,使被保护的核酸分子与亚硝酸盐在40-60℃(例如约50℃)的温度条件下反应20min~1h(例如约30min)。
  8. 权利要求1-7任一项所述的方法,所述方法还包括:在步骤(4)之前,对前一步骤中所得产物进行纯化、逆转录和/或扩增的步骤;
    优选地,所述方法还包括:对待测核酸分子进行检测的额外步骤;
    优选地,在步骤(4)中,所述检测包括通过测序或杂交、质谱(例如三重串联四极杆质谱)、酶片段化和/或色谱进行核苷酸组成分析;
    优选地,所述检测包括通过测序进行核苷酸组成分析;
    优选地,所述方法还包括对比步骤(4)的检测结果与所述额外步骤的检测结果(例如,对比步骤(4)中所述产物和所述额外步骤中所述待测核酸分子的序列或核苷酸组成,确定待测核酸分子中N6-甲基腺嘌呤的含量和/或位置信息)。
  9. 权利要求1-8任一项所述的方法,其中,所述待测核酸分子为RNA、DNA或DNA/RNA杂合体;
    优选地,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行脱保护;或者,在含有磷酸盐缓冲液中对步骤(2)中所得产物进行脱保护;
    优选地,步骤(3)中,在碱性条件下,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中或者在pH为7.1~8的含有二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中或者在pH为7.1~8的含有40~60v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,所述热处理是:
    a)将步骤(2)中所得产物在80~95℃的温度条件下处理5~10min,或者,
    b)将步骤(2)中所得产物在60~70℃的温度条件下处理1.5~2.5h(例如约2h);
    优选地,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行脱保护,或者,
    在pH为约7.4的含有约50v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过b)中所述热处理对步骤(2)中所得产物进行脱保护。
  10. 权利要求9所述的方法,其中,所述待测核酸分子为RNA或DNA/RNA杂合体;
    优选地,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行脱保护;
    优选地,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行一次脱保护,在磷酸盐缓冲液中对一次脱保护所得产物进行二次脱保护;
    优选地,步骤(3)中,在碱性条件下,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,步骤(3)中,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护;
    优选地,步骤(3)中,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护,在pH为7.1~8的含有二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对一次脱保护所得产物进行二次脱保护;
    优选地,步骤(3)中,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护;
    优选地,步骤(3)中,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行一次脱保护,在pH为7.1~8的含有40~60v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过热处理对一次脱保护所得产物进行二次脱保护;
    优选地,所述热处理是:
    a)将步骤(2)中所得产物在80~95℃的温度条件下处理5~10min,或者,
    b)将步骤(2)中所得产物在60~70℃的温度条件下处理1.5~2.5h(例如约2h);
    优选地,步骤(3)中,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行一次脱保护;
    优选地,步骤(3)中,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行一次脱保护,在pH为约7.4的含有约50v/v%二甲基亚砜(DMSO)的磷酸盐缓冲液中,通过b)中所述热处理对一次脱保护所得产物进行二次脱保护。
  11. 权利要求9所述的方法,其中,所述待测核酸分子为DNA或DNA/RNA杂合体;
    优选地,步骤(3)中,在碱性条件下对步骤(2)中所得产物进行脱保护;
    优选地,步骤(3)中,在碱性条件下,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,在pH为8~9的含有甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,在pH为8~9的含有45~50v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过热处理对步骤(2)中所得产物进行脱保护;
    优选地,所述热处理是:将步骤(2)中所得产物在80~95℃的温度条件下处理5~10min;
    优选地,在pH为约8.6的含有约47.5v/v%甲酰胺的醋酸三乙胺(TEAA)缓冲液中,通过a)中所述热处理对步骤(2)中所得产物进行脱保护。
  12. 试剂盒,其包含第一羰基化合物和亚硝酸盐;优选地,所述第一羰基化合物选自式I所示化合物及其任意组合,
    Figure PCTCN2021130282-appb-100002
    其中R 1为醛基、C 1-3烷基(例如甲基、乙基或正丙基)或卤素取代的C 1-3烷基(例如三氯甲基、三氟甲基),R 2为H、C 1-3烷基(例如甲基、乙基或正丙基)、苯基或 HC(=O)-CH(Br)-;或者,
    R 1和R 2与它们所连接的羰基形成环状结构,例如茚三酮;
    优选地,所述第一羰基化合物选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合;
    优选地,所述第一羰基化合物选自:乙二醛、2,3-丁二酮、茚三酮、2-溴丙二醛、丙酮醛、三氯乙醛、苯基乙二醛及其任意组合;
    优选地,所述第一羰基化合物为选自乙二醛、2-溴丙二醛、2,3-丁二酮、苯基乙二醛、三氯乙醛、茚三酮及其任意组合;
    优选地,所述试剂盒还包含第二羰基化合物;优选地,所述第二羰基化合物选自式I所示化合物及其任意组合,
    Figure PCTCN2021130282-appb-100003
    其中R 1为醛基、C 1-3烷基(例如甲基、乙基或正丙基)或卤素取代的C 1-3烷基(例如三氯甲基、三氟甲基),R 2为H、C 1-3烷基(例如甲基、乙基或正丙基)、苯基或HC(=O)-CH(Br)-;或者,
    R 1和R 2与它们所连接的羰基形成环状结构,例如茚三酮;
    优选地,所述第一羰基化合物与第二羰基化合物相同或不同;更优选地,所述第二羰基化合物与第一羰基化合物相同;
    优选地,所述第二羰基化合物选自:乙二醛、茚三酮及其任意组合;
    优选地,所述亚硝酸盐选自亚硝酸钠、亚硝酸钾以及二者的组合;
    优选地,所述试剂盒还包含硼酸或其盐(例如硼酸钾);
    优选地,所述试剂盒还包含二甲基亚砜(DMSO)、N,N-二甲基甲酰胺(DMF)或其组合;
    优选地,所述试剂盒还包含选自:对甲苯磺酸、磷酸、2-(N-吗啉)乙磺酸(MES)、醋酸钠、3-(N-吗啡啉)丙磺酸(MOPS)、哌嗪-1,4-二乙磺酸(PIPES)、4-羟乙基哌嗪乙磺酸(HEPPS)、三羟甲基氨基甲烷(TRIS)中的至少一种物质;
    优选地,所述试剂盒还包含选自:2-(N-吗啉)乙磺酸(MES)、醋酸钠、3-(N-吗 啡啉)丙磺酸(MOPS)、哌嗪-1,4-二乙磺酸(PIPES)、4-羟乙基哌嗪乙磺酸(HEPPS)、三羟甲基氨基甲烷(TRIS)中的至少一种物质;
    优选地,所述试剂盒还包含选自下列的物质:配制2-(N-吗啉)乙磺酸(MES)缓冲液的物质、配制醋酸钠缓冲液的物质、配制3-(N-吗啡啉)丙磺酸(MOPS)缓冲液的物质、配制哌嗪-1,4-二乙磺酸(PIPES)缓冲液的物质、配制4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液的物质、配制三羟甲基氨基甲烷(TRIS)缓冲液的物质及其任意组合;
    优选地,所述试剂盒还包含2-(N-吗啉)乙磺酸(MES)缓冲液、醋酸钠缓冲液、3-(N-吗啡啉)丙磺酸(MOPS)缓冲液、哌嗪-1,4-二乙磺酸(PIPES)缓冲液、4-羟乙基哌嗪乙磺酸(HEPPS)缓冲液或三羟甲基氨基甲烷(TRIS)缓冲液或其任意组合;
    优选地,所述试剂盒还包含选自下列的物质:配制磷酸盐缓冲液的物质、配制醋酸三乙胺(TEAA)缓冲液的物质及二者的组合;
    优选地,所述试剂盒还包含磷酸盐缓冲液、醋酸三乙胺(TEAA)缓冲液或其组合;
    优选地,所述醋酸三乙胺(TEAA)缓冲液中含有45~50v/v%甲酰胺,pH为8~9;
    优选地,所述磷酸盐缓冲液中含有40~60v/v%二甲基亚砜(DMSO),pH为7.1~8;
    优选地,所述试剂盒还包含质谱检测和/或DNA测序所用试剂;
    优选地,所述试剂盒还包含说明书;优选地,所述说明书记载权利要求1-11任一项所述的方法。
  13. 权利要求12所述的试剂盒在检测核酸分子中N6-甲基腺嘌呤中的用途。
PCT/CN2021/130282 2021-06-11 2021-11-12 检测核酸分子中n6-甲基腺嘌呤的方法及试剂盒 WO2022257354A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP21944432.0A EP4353832A1 (en) 2021-06-11 2021-11-12 Method and kit for detecting n6-methyladenosine in nucleic acid molecules

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110655085.7 2021-06-11
CN202110655085 2021-06-11

Publications (1)

Publication Number Publication Date
WO2022257354A1 true WO2022257354A1 (zh) 2022-12-15

Family

ID=81045694

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/130282 WO2022257354A1 (zh) 2021-06-11 2021-11-12 检测核酸分子中n6-甲基腺嘌呤的方法及试剂盒

Country Status (3)

Country Link
EP (1) EP4353832A1 (zh)
CN (1) CN114317704B (zh)
WO (1) WO2022257354A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117165664B (zh) * 2023-10-24 2024-03-15 河北大学 一种在单碱基水平检测drach基序中腺苷酸第六位氮原子甲基化的方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180245128A1 (en) * 2015-08-31 2018-08-30 The University Of Chicago Composition and methods for detecting adenosine modifications
WO2020229831A1 (en) * 2019-05-14 2020-11-19 Nuclera Nucleics Ltd Nucleic acid polymer with amine-masked bases

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2071927A2 (en) * 2006-09-28 2009-06-24 Illumina, Inc. Compositions and methods for nucleotide sequencing
EP3790544A4 (en) * 2018-05-08 2022-04-27 The University of Chicago COMPOSITIONS AND METHODS RELATED TO KETHOXAL DERIVATIVES
CN108822174A (zh) * 2018-08-29 2018-11-16 上海兆维科技发展有限公司 新型核苷修饰物2’-eoe-鸟嘌呤核苷及其制备方法
CN111154837B (zh) * 2019-09-02 2021-05-18 浙江大学 一种全转录组范围单碱基分辨率检测rna n6-甲基腺嘌呤修饰的方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180245128A1 (en) * 2015-08-31 2018-08-30 The University Of Chicago Composition and methods for detecting adenosine modifications
WO2020229831A1 (en) * 2019-05-14 2020-11-19 Nuclera Nucleics Ltd Nucleic acid polymer with amine-masked bases

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
CAPITANCHIK CHARLOTTE, TOOLAN-KERR PATRICK, LUSCOMBE NICHOLAS M., ULE JERNEJ: "How Do You Identify m6 A Methylation in Transcriptomes at High Resolution? A Comparison of Recent Datasets", FRONTIERS IN GENETICS, vol. 11, XP093014317, DOI: 10.3389/fgene.2020.00398 *
CHEM. SCI., vol. 12, 2021, pages 606
MEYER KATE D.: "DART-seq: an antibody-free method for global mA detection", NATURE METHODS, NATURE PUBLISHING GROUP US, NEW YORK, vol. 16, no. 12, 23 September 2019 (2019-09-23), New York, pages 1275 - 1280, XP036929814, ISSN: 1548-7091, DOI: 10.1038/s41592-019-0570-0 *
no. 7647-01-0
NUCLEIC ACIDS RES., vol. 49, no. 4, 26 February 2021 (2021-02-26), pages e23
WERNER STEPHAN, GALLIOT AURELLIA, PICHOT FLORIAN, KEMMER THOMAS, MARCHAND VIRGINIE, SEDNEV MAKSIM V, LENCE TINA, ROIGNANT JEAN-YVE: "NOseq: amplicon sequencing evaluation method for RNA m6A sites after chemical deamination", NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, GB, vol. 49, no. 4, 26 February 2021 (2021-02-26), GB , pages e23 - e23, XP093014315, ISSN: 0305-1048, DOI: 10.1093/nar/gkaa1173 *

Also Published As

Publication number Publication date
EP4353832A1 (en) 2024-04-17
CN114317704A (zh) 2022-04-12
CN114317704B (zh) 2022-09-02

Similar Documents

Publication Publication Date Title
Vaisvila et al. Enzymatic methyl sequencing detects DNA methylation at single-base resolution from picograms of DNA
JP7206284B2 (ja) Dna、特にセルフリーdnaのエピジェネティック解析の方法
CN110699426B (zh) 基因目标区域富集方法及试剂盒
US8329398B2 (en) Universal amplification of fragmented RNA
US20230056763A1 (en) Methods of targeted sequencing
JP2011500092A (ja) 非ランダムプライマーを用いたcDNA合成の方法
CN112251821A (zh) 一种快速高效的构建二代测序文库的试剂盒
EP3198063A1 (en) Rna stitch sequencing: an assay for direct mapping of rna : rna interactions in cells
WO2015196120A1 (en) Methods and compositions for detecting polynucleotides and fragments thereof
WO2022257354A1 (zh) 检测核酸分子中n6-甲基腺嘌呤的方法及试剂盒
WO2021051665A1 (zh) 基因目标区域的富集方法及体系
CN114807300A (zh) 单引物多重扩增技术在检测片段化稀有特征核酸分子中的应用及试剂盒
CN114787385A (zh) 用于检测核酸修饰的方法和系统
CN114807324A (zh) 单引物扩增建库技术在检测片段化稀有dna分子突变中的应用及试剂盒
CN113025689A (zh) 一种携带修饰的小rna的建库方法及其应用
Ma et al. An enzyme-mediated bioorthogonal labeling method for genome-wide mapping of 5-hydroxymethyluracil
WO2024120262A1 (zh) 一种可剪切环状引物及其试剂盒与扩增方法
CN112301118B (zh) 一种全转录组范围同时获取rna丰度及活性rna聚合酶位点的方法及试剂盒
US20240158833A1 (en) Compositions and Methods for Labeling Modified Nucleotides in Nucleic Acids
Zhang Transcriptomic RNA Structure Mapping Unveils the Structure Dynamics Induced by DHX36 Binding
CN116574732A (zh) 一种基于修饰crRNA的CRISPR-Cas检测体系
CN117626451A (zh) 一种高效rna多重靶向建库方法及试剂盒
CN115896100A (zh) 一种基因检测分子及应用

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21944432

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2021944432

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2021944432

Country of ref document: EP

Effective date: 20240111