WO2013075629A1 - 一种检测核酸羟甲基化修饰的方法及其应用 - Google Patents
一种检测核酸羟甲基化修饰的方法及其应用 Download PDFInfo
- Publication number
- WO2013075629A1 WO2013075629A1 PCT/CN2012/084964 CN2012084964W WO2013075629A1 WO 2013075629 A1 WO2013075629 A1 WO 2013075629A1 CN 2012084964 W CN2012084964 W CN 2012084964W WO 2013075629 A1 WO2013075629 A1 WO 2013075629A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- linker
- nucleic acid
- seq
- sequencing
- control
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
- C12Q1/683—Hybridisation assays for detection of mutation or polymorphism involving restriction enzymes, e.g. restriction fragment length polymorphism [RFLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
Definitions
- the invention belongs to the technical field of genetic engineering, and in particular relates to a method for detecting methylolation modification of a nucleic acid and an application thereof. Background technique
- 5-hydroxymethylcytosine was first discovered in cytosine in phage in 1952 and has recently been found in mammalian genomes such as mouse neurons and embryonic stem cells. A large number of studies are currently focused on revealing the possible role of 5hmC in genomic organization and stem cell differentiation, and demonstrated that the TET protease family can convert 5mC to 5hmC by oxidation.
- Another object of the invention is to provide an application of the method.
- a method of detecting a methylolation modification of a nucleic acid comprising the steps of: (1) performing a glycosylation treatment on the nucleic acid to obtain a conversion of a methylolated base to a glycosylated nucleic acid of a glycosylmethylation base;
- the first control nucleic acid fragment, the sample nucleic acid fragment and the second control nucleic acid fragment obtained in the step (2) are respectively connected to the biotin-labeled linker to obtain a first control ligation product having a biotin linker, a sample ligation product, and a second control ligation product;
- the first control final digestion product obtained in the step (6), the final digestion product of the sample, and the final digestion product of the second control, are ligated to the sequencing adaptor, and the sequencing linker ligation product is amplified to obtain the first control sequencing library.
- the nucleic acid of step (1) is genomic DNA.
- the nucleic acid of step (1) is derived from an animal, a plant, a bacterium, a fungus, a virus, or a combination thereof.
- the glycosylation treatment according to the step (1) is: the nucleic acid is transferred to the 5-hydroxyl of the nucleic acid by using the uracil diphosphate glucose as a substrate under the action of the T4-BGT enzyme. On methylcytosine (5-hmC), ⁇ -glucosyl-5-hydroxymethylcytosine (5 gmC) was formed.
- the first restriction enzyme described in the step (2) is ⁇ /.
- the second restriction enzyme described in the step (2) is H/3 ⁇ 4J/.
- sequence of the biotin-labeled linker of step (3) is as shown in SEQ ID NO: 1 and SEQ ID NO: 2.
- the step (4) further comprises: obtaining the first control Malll digestion product, the sample Malll digestion product, and the second control by using a streptavidin magnetic bead to capture a fragment produced by Malll digestion.
- the product was digested with Malll to obtain a nucleic acid fragment having a biotin-labeled linker at one end and a sticky end at one end.
- the second linker of step (5) is formed by pairing two oligonucleotide chains, the two The nucleotide strand sequences are SEQ ID NO: 3 and SEQ ID NO: 4, respectively; or the two oligonucleotide strands are SEQ ID NO: 5 and SEQ ID NO: 6, respectively; or the two oligonucleotides
- the glucoside chains are SEQ ID NO: 7 and SEQ ID NO: 8, respectively.
- the specific restriction enzyme described in the step (6) is Mmel or Ecop5I.
- the step (6) is digested with Mmel to obtain a 20 bp-length fragment having a second linker at one end and a sticky end at the other end.
- the step (6) is digested with Ecopl5I to obtain a fragment of 25 bp in length having a second linker at one end and a sticky end at the other end.
- the sequencing linker of step (7) is paired by two oligonucleotide strands, and the two oligonucleotide strand sequences are SEQ ID NO: 9 and SEQ ID NO, respectively. : 10.
- step (8) is selected from any of the following sequencing platforms:
- Illumina Solaxa Roche 454 ABI SOLID Helicos TRUE single molecule sequencing, PacBio single molecule real time sequencing, Oxford Nanopore nanopore single molecule sequencing.
- the analyzing the sequence information described in step (8) comprises the steps of:
- step (iii) calculating the methylation and hydroxylation levels of each CCGG site based on the normalized data obtained in step (ii);
- the restriction endonuclease comprises Mspl, HpaII, Mmel and Main;
- the restriction endonuclease comprises Mspl, HpaII, Ecopl5I and Main;
- the biotin-labeled linker preferably consisting of two oligonucleotide strand pairs, for example, the two oligonucleotide strand sequences are respectively SEQ ID NO : 1 and SEQ ID NO: 2;
- the second linker being composed of two oligonucleotide strand pairs, preferably, the two oligonucleotide strand sequences are SEQ ID NO: 3 And SEQ ID NO: 4; or the two oligonucleotide strand sequences are SEQ ID NO: 5 and SEQ ID NO: 6; or the two oligonucleotide strand sequences are SEQ ID NO: 7 and SEQ ID NO: 8;
- the sequencing linker preferably consisting of a pair of two oligonucleotide strands,
- the two oligonucleotide strand sequences are SEQ ID NO: 9 and SEQ ID NO: 10.
- the kit further comprises: a reagent for performing magnetic bead capture, an reagent for nucleic acid purification, or a combination thereof.
- Figure 1 shows a method for detecting a methylolation modification in a preferred embodiment of the present invention.
- Figure 2 shows that different libraries were tested, and the three different libraries that were finally ligated with linker N and P7 were amplified by PCR, and the fragment size was 96 bp, which was consistent with the theoretical size.
- Figure 2 (a) shows the stem cell h9 genome via T4. -BGT glycosylation modification, Mspl digestion fragment fragment distribution range;
- Figure 2 (b) is the stem cell h9 genome directly through the Mspl digestion library fragment distribution;
- Figure 2 (c) is the stem cell h9 genome directly through the Hpall enzyme library Fragment distribution range.
- Figure 3 shows the overall distribution trend of methylation and methylolation levels of CCGG sites in the sample.
- the abscissa is the modification level, and the ordinate is the corresponding modification level.
- the number of CCGG sites at the modification level is at the total number of sites.
- Figure 4 shows the level of methylation and methylolation modification of the detected CCGG site on each chromatin.
- Figure 5 shows the results of a comparison of methylation and methylolation modification data with bisulfite analysis data.
- a detection method for methylation modification and methylolation modification in nucleic acids specifically, including the steps of: glycosylation modification of nucleic acid and Mspl digestion
- the digested fragments were ligated to the biotin-labeled linker at the two ends and subjected to Malll digestion; and then captured by streptavidin magnetic beads, all of the captured one ends were linked to the biotin linker, and the other end was highlighted with four bases of CATG.
- the sequence can indicate the modification status information of the adjacent CCGG site; by ligating a linker containing the Mmel or Ecopl5I cleavage site at the cohesive end of the CATG and cleaving with the corresponding enzyme, the short sequence fragment generated by Bj can be represented
- the modification information of the CCGG site is obtained; after sequence comparison, the methylation modification and the methylolation modification information can be obtained.
- the term “containing” includes “comprise”, “consisting essentially of” and “consisting of.” As used herein, the terms “above” and “below” include the number, for example “80% or more” means ⁇ 80%, and “2% or less” means ⁇ 2%.
- 5-Hydroxymethylpyrimidine is a modified exact group, which is produced by the oxidation of 5-methylcytosine (5-ffiC) by the TET family of enzymes. The absorption and chromatogram of its purple buds are expressed with cytosines. 5-methylcytosine can be present at low levels in a variety of cell types in succulent animals.
- 5hmC is also present in different genomes or in different cells or tissues.
- Immunoassay found that the percentage of 5hmC in brain, liver, kidney and colorectal tissues was relatively high, 0.40-0.65%; while in lung tissue, the content was relatively low, 0.18%; in the heart, breast and placenta The content is extremely low, only 0.05-0.06%, which is 0.46-0.57% relative to normal colorectal tissue, and the content of cancerous colorectal tissue is only 0.02-0.06%.
- 5hmC is mainly concentrated in the vicinity of exons and transcription initiation sites, especially in the promoter containing histone H3 lysine 27 trimethylation (H3K27me3) and histone H3 lysine 4 trimethylation (H3K4me3).
- H3K27me3 histone H3 lysine 27 trimethylation
- H3K4me3 histone H3 lysine 4 trimethylation
- T4 phage ⁇ -glucosyltransferase T4-BGT
- T4-BGT ⁇ 4 phage ⁇ -glucosyltransferase
- T4-BGT can efficiently transfer the glucose unit of UDP-Glucose to the 5-hydroxymethylcytosine residue of double-stranded DNA to form ⁇ - Glucosyl-5-hydroxymethylcytosine (5 gmC), while 5 gmC cannot be cleaved by Mspl.
- the methylolation modification of a specific single CCGG site can be quantitatively detected by PCR semi-quantitative or Q-PCR.
- the term "primer” refers to a generic term for an oligonucleotide that is complementary to a template and which synthesizes a DNA strand complementary to a template in the action of a DNA polymerase.
- the primers may be natural RNA, DNA, and may contain any form of natural nucleotides, and the primers may even contain non-natural nucleotides such as LNA or ZNA.
- the primer is “substantially” (or “substantially") complementary to a particular sequence on a strand on the template. The primer must be sufficiently complementary to a strand on the template to initiate extension, but the sequence of the primer does not have to be fully complementary to the sequence of the template.
- primers that are not complementary to the template are added to the 5' end of the primer complementary to the template at a 3' end, such primers are still substantially complementary to the template.
- primers that are not fully complementary can also form a primer-template complex with the template for amplification.
- the "re-sequencing" of the genome enables humans to detect abnormal changes in disease-associated genes as early as possible, and to conduct in-depth research on the diagnosis and treatment of individual diseases.
- Those skilled in the art can generally perform high-throughput sequencing using a variety of second-generation sequencing platforms: 454 FLX (Roche), Solexa Genome Analyzer (Illumina), and SOLID from Applied Biosystems.
- the common feature of these platforms is the extremely high sequencing throughput. Compared to the 96 sequencing capillary sequencing of traditional sequencing, high-throughput sequencing can read 400,000 to 4 million sequences in one experiment. According to the platform, the read length is from 25bp.
- Solexa high-throughput sequencing includes two steps: DNA cluster formation and on-machine sequencing: a mixture of PCR amplification products is hybridized with a sequencing probe immobilized on a solid phase carrier, and subjected to solid phase bridge PCR amplification to form a sequencing. cluster; The sequencing cluster is sequenced by "edge synthesis-edge sequencing" to obtain the nucleotide sequence of the disease-associated nucleic acid molecule in the sample.
- the DNA cluster is formed by using a flow cell with a single-stranded primer attached to the surface, and the DNA fragment of the single-stranded state is fixed by the principle of complementary pairing with the linker on the surface of the chip.
- the fixed single-stranded DNA becomes double-stranded DNA by amplification reaction, and the double strand is denatured into a single strand, one end of which is anchored on the sequencing chip, and the other end is randomly complementary to another primer in the vicinity to be anchored.
- the DNA clusters were sequenced on the Solexa sequencer while sequencing.
- the four bases were labeled with different fluorescence, and each base was blocked by a protected base. Only one base could be added to a single reaction. After reading the color of the reaction, the protection group is removed, and the next reaction can be continued. Thus, the exact sequence of the base is obtained.
- the IndexC tag or barcode is used to distinguish the samples, and after the conventional sequencing is completed, an additional 7 cycles of sequencing for the Index portion can be performed. Up to 1 sequence can be sequenced by Index identification. Twelve different samples were distinguished in the ramp. Detection method
- the present invention provides a method for accurately detecting a methylol modification site.
- the method comprises the following steps (see Figure 1):
- T4-BGT T4 ⁇ -glucosyltransferase
- the DNA of the glycosylation-modified group is transferred to the 5-hydroxymethylcytosine residue of double-stranded DNA by the action of T4-BGT enzyme using UDP-Glucose as a substrate. , ⁇ -glucosyl-5-hydroxymethylcytosine (5 gmC) was formed.
- the reaction is independent of the DNA sequence, so all 5-hmCs can be glycosylated, while unmodified cytosine residues and methylated 5-cC residues are not glycosylated; The group did not add T4-BGT and did not undergo glycosylation modification.
- the genomic DNA may be derived from genomic DNA extracted from animal tissues, cellular genomic DNA, and the like, and may be detected by the technique as long as the CCGG site in the genomic sequence has a C h CGG methylolation modification.
- Mspl and Hpall have different sensitivity to methylation: Hpall can only recognize and cleave unmodified CCGG sites; Mspl can recognize and cleave various modified CCGG sites (CCGG, C m CGG and PC h CGG; In the present application In the DNA sequence, the superscript m indicates methylation, the superscript h indicates methylolation, but the C g CGG site cannot be cleaved.
- the end of each set of DNA fragments contains different modification information:
- the end of the DNA fragment modified by glycosylation and Mspl contains the information of CCGG and C m CGG in the genome, and the DNA fragment directly digested by Hpall The end contains only the information of CCGG in the genome, and the DNA fragment of the control group directly digested by Mspl contains all the information of CCGG, C m CGG and C h CGG in the genome.
- Biotin-linker (biotin-labeled linker): Biotin-linker is attached to the biotin-labeled linker at both ends of the differently processed and digested DNA fragments by DNA ligase.
- Streptavidin magnetic bead capture using M-280 chain affinitymycin-conjugated magnetic beads to capture a DNA fragment that is ligated to the biotin-labeled linker at one end and a sticky base at the other end of the 4 base (CATG) wash DNA sequences with sticky ends at both ends were removed, and discarding these sequences had no effect on subsequent analysis.
- M-280 chain affinitymycin-conjugated magnetic beads to capture a DNA fragment that is ligated to the biotin-labeled linker at one end and a sticky base at the other end of the 4 base (CATG) wash DNA sequences with sticky ends at both ends were removed, and discarding these sequences had no effect on subsequent analysis.
- Linker N Under the action of DNA ligase, the DNA fragment captured on the streptavidin-coupled magnetic beads and the linker containing the Mmel restriction endonuclease recognition site (Linker N) Ligation, the resulting DNA fragment is ligated to the magnetic beads by the affinity of biotin and streptavidin, and the other end is linked to a Link N containing a Mmel cleavage site.
- the recognition site for Mmel is 5' TCCRAC3', where R is base or 0. In another preferred embodiment, it is equally feasible to deform the Mmel cleavage site in the Linker N to the Ecop5I cleavage site.
- Mmel or Ecop5I digestion The restriction enzyme Mmel containing the restriction site contained in Linker N was digested to generate a 20 bp insert with Linker N attached to one end and the other end. The sticky ends of two arbitrary bases are highlighted, and a corresponding fragment bound to the magnetic beads is generated, and each of the fragments linked to the Linker N is generated to represent the modification information on the adjacent CCGG sites.
- the restriction endonuclease Ecop5I containing the restriction site contained in LinkerN is digested to generate a 25 bp (Ecopl5I) insert having the Linker N at its end, and the other One end is a sticky end protruding from two arbitrary bases, and a corresponding fragment bound to the magnetic beads is generated, and each of the fragments linked to the LinkerN is generated to represent the modification information on the adjacent CCGG sites.
- PCR amplification and purification PCR amplification of LinkerN and P7 linker sequences for universal amplification, amplification products
- the 6% non-denaturing PAGE gel was recovered and purified, and the recovered product was subjected to Agilent 2100 fragment size detection and Q-PCR quantitative analysis, and sequence analysis was performed on a Hiseq2000 sequencer.
- Sequencing and data analysis on the machine After the library is tested, the sequence analysis will be performed on the Hiseq2000 sequencer according to the read length of 50 bases at the single end. After normalizing the sequencing data, compare 20 bp corresponding to each CCGG site in different libraries. The number of short sequences sequenced to obtain information on the methylation and methylolation levels of each site.
- the analytical comparison of the sequence information is obtained by a method comprising the steps of: (i) filtering the original reads of each library obtained after sequencing to obtain high quality fragment information; The simulation is performed to obtain a virtual library consisting of theoretically digested fragments; (ii) the high-quality fragment information obtained by step ⁇ is compared with the virtual library, and the statistical data is normalized to obtain the sequencing depth of the three libraries. Data; methylation and hydroxylation levels of each CCGG site according to the normalized data obtained in step (ii); statistical samples of methylation and hydroxylation levels of each CCGG site obtained according to step (iii) The level of overall methylation and hydroxylation and the level of methylation and hydroxylation modification on chromatin.
- the filter conditions include: original library sequence information minus linker sequence information; original library sequence information minus 10% of the number of bases exceeds 10% of the total number of bases; original library sequence information minus base quality value is low Sequence information at 20 bases exceeding 10% of the total number of bases.
- Normalization includes the steps of: sorting each library according to the depth of the CCGG site, obtaining a ranking value in each library for each CCGG site; obtaining a ranking value for each CCGG site in each column, calculating each The variance of the three sorted values of one locus, the n loops remove the points with larger variance, and the last m loci as the normalized reference line, m, n are positive integers; in another preferred example, m The value range is 5000-15000, n>4 ; the library is normalized according to the proportional relationship between the total sequencing depth of the m sorted stable points in the library. Kit
- the invention also provides a kit for accurately detecting genomic methylolation modification, the kit comprising:
- the restriction endonuclease comprises Mspl, HpalK Mmel and P Main; or comprises Mspl, Hpall, Ecopl5I and Malll;
- the biotin-labeled linker preferably consisting of two oligonucleotide strand pairs, for example, the two oligonucleotide strand sequences are respectively SEQ ID NO : 1 and SEQ ID NO: 2;
- the second linker preferably consisting of two oligonucleotide strand pairs, for example, the two oligonucleotide strand sequences are SEQ ID NO: 3 and SEQ ID NO: 4; or the two oligonucleotide strand sequences are SEQ ID NO: 5 and SEQ ID NO: 6; or the two oligonucleotide strand sequences are SEQ ID NO: 7 and SEQ ID NO: 8;
- the sequencing linker preferably consisting of two oligonucleotide strand pairs, for example, the two oligonucleotide strand sequences are SEQ ID NO: 9 and SEQ ID NO: 10;
- the kit further comprises: a reagent for performing magnetic bead capture, an reagent for nucleic acid purification, or a combination thereof.
- the main advantages of the invention include: (1) The method of the present invention is a method for detecting hydroxymethyl modification by single-base resolution in combination with a high-throughput sequencing method in a genome-wide range, and can simultaneously detect a certain CCGG site at a single base resolution. Basic modification state
- the technique adopted by the present invention indirectly reflects the modification state of each site by the sequence tag, and only one end of sequencing, the amount of sequence data is greatly reduced, and the cost is greatly reduced.
- the invention is further illustrated below in conjunction with specific embodiments. It is to be understood that the examples are merely illustrative of the invention and are not intended to limit the scope of the invention.
- the experimental methods in the following examples which do not specify the specific conditions are usually carried out according to the conditions described in conventional conditions such as Sambrook et al., Molecular Cloning: Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the manufacturer. The suggested conditions. Main instruments and reagents
- NanoDrop 1000 DNA concentration detector
- Thermomixer (heating and mixing instrument) Thermomixer comfort Eppendorf
- Table 2 The main primer sequences and names used in the examples are listed in Table 3.
- SEQ ID NO: 3 and P SEQ ID NO: 4 is a Linker N sequence with a Mmel recognition site
- SEQ ID NO 5 and SEQ ID NO: 6, and SEQ ID NO: 7 and SEQ ID NO: 8 are Linker N sequences with an Ecopl5I recognition site.
- Example 1 Genomic DNA glycosylation modification
- the genomic DNA of the h9 cell line of 1 ⁇ ⁇ was separately subjected to glycosylation modification and control reaction:
- the reaction system shown in Table 4 was placed in a 1.5 ml centrifuge tube, respectively.
- glycosylation treatment and the control DNA were separately digested with Mspl.
- the system shown in Table 5 was placed in a 1.5 ml centrifuge tube.
- the enzyme-cut DNA was prepared in a 1.5 ml centrifuge tube, and the reaction system is shown in Table 7.
- reaction solution was placed on a Thermomixer (Eppendorf), reacted at 37 ° C for 1 hour and 10 minutes, and resuspended once every 10 minutes.
- Thermomixer Eppendorf
- the DNA recovery product obtained in Example 9 was prepared in accordance with Table 11 to form a linking reaction system.
- the library was amplified by using 5 ⁇ l of the reaction product of Example 9 as a template, and the amplification system is shown in Table 12.
- the gel fragment was selected to be about 86-90 bp in size, and the target strip was placed in a 0.5 ml centrifuge tube with a 2 ml centrifuge tube (there were 6 small holes in the bottom of the 0.5 ml centrifuge tube). This was placed in a centrifuge at 14,000 rpm, centrifuged for 2 min, and the gel was pulverized in a 2 ml centrifuge tube.
- the Agilent 2100 Bioanalyzer was used to detect the h9 genome after T4-BGT glycosylation modification and then Mspl-digested library and h9 genome were directly subjected to Mspl digestion or Hpall digestion.
- the library results were as follows: Figure 2 shows Different libraries were tested, and the three different libraries that were finally ligated with linker N and P7 were amplified by PCR, and the fragment size was 96 bp, which was consistent with the theoretical size.
- Figure 2 (a) shows the stem cell h9 genome via T4-BGT sugar.
- sequence analysis was performed on a Hiseq2000 sequencer according to a single-end 50 base read length. After normalization of the sequence data, information on the methylation and methylolation levels of each site was obtained by comparing the number of sequencing sequences of 20 bp short sequences corresponding to each CCGG site in three different libraries. The specific steps are as follows:
- the original sequence information of the built-in library fragments is obtained, and the sequence of the linker in each original sequence is removed according to the sequence information of the sequenced joints of the database, and the original sequence is subjected to mass filtering, and the low frequency is removed.
- the original sequence of mass, the filter conditions are: the number of N bases in the sequence exceeds 10% of the total number of bases, or the number of bases whose base mass value is less than 20 exceeds 10% of the total number of bases, such a sequence will Filtered 3)
- the sequence of the human genome hgl9 was subjected to computer simulation under the experimental protocol, and the theoretical digestion fragment was obtained to form a virtual library; and the sequencing sequence obtained by the previous step was compared with the virtual library, which was not allowed. Mismatch; after the comparison is over, the results are compared against the results;
- Figure 3 shows the overall distribution trend of methylation and methylolation levels of CCGG sites in the sample.
- the abscissa is the modification level, and the ordinate is the corresponding modification level.
- the number of CCGG sites at the modification level is in the total number of sites. Density;
- Figure 3 shows that the methylation modification detected by the scheme of the present invention has two trends of hypomethylation and hypermethylation modification, and the methylolation modification is only a low level of modification.
- Figure 4 shows the results of methylation and methylolation at each chromatin.
- the level of methylation is between 60% and 80%, mainly around 70%.
- the previously demonstrated methylation modification is approximately 70% identical to the human genomic CG site.
- the inventors detected that in the human embryonic stem cell h9, the methylolation modification was at a low level, less than 20%, consistent with the low level of methylolation modification demonstrated by the current study, indicating the detection of the present invention.
- the technology is very reliable.
- the inventors downloaded published h9 cell genomic bisulfite sequencing data, comparing bisulfite sequencing with the enzymatic methylation and methylolation of the present invention. The difference in sequencing.
- Figure 5 shows the results of the consistency of methylation and methylolation modification data with bisulfite analysis data.
- ⁇ 0.25 difference value 87.9% of the methylation or methylolation modification sites detected by the two methods were consistent and the agreement was high.
- Sites with very small differences may be due to factors such as bisulfite conversion efficiency, cell state differences, and enzymatic cleavage efficiency, but this does not affect the holistic application of the technology. Differences can be accepted.
- the present embodiment provides a kit for accurately detecting nucleic acid methylolation modification in a sample, comprising the components: (1) a first container and an agent for performing 5hmC glycosylation modification in the container;
- NEB For high-throughput detection of genomic methylation of CCGG loci in the whole genome, NEB has designed a strategy with the following specific ideas:
- the whole genome is digested with Mspl, so that all CCGG sites in the genome can be excised (including methylation and methylolation modification sites) based on 100% enzymatic cleavage efficiency;
- the digested fragment is dCTP as a substrate, and a Klenow fragment is formed to form a sticky end of a base C protruding 5';
- a 4% acrylamide gel recovers a Klenow fragment-repaired DNA fragment of 40-300 bp in length
- the fragment is ligated to 5' to highlight the double link of the base G (the linker can mediate subsequent PCR amplification and sequencing);
- the recovered fragment of the adaptor is modified by BGT glycosylation, and if the CCGG site of the original sequence of the genome contains a methylolation modification, 5gmC is formed;
- the glycosylation modified product is further digested with Mspl. If the CCGG site is modified by hydroxymethylation, the shellfish head will not be cut;
- the seventh step 1/3 of the above products are subjected to PCR amplification and sequencing, and only the sequence having a linker at both ends, that is, a sequence having a methylolation modification at both ends can be detected; the remaining 2/3 of the product is divided into two For each part, 1/3 of each is not directly subjected to PCR amplification, and under the action of dCTP substrate, the Klenow fragment is again subjected to end repair to form a sticky end product protruding one base C at one end or both ends, and then Another linker is ligated under the action of a ligase, and one of the ligation products is directly subjected to PCR amplification and sequencing.
- the other was digested with Hpall and then subjected to end-repair and linker ligation, PCR amplification and sequencing.
- the CCGG sites at both ends of the first sequence detected were methylolated, and one end of the sequence detected in the second group was methylolated, and the other end was methylated or not.
- One end of the sequence detected by the panel was a methylolation modification, and the other end was an unmodified CCGG site.
- NEB's technology has made great progress in detecting the distribution of 5hmC in the genome, it also has one.
- the technique can only use dCTP as a substrate for end-repair.
- dCTP a substrate for end-repair.
- the range of fragments to be analyzed by this technique is selected by the gelatin (40-300 bp), so that the modification information of the sites outside the fragments is not obtained. That is, the detection site is incomplete and the amount obtained is small.
- the inventors designed a set of biotin-containing linkers and Mmel-containing cleavage sites by the principle of glycosylation of 5hmC, which could not be cleaved by Mspl restriction endonuclease, and 5mC and 5C could be cleaved by Mspl.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/360,594 US9567633B2 (en) | 2011-11-24 | 2012-11-21 | Method for detecting hydroxylmethylation modification in nucleic acid and use thereof |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110376589.1A CN103131754B (zh) | 2011-11-24 | 2011-11-24 | 一种检测核酸羟甲基化修饰的方法及其应用 |
CN201110376589.1 | 2011-11-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013075629A1 true WO2013075629A1 (zh) | 2013-05-30 |
Family
ID=48469116
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2012/084964 WO2013075629A1 (zh) | 2011-11-24 | 2012-11-21 | 一种检测核酸羟甲基化修饰的方法及其应用 |
Country Status (3)
Country | Link |
---|---|
US (1) | US9567633B2 (zh) |
CN (1) | CN103131754B (zh) |
WO (1) | WO2013075629A1 (zh) |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9115386B2 (en) | 2008-09-26 | 2015-08-25 | Children's Medical Center Corporation | Selective oxidation of 5-methylcytosine by TET-family proteins |
ES2872073T3 (es) | 2011-12-13 | 2021-11-02 | Univ Oslo Hf | Procedimientos y kits de detección de estado de metilación |
EP3351644B1 (en) | 2012-11-30 | 2020-01-29 | Cambridge Epigenetix Limited | Oxidising agent for modified nucleotides |
CN103388024B (zh) * | 2013-07-04 | 2015-06-17 | 徐州医学院 | 一种基于桥式pcr检测dna羟甲基化的方法 |
CN103409514B (zh) * | 2013-07-23 | 2016-01-06 | 徐州医学院 | 一种基于芯片的高通量高灵敏检测5-羟甲基化胞嘧啶的方法 |
CN103911439A (zh) * | 2014-03-13 | 2014-07-09 | 眭维国 | 系统性红斑狼疮羟甲基化状态的差异表达基因的分析方法和应用 |
CN104480214B (zh) * | 2014-12-30 | 2018-01-16 | 深圳市易基因科技有限公司 | 羟甲基化暨甲基化长序列标签测序技术 |
EP3355939A4 (en) | 2015-09-30 | 2019-04-17 | Trustees of Boston University | MICROBIAL DEADMAN AND PASS CODE EMERGENCY STOP SWITCHES |
EP3170883B1 (en) * | 2015-11-20 | 2021-08-11 | The Procter & Gamble Company | Cleaning product |
CN105648537B (zh) * | 2016-03-02 | 2018-06-29 | 上海易毕恩基因科技有限公司 | Dna5-甲基胞嘧啶与5-羟甲基胞嘧啶基因图谱测序方法 |
US11162139B2 (en) | 2016-03-02 | 2021-11-02 | Shanghai Epican Genetech Co. Ltd. | Method for genomic profiling of DNA 5-methylcytosine and 5-hydroxymethylcytosine |
AU2017312953A1 (en) * | 2016-08-16 | 2019-01-24 | The Regents Of The University Of California | Method for finding low abundance sequences by hybridization (flash) |
CN107142320B (zh) * | 2017-06-16 | 2021-03-09 | 上海易毕恩基因科技有限公司 | 用于检测肝癌的基因标志物及其用途 |
WO2019051484A1 (en) * | 2017-09-11 | 2019-03-14 | Ludwig Institute For Cancer Research Ltd | SELECTIVE MARKING OF 5-METHYLCYTOSINE IN CIRCULATING ACELLULAR DNA |
CN109097460A (zh) * | 2018-08-30 | 2018-12-28 | 青岛大学 | 一种氧化修饰的含氮碱基的检测方法 |
CN109321647A (zh) * | 2018-10-26 | 2019-02-12 | 苏州森苗生物科技有限公司 | 标记组合物及羟甲基化核酸文库的构建方法 |
CN109811037B (zh) * | 2018-11-15 | 2022-02-11 | 华南师范大学 | 一种dna甲基化过程的连续在线检测方法 |
CN111254144B (zh) * | 2020-01-23 | 2023-04-28 | 南开大学 | 一种分子尺发夹结构及其用于测量单分子磁镊空间尺度准确性的方法 |
CA3187549A1 (en) | 2020-07-30 | 2022-02-03 | Cambridge Epigenetix Limited | Compositions and methods for nucleic acid analysis |
CN115386966B (zh) * | 2022-10-26 | 2023-03-21 | 北京寻因生物科技有限公司 | Dna表观修饰的建库方法、测序方法及其建库试剂盒 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011025819A1 (en) * | 2009-08-25 | 2011-03-03 | New England Biolabs, Inc. | Detection and quantification of hydroxymethylated nucleotides in a polynucleotide preparation |
WO2011127136A1 (en) * | 2010-04-06 | 2011-10-13 | University Of Chicago | Composition and methods related to modification of 5-hydroxymethylcytosine (5-hmc) |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2289904A1 (en) * | 2009-07-03 | 2011-03-02 | Universita' degli Studi di Milano | Inhibitors of microbial infections |
-
2011
- 2011-11-24 CN CN201110376589.1A patent/CN103131754B/zh active Active
-
2012
- 2012-11-21 WO PCT/CN2012/084964 patent/WO2013075629A1/zh active Application Filing
- 2012-11-21 US US14/360,594 patent/US9567633B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011025819A1 (en) * | 2009-08-25 | 2011-03-03 | New England Biolabs, Inc. | Detection and quantification of hydroxymethylated nucleotides in a polynucleotide preparation |
WO2011127136A1 (en) * | 2010-04-06 | 2011-10-13 | University Of Chicago | Composition and methods related to modification of 5-hydroxymethylcytosine (5-hmc) |
Non-Patent Citations (1)
Title |
---|
WILLIAM A. PASTOR. ET AL.: "Genome-wide mapping of 5-hydroxymethylcytosine in embryonic stem cells", NATURE, vol. 473, no. 7347, 19 May 2011 (2011-05-19), pages 394 - 397 * |
Also Published As
Publication number | Publication date |
---|---|
CN103131754A (zh) | 2013-06-05 |
US20150031552A1 (en) | 2015-01-29 |
CN103131754B (zh) | 2014-07-30 |
US9567633B2 (en) | 2017-02-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2013075629A1 (zh) | 一种检测核酸羟甲基化修饰的方法及其应用 | |
AU2021200391B2 (en) | Differential tagging of RNA for preparation of a cell-free DNA/RNA sequencing library | |
JP6571895B1 (ja) | 核酸プローブ及びゲノム断片検出方法 | |
US20220316010A1 (en) | Methods for copy number determination | |
JP7407227B2 (ja) | 遺伝子アリルを同定するための方法及びプローブ | |
US20230340590A1 (en) | Method for verifying bioassay samples | |
WO2011127136A1 (en) | Composition and methods related to modification of 5-hydroxymethylcytosine (5-hmc) | |
TW201321518A (zh) | 微量核酸樣本的庫製備方法及其應用 | |
CN112105626A (zh) | 用于dna、特别是细胞游离dna的表观遗传学分析的方法 | |
US10465241B2 (en) | High resolution STR analysis using next generation sequencing | |
CA3168144A1 (en) | Methods of targeted sequencing | |
TW202305143A (zh) | 用於準確的平行定量核酸的高靈敏度方法 | |
EP4172357B1 (en) | Methods and compositions for analyzing nucleic acid | |
TW202302861A (zh) | 用於準確的平行定量稀釋或未純化樣品中的核酸的方法 | |
CN105603052B (zh) | 探针及其用途 | |
CN114787385A (zh) | 用于检测核酸修饰的方法和系统 | |
CN114774514B (zh) | 一种适用于高通量靶向基因组甲基化检测的文库构建方法及其试剂盒 | |
US20220307077A1 (en) | Conservative concurrent evaluation of dna modifications | |
EP4215619A1 (en) | Methods for sensitive and accurate parallel quantification of nucleic acids | |
WO2024056008A1 (zh) | 鉴别癌症的甲基化标志物及应用 | |
JP2024035110A (ja) | 変異核酸の正確な並行定量するための高感度方法 | |
CN117915922A (zh) | 与假尿苷和5-羟甲基胞嘧啶的修饰和检测相关的组合物和方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12851694 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14360594 Country of ref document: US |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 07/11/2014) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12851694 Country of ref document: EP Kind code of ref document: A1 |