WO2021120517A1 - FS-Hi-C——一种能够高效捕获冷冻生物样本染色体三维构象的技术 - Google Patents
FS-Hi-C——一种能够高效捕获冷冻生物样本染色体三维构象的技术 Download PDFInfo
- Publication number
- WO2021120517A1 WO2021120517A1 PCT/CN2020/091706 CN2020091706W WO2021120517A1 WO 2021120517 A1 WO2021120517 A1 WO 2021120517A1 CN 2020091706 W CN2020091706 W CN 2020091706W WO 2021120517 A1 WO2021120517 A1 WO 2021120517A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sample
- linking
- cross
- freezing
- dna
- Prior art date
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
Definitions
- the invention relates to the technical field of chromosome conformation capture, in particular to a technology capable of efficiently capturing the three-dimensional conformation of the chromosome of a frozen biological sample.
- a chromosome is a special structure that exists in the cell nucleus. According to the principle of "structure determines function", the three-dimensional (3D) structure of chromosome naturally becomes the basis for understanding its biological functions.
- the 3D structure of the genome plays a role in DNA replication, damage repair, and transcriptional regulation. It plays a vital role.
- unknown target genes of regulatory elements can be identified, and genes regulated by cis-regulatory elements, phenotypes regulated by eQTL, gene expression regulated by TAD recombination and A/Bcompartment conversion can be studied, but are limited For technical reasons, the progress of analyzing the 3D structure of chromosomes has been relatively slow.
- 3C chromosome conformation capture (3C) technology which is a chromosome conformation technology that studies the interaction of chromosomes and proteins. It can analyze the correlation between gene loci that are far away in a straight line. . 3C technology solves the spatial relationship between two DNA fragments that are farther apart in a straight line, that is, the "one-to-one" relationship. This technology has a milestone significance for the research in the field of chromosomal DNA spatial structure, opening a new era in the research of the three-dimensional advanced structure of chromatin, and inspiring the advent of a series of derivative technologies.
- 3C 3C chromosome conformation capture
- Hi-C high-throughput chromosome conformation capture
- Hi-C technology is modified on the basis of 3C technology, adding biotin-labeled nucleotides, so that the small fragments of DNA produced by subsequent shearing can be enriched, and sequencing adapters are added to both ends of these fragments, and then Use the latest sequencing methods to compare and analyze the results.
- This technology has many steps, time-consuming, complicated reagents and consumables, and there is much room for improvement and optimization in the whole process.
- These shortcomings of the existing Hi-C high-throughput sequencing limit the application of this technology in the promotion of functional genomics research, so it is necessary to improve the technology to provide a new high-efficiency, convenient, economical and widely applicable Hi-C high-throughput sequencing library construction method.
- Chromosome interaction plays an important role in genome structure and gene regulation.
- Hi-C is a powerful tool for studying the three-dimensional genome structure of species. However, obtaining natural chromatin conformation requires fresh samples, which hinders the progress of three-dimensional genome research.
- the present invention optimizes and improves the key steps of commonly used Hi-C technology that is cumbersome in sample preparation and not easy to carry out quality control, so as to facilitate standardization and quality control, and can provide more stable and reliable experimental results, and promote Hi-C technology.
- -C technology is further widely used.
- the present invention claims a sample pre-processing method suitable for Hi-C high-throughput sequencing library construction.
- the sample pre-processing method for Hi-C high-throughput sequencing library construction claimed by the present invention may include the following steps: freezing and then cross-linking and fixing the biological sample to be tested.
- the freezing can be -196°C low-temperature treatment.
- the freezing specifically refers to placing the test biological sample in liquid nitrogen for quick freezing, and grinding into powder in the liquid nitrogen.
- the cross-linking and fixing can be cross-linking and fixing using a formaldehyde solution.
- the formaldehyde solution may be a formaldehyde solution with a volume percentage of 37%.
- the final concentration of formaldehyde in the system is 1% by volume.
- the freezing and the cross-linking fixation also includes a process of pre-freezing cross-linking and gradient heating of the test biological sample.
- This method can prevent the three-dimensional structure change during the heating process after the sample is removed from the liquid nitrogen solution. It solves the biological problem of how cryopreserved samples can be used to study the three-dimensional structure of their chromosomes, and can effectively maintain the chromatin conformation in living cells, improving the quality of biological Hi-C data.
- the pre-freezing cross-linking is carried out in a pre-freezing cross-linking liquid
- the pre-freezing cross-linking liquid is an ethanol solution containing 2% water and 0.01% formaldehyde, and% means volume percentage.
- the gradient heating is -90°C for 6 hours, -60°C for 6 hours, -30°C for 6 hours, and 0°C for 6 hours.
- rising from -90°C to -60°C means heating up 5°C per hour
- rising from -60°C to -30°C means heating up 5°C per hour
- rising from -30°C to 0°C is The temperature is increased by 5°C per hour.
- the method may include the steps of sequentially performing the following processing on the test biological sample:
- step (A1) the test biological sample is put into liquid nitrogen for quick freezing, and ground into powder in liquid nitrogen.
- step (A2) the ground powder (A1) is transferred to a pre-freezing cross-linking solution at -90° C.
- the pre-freezing cross-linking solution is an ethanol solution containing 2% water and 0.01% formaldehyde, and% means volume percentage. Sub-content.
- the gradient temperature increase may specifically be -90°C for 6 hours, -60°C for 6 hours, and -30°C for 6 hours, with the purpose of replacing the water in the cells with an ethanol solution.
- rising from -90°C to -60°C is a gradient temperature increase of 5°C per hour
- rising from -60°C to -30°C is a gradient temperature increase of 5°C per hour.
- step (A4) the formula of the NIbuffer is as follows: 20mM Hepes pH 8, 250mM sucrose, 1mM magnesium chloride, 5mM potassium chloride, 40% by volume glycerol, and 0.25% by volume Triton X-100 , 0.1mM PMSF, 0.1% by volume ⁇ -mercaptoethanol, 1/5 volume fraction of cocktail.
- the cocktail is a protease inhibitor.
- the cocktail is a product of MCE Company, the product number is HY-K0010.
- step (A4) after performing the centrifugation (for example, 3000g at 4°C for 30 seconds), the supernatant is removed (the alcohol solution is removed), and the NIbuffer that is pre-cooled (pre-cooled on ice) is added for cleaning. Wash three times, after washing, add the pre-cooled (pre-cooled on ice) NIbuffer in an amount of 20 mL per 1-2g (such as 2g) of the test biological sample (or 2 ⁇ 10 6 Drosophila cell samples). The suspension is then gently shaken for 15 minutes, and then filtered with Miracloth (Millipore, catalog number: 475855) (filter twice), collect the filtrate, and centrifuge (eg, 4°C 3000g for 15 minutes).
- step (A5) a 37% aqueous formaldehyde solution is added to the supernatant obtained by centrifugation in (A4).
- the final concentration of formaldehyde in the system is 1% by volume.
- step (A6) the crosslinking can be terminated by adding a 2.5M glycine solution (the solvent is water).
- the centrifugation may be 1500g centrifugation at 4°C for 5 minutes.
- the present invention claims a Hi-C high-throughput sequencing library construction method.
- the Hi-C high-throughput sequencing library construction method claimed in the present invention may include the following steps: first use the method described above to pre-treat the test biological sample; then subject the processed sample to Hi-C high-throughput Sequencing to build a library.
- the present invention claims a Hi-C high-throughput sequencing method.
- the Hi-C high-throughput sequencing method claimed in the present invention may include the following steps: first pretreating the test biological sample using the method described above; then, subjecting the processed sample to Hi-C high-throughput sequencing Build the library; finally, perform Hi-C high-throughput sequencing.
- the biological sample may be cells or tissues, such as fresh cells or tissues.
- the organism is a plant (such as cotton, soybean or radish) or an animal (Drosophila).
- the biological sample is specifically cotton callus or Drosophila cell line.
- Fig. 1 is a block diagram of the pre-processing flow diagram of the FS-Hi-C high-throughput sequencing library building method suitable for biology of the present invention.
- Figure 2 is a schematic flow chart of the FS-Hi-C method applicable to biology of the present invention.
- Fig. 3 shows the results of agarose gel electrophoresis detection of the genome obtained by the two different pretreatment methods of plant samples in the first batch of experiments (small-scale sequencing) in Example 1.
- 1 and 8 DNA marker
- 2 Genome extracted by traditional method
- 3 Genome extracted by the method of the present invention
- 4 Results of restriction digestion (endonuclease DpnII) by traditional method
- 5 Restriction digestion (endonuclease) of the present invention DpnII) result
- 6 ligation result after restriction digestion with traditional method
- 7 ligation result after restriction digestion with the method of the present invention.
- FIG. 4 is a Hi-C matrix diagram constructed with HiCPro software at a resolution of 1Mb in the first batch of experiments (small-scale sequencing) in Example 1.
- H and MH are traditional methods
- F and CF are methods of the present invention.
- Fig. 5 is a heat map of the chromosome interaction between the frozen replacement Hi-C (FS-Hi-C) and the traditional Hi-C method in the second batch experiment (deep sequencing) of Example 1.
- Figure 6 is a comparison of the distribution patterns of A/Bcompartments in cotton processed by the frozen replacement Hi-C (FS-Hi-C) and the traditional Hi-C method in the second batch of deep sequencing experiments in Example 1.
- Fig. 7 is a comparison of topologically associated domains (TADs) of frozen replacement Hi-C (FS-Hi-C) and traditional Hi-C methods in the second batch of deep sequencing experiments in Example 1.
- Figure 8 shows the changes in the chromatin structure in the nucleus under different treatments in Example 1: the processing methods in the a and bf diagrams; b. The chromatin structure of fresh cotton leaves after high-pressure freezing and freezing replacement treatments (HPF-FS- TEM); c. The chromatin structure of the nucleus after freezing and direct heating after high pressure freezing and freezing replacement; d. The chromatin structure of the nucleus extracted according to the traditional Hi-C method after high pressure freezing and freezing replacement; e Chromatin structure of the nucleus extracted by FS-Hi-C but without gradient heating process after high-pressure freezing and freeze replacement; f. Freeze replacement Hi-C (FS-Hi-C) extracted nucleus under high pressure freezing and freeze replacement After the chromatin structure.
- FS-Hi-C Freeze replacement Hi-C
- Fig. 9 is a heat map of the chromosomal interaction of Drosophila treated with frozen replacement Hi-C (FS-Hi-C) and the traditional Hi-C method in Example 2.
- Fig. 10 is a comparison of the distribution patterns of A/B compartments in Drosophila by freezing replacement Hi-C (FS-Hi-C) and traditional Hi-C methods in Example 2.
- Fig. 11 is a comparison of topologically associated domains (TADs) in Drosophila by freezing replacement Hi-C (FS-Hi-C) and traditional Hi-C methods in Example 2.
- the pre-processing flow chart of the FS-Hi-C high-throughput sequencing library building method suitable for plants of the present invention is shown in Figure 1.
- the test biological samples are sequentially processed as follows: liquid nitrogen quick freezing grinding, pre-freezing cross-linking, gradient Heating, centrifugation and NIBbuffer filtration, 37% formaldehyde cross-linking fixation, centrifugation to recover the nucleus.
- the schematic flow diagram of the FS-Hi-C method of the present invention applicable to biology is shown in FIG. 2.
- Pre-frozen cross-linking liquid an alcohol solution containing 2% water and 0.01% formaldehyde,% means volume percentage.
- NIbuffer 20mM Hepes pH 8; 250mM sucrose; 1mM MgCl 2 ; 5mM KCl; 40% (v/v) glycerol; 0.25% (v/v) Triton X-100; 0.1 mM PMSF; 0.1% (v/v) ⁇ -Mercaptoethanol; 1/5 volume fraction cocktail (protease inhibitor, product of MCE company, catalog number HY-K0010).
- 1.2 ⁇ NEBuffer 2 10 ⁇ NEBuffer 2 is a product of NEB company, the article number: B7002S. Dilute as needed.
- NEB buffer3.1 is a product of NEB company, the article number: B7203.
- 1 ⁇ NEBuffer 2 10 ⁇ NEBuffer 2 is a product of NEB company, the article number: B7002S. Dilute as needed.
- the callus in the process of cotton tissue culture was used as the research object, and two batches of experiments were carried out.
- the first batch performed small-scale sequencing on the Hi-C library of the two methods (10g data volume, each Two small data evaluations) to evaluate the quality of the library, it was found that the Hi-C new method of cotton Hi-C library was far higher than the traditional method in the effective data ratio; in the second batch, we built two methods for cotton
- the libraries (three each) were deeply sequenced, and the genome coverage reached approximately 75X, 50X and 30X (FS-Hi-C); 50X, 50X and 30X (traditional Hi-C). It was found that FS-Hi-C significantly increased the effective data ratio of the Hi-C library, and greatly reduced the ratio of repetitive sequences in the library caused by PCR amplification.
- the sample is fully ground into powder by liquid nitrogen, and transferred to a 50mL centrifuge tube containing 10mL ice-cooled NIbuffer (NIbuffer, 20mM Hepes pH 8, 250mM Sucrose, 1mM MgCl 2 , 5mM KCl, 40% (v/v) ) Glycerol, 0.25% (v/v) Triton X-100, 0.1 mM PMSF, 0.1% (v/v) ⁇ -ME, 1/5 cocktail), shake gently on ice for 15 minutes.
- NIbuffer NIbuffer
- 20mM Hepes pH 8 250mM Sucrose
- 1mM MgCl 2 1mM MgCl 2
- 5mM KCl 40% (v/v) ) Glycerol
- 0.25% (v/v) Triton X-100 0.1 mM PMSF, 0.1% (v/v) ⁇ -ME, 1/5 cocktail
- the suspension was filtered twice through Miracloth (Millipore), the filtrate was collected in a 50 mL centrifuge tube, and centrifuged at 3000 g at 4°C for 15 minutes.
- biotin-labeled DNA 1.5 ⁇ L 10mM dATP, 1.5 ⁇ L 10mM dGTP, 1.5 ⁇ L 10mM dTTP, 37.5 ⁇ L 0.4mM biotin-14-dCTP
- 10 ⁇ L 5U/ ⁇ L Klenow enzyme 10 ⁇ L 5U/ ⁇ L Klenow enzyme
- Figure 3 shows the agarose gel detection results of the genomic samples extracted by the traditional method and the method of the present invention, the samples after the genomic digestion, and the samples after the digestion and ligation. This result shows that the method of the present invention presents a diffuse band after restriction digestion, indicating that the sample pretreatment is qualified and consistent with the treatment result of the traditional method, and subsequent experiments can be performed.
- T4 DNA polymerase uses the exonuclease activity of T4 DNA polymerase to remove uncircularized biotin-labeled DNA. Add 5 ⁇ g of library to a 100 ⁇ L system containing 1 ⁇ L 10mg/mL BSA, 10 ⁇ L 10 ⁇ NEBuffer 2,1 ⁇ L 10mM dATP, 1 ⁇ L 10mM dGTP and 5Units T4 DNA polymerase, and incubate at 16°C for 4 hours.
- step 28 Resuspend the magnetic beads in 300 ⁇ L 2 ⁇ Tween-free buffer (2 ⁇ NTB: 10mM Tris-HCl pH 8.0, 1mM EDTA, 2M NaCl), and add 300 ⁇ L Hi-C DNA obtained in step 28. Incubate for 15 minutes while rotating at room temperature to bind the biotin-labeled Hi-C DNA to the magnetic beads with streptavidin.
- 2 ⁇ Tween-free buffer 2 ⁇ Tween-free buffer
- 2 ⁇ NTB 10mM Tris-HCl pH 8.0, 1mM EDTA, 2M NaCl
- step 28 Use the total amount of DNA calculated in step 28 as the input amount of biotin pull-down, and divide it by 20 to estimate the total amount of Hi-C DNA that is pulled down and can be used for linker ligation and sequencing. Add the linker at a ratio of 6 picomoles of Illumina double-end adapters per microgram of Hi-C DNA that can be used for ligation. Use 1200 units of T4 DNA ligase to ligate the adaptor to the DNA. Incubate for 2 hours at room temperature.
- the purified PCR product is eluted with 50 ⁇ L 1 ⁇ TLE buffer, which is the constructed Hi-C library.
- the high-throughput sequencing data of 4 Hi-C libraries obtained by the traditional method and the method of the present invention were evaluated.
- the quality control results generated by the recognized software HiCPro are shown in Table 1 (small-scale sequencing).
- the results show that, compared with the traditional method, the logarithmic ratio of the high-quality Reads obtained by the present invention is higher than that of the traditional method; the ratio of Reads contaminated by the joint of the present invention is lower than that of the traditional method; both ends of the method of the invention are not compared
- the logarithm ratio of Reads to the genome is lower than that of the traditional method; the method of the present invention removes the Reads pairs of repeated sequences generated in the PCR process, and the ratio of the Reads logarithm of the Reads at both ends of the PE falling on different digested fragments is much greater than that of the traditional method.
- the quality of the Hi-C library of the method of the present invention is better than that of the library constructed by the traditional method.
- the HiCPro software was used to construct a Hi-C matrix at a resolution of 1Mb, as shown in Figure 4. It can be seen that at the genome-wide level, compared with the traditional method, the three-dimensional conformation of the chromatin of the rapid freezing cross-linking method of the present invention shows a high degree of similarity; when refined to the level of each chromosome (A01), the Hi -C heat map correlation shows that the DNA-DNA interaction intensity maintains a high degree of consistency in the two treatment groups (rapid freezing cross-linking group and traditional method treatment group); in addition, the chromatin compartment treated by the two methods The components are also highly similar. This shows that the rapid freezing and cross-linking treatment does not affect the three-dimensional conformation of cotton chromatin.
- the sample is fully ground into powder by liquid nitrogen and transferred to a 50mL centrifuge tube containing 10mL ice-cooled NIbuffer (NIbuffer, 20mM Hepes pH 8, 250mM Sucrose, 1mM MgCl 2 , 5mM KCl, 40% (v/v) ) Glycerol, 0.25% (v/v) Triton X-100, 0.1 mM PMSF, 0.1% (v/v) ⁇ -ME, 1/5 cocktail), shake gently on ice for 15 minutes.
- NIbuffer NIbuffer
- 20mM Hepes pH 8 250mM Sucrose
- 1mM MgCl 2 1mM MgCl 2
- 5mM KCl 40% (v/v) ) Glycerol
- 0.25% (v/v) Triton X-100 0.1 mM PMSF, 0.1% (v/v) ⁇ -ME, 1/5 cocktail
- the suspension was filtered twice through Miracloth (Millipore), the filtrate was collected in a 50 mL centrifuge tube, and centrifuged at 3000 g at 4°C for 15 minutes.
- biotin-labeled DNA 1.5 ⁇ L 10mM dATP, 1.5 ⁇ L 10mM dGTP, 1.5 ⁇ L 10mM dTTP, 37.5 ⁇ L 0.4mM biotin-14-dCTP
- 10 ⁇ L 5U/ ⁇ L Klenow enzyme 10 ⁇ L 5U/ ⁇ L Klenow enzyme
- T4 DNA polymerase uses the exonuclease activity of T4 DNA polymerase to remove uncircularized biotin-labeled DNA. Add 5 ⁇ g of library to a 100 ⁇ L system containing 1 ⁇ L 10mg/mL BSA, 10 ⁇ L 10 ⁇ NEBuffer 2,1 ⁇ L 10mM dATP, 1 ⁇ L 10mM dGTP and 5Units T4 DNA polymerase, and incubate at 16°C for 4 hours.
- step 28 Resuspend the magnetic beads in 300 ⁇ L 2 ⁇ Tween-free buffer (2 ⁇ NTB: 10mM Tris-HCl pH 8.0, 1mM EDTA, 2M NaCl), and add 300 ⁇ L Hi-C DNA obtained in step 28. Incubate for 15 minutes while rotating at room temperature to bind the biotin-labeled Hi-C DNA to the magnetic beads with streptavidin.
- 2 ⁇ Tween-free buffer 2 ⁇ Tween-free buffer
- 2 ⁇ NTB 10mM Tris-HCl pH 8.0, 1mM EDTA, 2M NaCl
- step 28 Use the total amount of DNA calculated in step 28 as the input amount of biotin pull-down, and divide it by 20 to estimate the total amount of Hi-C DNA that is pulled down and can be used for linker ligation and sequencing. Add the linker at a ratio of 6 picomoles of Illumina double-end adapters per microgram of Hi-C DNA that can be used for ligation. Use 1200 units of T4 DNA ligase to ligate the adaptor to the DNA. Incubate for 2 hours at room temperature.
- the purified PCR product is eluted with 50 ⁇ L 1 ⁇ TLE buffer, which is the constructed Hi-C library.
- the high-throughput sequencing data of 6 Hi-C libraries obtained by the traditional method and the method of the present invention were evaluated.
- the high-efficiency Hi-C data preprocessing tool HiC-Pro is used for data quality analysis.
- the quality control results are shown in Table 2.
- the results show that the ratio of uniquely matched paired-end sequencing reads obtained by the present invention is relatively stable, and the ratio of mutually interacting paired-end reads is relatively high.
- the storage capacity increases, the effective data increases, and the ineffectiveness caused by PCR amplification
- the data rate is low, maintaining at 21-25%.
- the ratio of interactive double-ended reads obtained by the traditional method is reduced, the library capacity of the constructed library is smaller, and the ratio of invalid data in PCR amplification is larger (67-73%).
- Table 2 The quality control results generated by the software HiCPro of the high-throughput sequencing data of the Hi-C library of the present invention and the traditional method
- the Hi-C maps of the same chromosome show a high degree of similarity ( Figure 5); the frozen replacement Hi-C (FS-Hi-C) of the present invention has a high degree of similarity ( Figure 5); Compared with the traditional Hi-C method, C (FS-Hi-C) has similar distribution patterns of A and B compartments between the same chromosomes ( Figure 6); the present invention freezes the replacement of Hi-C (FS-Hi-C) and Compared with the traditional Hi-C method, the TAD distribution pattern detected on the same chromosome is similar ( Figure 7); the chromatin structure of the frozen replacement (FS) sample of the present invention and the fresh sample are highly similar ( Figure 8).
- the present invention also uses frozen replacement Hi-C (FS-Hi-C) and traditional methods on tissue samples of soybeans and radishes to perform Hi-C library construction, Hi-C high-throughput sequencing and result analysis, respectively.
- FS-Hi-C frozen replacement Hi-C
- traditional methods on tissue samples of soybeans and radishes to perform Hi-C library construction, Hi-C high-throughput sequencing and result analysis, respectively.
- Table 3 Compared with traditional methods, in soybeans and radishes, the ratio of double-end reads matching the unique site of the genome generated by the method of the present invention increased, and the ratio of repetitive sequences generated by PCR amplification decreased.
- the Hi-C high-throughput sequencing library construction method proposed by the present invention proves that rapid freezing and then cross-linking does not change the chromatin conformation of the species and improves the Hi-C high-throughput sequencing library.
- the application scope of C technology can solve the problem of insufficient cross-linking due to the existence of plant cell wall and vacuoles.
- This embodiment takes Drosophila as the research object.
- S2 cell line Collect Drosophila cell line (S2 cell line) and take 5 ⁇ 10 6 cells;
- the sample is fully ground into powder by liquid nitrogen, and transferred to a 50mL centrifuge tube containing 10mL ice-cooled NIbuffer (NIbuffer, 20mM Hepes pH 8, 250mM Sucrose, 1mM MgCl 2 , 5mM KCl, 40% (v/v) ) Glycerol, 0.25% (v/v) Triton X-100, 0.1 mM PMSF, 0.1% (v/v) ⁇ -ME, 1/5 cocktail), shake gently on ice for 15 minutes.
- NIbuffer NIbuffer
- 20mM Hepes pH 8 250mM Sucrose
- 1mM MgCl 2 1mM MgCl 2
- 5mM KCl 40% (v/v) ) Glycerol
- 0.25% (v/v) Triton X-100 0.1 mM PMSF, 0.1% (v/v) ⁇ -ME, 1/5 cocktail
- the suspension was filtered twice through Miracloth (Millipore), the filtrate was collected in a 50 mL centrifuge tube, and centrifuged at 3000 g at 4°C for 15 minutes.
- biotin-labeled DNA 1.5 ⁇ L 10mM dATP, 1.5 ⁇ L 10mM dGTP, 1.5 ⁇ L 10mM dTTP, 37.5 ⁇ L 0.4mM biotin-14-dCTP
- 10 ⁇ L 5U/ ⁇ L Klenow enzyme 10 ⁇ L 5U/ ⁇ L Klenow enzyme
- T4 DNA polymerase uses the exonuclease activity of T4 DNA polymerase to remove uncircularized biotin-labeled DNA. Add 5 ⁇ g of library to a 100 ⁇ L system containing 1 ⁇ L 10mg/mL BSA, 10 ⁇ L 10 ⁇ NEBuffer 2,1 ⁇ L 10mM dATP, 1 ⁇ L 10mM dGTP and 5Units T4 DNA polymerase, and incubate at 16°C for 4 hours.
- step 28 Resuspend the magnetic beads in 300 ⁇ L 2 ⁇ Tween-free buffer (2 ⁇ NTB: 10mM Tris-HCl pH 8.0, 1mM EDTA, 2M NaCl), and add 300 ⁇ L Hi-C DNA obtained in step 28. Incubate for 15 minutes while rotating at room temperature to bind the biotin-labeled Hi-C DNA to the magnetic beads with streptavidin.
- 2 ⁇ Tween-free buffer 2 ⁇ Tween-free buffer
- 2 ⁇ NTB 10mM Tris-HCl pH 8.0, 1mM EDTA, 2M NaCl
- step 28 Use the total amount of DNA calculated in step 28 as the input amount of biotin pull-down, and divide it by 20 to estimate the total amount of Hi-C DNA that is pulled down and can be used for linker ligation and sequencing. Add the linker at a ratio of 6 picomoles of Illumina double-end adapters per microgram of Hi-C DNA that can be used for ligation. Use 1200 units of T4 DNA ligase to ligate the adaptor to the DNA. Incubate for 2 hours at room temperature.
- the purified PCR product is eluted with 50 ⁇ L 1 ⁇ TLE buffer, which is the constructed Hi-C library.
- the frozen replacement Hi-C (FS-Hi-C) of the present invention shows a high degree of similarity between the Hi-C maps of 2L, 2R, 3L, 3R, 4 and X chromosomes.
- the library of FS-Hi-C is of high quality, has a lot of valid data, and the frequency of detected DNA interactions has increased (Figure 9).
- Figure 10 When the raw reads are similar, compared with the two Hi-C methods, the overall distribution of A and B compartments is similar ( Figure 10).
- the overall distribution of TAD in the same chromosome detected by FS-Hi-C is similar ( Figure 11).
- the present invention proposes a method for building a frozen replacement Hi-C (FS-Hi-C) library suitable for organisms, by sequentially performing liquid nitrogen freezing treatment and pre-delivery on tissue samples Linking gradient heating and crosslinking can effectively maintain the chromatin conformation in living cells and greatly improve the quality of biological Hi-C data.
- This study breaks the restriction that Hi-C processing requires formaldehyde fixation of fresh samples before cross-linking.
- the samples can be stored in liquid nitrogen for later use, which provides an improved way for the application of Hi-C in biology. method.
- the present invention creatively develops a simple frozen Substitute-Hi-C technology (Frozen Substitute-Hi-C), which is not only suitable for fresh samples, but also for frozen samples stored in liquid nitrogen.
- This technology applies the Frozen Substitute technology to the preparation of Hi-C samples of liquid nitrogen (LN2) frozen samples to avoid three-dimensional structural changes during the heating process after the samples are removed from the liquid nitrogen solution.
- This method effectively expands the application range of Hi-C samples from fresh samples to frozen samples, and solves the problem of difficult sample sampling and long-distance transportation. It is worth noting that in plants, before the freeze replacement (FS) step, the present invention grinds the sample in liquid nitrogen to remove plant cell walls that have an adverse effect on plant cross-linking.
- the freezing replacement Hi-C is used to replace the liquid water in the sample cells with ethanol, so that the frozen sample maintains the same chromatin conformation during the heating process.
- the present invention has tested this new method in fruit flies, cotton, soybeans, and radishes. Observed by transmission electron microscope, it is found that the chromatin structure of the frozen replacement (FS) sample and the fresh sample are highly similar. Compared with traditional Hi-C, chromatin interaction, A/B compartment and TAD show a high degree of consistency. However, the library constructed by the FS-Hi-C method has better quality, increased storage capacity, more data for effective interaction, and reduced repetitive data generated by PCR amplification. This technology reduces the requirements on the timeliness of samples and facilitates the collection and storage of samples.
- the present invention breaks through the restriction that the Hi-C test requires fresh samples, improves the data quality of the library, maintains the chromatin conformation, and pave the way for further exploration of gene regulation and three-dimensional genome structure.
- the technical key of the present invention is to first use liquid nitrogen to quick-freeze the ground sample, perform pre-crosslinking and increase the temperature gradient, and then perform crosslinking and fixation.
- the present invention evaluated the quality of the frozen replacement Hi-C (FS-Hi-C) library, and found that in cotton and fruit flies, the three-dimensional conformation of the fresh sample and the frozen sample after the gradient warming were similar; and "freeze replacement" was used.
- the pre-processing of samples can improve the data quality of cotton and Drosophila Hi-C libraries.
- the invention will break the limitation that fresh samples must be used in the Hi-C test, and provide a broader prospect for the application of Hi-C in biological research.
- the present invention proposes a Hi-C experimental process suitable for organisms such as Drosophila, cotton, soybeans, and radishes.
- organisms such as Drosophila, cotton, soybeans, and radishes.
- frozen replacement Hi-C (FS-Hi-C) has no significant effect on the 3D structure of chromosomes.
- the FS-Hi-C technology of the present invention significantly increases the effective data volume of the constructed Hi-C library. Suitable for biological Hi-C library construction.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
一种适用于Hi-C高通量测序建库的样本前处理方法,包括:将供试生物样本先冷冻再交联固定。该方法所提供的冷冻替换Hi-C技术(FS-Hi-C)不仅适用于新鲜样品,也适用于储存在液氮中的冷冻样品。该方法将冷冻替换(FS)技术应用于液氮(LN2)冷冻样品的Hi-C样品制备中,以防止样品从液氮溶液中移出后在升温过程中产生的三维结构变化,解决了低温冷冻保存样品如何用于研究其染色体三维结构的生物学问题。在果蝇、棉花、大豆,萝卜中评价了该方法,通过高压冷冻透射电镜观察,发现冷冻替换(FS)样品和新鲜样品的染色质三维结构高度相似,并且能够提高棉花和果蝇Hi-C文库的数据质量。与传统Hi-C前处理方法相比,FS-Hi-C在染色质互作、A/B compartment和TAD方面表现出高度的一致性。同时,FS-Hi-C方法构建的文库质量更好,库容增加,有效互作的数据更多,PCR扩增产生的重复数据降低。该方法降低了对样本时效性的要求,方便了样本的采集和保存。该方法突破了Hi-C实验样品的限制,能够高效捕获冷冻生物样本染色体三维构象,提高了生物Hi-C数据质量,为Hi-C在研究中的应用提供了一种改进的方法。
Description
本发明涉及染色体构象捕获的技术领域,特别涉及一种能够高效捕获冷冻生物样本染色体三维构象的技术。
染色体是一种存在于细胞核内的特殊结构,根据“结构决定功能”原则,染色体三维(3D)结构自然成为理解其生物学功能的基础,基因组的3D结构对DNA的复制、损伤修复及转录调控起着至关重要的作用。通过探索染色质间的远程调控作用,可以鉴定得到调控元件的未知靶基因,研究顺式调控元件调控的基因、eQTL调控的表型、TAD重组及A/B compartment转换调控的基因表达,然而限于技术上的原因,对染色体3D结构解析的进展一直较为缓慢。
2002年,Dekker等建立了3C染色体构象捕获(chromosome conformation capture,3C)技术,这是一种研究染色体和蛋白互作的染色体构象技术,可以分析直线距离很远的基因位点之间的关联性。3C技术解决的是直线距离较远的2个DNA片段的空间关系,即“一对一”关系。该项技术对于染色体DNA空间结构领域的研究具有里程碑的意义,开创了染色质三维高级结构研究的新纪元,并启发了一系列衍生技术的问世。2006年出现的4C和5C技术,分别是解决DNA片段“一对多”和“多对多”的关系。尽管这些衍生技术深化了对染色体片段之间相互关系的理解,但是仍然无法从整体上认识染色体3D结构。
近年来,随着日益成熟的高通量测序技术的发展,大规模基因组信息的获得变得更加容易。2009年,Dekker等将染色体构象捕获和日益成熟的高通量测序技术结合,首次建立了最高通量的Hi-C(highest-throughput chromosome conformation capture)技术。Hi-C是一种分析染色体空间构象的高通量测序技术,它能够在全基因组范围内捕捉不同基因座位之间的空间交互,有助于研究者理解染色体三维空间结构、染色体之间相互作用以及基因表达的空间调控机制。Hi-C技术是在3C技术的基础上进行了修改,添加了生物素标记的核苷酸,使得后续剪切产生的小片段DNA能够富集,并对这些片段的两端添加测序接头,进而采用最新的测序手段对结果进行比对分析。该技术步骤多、耗时长,涉及的试剂耗材繁杂,整个流程还有较多可以改进优化的空间。现有Hi-C高通量测序的这些缺陷限制了该技术在促进功能基因组研究中的应用,因此有必要对该技术进行改进,从而提供一种高效、便捷、经济和适用性广的新的Hi-C高通量测序建库方法。染色体互作在基因组结构和基因调控中起着重要作用,Hi-C是研究物种三维基因组结构的有力工具。然而,获取天然染色质构象需要新鲜的样本,这阻碍了三维基因组研究的进展。
水的固态有三种形态,包括两种晶型(六方晶型和立方晶型)和玻璃态。玻璃态是由于样品的快速冻结而来不及形成晶体而产生的。晶体的形成导致水冻 结时膨胀,但玻璃态水在凝固后不会膨胀,这使得玻璃态水成为生物标本唯一理想的冻结形式。基于上述原理,液氮(LN2)被广泛应用于生物样品的冷冻和长期保存。然而,冷冻标本很少用于三维基因组研究。人们普遍认为从液氮溶液中移出后样品在升温过程中会对三维基因组结构产生影响。冷冻替换(FS)技术通常被用来维持单细胞和多细胞动植物的结构,因此它被广泛地应用于制备常规光学显微镜、透射电镜和扫描电镜的样品。
发明公开
本发明针对目前样品制备繁琐、不太容易进行质量控制的常用Hi-C技术的关键步骤进行了优化和改进,以便于进行标准化与质量控制,并能提供更加稳定、可靠的实验结果,促进Hi-C技术进一步的广泛应用。
第一方面,本发明要求保护一种适用于Hi-C高通量测序建库的样本前处理方法。
本发明所要求保护的适用于Hi-C高通量测序建库的样本前处理方法,可包括如下步骤:将供试生物样本先冷冻再交联固定。
其中,所述冷冻可为-196℃低温处理。
在本发明中,所述冷冻具体为将所述供试生物样本置于液氮中进行速冻,并在液氮中研磨成粉末。
其中,所述交联固定可为利用甲醛溶液进行交联固定。其中,所述甲醛溶液可为体积百分含量为37%的甲醛溶液。进行所述交联固定时,甲醛在体系中的终浓度为1%体积百分含量。
进一步地,在所述冷冻和所述交联固定之间还包括将所述供试生物样本进行预冷冻交联和梯度升温的过程。该方法能够防止样品从液氮溶液中移出后在升温过程中产生的三维结构变化。解决了低温冷冻保存样品如何用于研究其染色体三维结构的生物学问题,并可以有效维持活细胞中的染色质构象,提高了生物的Hi-C数据质量。
其中,所述的预冷冻交联是在预冷冻交联液中进行的,所述预冷冻交联液为含有2%水、0.01%甲醛的乙醇溶液,%表示体积百分含量。
其中,所述梯度升温为-90℃6h、-60℃6h、-30℃6h、0℃6h。在所述梯度升温的过程中,从-90℃升至-60℃是每小时升温5℃,从-60℃升至-30℃是每小时升温5℃,从-30℃升至0℃是每小时升温5℃。
更进一步地,所述方法可包括将所述供试生物样本依次进行如下处理的步骤:
(A1)液氮速冻研磨;
(A2)预冷冻交联;
(A3)梯度升温;
(A4)离心,加NIbuffer过滤;
(A5)甲醛交联固定;
(A6)甘氨酸终止交联;
(A7)离心回收细胞核。
在步骤(A1)中,将所述供试生物样本放入液氮进行速冻,并在液氮中研磨成粉末。
在步骤(A2)中,将(A1)研磨的粉末转移到-90℃预冷冻交联液中,所述预冷冻交联液为含有2%水,0.01%甲醛的乙醇溶液,%表示体积百分含量。
在步骤(A3)中,所述梯度升温具体可为-90℃6h、-60℃6h、-30℃6h,目的是将细胞中的水分替换为乙醇溶液。
在所述梯度升温的过程中,从-90℃升至-60℃是每小时梯度升温5℃,从-60℃升至-30℃是每小时梯度升温5℃。
在步骤(A4)中,所述NIbuffer的配方如下:20mM Hepes pH 8,250mM蔗糖,1mM氯化镁,5mM氯化钾,体积百分含量40%的甘油,体积百分含量0.25%的Triton X-100,0.1mM PMSF,体积百分含量0.1%的β-巯基乙醇,1/5体积分数的cocktail。其中,所述cocktail是蛋白酶抑制剂。在本发明的具体实施方式中,所述cocktail为MCE公司产品,货号HY-K0010。
在步骤(A4)中,进行所述离心(如4℃3000g离心30s)后去除上清(去除的是酒精溶液),加预冷(冰上预冷)的所述NIbuffer进行清洗,根据需要可清洗三次,清洗完之后按照每1-2g(如2g)所述供试生物样本(或者2×10
6个果蝇细胞样本)20mL的量加入预冷(冰上预冷)的所述NIbuffer形成悬浮液,然后温和摇晃15min,然后Miracloth(Millipore,货号:475855)过滤(可过滤两次),收集滤液,离心(如4℃3000g离心15min)。
更进一步地,在步骤(A5)中,向(A4)离心所得上清液中加入体积百分含量为37%的甲醛水溶液。其中,甲醛在体系中的终浓度为1%体积百分含量。进行所述交联固定时,交联的温度为室温,时间为8min。
更进一步地,在步骤(A6)中,可通过加入浓度为2.5M的甘氨酸溶液(溶剂是水)终止交联。
更进一步地,在步骤(A7)中,所述离心可为4℃1500g离心5min。第二方面,本发明要求保护一种Hi-C高通量测序建库方法。
本发明所要求保护的Hi-C高通量测序建库方法,可包括如下步骤:先利用前文所述方法对供试生物样本进行预处理;然后将处理后的样本进行Hi-C高通量测序建库。
第三方面,本发明要求保护一种Hi-C高通量测序方法。
本发明所要求保护的Hi-C高通量测序方法,可包括如下步骤:先利用前文所述方法对供试生物样本进行预处理;接着,将处理后的样本进行Hi-C高通量测序建库;最后进行Hi-C高通量测序。
在上述三方面中,所述生物样本可为细胞或组织,如新鲜的细胞或组织。
在上述三方面中,所述生物为植物(如棉花、大豆或萝卜)或动物(如果 蝇)。
在本发明的一个具体实施方式中,所述生物样本具体为棉花的愈伤组织或果蝇细胞系。
第四方面,本发明要求保护如下任一中的应用:
(B1)前文第一方面中所述的方法在Hi-C高通量测序建库中的应用;
(B2)前文第一方面和第二方面中所述的方法在Hi-C高通量测序中的应用。
图1为本发明适用于生物的FS-Hi-C高通量测序建库方法前处理流程框图。
图2为本发明适用于生物的FS-Hi-C方法流程示意图。
图3为实施例1第一批次试验(小规模测序)中对植物样品的两种不同前处理方法所得基因组的琼脂糖凝胶电泳检测结果。1和8:DNA marker;2:传统方法提取的基因组3:本发明方法提取的基因组;4:传统方法的酶切(内切酶DpnII)结果;5:本发明方法的酶切(内切酶DpnII)结果;6:传统方法酶切后连接结果;7:本发明方法酶切后连接结果。
图4为实施例1第一批次试验(小规模测序)中用HiCPro软件构建1Mb分辨率下的Hi-C矩阵图。图中,H和MH是传统方法,F和CF是本发明方法。
图5为实施例1第二批次试验(深度测序)中冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法处理的棉花染色体互作的热图。
图6为实施例1第二批次试验深度测序中冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法处理的棉花中A/B compartments的分布模式比较。
图7为实施例1第二批次试验深度测序中冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法处理的棉花的拓扑学相关结构域(TADs,topological associated domains)比较。
图8为实施例1中不同处理下细胞核中染色质结构的变化:a、b-f图中的处理方式;b、新鲜的棉花叶片在高压冷冻和冷冻替换处理后的染色质结构(HPF-FS-TEM);c、冷冻研磨并直接升温后的细胞核在高压冷冻和冷冻替换处理后的染色质结构;d、按传统Hi-C方法提取的细胞核在高压冷冻和冷冻置换后的染色质结构;e、FS-Hi-C处理但无梯度升温过程提取的细胞核在高压冷冻和冷冻置换后的染色质结构;f、冷冻替换Hi-C(FS-Hi-C)提取的细胞核在高压冷冻和冷冻置换后的染色质结构。
图9为实施例2中冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法处理的果蝇的染色体互作的热图。
图10为实施例2中冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法的果蝇中A/B compartments的分布模式的比较。
图11为实施例2中冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法的果蝇中的拓扑学相关结构域(TADs,topological associated domains)的比较。
实施发明的最佳方式
本发明的适用于植物的FS-Hi-C高通量测序建库方法前处理流程图如图1所示,依次对供试生物样本进行如下处理:液氮速冻研磨、预冷冻交联、梯度升温、离心加NIBbuffer过滤、37%甲醛交联固定、离心回收细胞核。本发明适用于生物的FS-Hi-C方法流程示意图如图2所示。
以下的实施例便于更好地理解本发明,但并不限定本发明。
下述实施例中所使用的实验方法如无特殊说明,均为常规方法。
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
预冷冻交联液:含有2%水、0.01%甲醛的酒精溶液,%表示体积百分含量。
NIbuffer:20mM Hepes pH 8;250mM蔗糖;1mM MgCl
2;5mM KCl;40%(v/v)甘油;0.25%(v/v)Triton X-100;0.1mM PMSF;0.1%(v/v)β-巯基乙醇;1/5体积分数的cocktail(蛋白酶抑制剂,MCE公司产品,货号HY-K0010)。
1.2×NEBuffer 2:10×NEBuffer 2为NEB公司产品,货号:B7002S。根据需要稀释。
NEB buffer3.1:NEB buffer3.1为NEB公司产品,货号:B7203。
10×NEBuffer 2:NEB公司,货号:B7002S。
1×NEBuffer 2:10×NEBuffer 2为NEB公司产品,货号:B7002S。根据需要稀释。
实施例1、棉花的Hi-C高通量测序
本实施例以棉花组织培养过程中的愈伤组织为研究对象,进行了两个批次的试验,其中第一批次对两种方法的Hi-C文库进行小规模测序(10g数据量,各两个小数据评估)以评估文库的质量,发现Hi-C新方法的棉花Hi-C文库在有效数据比例上远高于传统方法;第二批次中,我们对棉花的两种方法所建文库(各三个)进行了深度测序,基因组覆盖度约达到75X、50X和30X(FS-Hi-C);50X、50X和30X(传统Hi-C)。结果发现,FS-Hi-C显著提高了Hi-C文库的有效数据比例,并大幅度降低了文库中由PCR扩增引起的重复序列比例。
一、第一批次试验
(一)样本预处理
本发明方法:
1、用滤纸吸干新鲜的棉花愈伤组织,并取4g在液氮中速冻并研磨成粉末。
2、将研磨的棉花愈伤组织粉末转移至-90℃的预冷冻交联液中,然后缓慢的梯度升温,-90℃—-60℃—-30℃—0℃,每一个温度梯度持续6h(从-90℃升至-60℃是每小时升温5℃,从-60℃升至-30℃是每小时升温5℃,从-30℃升至0℃是每小时升温5℃)。将细胞中的水分替换成乙醇溶液。
3、4℃3000g离心30s,将乙醇溶液去除,然后用放置在冰上预冷的NIbuffer缓冲液清洗三次,清洗完之后按照每2g棉花愈伤组织20mL的量加入预冷(冰上预冷)的NIbuffer形成悬浮液,并温和摇晃15min。
4、悬浮液用Miracloth(Millipore,货号:475855)过滤两次,收集滤液, 4℃3000g离心15min,向离心所得上清中按照每2g棉花愈伤组织800μL的量加入体积百分含量为37%的甲醛水溶液(使得甲醛在体系中的终浓度为1%体积百分含量),室温交联固定8min,加入浓度为2.5M的甘氨酸(溶剂是水)终止交联。
5、1500g,4℃离心5min回收细胞核。
传统方法:
1、收集4g-10g新鲜的组织样品;
2、每个50mL离心管提前加好45mL PBS(或无糖MS)(通风橱中每管加35μL巯基乙醇)和1.25mL的37%甲醛溶液(sigma)(终浓度1%);
3、每2-3g样品用锋利的刀片切成2mm×3mm×10mm的小块,放置于上述50mL离心管中,室温下,抽真空-60到-80Kpa后,保持10min,然后恢复至大气压强。如此反复三次(标准:样品沉底!);
4、加2.5mL的2.5M甘氨酸(4℃保存)来终止交联反应(终浓度0.125M),摇匀5min,终止反应。
5、滤掉液体,ddH
2O清洗3次,将样品表面的液体用吸水纸吸干,液氮速冻,-80℃保存后,干冰运输。
6、交联后样品经液氮充分研磨成粉末,转移至含10mL冰预冷NIbuffer的50mL离心管中(NIbuffer,20mM Hepes pH 8,250mM Sucrose,1mM MgCl
2,5mM KCl,40%(v/v)glycerol,0.25%(v/v)Triton X-100,0.1mM PMSF,0.1%(v/v)β-ME,1/5cocktail),冰上温和摇晃15min。
7、悬浮液经Miracloth(Millipore)过滤两次,50mL离心管收集滤液,4℃3000g离心15min。
(二)建库流程
1、弃上清,沉淀重悬于2mL NIbuffer,4℃1900g离心5min。
2、重复步骤1一次。
3、弃上清。重悬于1mL 1.2×NEBuffer 2,4℃1900g离心5min。
4、弃上清,重悬于500μL 1.2×NEBuffer 2。
5、加入7.5μL 20%(w/v)SDS。65℃孵育40min。37℃800rpm摇床孵育20min。
6、加入50μL 20%(v/v)Triton X-100,37℃800rpm摇床孵育1h。
7、加入50μL NEB buffer3.1,10μL DpnII,37℃旋转过夜。
8、62℃灭活DpnII 20分钟。
9、加生物素标记的DNA(1.5μL 10mM dATP,1.5μL 10mM dGTP,1.5μL 10mM dTTP,37.5μL 0.4mM biotin-14-dCTP)和10μL 5U/μL Klenow酶补平DNA,小心混匀并在37℃孵育45min。
10、取出放置冰上,加86μL 10%(w/v)SDS终止反应。65℃孵育30min,之后迅速放置冰上。
11、补平DNA,在冰上加7.61mL成环反应体系(745μL 10%(v/v)Triton X-100,745μL 10×ligation buffer(500mM Tris-HCl pH 7.5,100mM MgCl
2,100mM DTT),80μL 10mg/mL BSA,80μL 100mM ATP和5.96mL ddH
2O)至15mL离心管,转移染色质至15mL离心管。
12、加50U T4 DNA ligase,颠倒混匀,16℃反应4h,进行DNA成环。
13、加50μL 10mg/mL蛋白酶K,65℃消化过夜。之后,再加50μL 10mg/mL蛋白酶K,65℃消化2h。
14、冷却至室温,转移到50mL离心管。通过酚提取纯化DNA。加10mL pH 8.0的酚,涡旋震荡2min。1500g离心10min,转移水相至新的50mL离心管。
15、通过酚氯仿(1:1体积)再次抽提纯化,涡旋震荡2min,22000g离心5min,转移水相至新的50mL离心管。
16、加入2.5倍体积纯酒精沉淀DNA,22000g离心5min,弃上清。
17、加入450μL TE(pH 8.0)溶解DNA沉淀,转移至2mL离心管。
18、加入500μL酚氯仿(1:1体积),震荡1min,22000g离心5min,转移水相至新管。纯化两次之后,加0.1倍体积的NaOAc和2倍体积纯酒精,-80℃孵育30min,离心去上清。
19、加入1mL 70%酒精清洗DNA,22000g离心5min。弃上清。
20、加入25μL TE重悬DNA。加1μL 1mg/mL RNase A于37℃消化30min。
21质检:取5μL样品进行1%琼脂糖胶检测。主带明显的样品为不合格,需处理后再行酶切。如呈现弥散型条带,样品继续进行后续实验。
图3中显示了传统方法和本发明方法提取的基因组样品、基因组酶切后的样品,以及酶切后再进行连接的样品的琼脂糖凝胶检测结果。该结果显示本发明方法酶切后呈现弥散条带,说明样品前处理合格,与传统方法处理结果一致,可以进行后续实验。
22、利用T4 DNA聚合酶的外切酶活性,去掉未环化的生物素标记DNA。加5μg的library至含1μL 10mg/mL BSA,10μL 10×NEBuffer 2,1μL 10mM dATP,1μL 10mM dGTP和5Units T4 DNA polymerase的100μL体系中,16℃孵育4小时。
23、加2μL 0.5M EDTA pH 8.0终止反应,并通过酚氯仿(1:1,体积比)/乙醇沉淀纯化DNA。100μL ddH
2O重悬DNA。
24、利用超声波DNA破碎仪(Covaris M220)将DNA打断至200-700bp左右。
25、修复末端,加14μL 10×ligation buffer,14μL 2.5mM dNTP mix,5μL T4 DNA polymerase,5μL T4 polynucleotide kinase,1μL Klenow DNA polymerase和1μL水,20℃孵育30min。
26、通过DNA纯化试剂盒(Qiagen)进行DNA纯化。以15μL低盐TE(TLE:10mM Tris pH 8.0,0.1mM EDTA)洗脱两次,之后对DNA加A,通过5μL 10 ×NEBuffer 2,10μL 1mM dATP,2μL水和3μL Klenow(exo-),37℃孵育20分钟。
27、65℃孵育20min终止反应,并迅速放冰上。真空浓缩至20μL。
28、用1.5%的琼脂糖凝胶80-90v跑3.5小时回收DNA。挑选300-500bp的DNA切胶并通过胶回收试剂盒进行回收。以50μL TLE洗脱。汇总洗脱液至300μL。通过Qubit计算DNA总量。
29、接下来的实验推荐在低DNA吸附管(DNA LoBind tubes)中进行。
30、通过磁珠法进行生物素DNA富集,详细参照试剂盒说明操作(Agencourt AMPure eads,Beckman)。具体来说:
A.用400μL Tween缓冲液(TB:5mM Tris-HCl pH 8.0,0.5mM EDTA,1M NaCl,0.05%Tween)洗涤150μL重悬浮的链霉亲和素磁珠两次来制备生物素下拉磁珠。
B.将磁珠重悬于300μL 2×不含Tween的缓冲液(2×NTB:10mM Tris-HCl pH 8.0,1mM EDTA,2M NaCl)中,并加入步骤28中所得的300μL Hi-C DNA。在室温下边旋转边孵育15分钟,使生物素标记的Hi-C DNA与带有链霉亲和素的磁珠结合。
C.用磁性粒子集中器回收与DNA结合的链霉亲和素磁珠,并弃去上清液。依次用400μL 1×NTB(5mM Tris-HCl pH 8.0,0.5mM EDTA,1M NaCl)和100μL1×ligation buffer洗涤珠子。接着将珠重悬于50μL 1×ligation buffer中,并将混合物转移到新管中。
D.将磁珠用Gibson试剂盒缓冲液重悬,并加入T5外切酶37℃消化10min(参考文献:enzymatic assembly of DNA molecules up to several hundred kilobases)。
E.将在步骤28中计算出的DNA总量作为生物素下拉的输入量,并将其除以20估计被拉下且可用于接头连接及测序的Hi-C DNA总量。按每微克可用于连接的Hi-C DNA添加6皮摩尔的Illumina双末端接头的比例添加连接物。使用1200单位T4 DNA连接酶将接头连接到DNA上。在室温下孵育2小时。
F.回收与Hi-C DNA结合的磁珠并用400μL 1×TB洗涤两次,从而去除多余的双末端接头。
G.依次用200μL 1×NTB、200μL和50μL 1×NEBuffer洗涤磁珠。最后一次洗涤后,将其重悬于50μL 1×NEBuffer 2并转移到新管。
H.为了确定合适的循环次数以产生足够的可进行测序的PCR产物,分别以6、9、12和15个循环进行PCR试验。通过在5%聚丙烯酰胺凝胶上进行PCR反应并用Sybr Green染色确定最佳循环数,确保不存在杂带,并且在400-600个碱基对区段有条带出现,因为这是Hi-C DNA与接头相连后大致的片段长度。
I.以最佳PCR循环次数进行大规模PCR以扩增其余与磁珠结合的Hi-C DNA文库。分别收集PCR产物并回收磁珠。取1%上述PCR产物于凝胶上作为对照,并用1.8倍体积的Ampure珠纯化剩余的PCR产物。
J.用50μL 1×TLE缓冲液洗脱纯化后的PCR产物,即为构建好的Hi-C文库。
通过上述操作,获得两份传统方法获得的Hi-C文库(记为T1和T2),两份本发明方法获得的Hi-C文库(记为F1和F2)。
(三)Hi-C高通量测序
采用Illumina:HiSeq X Ten测序仪进行pair-end测序(双端测序)。
(四)Hi-C文库质量鉴定
对传统方法和本发明方法所得的4份Hi-C文库的高通量测序数据进行评估的。通过公认的软件HiCPro生成质控结果如表1所示(小规模测序)。该结果显示,与传统方法相比,本发明得到的高质量的Reads对数比例要高于传统方法;本发明接头污染的Reads的比例要低于传统方法;本发明方法两端均未比对到基因组的Reads对数比例要低于传统方法;本发明方法去除PCR过程中产生的重复序列的Reads对后PE两端的Reads落在不同酶切片段的Reads对数的比例要远大于传统方法。结论:本发明方法的Hi-C文库质量要优于传统方法建库的文库质量。
表1 本发明和传统方法的Hi-C文库的高通量测序数据通过软件HiCPro生成的质控结果
接着,用HiCPro软件构建1Mb分辨率下的Hi-C矩阵图,如图4所示。可以看出,在全基因组水平,本发明快速冷冻交联法和传统方法相比,两者的染色质三维构象呈现高度的相似性;细化到每条染色体水平(A01),两者的Hi-C热图相关性显示,DNA-DNA的交互作用强度在两个处理组(快速冷冻交联组和传统方法处理组)中保持高度的一致性;此外,两种方法处理后的染色质compartment组分也高度相似。这说明快速冷冻交联处理不影响棉花的染色质三维构象。
二、第二批次试验
(一)样本预处理
本发明方法:
1、用滤纸吸干新鲜的棉花愈伤组织,并取4g在液氮中速冻并研磨成粉末。
2、将研磨的棉花愈伤组织粉末转移至-90℃的预冷冻交联液中,然后缓慢的梯度升温:-90℃,6h;-90℃到-60℃,6h(每小时升温5℃);-60℃,6h;-60℃到-30℃,6h(每小时升温5℃);-30℃,6h;-30℃到0℃,6h(每小时升温5℃);每一个温度梯度持续6h。将细胞中的水分替换成乙醇溶液。
3、4℃3000g离心30s,将酒精乙醇去除,然后用放置在冰上预冷的NIbuffer缓冲液清洗三次,清洗完之后按照每2g棉花愈伤组织20mL的量加入预冷(冰上预冷)的NIbuffer形成悬浮液,并温和摇晃15min。
4、悬浮液用Miracloth(Millipore,货号:475855)过滤两次,收集滤液,4℃3000g离心15min,向离心所得上清中按照每2g棉花愈伤组织800μL的量加入体积百分含量为37%的甲醛水溶液(使得甲醛在体系中的终浓度为1%体积百分含量),室温交联固定8min,加入浓度为2.5M的甘氨酸(溶剂是水)终止交联。
5、1500g,4℃离心5min回收细胞核。
传统方法:
1、收集4g-10g新鲜的组织样品;
2、每个50mL离心管提前加好45mL PBS(或无糖MS)(通风橱中每管加35μL巯基乙醇)和1.25mL的37%甲醛溶液(sigma)(终浓度1%);
3、每2-3g样品用锋利的刀片切成2mm×3mm×10mm的小块,放置于上述50mL离心管中,室温下,抽真空-60到-80Kpa后,保持10min,然后恢复至大气压强。如此反复三次(标准:样品沉底!);
4、加2.5mL的2.5M甘氨酸(4℃保存)来终止交联反应(终浓度0.125M),摇匀5min,终止反应。
5、滤掉液体,ddH
2O清洗3次,将样品表面的液体用吸水纸吸干,液氮速冻,-80℃保存后,干冰运输。
6、交联后样品经液氮充分研磨成粉末,转移至含10mL冰预冷NIbuffer的50mL离心管中(NIbuffer,20mM Hepes pH 8,250mM Sucrose,1mM MgCl
2, 5mM KCl,40%(v/v)glycerol,0.25%(v/v)Triton X-100,0.1mM PMSF,0.1%(v/v)β-ME,1/5cocktail),冰上温和摇晃15min。
7、悬浮液经Miracloth(Millipore)过滤两次,50mL离心管收集滤液,4℃3000g离心15min。
(二)建库流程
1、弃上清,沉淀重悬于2mL NIbuffer,4℃1900g离心5min。
2、重复步骤1一次。
3、弃上清。重悬于1mL 1.2×NEBuffer 2,4℃1900g离心5min。
4、弃上清,重悬于500μL 1.2×NEBuffer 2。
5、加入7.5μL 20%(w/v)SDS。65℃孵育40min。37℃800rpm摇床孵育20min。
6、加入50μL 20%(v/v)Triton X-100,37℃800rpm摇床孵育1h。
7、加入50μL NEB buffer3.1,10μL DpnII,37℃旋转过夜。
8、62℃灭活DpnII 20分钟。
9、加生物素标记的DNA(1.5μL 10mM dATP,1.5μL 10mM dGTP,1.5μL 10mM dTTP,37.5μL 0.4mM biotin-14-dCTP)和10μL 5U/μL Klenow酶补平DNA,小心混匀并在37℃孵育45min。
10、取出放置冰上,加86μL 10%(w/v)SDS终止反应。65℃孵育30min,之后迅速放置冰上。
11、补平DNA,在冰上加7.61mL成环反应体系(745μL 10%(v/v)Triton X-100,745μL 10×ligation buffer(500mM Tris-HCl pH 7.5,100mM MgCl
2,100mM DTT),80μL 10mg/mL BSA,80μL 100mM ATP和5.96mL ddH
2O)至15mL离心管,转移染色质至15mL离心管。
12、加50U T4 DNA ligase,颠倒混匀,16℃反应4h,进行DNA成环。
13、加50μL 10mg/mL蛋白酶K,65℃消化过夜。之后,再加50μL 10mg/mL蛋白酶K,65℃消化2h。
14、冷却至室温,转移到50mL离心管。通过酚提取纯化DNA。加10mL pH 8.0的酚,涡旋震荡2min。1500g离心10min,转移水相至新的50mL离心管。
15、通过酚氯仿(1:1体积)再次抽提纯化,涡旋震荡2min,22000g离心5min,转移水相至新的50mL离心管。
16、加入2.5倍体积纯酒精沉淀DNA,22000g离心5min,弃上清。
17、加入450μL TE(pH 8.0)溶解DNA沉淀,转移至2mL离心管。
18、加入500μL酚氯仿(1:1体积),震荡1min,22000g离心5min,转移水相至新管。纯化两次之后,加0.1倍体积的NaOAc和2倍体积纯酒精,-80℃孵育30min,离心去上清。
19、加入1mL 70%酒精清洗DNA,22000g离心5min。弃上清。
20、加入25μL TE重悬DNA。加1μL 1mg/mL RNase A于37℃消化30min。
21、质检:取5μL样品进行1%琼脂糖胶检测。主带明显的样品为不合格,需处理后再行酶切。如呈现弥散型条带,样品继续进行后续实验。
22、利用T4 DNA聚合酶的外切酶活性,去掉未环化的生物素标记DNA。加5μg的library至含1μL 10mg/mL BSA,10μL 10×NEBuffer 2,1μL 10mM dATP,1μL 10mM dGTP和5Units T4 DNA polymerase的100μL体系中,16℃孵育4小时。
23、加2μL 0.5M EDTA pH 8.0终止反应,并通过酚氯仿(1:1,体积比)/乙醇沉淀纯化DNA。100μL ddH
2O重悬DNA。
24、利用超声波DNA破碎仪(Covaris M220)将DNA打断至200-700bp左右。
25、修复末端,加14μL 10×ligation buffer,14μL 2.5mM dNTP mix,5μL T4 DNA polymerase,5μL T4 polynucleotide kinase,1μL Klenow DNA polymerase和1μL水,20℃孵育30min。
26、通过DNA纯化试剂盒(Qiagen)进行DNA纯化。以15μL低盐TE(TLE:10mM Tris pH 8.0,0.1mM EDTA)洗脱两次,之后对DNA加A,通过5μL 10×NEBuffer 2,10μL 1mM dATP,2μL水和3μL Klenow(exo-),37℃孵育20分钟。
27、65℃孵育20min终止反应,并迅速放冰上。真空浓缩至20μL。
28、用1.5%的琼脂糖凝胶80-90v跑3.5小时回收DNA。挑选300-500bp的DNA切胶并通过胶回收试剂盒进行回收。以50μL TLE洗脱。汇总洗脱液至300μL。通过Qubit计算DNA总量。
29、接下来的实验推荐在低DNA吸附管(DNA LoBind tubes)中进行。
30、通过磁珠法进行生物素DNA富集,详细参照试剂盒说明操作(Agencourt AMPure eads,Beckman)。具体来说:
A.用400μL Tween缓冲液(TB:5mM Tris-HCl pH 8.0,0.5mM EDTA,1M NaCl,0.05%Tween)洗涤150μL重悬浮的链霉亲和素磁珠两次来制备生物素下拉磁珠。
B.将磁珠重悬于300μL 2×不含Tween的缓冲液(2×NTB:10mM Tris-HCl pH 8.0,1mM EDTA,2M NaCl)中,并加入步骤28中所得的300μL Hi-C DNA。在室温下边旋转边孵育15分钟,使生物素标记的Hi-C DNA与带有链霉亲和素的磁珠结合。
C.用磁性粒子集中器回收与DNA结合的链霉亲和素磁珠,并弃去上清液。依次用400μL 1×NTB(5mM Tris-HCl pH 8.0,0.5mM EDTA,1M NaCl)和100μL 1×ligation buffer洗涤珠子。接着将珠重悬于50μL 1×ligation buffer中,并将混合物转移到新管中。
D.将磁珠用Gibson试剂盒缓冲液重悬,并加入T5外切酶37℃消化10min(参考文献:enzymatic assembly of DNA molecules up to several hundred kilobases)。
E.将在步骤28中计算出的DNA总量作为生物素下拉的输入量,并将其除以20估计被拉下且可用于接头连接及测序的Hi-C DNA总量。按每微克可用于连接的Hi-C DNA添加6皮摩尔的Illumina双末端接头的比例添加连接物。使用1200单位T4 DNA连接酶将接头连接到DNA上。在室温下孵育2小时。
F.回收与Hi-C DNA结合的磁珠并用400μL 1×TB洗涤两次,从而去除多余的双末端接头。
G.依次用200μL 1×NTB、200μL和50μL 1×NEBuffer洗涤磁珠。最后一次洗涤后,将其重悬于50μL 1×NEBuffer 2并转移到新管。
H.为了确定合适的循环次数以产生足够的可进行测序的PCR产物,分别以6、9、12和15个循环进行PCR试验。通过在5%聚丙烯酰胺凝胶上进行PCR反应并用Sybr Green染色确定最佳循环数,确保不存在杂带,并且在400-600个碱基对区段有条带出现,因为这是Hi-C DNA与接头相连后大致的片段长度。
I.以最佳PCR循环次数进行大规模PCR以扩增其余与磁珠结合的Hi-C DNA文库。分别收集PCR产物并回收磁珠。取1%上述PCR产物于凝胶上作为对照,并用1.8倍体积的Ampure珠纯化剩余的PCR产物。
J.用50μL 1×TLE缓冲液洗脱纯化后的PCR产物,即为构建好的Hi-C文库。
通过上述操作,获得三份本发明方法FS-Hi-C获得的Hi-C文库(记为CF9、CF10和CF15),获得三份传统方法获得的Hi-C文库(记为MH14、MH16和MH19)。
(三)Hi-C高通量测序
采用Illumina:HiSeq X Ten测序仪进行pair-end测序(双端测序)。
(四)Hi-C文库质量鉴定
对传统方法和本发明方法所得的6份Hi-C文库的高通量测序数据进行评估的。利用高效率的Hi-C数据预处理工具HiC-Pro进行数据质量分析,质控结果如表2所示。该结果显示,本发明得到的唯一匹配的双端测序reads的比率比较稳定,互作的双端reads比例较高,随着测序深度的增加,库容增加,有效数据增加,PCR扩增导致的无效数据的比率较低,维持在21-25%。相比之下,传统方法得到的互作的双端reads比例降低,构建的文库的库容较小,PCR扩增的无效数据比例较大(67-73%)。
结论:本发明方法的Hi-C文库质量要优于传统方法建库的文库质量。
表2 本发明和传统方法的Hi-C文库的高通量测序数据通过软件HiCPro生成的质控结果
本发明冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法相比,相同染色体之间的Hi-C map均呈现出高度的相似性(图5);本发明冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法相比,相同染色体之间的A和B compartment的分布模式相似(图6);本发明冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法相比,相同染色体上检测到的TAD分布模式相似(图7);本发明冷冻替换(FS)样品和新鲜样品的染色质结构高度相似(图8)。这些结果说明,在全基因组水平,冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法相比,两者的染色质三维构象呈现出高度的相似性,FS-Hi-C显示出更高的文库质量。
另外,本发明还对大豆、萝卜的组织样本采用冷冻替换Hi-C(FS-Hi-C)和传统方法分别进行了Hi-C文库构建、Hi-C高通量测序及结果分析。结果如表3所示,在大豆,萝卜中,与传统方法相比,本发明方法产生的匹配到基因组唯一位点的双端reads的数量比例增加,PCR扩增产生的重复序列的比例降低。
结论:冷冻替换Hi-C(FS-Hi-C)文库质量要优于传统方法建库的文库质量,相同raw reads下,本发明方法检测到更多的有效reads。
表3 本发明快速冷冻交联技术能够有效提高文库有效数据量
综合本发明实施例结果,可见:本发明提出的Hi-C高通量测序的建库方法,通过改良前处理方式,证明快速冷冻然后交联并不改变物种的染色质构象,提高了Hi-C技术的应用范围,能够解决植物细胞细胞壁及液泡的存在使得交联不充分的问题。
实施例2、果蝇的Hi-C高通量测序
本实施例以果蝇为研究对象。
一、样本预处理
本发明方法:
1、将果蝇细胞系(S2细胞系)取5×10
6个细胞,在液氮中速冻并研磨成粉末。
2、将研磨物质转移至-90℃的预冷冻交联液中,然后缓慢的梯度升温:-90℃,6h;-90℃到-60℃,6h(每小时升温5℃);-60℃,6h;-60℃到-30℃,6h(每小时升温5℃);-30℃,6h;-30℃到0℃,6h(每小时升温5℃);每一个温度梯度持续6h。将细胞中的水分替换成乙醇溶液。
3、4℃3000g离心30s,将乙醇溶液去除,然后用放置在冰上预冷的NIbuffer缓冲液清洗三次,清洗完之后按照每2×10
6个细胞20mL的量加入预冷(冰上预冷)的NIbuffer形成悬浮液,并温和摇晃15min。
4、悬浮液用Miracloth(Millipore,货号:475855)过滤两次,收集滤液,4℃3000g离心15min,向离心所得上清中按照每2×10
6个细胞800μL的量加入 体积百分含量为37%的甲醛水溶液(使得甲醛在体系中的终浓度为1%体积百分含量),室温交联固定8min,加入浓度为2.5M的甘氨酸(溶剂是水)终止交联。
5、1500g,4℃离心5min回收细胞核。
传统方法:
1、收集果蝇细胞系(S2细胞系),取5×10
6个细胞;
2、每个50mL离心管提前加好45mL PBS(或无糖MS)(通风橱中每管加35μL巯基乙醇)和1.25mL的37%甲醛溶液(sigma)(终浓度1%);
3、将样品放置于上述50mL离心管中,室温下,抽真空-60到-80Kpa后,保持10min,然后恢复至大气压强。如此反复三次(标准:样品沉底!);
4、加2.5mL的2.5M甘氨酸(4℃保存)来终止交联反应(终浓度0.125M),摇匀5min,终止反应。
5、滤掉液体,ddH
2O清洗3次,将样品表面的液体用吸水纸吸干,液氮速冻,-80℃保存后,干冰运输。
6、交联后样品经液氮充分研磨成粉末,转移至含10mL冰预冷NIbuffer的50mL离心管中(NIbuffer,20mM Hepes pH 8,250mM Sucrose,1mM MgCl
2,5mM KCl,40%(v/v)glycerol,0.25%(v/v)Triton X-100,0.1mM PMSF,0.1%(v/v)β-ME,1/5cocktail),冰上温和摇晃15min。
7、悬浮液经Miracloth(Millipore)过滤两次,50mL离心管收集滤液,4℃3000g离心15min。
二、建库流程
1、弃上清,沉淀重悬于2mL NIbuffer,4℃1900g离心5min。
2、重复步骤1一次。
3、弃上清。重悬于1mL 1.2×NEBuffer 2,4℃1900g离心5min。
4、弃上清,重悬于500μL 1.2×NEBuffer 2。
5、加入7.5μL 20%(w/v)SDS。65℃孵育40min。37℃800rpm摇床孵育20min。
6、加入50μL 20%(v/v)Triton X-100,37℃800rpm摇床孵育1h。
7、加入50μL NEB buffer3.1,10μL DpnII,37℃旋转过夜。
8、62℃灭活DpnII 20分钟。
9、加生物素标记的DNA(1.5μL 10mM dATP,1.5μL 10mM dGTP,1.5μL 10mM dTTP,37.5μL 0.4mM biotin-14-dCTP)和10μL 5U/μL Klenow酶补平DNA,小心混匀并在37℃孵育45min。
10、取出放置冰上,加86μL 10%(w/v)SDS终止反应。65℃孵育30min,之后迅速放置冰上。
11、补平DNA,在冰上加7.61mL成环反应体系(745μL 10%(v/v)Triton X-100,745μL 10×ligation buffer(500mM Tris-HCl pH 7.5,100mM MgCl
2, 100mM DTT),80μL 10mg/mL BSA,80μL 100mM ATP和5.96mL ddH
2O)至15mL离心管,转移染色质至15mL离心管。
12、加50U T4 DNA ligase,颠倒混匀,16℃反应4h,进行DNA成环。
13、加50μL 10mg/mL蛋白酶K,65℃消化过夜。之后,再加50μL10mg/mL蛋白酶K,65℃消化2h。
14、冷却至室温,转移到50mL离心管。通过酚提取纯化DNA。加10mL pH 8.0的酚,涡旋震荡2min。1500g离心10min,转移水相至新的50mL离心管。
15、通过酚氯仿(1:1体积)再次抽提纯化,涡旋震荡2min,22000g离心5min,转移水相至新的50mL离心管。
16、加入2.5倍体积无水乙醇沉淀DNA,22000g离心5min,弃上清。
17、加入450μL TE(pH 8.0)溶解DNA沉淀,转移至2mL离心管。
18、加入500μL酚氯仿(1:1体积),震荡1min,22000g离心5min,转移水相至新管。纯化两次之后,加0.1倍体积的NaOAc和2倍体积无水乙醇,-80℃孵育30min,离心去上清。
19、加入1mL 70%酒精清洗DNA,22000g离心5min。弃上清。
20、加入25μL TE重悬DNA。加1μL 1mg/mL RNase A于37℃消化30min。
21质检:取5μL样品进行1%琼脂糖胶检测。主带明显的样品为不合格,需处理后再行酶切。如呈现弥散型条带,样品继续进行后续实验。
22、利用T4 DNA聚合酶的外切酶活性,去掉未环化的生物素标记DNA。加5μg的library至含1μL 10mg/mL BSA,10μL 10×NEBuffer 2,1μL 10mM dATP,1μL 10mM dGTP和5Units T4 DNA polymerase的100μL体系中,16℃孵育4小时。
23、加2μL 0.5M EDTA pH 8.0终止反应,并通过酚氯仿(1:1,体积比)/乙醇沉淀纯化DNA。100μL ddH
2O重悬DNA。
24、利用超声波DNA破碎仪(Covaris M220)将DNA打断至200-700bp左右。
25、修复末端,加14μL 10×ligation buffer,14μL 2.5mM dNTP mix,5μL T4 DNA polymerase,5μL T4 polynucleotide kinase,1μL Klenow DNA polymerase和1μL水,20℃孵育30min。
26、通过DNA纯化试剂盒(Qiagen)进行DNA纯化。以15μL低盐TE(TLE:10mM Tris pH 8.0,0.1mM EDTA)洗脱两次,之后对DNA加A,通过5μL10×NEBuffer 2,10μL 1mM dATP,2μL水和3μL Klenow(exo-),37℃孵育20分钟。
27、65℃孵育20min终止反应,并迅速放冰上。真空浓缩至20μL。
28、用1.5%的琼脂糖凝胶80-90v跑3.5小时回收DNA。挑选300-500bp的DNA切胶并通过胶回收试剂盒进行回收。以50μL TLE洗脱。汇总洗脱液至300μL。通过Qubit计算DNA总量。
29、接下来的实验推荐在低DNA吸附管(DNA LoBind tubes)中进行。
30、通过磁珠法进行生物素DNA富集,详细参照试剂盒说明操作(Agencourt AMPure eads,Beckman)。具体来说:
A.用400μL Tween缓冲液(TB:5mM Tris-HCl pH 8.0,0.5mM EDTA,1M NaCl,0.05%Tween)洗涤150μL重悬浮的链霉亲和素磁珠两次来制备生物素下拉磁珠。
B.将磁珠重悬于300μL 2×不含Tween的缓冲液(2×NTB:10mM Tris-HCl pH 8.0,1mM EDTA,2M NaCl)中,并加入步骤28中所得的300μL Hi-C DNA。在室温下边旋转边孵育15分钟,使生物素标记的Hi-C DNA与带有链霉亲和素的磁珠结合。
C.用磁性粒子集中器回收与DNA结合的链霉亲和素磁珠,并弃去上清液。依次用400μL 1×NTB(5mM Tris-HCl pH 8.0,0.5mM EDTA,1M NaCl)和100μL1×ligation buffer洗涤珠子。接着将珠重悬于50μL 1×ligation buffer中,并将混合物转移到新管中。
D.将磁珠用Gibson试剂盒缓冲液重悬,并加入T5外切酶37℃消化10min(参考文献:enzymatic assembly of DNA molecules up to several hundred kilobases)。
E.将在步骤28中计算出的DNA总量作为生物素下拉的输入量,并将其除以20估计被拉下且可用于接头连接及测序的Hi-C DNA总量。按每微克可用于连接的Hi-C DNA添加6皮摩尔的Illumina双末端接头的比例添加连接物。使用1200单位T4 DNA连接酶将接头连接到DNA上。在室温下孵育2小时。
F.回收与Hi-C DNA结合的磁珠并用400μL 1×TB洗涤两次,从而去除多余的双末端接头。
G.依次用200μL 1×NTB、200μL和50μL 1×NEBuffer洗涤磁珠。最后一次洗涤后,将其重悬于50μL 1×NEBuffer 2并转移到新管。
H.为了确定合适的循环次数以产生足够的可进行测序的PCR产物,分别以6、9、12和15个循环进行PCR试验。通过在5%聚丙烯酰胺凝胶上进行PCR反应并用Sybr Green染色确定最佳循环数,确保不存在杂带,并且在400-600个碱基对区段有条带出现,因为这是Hi-C DNA与接头相连后大致的片段长度。
I.以最佳PCR循环次数进行大规模PCR以扩增其余与磁珠结合的Hi-C DNA文库。分别收集PCR产物并回收磁珠。取1%上述PCR产物于凝胶上作为对照,并用1.8倍体积的Ampure珠纯化剩余的PCR产物。
J.用50μL 1×TLE缓冲液洗脱纯化后的PCR产物,即为构建好的Hi-C文库。
通过上述操作,获得两份本发明方法获得的Hi-C文库(记为S2-1和S2-5),传统方法获得的Hi-C文库(记为SRR7187330和SRR7187332)。
三、Hi-C高通量测序
采用Illumina:HiSeq X Ten测序仪进行pair-end测序(双端测序)。
四、Hi-C文库质量鉴定
对本发明冷冻替换Hi-C(FS-Hi-C)和传统方法所得的4份Hi-C文库的高通量测序数据进行评估(S2-1和S2-5为本发明方法构建的文库,SRR7187330和SRR7187332为传统方法构建的文库)。利用高效率的Hi-C数据预处理工具HiC-Pro进行数据质量分析,质控结果如表4所示。该结果显示,与传统方法相比,本发明得到的可信的双端测序片段数量、比对到基因组唯一位点的双端测序片段数量及比率、有相互作用的双端测序片段数量、有效相互作用的双末端测序片段的数量均显著提高。
结论:本发明方法的Hi-C文库质量要优于传统方法建库的文库质量。
表4 本发明和传统方法的Hi-C文库的高通量测序数据通过软件HiC-Pro生成的质控结果
本发明冷冻替换Hi-C(FS-Hi-C)和传统Hi-C方法相比,2L、2R、3L、3R、4和X染色体的Hi-C map均呈现出高度的相似性。FS-Hi-C的文库质量高,有效数据多,检测到的DNA互作频率增加(图9)。raw reads相近的情况下,两种Hi-C方法相比,A和B compartment整体分布相似(图10)。与传统Hi-C方法相比,FS-Hi-C检测到的相同染色体中的TAD整体分布相似(图11)。这些结果说明FS-Hi-C方法不影响果蝇的染色质三维构象。
综合本发明两个实施例结果,可见:本发明提出一种适用于生物的冷冻替换Hi-C(FS-Hi-C)的建库方法,通过对组织样品依次进行液氮冷冻处理、预交联梯度升温和交联,可以有效维持活细胞中的染色质构象,极大地提高了生物的Hi-C数据质量。本研究打破了Hi-C处理需要对新鲜样品进行甲醛固定后再交联的限制,可以将样品保存在液氮中以备后期使用,为Hi-C在生物中的应用提供了一种改进的方法。
工业应用
本发明创造性开发了一种简单的冷冻替换Hi-C技术(Frozen Substitute-Hi-C),该方法不仅适用于新鲜样品,也适用于储存在液氮中的冷冻样品。该技术将冷冻替换(Frozen Substitute)技术应用于液氮(LN2)冷冻样品的Hi-C样品制备中,避免样品从液氮溶液中移出后在升温过程中产生三维结构变化。该方法有效地将Hi-C样品的应用范围从新鲜样品扩展到冰冻样品,解决了样品取样困难,无法长距离运输的问题。值得注意的是,在植物中,冷冻替换(FS)步骤之前,本发明在液氮中研磨样品,能够去除对植物交联有不利影响的植物细胞壁。使用冷冻替换Hi-C将样品细胞内的液态水替换为乙醇,使得冷冻样品在升温过程中保持不变的染色质构象。本发明在果蝇、棉花、大豆,萝卜中检验了这种新方法,通过透射电镜观察发现,冷冻替换(FS)样品和新鲜样品的染色质结构高度相似。与传统Hi-C相比,染色质互作、A/B compartment和TAD等方面表现出高度的一致性。但FS-Hi-C方法构建的文库质量更好,库容增加,有效互作的数据更多,PCR扩增产生的重复数据降低。该技术降低了对样本时效性的要求,方便了样本的采集和保存。本发明突破了Hi-C试验需要新鲜样品的限制,提高了文库数据质量,维持了染色质构象,为进一步探索基因调控和三维基因组结构铺平道路。
本发明的技术关键在于先使用液氮速冻研磨样品后进行预交联并梯度升温,然后再进行交联固定。本发明评估了冷冻替换Hi-C(FS-Hi-C)的文库质量,发现在棉花和果蝇中,新鲜样品与冷冻后梯度升温后的样品之间三维构象相似;而且使用“冷冻替换”的方式进行样品的前处理,能够提高棉花和果蝇Hi-C文库的数据质量。本发明将打破Hi-C试验中必须使用新鲜样品的局限性,为Hi-C在生物研究中的应用提供更为广阔的前景。
本发明提出一种适用于生物,如果蝇、棉花、大豆、萝卜的Hi-C实验流程,通过创新优化的前处理方法,解决了生物细胞交联质量不高,生物样品建库差的技术问题,显著提高了所构建Hi-C文库的有效数据量。
与现有技术相比,本发明的优势主要在于:
(1)在生物实验中,冷冻替换Hi-C(FS-Hi-C)不影响染色体的三维构象。
(2)在生物实验中,冷冻替换Hi-C(FS-Hi-C)显著提高了Hi-C的质量。
(3)在生物实验中,冷冻替换Hi-C(FS-Hi-C)有效地将Hi-C样品的适用范围从新鲜样品扩展到冰冻样品,解决了样品取样困难,无法长距离运输的问题。
(4)在棉花和果蝇样本中,冷冻替换Hi-C(FS-Hi-C)对染色体3D结构没有明显影响。
本发明的FS-Hi-C技术显著的提高了所构建Hi-C文库的有效数据量。适用于生物的Hi-C文库构建。
Claims (22)
- 一种适用于Hi-C高通量测序建库的样本前处理方法,包括如下步骤:将供试生物样本先冷冻再交联固定。
- 根据权利要求1所述的方法,其特征在于:所述冷冻为零下196℃低温处理。
- 根据权利要求2所述的方法,其特征在于:所述冷冻为将所述供试生物样本置于液氮中进行速冻。
- 根据权利要求1-3中任一所述的方法,其特征在于:所述交联固定为利用甲醛溶液进行交联固定。
- 根据权利要求1-3中任一所述的方法,其特征在于:在所述冷冻和所述交联固定之间还包括将所述供试生物样本进行预冷冻交联和梯度升温的过程。
- 根据权利要求1-5中任一所述的方法,其特征在于:所述方法包括将所述供试生物样本依次进行如下处理的步骤:(A1)液氮速冻研磨;(A2)预冷冻交联;(A3)梯度升温;(A4)离心,加NIbuffer过滤;(A5)甲醛交联固定;(A6)甘氨酸终止交联;(A7)离心回收细胞核。
- 根据权利要求6所述的方法,其特征在于:在步骤(A1)中,将所述供试生物样本放入液氮进行速冻,并在液氮中研磨成粉末。
- 根据权利要求5-7中任一所述的方法,其特征在于:权利要求5中所述的预冷冻交联是在预冷冻交联液中进行的,所述预冷冻交联液为含有2%水、0.01%甲醛的酒精溶液,%表示体积百分含量;在权利要求6或7的步骤(A2)中,将(A1)研磨的粉末转移到-90℃的所述预冷冻交联液。
- 根据权利要求5-8中任一所述的方法,其特征在于:所述梯度升温为-90℃6h、-60℃6h、-30℃6h、0℃6h。
- 根据权利要求9所述的方法,其特征在于:在所述梯度升温的过程中,从-90℃升至-60℃是每小时升温5℃,从-60℃升至-30℃是每小时升温5℃,从-30℃升至0℃是每小时升温5℃。
- 根据权利要求5-10中任一所述的方法,其特征在于:在步骤(A4)中,所述NIbuffer的配方如下:20mM Hepes pH 8,250mM蔗糖,1mM氯化镁 2,5mM氯化钾,体积百分含量40%的甘油,体积百分含量0.25%的Triton X-100,0.1mM PMSF,体积百分含量0.1%的β-巯基乙醇,1/5体积分数的cocktail。
- 根据权利要求5-11中任一所述的方法,其特征在于:在步骤(A4)中,进行所述离心后去除上清,加预冷的所述NIbuffer悬浮沉淀,用Miracloth过滤,收集滤液,离心。
- 根据权利要求4-12中任一所述的方法,其特征在于:权利要求4中所述甲醛溶液为体积百分含量为37%的甲醛溶液;在权利要求5的步骤(A5)中,向(A4)离心所得上清液中加入体积百分含量为37%的甲醛水溶液进行交联固定,交联的温度为室温,时间为8min。
- 根据权利要求13所述的方法,其特征在于:进行所述交联固定时,所述甲醛在体系中的终浓度为1%体积百分含量。
- 根据权利要求5-14中任一所述的方法,其特征在于:步骤(A6)中,过加入浓度为2.5M的甘氨酸溶液终止交联。
- 一种Hi-C高通量测序建库方法,包括如下步骤:先利用权利要求1-15中任一所述方法对供试生物样本进行预处理;然后将处理后的样本进行Hi-C高通量测序建库。
- 一种Hi-C高通量测序方法,包括如下步骤:先利用权利要求1-15中任一所述方法对供试生物样本进行预处理;接着将处理后的样本进行Hi-C高通量测序建库;最后进行Hi-C高通量测序。
- 根据权利要求1-17中任一所述的方法,其特征在于:所述生物样本为细胞或组织。
- 根据权利要求1-18中任一所述的方法,其特征在于:所述生物为植物或动物。
- 根据权利要求19所述的方法,其特征在于:所述植物为棉花或大豆或萝卜;所述动物为果蝇。
- 根据权利要求20所述的方法,其特征在于:所述生物样本为棉花的愈伤组织或果蝇细胞系。
- 如下任一中的应用:(B1)权利要求1-15中任一所述的方法在Hi-C高通量测序建库中的应用;(B2)权利要求1-16中任一所述的方法在Hi-C高通量测序中的应用。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2019126508 | 2019-12-19 | ||
CNPCT/CN2019/126508 | 2019-12-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021120517A1 true WO2021120517A1 (zh) | 2021-06-24 |
Family
ID=76477061
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/091706 WO2021120517A1 (zh) | 2019-12-19 | 2020-05-22 | FS-Hi-C——一种能够高效捕获冷冻生物样本染色体三维构象的技术 |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2021120517A1 (zh) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100081141A1 (en) * | 2008-08-06 | 2010-04-01 | University Of Southern California | Genome-Wide Chromosome Conformation Capture |
CN105839196A (zh) * | 2016-05-11 | 2016-08-10 | 北京百迈客生物科技有限公司 | 一种真核生物DNA的Hi-C高通量测序建库方法 |
CN106591285A (zh) * | 2015-10-19 | 2017-04-26 | 安诺优达基因科技(北京)有限公司 | 一种构建高可利用数据率的Hi-C文库的方法 |
WO2017068379A1 (en) * | 2015-10-23 | 2017-04-27 | Oxford University Innovation Limited | Method of analysing dna sequences |
CN107475394A (zh) * | 2017-08-31 | 2017-12-15 | 中国农业科学院农业基因组研究所 | 一种基于少量细胞全基因组染色质高分辨率构象技术eHi‑C2.0 |
CN107751186A (zh) * | 2017-10-25 | 2018-03-06 | 北京科兴生物制品有限公司 | 一种快速冻存及复苏细胞的方法 |
WO2018107093A1 (en) * | 2016-12-09 | 2018-06-14 | University Of Southern California | Genome-wide mapping of dna-dna proximities in the nucleus |
CN108265104A (zh) * | 2018-01-02 | 2018-07-10 | 北京诺禾致源科技股份有限公司 | 染色体构型捕捉文库及其构建方法 |
CN109913529A (zh) * | 2019-03-20 | 2019-06-21 | 嘉兴菲沙基因信息有限公司 | 一种适用于狐尾藻的Hi-C高通量测序建库方法 |
-
2020
- 2020-05-22 WO PCT/CN2020/091706 patent/WO2021120517A1/zh active Application Filing
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100081141A1 (en) * | 2008-08-06 | 2010-04-01 | University Of Southern California | Genome-Wide Chromosome Conformation Capture |
CN106591285A (zh) * | 2015-10-19 | 2017-04-26 | 安诺优达基因科技(北京)有限公司 | 一种构建高可利用数据率的Hi-C文库的方法 |
WO2017068379A1 (en) * | 2015-10-23 | 2017-04-27 | Oxford University Innovation Limited | Method of analysing dna sequences |
CN105839196A (zh) * | 2016-05-11 | 2016-08-10 | 北京百迈客生物科技有限公司 | 一种真核生物DNA的Hi-C高通量测序建库方法 |
WO2018107093A1 (en) * | 2016-12-09 | 2018-06-14 | University Of Southern California | Genome-wide mapping of dna-dna proximities in the nucleus |
CN107475394A (zh) * | 2017-08-31 | 2017-12-15 | 中国农业科学院农业基因组研究所 | 一种基于少量细胞全基因组染色质高分辨率构象技术eHi‑C2.0 |
CN107751186A (zh) * | 2017-10-25 | 2018-03-06 | 北京科兴生物制品有限公司 | 一种快速冻存及复苏细胞的方法 |
CN108265104A (zh) * | 2018-01-02 | 2018-07-10 | 北京诺禾致源科技股份有限公司 | 染色体构型捕捉文库及其构建方法 |
CN109913529A (zh) * | 2019-03-20 | 2019-06-21 | 嘉兴菲沙基因信息有限公司 | 一种适用于狐尾藻的Hi-C高通量测序建库方法 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111733214B (zh) | 一种能够高效捕获冷冻生物样本染色体三维构象的高通量测序建库技术 | |
Zylicz et al. | Chromatin dynamics and the role of G9a in gene regulation and enhancer silencing during early mouse development | |
WO2017066907A1 (zh) | 一种构建高可利用数据率的Hi-C文库的方法 | |
US20210332353A1 (en) | Argonaute protein mutant and use thereof | |
CN106591285B (zh) | 一种构建高可利用数据率的Hi-C文库的方法 | |
CN105849264A (zh) | 用于鉴别重复测序读数的组合物和方法 | |
WO2017193833A1 (zh) | 一种富集4000人类致病靶基因的方法及试剂盒 | |
CN105200041B (zh) | 用于构建单细胞转录组测序文库的试剂盒及文库构建方法 | |
WO2019006975A1 (zh) | 极小量细胞原位全基因组染色质构象捕获方法 | |
WO2020223936A1 (zh) | 一种高质量水稻核糖体印记文库的构建方法 | |
CN108265104B (zh) | 染色体构型捕捉文库及其构建方法 | |
WO2023202030A1 (zh) | 一种小分子rna的高通量测序文库构建方法 | |
JP2023506631A (ja) | 共有結合で閉端された核酸分子末端を使用したngsライブラリー調製 | |
CN112280828A (zh) | 一种降低单细胞扩增偏倚性的离体组织细胞核分离方法 | |
Lee et al. | Single-cell toolkits opening a new era for cell engineering | |
CN112941159A (zh) | 一种在全基因组水平鉴定植物基因组dna鸟嘌呤四联体位点的方法 | |
WO2021120517A1 (zh) | FS-Hi-C——一种能够高效捕获冷冻生物样本染色体三维构象的技术 | |
CN113528612A (zh) | 用于检测染色质开放位点间染色质相互作用的NicE-C技术 | |
CN109097443B (zh) | 一种用于高通量测序的捕获基因组靶序列的方法 | |
CN114276417B (zh) | 一种在植物正常生理条件下鉴定全基因组dna鸟嘌呤四联体位点的方法 | |
Grob et al. | Chromatin conformation capture-based analysis of nuclear architecture | |
CN111484976A (zh) | 肺癌循环肿瘤细胞检测试剂盒及检测系统 | |
CN114410813B (zh) | 一种在全基因组水平鉴定植物基因组dna胞嘧啶四联体位点的方法 | |
WO2020135650A1 (zh) | 一种基因测序文库的构建方法 | |
CN116376909B (zh) | 一种具有增强子和启动子活性的甘蔗复合型顺式转录调控元件及筛选方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20900936 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20900936 Country of ref document: EP Kind code of ref document: A1 |