WO2020259303A1 - Method for rapid construction of rna 3'-end gene expression library - Google Patents

Method for rapid construction of rna 3'-end gene expression library Download PDF

Info

Publication number
WO2020259303A1
WO2020259303A1 PCT/CN2020/095600 CN2020095600W WO2020259303A1 WO 2020259303 A1 WO2020259303 A1 WO 2020259303A1 CN 2020095600 W CN2020095600 W CN 2020095600W WO 2020259303 A1 WO2020259303 A1 WO 2020259303A1
Authority
WO
WIPO (PCT)
Prior art keywords
linker
sequence
double
stranded cdna
sequencing
Prior art date
Application number
PCT/CN2020/095600
Other languages
French (fr)
Chinese (zh)
Inventor
耿亮
辛文
Original Assignee
北京全式金生物技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京全式金生物技术有限公司 filed Critical 北京全式金生物技术有限公司
Publication of WO2020259303A1 publication Critical patent/WO2020259303A1/en

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1096Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms

Definitions

  • the invention relates to the technical field of molecular biology. More specifically, it relates to a method for quickly constructing an RNA 3'gene expression library.
  • the rise of gene chip (also known as microarray) technology has made high-throughput expression detection possible.
  • This technology uses a large number of probe molecules to be immobilized on a specific support (chip) at a high density and hybridizes with the labeled sample molecules, and then obtains the number and sequence information of the sample molecules by detecting the intensity of the hybridization signal of each probe molecule .
  • This technology can detect and analyze a large number of sequences of samples at one time, meeting the needs of high-throughput gene expression detection.
  • this method due to technical constraints in sample preparation, probe synthesis and immobilization, molecular labeling, data reading and analysis, etc., this method also has many shortcomings in current applications, such as high cost, low detection sensitivity, and repetition. Poor sex and so on.
  • RNA sequencing technology can effectively study mRNA on a qualitative and quantitative level, help people understand the cellular response mechanism, and promote a series of life science research, as well as medical and pharmaceutical related research and the development of molecular diagnostic reagents.
  • RNA library construction in the field of gene expression testing can be It can be achieved by only constructing a library of the 3'-end region of the mRNA to obtain a certain length of specific information that can determine the difference between it and other molecules. Based on the above principles, there are many mature technical solutions, such as MARS-seq.
  • transposase complexes include: 1. Randomly introduce the first or second linker at both ends of the DNA fragment. For a single fragment, the sequences at both ends may be the same. It may be different; 2. The molecular end of the DNA template cannot be introduced into the linker by transposase. These two points are not a problem for DNA library construction using genome as a template, but for RNA library construction, directional double-ended linkers cannot be connected to the 3'region of mRNA molecules.
  • the first objective of the present invention is to provide a method for quickly constructing an RNA 3'end gene expression library, which can effectively realize the construction of a transcriptome library for the 3'end of eukaryotic mRNA.
  • the second object of the present invention is to provide a rapid gene expression sequencing method.
  • the present invention adopts the following technical solutions:
  • a method for quickly constructing an RNA 3'gene expression library includes the following steps:
  • transposase complex carrying the second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
  • the method further includes the following steps:
  • the double-stranded cDNA fragments are amplified by PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library.
  • PCR primers designed according to the first linker sequence and the second linker sequence also include the Index sequence required for library construction and sequencing.
  • first linker and the second linker carry Index sequences required for library construction and sequencing.
  • the transposase complex includes a transposase and inserted DNA; and the transposase is Tn5 transposase or MuA transposase.
  • the insert DNA carrying the second linker is two identical DNA molecules, or the insert DNA carrying the second linker is a piece with the same second linker at both ends. DNA molecule.
  • the Oligo (dT) reverse transcription primer with a first linker has an Oligo (dT) primer with a first linker sequence at the 5'end, and a T base-containing primer at the 3'end Nucleotides.
  • the Oligo (dT) reverse transcription primer with the first linker has an Oligo (dT) primer with the first linker sequence at the 5'end, and the 3'end with the immediately adjacent mRNA A nucleotide containing anchor bases annealed in the poly(A) region.
  • the present invention provides a method for sequencing gene expression, which includes the following steps:
  • transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
  • first linker and the second linker carry Index sequences required for library building and sequencing; the sequences of the first linker and the second linker are different.
  • the present invention also provides another gene expression sequencing method, the method comprising the following steps:
  • transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
  • RNA 3'end gene expression library Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library
  • the PCR primer designed according to the sequence of the first linker and the sequence of the second linker also includes an Index sequence required for library construction and sequencing; the sequence of the first linker and the second linker are different.
  • the method provided by the present invention intends to combine the relatively mature transposase complex-based library construction technology in the field of DNA NGS library construction with the requirement for expression detection of low template mRNA, so as to realize simple, fast and efficient RNA 3' End gene expression library construction.
  • the technical route of the present invention is clear.
  • the second-strand cDNA is synthesized, and the double-strand cDNA is processed by the transposase complex with the second linker Therefore, the first linker and the second linker were successfully added through the reverse transcription and transposition steps to generate a double-stranded cDNA with different linkers at both ends of the mRNA 3'-end region, which can be carried out directly or by adding a step of library amplification followed by downstream Sequencing.
  • This method has high efficiency in library construction, and the method is simple and easy to implement. It saves time and effort. It is especially suitable for the construction of low-initiated RNA templates, especially single-cell RNA expression libraries. Research in the fields of omics and RNA sequencing has great and far-reaching significance.
  • Fig. 1 shows a schematic diagram of the principle of the present invention.
  • Figure 2 shows the electrophoresis detection results of the purified transposition products in Examples 1 and 2.
  • M refers to a DNA molecular marker of 100 bp
  • the amplified product of Example 1 is in Lane 1
  • the amplified product of Example 2 is in Lane 2.
  • Fig. 3 shows the electrophoresis diagram of the library after sorting in Example 1 through the Agilent high-sensitivity DNA chip.
  • Example 4 shows the electrophoresis diagram of the library after sorting in Example 2 through the Agilent high-sensitivity DNA chip.
  • the present invention provides a new method for quickly constructing an RNA 3'gene expression library.
  • the method includes the following steps:
  • sequences of the first linker and the second linker are different to ensure that the double-stranded cDNA derived from the 3'end region of the mRNA template has different linker sequences at both ends.
  • the linker sequence refers to the DNA sequence of the non-target nucleic acid component that provides addressing means to the nucleic acid fragment to which it is joined.
  • the mRNA sample of the present invention refers to an mRNA sample containing a polyA sequence at the 3'end of a eukaryotic organism.
  • the mRNA sample can be obtained through conventional RNA extraction, purification and enrichment steps, which are all mature technologies in the prior art, and the present invention does not impose further restrictions on this.
  • the specific technical process of the present invention is as follows: firstly, reverse transcription of the RNA sample is carried out with Oligo (dT) primer with the first linker to obtain the first strand cDNA with the first linker, and then by digesting the mRNA template, synthesizing the second
  • the double-stranded cDNA with the first linker can be obtained from the stranded cDNA; then the double-stranded cDNA is treated with the transposase complex with the second linker sequence, and the second linker is inserted while fragmenting;
  • the double-stranded cDNA in the 3'end region of the mRNA template has different first and second linker sequences at both ends, and the remaining fragments have the same second linker sequence at both ends, so there is only the 3'end region of the mRNA template
  • the downstream sequencing work can be performed, and the remaining areas will not be used.
  • the present invention can effectively realize the construction and sequencing of the transcriptome library for the 3'end of mRNA.
  • the transcriptome library obtained by the method provided by the present invention can be used in Roche, Illumina, ThermoFisher, Pacific Biosciences, BGI, Oxford Nanopore Technologies, Huayinkang, and Hanhai Gene high-throughput sequencing platforms.
  • the double-stranded cDNA fragment with the first linker and/or the second linker obtained in the present invention can be directly subjected to downstream sequencing work, or the library can be amplified by using PCR primers designed according to the first linker and the second linker sequence to realize Enrichment and amplification of the target area, and then downstream sequencing work.
  • the method when the library needs to be amplified, the method further includes the following steps: Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence, Obtain the RNA 3'end gene expression library. When the amount of the original RNA sample is very low, this step of amplification can effectively expand the amount of double-stranded cDNA fragments to achieve the enrichment of the target region and signal amplification. Further preferably, the PCR primers designed according to the first linker sequence and the second linker sequence in this step also include the Index sequence required for library building and sequencing.
  • the PCR primers designed according to the first linker sequence and the second linker sequence may also include any other desired sequences, such as high-throughput single cell
  • the molecular tags and cell tags are introduced by multi-step PCR, and high throughput is achieved by multiplying the numbers of the two. Introduce any other desired sequences through the library amplification PCR step to achieve downstream sequencing work.
  • the first adapter and the second adapter when directly performing downstream sequencing work, directly carry the Index sequence required for library construction and sequencing or any sequence required for other sequencing, so as to achieve Downstream sequencing work.
  • the transposase complex in step (3) comprises a transposase and inserting DNA.
  • the transposase can be Tn5, MuA or other transposable functions.
  • the insert carrying the second linker may be two identical DNA molecules; or the insert carrying the second linker may also be a piece of DNA with the same second linker sequence at both ends.
  • the molecule is formed at this time as a circular transposon complex (see the patent application "A circular transposon complex and its application", application number 201710013203.8).
  • the reverse transcription primer used when synthesizing the first-strand cDNA of the present invention is an Oligo(dT) primer with a first linker sequence at the 5'end; The nucleotides of anchor bases annealed in the poly(A) region.
  • This primer can bind to all the mRNA components in the eukaryotic total RNA sample to initiate reverse transcription.
  • the reverse transcription primer used when synthesizing the first-strand cDNA of the present invention is an Oligo (dT) primer with a first linker sequence at the 5'end; the 3'end contains T-base nucleotides.
  • RNA capture or ribosomal RNA removal usually first require mRNA capture or ribosomal RNA removal, and then reverse transcription, followed by more cumbersome traditional library construction procedures (including fragmentation, end repair and A, linker connection, library Steps such as amplification, as well as multi-step product sorting and purification).
  • This method is complicated in steps, time-consuming and laborious, and has a high probability of operation contamination. It requires a high initial amount of RNA samples. For a small amount of RNA template, especially single-cell RNA library construction, the effect is often not ideal.
  • the present invention uses the technical route of reverse transcription—synthesis of double-stranded cDNA—in vitro transposition.
  • the 3'-end region of the mRNA template is successfully constructed into a library with different linker sequences at both ends.
  • This method has fewer steps, simple and flexible operation, and is particularly suitable for low-initiated RNA library building, such as single-cell RNA library building and sequencing.
  • the present invention also provides two methods for sequencing gene expression.
  • the first sequencing method includes the following steps:
  • transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
  • first linker and the second linker carry Index sequences required for library building and sequencing; the sequences of the first linker and the second linker are different.
  • the second sequencing method includes the following steps:
  • transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
  • RNA 3'end gene expression library Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library
  • the PCR primer designed according to the sequence of the first linker and the sequence of the second linker also includes an Index sequence required for library construction and sequencing; the sequence of the first linker and the second linker are different.
  • Example 1 Enter the paired-end sequencing adapter by PCR after transposition method
  • the specific steps are conventional mRNA enrichment steps.
  • an mRNA purification kit (Beijing Quanshijin Biology, catalog number: EC801) can be used, and the steps are omitted here.
  • first-strand cDNA transcription synthesis kit (Beijing Quanshijin Biology, catalog number: AT301) for this operation:
  • Oligo(T)-VN1 is an Oligo(dT) reverse transcription primer with a first linker, the sequence is: GACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAAVN (shown in SEQ ID NO.1), where V represents the three bases of G/A/C Any one of the four bases, N represents any one of the four bases G/A/C/T.
  • the synthesis of the second strand is directly performed, for example, the first strand cDNA transcription synthesis kit of Quanshijin Company (Beijing Quanshijin Biology, catalog number: AT302) is used for this step.
  • the Tn5 transposition complex Tn5-S containing the same short linker (prepared by annealing Oligo A and Oligo B) is used to directly insert the double-stranded cDNA.
  • the sequence of Oligo A is: TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG (As shown in SEQ ID NO. 2);
  • Oligo B's sequence is: CTGTCTCTTATACACATCT (as shown in SEQ ID NO. 3).
  • the steps to insert are:
  • the transposase Tn5 can react slowly at room temperature, so the reaction mixture should be added on ice, and the incubation reaction should be performed immediately after pipetting and mixing.
  • MagicPureTM Size Selection DNA Beads (Beijing Quanshijin Biotech, catalog number: EC401) is used for purification.
  • the purified product was subjected to electrophoresis detection. The detection result is shown in Figure 2. It can be seen that the purified transposition product is mainly distributed in the range of 300-500bp, and the fragmentation effect is good.
  • the sequence of the upstream primer Primer-F is: CAAGCAGAAGACGGCATACGAGAT[index-i5]GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT; where [index-i5] is inserted into the original sequence of the upstream primer Primer-F CAAGCAGAAGACGGCATACGAGATGTGACTGGAGTTCAGACGTG.TGCTTCCGIDATCAGACGNO. The position is shown in the sequence of the upstream primer Primer-F.
  • the sequence of the downstream primer Primer-R is: AATGATACGGCGACCACCGAGATCTACAC[index-i7]TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG.
  • [index-i7] is inserted in the middle of the original sequence AATGACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTCAGATGTGTATAAGAGACAG (shown in SEQ ID NO. 5) of the downstream primer Primer-R, and the specific insertion position is shown in the sequence of the primer-R of the lower primer.
  • the transposition reaction product is not a complete double-stranded DNA. Incubate at 72°C for 3 minutes to generate a mature PCR template.
  • MagicPureTM Size Selection DNA Beads (Beijing Quanshijin Biotech, catalog number: EC401) is used in this embodiment.
  • the Agilent 2100 high-sensitivity DNA chip was used to detect the sorted products. The test results are shown in Figure 3. It can be seen that the amplified products are mainly distributed in the range of 300-500bp after sorting, and the size distribution and peak shape are normal.
  • the synthesis of the first strand cDNA is performed.
  • the first-strand cDNA transcription synthesis kit (Beijing Quanshijin Biology, catalog number: AT301) was used to perform this operation:
  • Oligo(T)-VN2 is Oligo(dT) reverse transcription primer 2 with a first linker, and its sequence is: CAAGCAGAAGACGGCATACGAGAT[index-i5]GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAAVN.
  • [index-i5] plays a role in distinguishing the data of each sample.
  • Each sample has a different index.
  • the data of the sample will be distinguished by the difference of the index; [index-i5] is inserted in the with
  • the original sequence of the Oligo (dT) reverse transcription primer of the first linker 2 CAAGCAGAAGACGGCATACGAGATGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAVN (shown in SEQ ID NO. 6) in the middle, the specific insertion position is as in the Oligo (dT) reverse transcription primer 2 with the first linker The sequence is shown.
  • V at the end of the sequence represents any of the three bases of G/A/C
  • N represents any of the four bases of G/A/C/T.
  • the Tn5 transposable complex Tn5-L containing the same long linker (prepared by the annealing of Oligo C and Oligo B) is used to directly insert the double-stranded cDNA;
  • the sequence of Oligo C is: CAAGCAGAAGACGGCATACGAGAT [index-i7] GTCTCGTGGGCTCGGAGATGTGTATA AGAGACAG; [index-i7] is inserted in the original Oligo C sequence CAAGCAGAAGACGGCATACGAGATGTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG (shown in SEQ ID NO. 7), and the specific insertion position is shown in the sequence of Oligo C.
  • the steps to insert are:
  • the transposase Tn5 can react slowly at room temperature, so the reaction mixture should be added on ice, and the incubation reaction should be performed immediately after pipetting and mixing.
  • MagicPureTM Size Selection DNA Beads (Beijing Quanshijin Biotech, catalog number: EC401) is used.
  • the Agilent 2100 high-sensitivity DNA chip was used to detect the sorted products. The test results are shown in Figure 4. It can be seen that the amplified products are mainly distributed in the range of 300-500bp after sorting, and the size distribution and peak shape are normal. .

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Analytical Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Immunology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Disclosed is a method for rapid construction of an RNA 3'-end gene expression library, comprising: using Oligo(dT) reverse transcription primer with a first adapter to perform reverse transcription on an mRNA sample to obtain a first-strand cDNA with the first adapter; using the first-strand cDNA as a template to synthesize a second-strand cDNA to obtain a double-stranded cDNA with the first adapter; using a transposase complex with a second adapter to process the double-stranded cDNA, such that the second adapter is inserted during fragmentation to obtain a double-stranded cDNA fragment with the first adapter and/or the second adapter. The sequences of the first adapter and the second adapter are not completely the same. The method provided by the present invention can effectively achieve the construction of a transcriptome library for the 3'-end of eukaryotic mRNA.

Description

一种快速构建RNA 3’端基因表达文库的方法Method for quickly constructing RNA 3'end gene expression library 技术领域Technical field
本发明涉及分子生物学技术领域。更具体地,涉及一种快速构建RNA 3’端基因表达文库的方法。The invention relates to the technical field of molecular biology. More specifically, it relates to a method for quickly constructing an RNA 3'gene expression library.
背景技术Background technique
随着后基因组时代的来临,基因表达与转录组学研究作为一种强有力的工具,在现今的生命科学研究中占有日益重要的地位。它可以帮助人们深入了解基因结构,揭示特定条件下的基因表达情况,以及生物体应对外界环境变化的调控机制。不同于DNA水平的研究,针对mRNA的表达文库构建与检测可以更加准确、直接地反映生物个体、组织乃至单细胞水平的表达差异;作为蛋白质组学的上游研究范畴,基因表达检测在现今的基础研究与精准医疗、药物研发等领域发挥着日益重要的作用。With the advent of the post-genome era, gene expression and transcriptomics research, as a powerful tool, occupies an increasingly important position in today's life science research. It can help people understand gene structure in depth, reveal gene expression under specific conditions, and the regulatory mechanism of organisms to respond to changes in the external environment. Different from the research at the DNA level, the construction and detection of mRNA expression library can more accurately and directly reflect the expression differences at the level of individual organisms, tissues and even single cells; as the upstream research field of proteomics, gene expression detection is based on the current foundation Research and precision medicine, drug development and other fields are playing an increasingly important role.
早期的基因表达检测,其方法主要是依据在细胞或生物体中观察到的生物化学或特定表型的变化来判断特定基因是否表达;之后随着生物化学技术的不断发展,针对特定表达产物和蛋白分子的检测方法也被应用于定性或半定量地检测基因表达,如Western Blot技术。另一方面,分子生物学的迅猛发展,为表达检测提供了强有力的工具,最具代表性和里程碑式意义的技术当首推RT-PCR与RT-qPCR技术,研究人员可以利用该技术构建表达文库,更加灵敏、准确地定量检测特定基因的表达水平。但这些技术的共同局限,是在同一时间只能获得有限的数据量,面对日益增加的高通量、大样本检测需求则力有未逮。Early gene expression detection methods were mainly based on the biochemical or specific phenotypic changes observed in cells or organisms to determine whether a specific gene was expressed; later, with the continuous development of biochemical technology, specific expression products and Protein molecule detection methods have also been applied to qualitatively or semi-quantitatively detect gene expression, such as Western Blot technology. On the other hand, the rapid development of molecular biology has provided powerful tools for expression detection. The most representative and landmark technologies are RT-PCR and RT-qPCR technologies, which researchers can use to construct Expression library, more sensitive and accurate quantitative detection of specific gene expression level. But the common limitation of these technologies is that they can only obtain a limited amount of data at the same time, and they are unable to meet the increasing demand for high-throughput and large-sample testing.
基因芯片(又称微阵列)技术的兴起,使得高通量表达检测成为可能。该技术通过将大量探针分子以高密度固定于特定支持物(芯片)上后与标记的样品分子进行杂交,再通过检测每个探针分子的杂交信号强度进而获取样品分子的数量和序列信息。该技术可一次性对样品大量序列进行检测和分析,满足高通量基因表达检测需求。但是,受制于在样品制备、探针合成与固定、分子标记、数据读取与分析等各环节的技术制约,该方法在当前应用中也存在诸多不足,如成本高昂、检测灵敏度较低、重复性差等等。The rise of gene chip (also known as microarray) technology has made high-throughput expression detection possible. This technology uses a large number of probe molecules to be immobilized on a specific support (chip) at a high density and hybridizes with the labeled sample molecules, and then obtains the number and sequence information of the sample molecules by detecting the intensity of the hybridization signal of each probe molecule . This technology can detect and analyze a large number of sequences of samples at one time, meeting the needs of high-throughput gene expression detection. However, due to technical constraints in sample preparation, probe synthesis and immobilization, molecular labeling, data reading and analysis, etc., this method also has many shortcomings in current applications, such as high cost, low detection sensitivity, and repetition. Poor sex and so on.
二代测序技术的蓬勃发展,特别是RNA测序技术的进步,迅速推进了转录组学研究的广泛应用。目前RNA测序技术已可以在定性与定量层面上对mRNA进行有效研究,帮助人们了解细胞应答机制,推进了一系列生命科学研究乃至医学、药学相关研究与分子诊断试剂开发。The vigorous development of second-generation sequencing technology, especially the advancement of RNA sequencing technology, has rapidly promoted the widespread application of transcriptomics research. At present, RNA sequencing technology can effectively study mRNA on a qualitative and quantitative level, help people understand the cellular response mechanism, and promote a series of life science research, as well as medical and pharmaceutical related research and the development of molecular diagnostic reagents.
由于基因表达检测往往更多关注的是特定基因的表达水平,或者说是特定mRNA分子的绝对数量与相对数量,而非其全长精确序列,因此RNA建库在基因表达检测领域的应用,可通过仅对mRNA的3’端区域进行建库,获得一定长度的、可确定其与其它分子差异的特异性信息即可实现。基于上述原理,目前已有多种成熟的技术方案,如MARS-seq等。Because gene expression testing often focuses more on the expression level of specific genes, or the absolute number and relative number of specific mRNA molecules, rather than its full-length exact sequence, the application of RNA library construction in the field of gene expression testing can be It can be achieved by only constructing a library of the 3'-end region of the mRNA to obtain a certain length of specific information that can determine the difference between it and other molecules. Based on the above principles, there are many mature technical solutions, such as MARS-seq.
但这些方案的一点共性,都是通过mRNA分子的3’端poly(A)结构对其进行反转录,进而转化为双链cDNA,再进行传统的DNA文库构建。此方法需要经历末端修复与磷酸化、加A、连接接头、文库放大以及多步纯化等一系列繁琐的操作,这些操作对于较大 起始量的RNA样本尚可接受,但对于以单细胞RNA建库为代表的极低模板量mRNA建库,特别是对于某些表达水平极低的mRNA来说,其分子数目很少甚至是单拷贝级别,则十分容易导致信息的丢失。However, one common feature of these schemes is that they are reverse transcribed through the 3'end poly(A) structure of the mRNA molecule, and then converted into double-stranded cDNA, which is then constructed by traditional DNA library. This method requires a series of tedious operations such as end repair and phosphorylation, addition of A, ligation of adapters, library amplification, and multi-step purification. These operations are acceptable for larger amounts of RNA samples, but for single-cell RNA The construction of a library is represented by extremely low template mRNA. Especially for some mRNAs with extremely low expression levels, the number of molecules is small or even at the level of single copy, which can easily lead to the loss of information.
值得注意的是,目前针对DNA的NGS文库构建,已有成熟的基于转座酶复合物的方法,如Epicentre公司的第二代DNA测序试剂盒系列产品(TRANSPOSON END COMPOSITIONS AND METHODS FOR MODIFYING NUCLEIC ACIDS.United States Patent Application Pub.No.US 20110287435 A1.),以及国内学者对以上建库方法做出的改进(中国发明专利:201710013203.8;发明名称:一种环状转座子复合物及其应用)。该方法较之传统的DNA文库构建方案,步骤更加简单,易于操作,大幅缩短了建库操作时间;但此方法在RNA建库方向上尚无成熟应用的先例。It is worth noting that there are mature methods based on transposase complexes for DNA NGS library construction, such as Epicentre's second-generation DNA sequencing kit series products (TRANSPOSONEND COMPOSITIONS AND METHODS FOR MODIFYING NUCLEIC ACIDS. United States Patent Application Pub. No. US 20110287435 A1.), as well as improvements made by domestic scholars to the above library construction method (Chinese invention patent: 201710013203.8; invention title: a circular transposon complex and its application). Compared with the traditional DNA library construction scheme, this method has simpler steps, is easy to operate, and greatly shortens the operation time of library construction; however, this method has no precedent for mature application in the direction of RNA library construction.
作为一种相对新颖的DNA建库技术,转座酶复合物的两个特点包括:1、随机在DNA片段两端引入第一或第二接头,对单个片段而言,两端的序列可能相同也可能不同;2、DNA模板的分子末端无法被转座酶引入接头。这两点对于以基因组为模板的DNA建库而言不是问题,但对于RNA建库而言,无法对mRNA分子3’区域进行有方向性的双端不同接头连接。As a relatively novel DNA library construction technology, the two characteristics of transposase complexes include: 1. Randomly introduce the first or second linker at both ends of the DNA fragment. For a single fragment, the sequences at both ends may be the same. It may be different; 2. The molecular end of the DNA template cannot be introduced into the linker by transposase. These two points are not a problem for DNA library construction using genome as a template, but for RNA library construction, directional double-ended linkers cannot be connected to the 3'region of mRNA molecules.
如能解决此问题,则可将简单、快速、高效的转座酶建库技术与转录组建库技术相结合,开发出强有力的RNA 3’端基因表达文库构建与测序技术,对转录组学、RNA测序领域研究具有重大深远的实际意义。If this problem can be solved, simple, fast, and efficient transposase library construction technology can be combined with transcription construction library technology to develop powerful RNA 3'end gene expression library construction and sequencing technology, which is very useful for transcriptomics. The research in the field of RNA sequencing has great and far-reaching practical significance.
发明内容Summary of the invention
本发明的第一个目的在于提供一种快速构建RNA 3’端基因表达文库的方法,该方法可有效实现针对真核生物mRNA 3’端的转录组文库构建。The first objective of the present invention is to provide a method for quickly constructing an RNA 3'end gene expression library, which can effectively realize the construction of a transcriptome library for the 3'end of eukaryotic mRNA.
本发明的第二个目的在于提供一种可快速的基因表达的测序方法。The second object of the present invention is to provide a rapid gene expression sequencing method.
为达到上述目的,本发明采用下述技术方案:To achieve the above objective, the present invention adopts the following technical solutions:
一种快速构建RNA 3’端基因表达文库的方法,所述方法包括以下步骤:A method for quickly constructing an RNA 3'gene expression library, the method includes the following steps:
使用带有第一接头的Oligo(dT)反转录引物对mRNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo (dT) reverse transcription primer with the first linker to reverse transcription of the mRNA sample to obtain the first strand cDNA with the first linker;
以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
使用携带第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex carrying the second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
其中,所述第一接头与第二接头的序列不相同。Wherein, the sequence of the first linker and the second linker are not the same.
此外,优选的方案是,所述方法还包括以下步骤:In addition, a preferred solution is that the method further includes the following steps:
以根据第一接头序列和第二接头序列设计的PCR引物对双链cDNA片段进行扩增,得到RNA 3’端基因表达文库。The double-stranded cDNA fragments are amplified by PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library.
此外,优选的方案是,根据第一接头序列和第二接头序列设计的PCR引物还包括建库和测序所需的Index序列。In addition, a preferred solution is that the PCR primers designed according to the first linker sequence and the second linker sequence also include the Index sequence required for library construction and sequencing.
此外,优选的方案是,所述第一接头和第二接头中带有建库和测序所需的Index序列。In addition, a preferred solution is that the first linker and the second linker carry Index sequences required for library construction and sequencing.
此外,优选的方案是,所述转座酶复合物包含转座酶和插入DNA;所述转座酶为Tn5转座酶或MuA转座酶。In addition, a preferred solution is that the transposase complex includes a transposase and inserted DNA; and the transposase is Tn5 transposase or MuA transposase.
此外,优选的方案是,所述转座酶复合物中,携带第二接头的插入DNA为两条一样的DNA分子,或携带第二接头的插入DNA为一条两端带有相同第二接头的DNA分子。In addition, a preferred solution is that in the transposase complex, the insert DNA carrying the second linker is two identical DNA molecules, or the insert DNA carrying the second linker is a piece with the same second linker at both ends. DNA molecule.
此外,优选的方案是,带有第一接头的Oligo(dT)反转录引物,其5’端为带有第一接头序列的Oligo(dT)引物,其3’末端为含T碱基的核苷酸。In addition, a preferred solution is that the Oligo (dT) reverse transcription primer with a first linker has an Oligo (dT) primer with a first linker sequence at the 5'end, and a T base-containing primer at the 3'end Nucleotides.
此外,优选的方案是,带有第一接头的Oligo(dT)反转录引物,其5’端为带有第一接头序列的Oligo(dT)引物,其3’末端为带有与紧邻mRNA poly(A)区域发生退火的含锚定碱基的核苷酸。In addition, a preferred solution is that the Oligo (dT) reverse transcription primer with the first linker has an Oligo (dT) primer with the first linker sequence at the 5'end, and the 3'end with the immediately adjacent mRNA A nucleotide containing anchor bases annealed in the poly(A) region.
根据本发明的第二个目的,本发明提供一种基因表达的测序方法,所述方法包括以下步骤:According to the second object of the present invention, the present invention provides a method for sequencing gene expression, which includes the following steps:
使用带有第一接头的Oligo(dT)反转录引物对RNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo (dT) reverse transcription primer with the first adapter to reverse transcription of the RNA sample to obtain the first strand cDNA with the first adapter;
以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
使用带有第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
利用带有第一接头和/或第二接头的双链cDNA片段进行测序工作;Use the double-stranded cDNA fragment with the first linker and/or the second linker for sequencing;
其中,所述第一接头和第二接头中带有建库和测序所需的Index序列;所述第一接头与第二接头的序列不相同。Wherein, the first linker and the second linker carry Index sequences required for library building and sequencing; the sequences of the first linker and the second linker are different.
根据本发明的第二个目的,本发明还提供另一种基因表达的测序方法,所述方法包括以下步骤:According to the second objective of the present invention, the present invention also provides another gene expression sequencing method, the method comprising the following steps:
使用带有第一接头的Oligo(dT)反转录引物对RNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo (dT) reverse transcription primer with the first adapter to reverse transcription of the RNA sample to obtain the first strand cDNA with the first adapter;
以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
使用带有第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
以根据第一接头序列和第二接头序列设计的PCR引物对双链cDNA片段进行扩增,得到RNA 3’端基因表达文库;Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library;
将RNA 3’端基因表达文库进行测序工作;Sequencing the RNA 3'gene expression library;
其中,根据第一接头序列和第二接头序列设计的PCR引物还包括建库和测序所需的Index序列;所述第一接头与第二接头的序列不相同。Wherein, the PCR primer designed according to the sequence of the first linker and the sequence of the second linker also includes an Index sequence required for library construction and sequencing; the sequence of the first linker and the second linker are different.
本发明的有益效果如下:The beneficial effects of the present invention are as follows:
传统的转录组文库构建因操作繁琐、步骤过多而容易导致信息丢失,而且针对低模板量mRNA(如单细胞RNA)的表达文库构建困难。本发明提供的该方法拟将DNA NGS文库构建领域已较为成熟的基于转座酶复合物的建库技术与针对低模板量mRNA的表达检测需求相结合,实现简单、快速、高效的RNA 3’端基因表达文库构建。Traditional transcriptome library construction can easily lead to information loss due to cumbersome operations and excessive steps, and it is difficult to construct an expression library for low template mRNA (such as single-cell RNA). The method provided by the present invention intends to combine the relatively mature transposase complex-based library construction technology in the field of DNA NGS library construction with the requirement for expression detection of low template mRNA, so as to realize simple, fast and efficient RNA 3' End gene expression library construction.
本发明技术路线明确,利用带有第一接头的Oligo(dT)引物对RNA样本进行反转录后,合成第二链cDNA,并使用带有第二接头的转座酶复合物处理双链cDNA,因此通过反转录与转座步骤分别成功添加第一接头和第二接头,生成双端带有不同接头的mRNA3’端区域的双链cDNA,可直接进行或增加一步文库扩增后进行下游测序。本方法建库效率高,且方法简单易行,省时省力,尤其适用于低起始量RNA模板,特别是单细胞RNA 的表达文库构建,有效避免因步骤繁多导致的信息丢失问题,对转录组学、RNA测序领域研究具有重大而深远的意义。The technical route of the present invention is clear. After reverse transcription of the RNA sample using Oligo (dT) primer with the first linker, the second-strand cDNA is synthesized, and the double-strand cDNA is processed by the transposase complex with the second linker Therefore, the first linker and the second linker were successfully added through the reverse transcription and transposition steps to generate a double-stranded cDNA with different linkers at both ends of the mRNA 3'-end region, which can be carried out directly or by adding a step of library amplification followed by downstream Sequencing. This method has high efficiency in library construction, and the method is simple and easy to implement. It saves time and effort. It is especially suitable for the construction of low-initiated RNA templates, especially single-cell RNA expression libraries. Research in the fields of omics and RNA sequencing has great and far-reaching significance.
附图说明Description of the drawings
下面结合附图对本发明的具体实施方式作进一步详细的说明。The specific embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.
图1示出本发明的原理示意图。Fig. 1 shows a schematic diagram of the principle of the present invention.
图2示出实施例1、2中转座产物纯化后的电泳检测结果图。其中,M是指100bp的DNA分子标记物,1号泳道中为实施例1扩增产物,2号泳道中为实施例2扩增产物。Figure 2 shows the electrophoresis detection results of the purified transposition products in Examples 1 and 2. Wherein, M refers to a DNA molecular marker of 100 bp, the amplified product of Example 1 is in Lane 1 and the amplified product of Example 2 is in Lane 2.
图3示出实施例1中分选后文库经安捷伦高灵敏DNA芯片电泳图。Fig. 3 shows the electrophoresis diagram of the library after sorting in Example 1 through the Agilent high-sensitivity DNA chip.
图4示出实施例2中分选后文库经安捷伦高灵敏DNA芯片电泳图。4 shows the electrophoresis diagram of the library after sorting in Example 2 through the Agilent high-sensitivity DNA chip.
具体实施方式Detailed ways
为了更清楚地说明本发明,下面结合优选实施例和附图对本发明做进一步的说明。附图中相似的部件以相同的附图标记进行表示。本领域技术人员应当理解,下面所具体描述的内容是说明性的而非限制性的,不应以此限制本发明的保护范围。In order to explain the present invention more clearly, the present invention will be further described below in conjunction with preferred embodiments and drawings. Similar components in the drawings are denoted by the same reference numerals. Those skilled in the art should understand that the content described below is illustrative rather than restrictive, and should not limit the protection scope of the present invention.
本发明提供了一种快速构建RNA 3’端基因表达文库的新方法,所述方法包括以下步骤:The present invention provides a new method for quickly constructing an RNA 3'gene expression library. The method includes the following steps:
(1)使用带有第一接头的Oligo(dT)反转录引物对mRNA样本进行反转录,得到带有第一接头的第一链cDNA;(1) Use Oligo (dT) reverse transcription primer with the first linker to perform reverse transcription on the mRNA sample to obtain the first strand cDNA with the first linker;
(2)以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;(2) Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
(3)使用携带第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;(3) Treat the double-stranded cDNA with the transposase complex carrying the second linker, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
其中,所述第一接头与第二接头的序列不相同,以保证源自mRNA模板3’端区域的双链cDNA两端带有不同的接头序列。Wherein, the sequences of the first linker and the second linker are different to ensure that the double-stranded cDNA derived from the 3'end region of the mRNA template has different linker sequences at both ends.
本发明的术语“第一接头”和“第二接头”中接头序列是指向他所接合的核酸片段提供寻址手段的非靶核酸组分的DNA序列。In the terms "first linker" and "second linker" of the present invention, the linker sequence refers to the DNA sequence of the non-target nucleic acid component that provides addressing means to the nucleic acid fragment to which it is joined.
本发明的mRNA样本是指真核生物的3’端含有polyA序列的mRNA样本。该mRNA样本可通过常规的RNA提取、纯化和富集等步骤获得,这些步骤均属于现有技术中的成熟技术,本发明对此不做进一步限制。The mRNA sample of the present invention refers to an mRNA sample containing a polyA sequence at the 3'end of a eukaryotic organism. The mRNA sample can be obtained through conventional RNA extraction, purification and enrichment steps, which are all mature technologies in the prior art, and the present invention does not impose further restrictions on this.
本发明的具体技术流程为:首先通过带有第一接头的Oligo(dT)引物进行RNA样本的反转录,获得带有第一接头的第一链cDNA,然后通过消化mRNA模板、合成第二链cDNA即可获得带有第一接头的双链cDNA;然后利用带有第二接头序列的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头;此时,除了源自mRNA模板3’端区域的双链cDNA两端带有不同的第一接头与第二接头序列外,其余片段的两端均带有相同的第二接头序列,因此仅有mRNA模板3’端区域可进行下游的测序工作,而其余区域则不会被利用。综上,本发明可有效实现针对mRNA 3’端的转录组文库构建与测序工作。通过本发明提供的方法得到的转录组文库,可用于Roche、Illumina、ThermoFisher、Pacific Biosciences、华大基因、Oxford Nanopore Technologies、华因康、瀚海基因高通量测序平台。The specific technical process of the present invention is as follows: firstly, reverse transcription of the RNA sample is carried out with Oligo (dT) primer with the first linker to obtain the first strand cDNA with the first linker, and then by digesting the mRNA template, synthesizing the second The double-stranded cDNA with the first linker can be obtained from the stranded cDNA; then the double-stranded cDNA is treated with the transposase complex with the second linker sequence, and the second linker is inserted while fragmenting; The double-stranded cDNA in the 3'end region of the mRNA template has different first and second linker sequences at both ends, and the remaining fragments have the same second linker sequence at both ends, so there is only the 3'end region of the mRNA template The downstream sequencing work can be performed, and the remaining areas will not be used. In summary, the present invention can effectively realize the construction and sequencing of the transcriptome library for the 3'end of mRNA. The transcriptome library obtained by the method provided by the present invention can be used in Roche, Illumina, ThermoFisher, Pacific Biosciences, BGI, Oxford Nanopore Technologies, Huayinkang, and Hanhai Gene high-throughput sequencing platforms.
本发明得到的带有第一接头和/或第二接头的双链cDNA片段可直接进行下游测序工作,或者使用根据第一接头和第二接头序列所设计的PCR引物进行文库扩增,实现对目标区域的富集与放大,然后再进行下游的测序工作。The double-stranded cDNA fragment with the first linker and/or the second linker obtained in the present invention can be directly subjected to downstream sequencing work, or the library can be amplified by using PCR primers designed according to the first linker and the second linker sequence to realize Enrichment and amplification of the target area, and then downstream sequencing work.
在本发明优选的实施方式中,当需要对文库进行扩增时,所述方法还包括以下步骤:以根据第一接头序列和第二接头序列设计的PCR引物对双链cDNA片段进行扩增,得到RNA 3’端基因表达文库。当原始的RNA样本量很低时,该步扩增可以有效扩大双链cDNA片段的量,实现对目标区域的富集和信号放大。进一步优选地,该步骤中根据第一接头序列和第二接头序列设计的PCR引物还包括建库和测序所需的Index序列。本领域技术人员可以理解的是,在本发明可以实现的实施方式中,根据第一接头序列和第二接头序列设计的PCR引物还可以包括其它任意所需的序列,例如进行高通量单细胞测序时,通过多步PCR引入分子标签与细胞标签,并通过二者数量的相乘实现高通量。通过文库扩增PCR步骤引入其它任意所需序列,以便实现下游的测序工作。In a preferred embodiment of the present invention, when the library needs to be amplified, the method further includes the following steps: Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence, Obtain the RNA 3'end gene expression library. When the amount of the original RNA sample is very low, this step of amplification can effectively expand the amount of double-stranded cDNA fragments to achieve the enrichment of the target region and signal amplification. Further preferably, the PCR primers designed according to the first linker sequence and the second linker sequence in this step also include the Index sequence required for library building and sequencing. Those skilled in the art can understand that in the achievable embodiments of the present invention, the PCR primers designed according to the first linker sequence and the second linker sequence may also include any other desired sequences, such as high-throughput single cell When sequencing, the molecular tags and cell tags are introduced by multi-step PCR, and high throughput is achieved by multiplying the numbers of the two. Introduce any other desired sequences through the library amplification PCR step to achieve downstream sequencing work.
在本发明优选的实施方式中,当直接进行下游测序工作时,所述第一接头和第二接头中直接带有建库和测序所需的Index序列或其它测序所需的任意序列,以便实现下游的测序工作。In a preferred embodiment of the present invention, when directly performing downstream sequencing work, the first adapter and the second adapter directly carry the Index sequence required for library construction and sequencing or any sequence required for other sequencing, so as to achieve Downstream sequencing work.
在本发明优选的实施方式中,步骤(3)所述转座酶复合物包含转座酶和插入DNA,所述转座酶可以是Tn5,也可以是MuA或是其它可以行使转座功能的转座酶。进一步优选地,转座酶复合物中,携带第二接头的插入片段可以是两条一样的DNA分子;或者携带第二接头的插入片段也可以是一条两端带有相同第二接头序列的DNA分子,此时形成的是环状转座子复合物(参见专利申请《一种环状转座子复合物及其应用》,申请号201710013203.8)。In a preferred embodiment of the present invention, the transposase complex in step (3) comprises a transposase and inserting DNA. The transposase can be Tn5, MuA or other transposable functions. Transposase. Further preferably, in the transposase complex, the insert carrying the second linker may be two identical DNA molecules; or the insert carrying the second linker may also be a piece of DNA with the same second linker sequence at both ends. The molecule is formed at this time as a circular transposon complex (see the patent application "A circular transposon complex and its application", application number 201710013203.8).
在本发明优选的实施方式中,本发明合成第一链cDNA时使用的反转录引物为5’端带有第一接头序列的Oligo(dT)引物;其3’末端为带有与紧邻mRNA poly(A)区域发生退火的锚定碱基的核苷酸,该引物可与真核生物总RNA样本中所有mRNA组分结合启动反转录。In a preferred embodiment of the present invention, the reverse transcription primer used when synthesizing the first-strand cDNA of the present invention is an Oligo(dT) primer with a first linker sequence at the 5'end; The nucleotides of anchor bases annealed in the poly(A) region. This primer can bind to all the mRNA components in the eukaryotic total RNA sample to initiate reverse transcription.
在本发明的另一种优选的实施方式中,本发明合成第一链cDNA时使用的反转录引物为5’端带有第一接头序列的Oligo(dT)引物;其3’末端为含T碱基的核苷酸。In another preferred embodiment of the present invention, the reverse transcription primer used when synthesizing the first-strand cDNA of the present invention is an Oligo (dT) primer with a first linker sequence at the 5'end; the 3'end contains T-base nucleotides.
传统基因表达文库构建方法,通常首先需要进行mRNA的捕获或核糖体RNA去除,然后进行反转录,继而进行较繁琐的传统建库流程(包括片段化、末端修复与加A、接头连接、文库扩增等步骤,以及多步产物分选与纯化)。此方法步骤繁琐,耗时费力,且操作污染几率较大,需要RNA样品起始量较高,对于少量RNA模板,特别是单细胞RNA建库,往往效果并不理想。Traditional gene expression library construction methods usually first require mRNA capture or ribosomal RNA removal, and then reverse transcription, followed by more cumbersome traditional library construction procedures (including fragmentation, end repair and A, linker connection, library Steps such as amplification, as well as multi-step product sorting and purification). This method is complicated in steps, time-consuming and laborious, and has a high probability of operation contamination. It requires a high initial amount of RNA samples. For a small amount of RNA template, especially single-cell RNA library construction, the effect is often not ideal.
本发明使用反转录—合成双链cDNA—体外转座的技术路线,通过反转录与转座两步骤,将mRNA模板3’端区域成功构建为两端带有不同接头序列的文库,可直接进行或增加一步文库扩增后进行下游测序。此方法步骤较少,操作简单灵活,特别适用于低起始量RNA建库,如单细胞RNA建库与测序。The present invention uses the technical route of reverse transcription—synthesis of double-stranded cDNA—in vitro transposition. Through the two steps of reverse transcription and transposition, the 3'-end region of the mRNA template is successfully constructed into a library with different linker sequences at both ends. Directly or add a step of library amplification followed by downstream sequencing. This method has fewer steps, simple and flexible operation, and is particularly suitable for low-initiated RNA library building, such as single-cell RNA library building and sequencing.
基于上述本发明提供的构建RNA 3’端基因表达文库的方法,本发明还提供了两种基因表达的测序方法。Based on the method for constructing an RNA 3'gene expression library provided by the present invention, the present invention also provides two methods for sequencing gene expression.
其中,第一种测序方法包括以下步骤:Among them, the first sequencing method includes the following steps:
使用带有第一接头的Oligo(dT)反转录引物对RNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo (dT) reverse transcription primer with the first adapter to reverse transcription of the RNA sample to obtain the first strand cDNA with the first adapter;
以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
使用带有第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
利用带有第一接头和/或第二接头的双链cDNA片段进行测序工作;Use the double-stranded cDNA fragment with the first linker and/or the second linker for sequencing;
其中,所述第一接头和第二接头中带有建库和测序所需的Index序列;所述第一接头与第二接头的序列不相同。Wherein, the first linker and the second linker carry Index sequences required for library building and sequencing; the sequences of the first linker and the second linker are different.
第二种测序方法包括以下步骤:The second sequencing method includes the following steps:
使用带有第一接头的Oligo(dT)反转录引物对RNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo (dT) reverse transcription primer with the first adapter to reverse transcription of the RNA sample to obtain the first strand cDNA with the first adapter;
以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
使用带有第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
以根据第一接头序列和第二接头序列设计的PCR引物对双链cDNA片段进行扩增,得到RNA 3’端基因表达文库;Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library;
将RNA 3’端基因表达文库进行测序工作;Sequencing the RNA 3'gene expression library;
其中,根据第一接头序列和第二接头序列设计的PCR引物还包括建库和测序所需的Index序列;所述第一接头与第二接头的序列不相同。Wherein, the PCR primer designed according to the sequence of the first linker and the sequence of the second linker also includes an Index sequence required for library construction and sequencing; the sequence of the first linker and the second linker are different.
实施例1转座法后通过PCR的方式进入双端测序接头Example 1 Enter the paired-end sequencing adapter by PCR after transposition method
1.mRNA的富集1. Enrichment of mRNA
对真核生物的mRNA进行富集,具体步骤为常规mRNA富集步骤,例如可以使用mRNA纯化试剂盒(北京全式金生物,目录号:EC801),此处步骤略。For the enrichment of eukaryotic mRNA, the specific steps are conventional mRNA enrichment steps. For example, an mRNA purification kit (Beijing Quanshijin Biology, catalog number: EC801) can be used, and the steps are omitted here.
2.第一链cDNA的合成2. Synthesis of the first strand cDNA
第一链cDNA的合成,例如使用第一链cDNA转录合成试剂盒(北京全式金生物,目录号:AT301)进行该操作:For the synthesis of first-strand cDNA, for example, use the first-strand cDNA transcription synthesis kit (Beijing Quanshijin Biology, catalog number: AT301) for this operation:
1>依照以下体系加入各组分:1>Add each component according to the following system:
Figure PCTCN2020095600-appb-000001
Figure PCTCN2020095600-appb-000001
Oligo(T)-VN1为带有第一接头的Oligo(dT)反转录引物,序列为:GACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAAAAVN(如SEQ ID NO.1所示),其中V代表G/A/C三种碱基中的任意一种,N代表G/A/C/T四种碱基中的任意一种。Oligo(T)-VN1 is an Oligo(dT) reverse transcription primer with a first linker, the sequence is: GACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAAAAVN (shown in SEQ ID NO.1), where V represents the three bases of G/A/C Any one of the four bases, N represents any one of the four bases G/A/C/T.
2>轻轻混匀;42℃孵育30分钟;2>Mix gently; incubate at 42°C for 30 minutes;
3>85℃加热5秒失活RT/RI转录酶。3> Heat at 85°C for 5 seconds to inactivate RT/RI transcriptase.
3.mRNA的消化3. Digestion of mRNA
1>在上述体系中加入1μl RNA酶混合物,对残留的mRNA于37℃消化15分钟,得到带有第一接头的第一链cDNA溶液;1>Add 1μl RNase mixture to the above system, digest the residual mRNA at 37°C for 15 minutes to obtain the first strand cDNA solution with the first linker;
4.第二链合成4. Second strand synthesis
在上述体系中,直接进行第二链的合成,例如使用全式金公司的第一链cDNA转录合成试剂盒(北京全式金生物,目录号:AT302)进行该步操作。In the above system, the synthesis of the second strand is directly performed, for example, the first strand cDNA transcription synthesis kit of Quanshijin Company (Beijing Quanshijin Biology, catalog number: AT302) is used for this step.
1>依照以下体系加入各组分:1>Add each component according to the following system:
Figure PCTCN2020095600-appb-000002
Figure PCTCN2020095600-appb-000002
2>轻轻混匀;16℃孵育60分钟,得到带有第一接头的双链cDNA溶液;2> Mix gently; incubate at 16°C for 60 minutes to obtain a double-stranded cDNA solution with the first linker;
5.转座法第二接头的插入5. Insert the second joint of the transposition method
在上述体系中,使用含有相同短接头(使用Oligo A与Oligo B退火制备而成)的Tn5转座复合物Tn5-S直接对双链cDNA进行接头的插入,其中,Oligo A的序列为:TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG(如SEQ ID NO.2所示);Oligo B的序列为:CTGTCTCTTATACACATCT(如SEQ ID NO.3所示)。插入的步骤为:In the above system, the Tn5 transposition complex Tn5-S containing the same short linker (prepared by annealing Oligo A and Oligo B) is used to directly insert the double-stranded cDNA. The sequence of Oligo A is: TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG (As shown in SEQ ID NO. 2); Oligo B's sequence is: CTGTCTCTTATACACATCT (as shown in SEQ ID NO. 3). The steps to insert are:
1>依照以下体系加入各组分:1>Add each component according to the following system:
Figure PCTCN2020095600-appb-000003
Figure PCTCN2020095600-appb-000003
注意:转座酶Tn5在室温下也可缓慢反应,因此反应混合液需在冰上加入,并于吹打混匀后立即进行孵育反应。Note: The transposase Tn5 can react slowly at room temperature, so the reaction mixture should be added on ice, and the incubation reaction should be performed immediately after pipetting and mixing.
2>移液枪吹吸混匀,管壁上如有液体可点甩离心,并立即进行下步反应。2>Blow and mix with pipette. If there is liquid on the tube wall, it can be centrifuged, and the next step of reaction will be carried out immediately.
3>将样品置于PCR仪中,55℃孵育5分钟(盖子温度70℃)。3> Put the sample in a PCR machine and incubate at 55°C for 5 minutes (cover temperature 70°C).
4>立即置于冰中,并立即向反应管中加入20μl 4×Tn5 Digestion Mix*,移液枪吹吸混匀,PCR仪中55℃孵育5分钟(盖子温度70℃),立即置于冰中。4>Place on ice immediately, and immediately add 20μl 4×Tn5 Digestion Mix* to the reaction tube, pipette to mix evenly, incubate at 55°C for 5 minutes in the PCR machine (cover temperature 70°C), and place on ice immediately in.
6.转座产物纯化6. Purification of transposition products
进行产物纯化,具体操作略。例如本实施例中使用MagicPureTM Size Selection DNA Beads(北京全式金生物,目录号:EC401)进行纯化。纯化结束后对纯化产物进行电泳检测,检测结果如图2,可见,转座产物纯化后主要分布于300~500bp范围内,片段化效果良好。For product purification, the specific operation is omitted. For example, in this embodiment, MagicPure™ Size Selection DNA Beads (Beijing Quanshijin Biotech, catalog number: EC401) is used for purification. After purification, the purified product was subjected to electrophoresis detection. The detection result is shown in Figure 2. It can be seen that the purified transposition product is mainly distributed in the range of 300-500bp, and the fragmentation effect is good.
7. 3’末端产物的扩增7. Amplification of the 3'end product
1>依照以下体系加入各组分:1>Add each component according to the following system:
Figure PCTCN2020095600-appb-000004
Figure PCTCN2020095600-appb-000004
Figure PCTCN2020095600-appb-000005
Figure PCTCN2020095600-appb-000005
其中,上游引物Primer-F的序列为:CAAGCAGAAGACGGCATACGAGAT[index-i5]GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT;其中,[index-i5]插入在上游引物Primer-F原始序列CAAGCAGAAGACGGCATACGAGATGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT(如SEQ ID NO.4所示)中间,具体插入位置如上游引物Primer-F的序列所示。Among them, the sequence of the upstream primer Primer-F is: CAAGCAGAAGACGGCATACGAGAT[index-i5]GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT; where [index-i5] is inserted into the original sequence of the upstream primer Primer-F CAAGCAGAAGACGGCATACGAGATGTGACTGGAGTTCAGACGTG.TGCTTCCGIDATCAGACGNO. The position is shown in the sequence of the upstream primer Primer-F.
下游引物Primer-R的序列为:AATGATACGGCGACCACCGAGATCTACAC[index-i7]TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG。其中,[index-i7]插入在下游引物Primer-R原始序列AATGATACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTCAGATGTGTATAAGAGACAG(如SEQ ID NO.5所示)中间,具体插入位置如下游引物Primer-R的序列所示。The sequence of the downstream primer Primer-R is: AATGATACGGCGACCACCGAGATCTACAC[index-i7]TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG. Among them, [index-i7] is inserted in the middle of the original sequence AATGACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTCAGATGTGTATAAGAGACAG (shown in SEQ ID NO. 5) of the downstream primer Primer-R, and the specific insertion position is shown in the sequence of the primer-R of the lower primer.
2>移液枪吹吸混匀,管壁上如有液体可点甩离心。2>Blow and mix with pipette, if there is liquid on the tube wall, it can be centrifuged.
3>PCR仪中进行以下扩增程序3>Perform the following amplification procedures in the PCR machine
Figure PCTCN2020095600-appb-000006
Figure PCTCN2020095600-appb-000006
*此步不可省略。转座反应产物并非完整的双链DNA,72℃孵育3分钟用于生成成熟的PCR模板。*This step cannot be omitted. The transposition reaction product is not a complete double-stranded DNA. Incubate at 72°C for 3 minutes to generate a mature PCR template.
8.扩增产物的分选:8. Sorting of amplified products:
进行产物片段分选,例如,本实施例中使用MagicPureTM Size Selection DNA Beads(北京全式金生物,目录号:EC401)。为防止引物及大片段残留,建议进行两次片段分选,分选具体操作略。分选结束后,对分选产物使用安捷伦2100高灵敏DNA芯片检测,检测结果如图3,可见,扩增产物分选后主要分布于300~500bp范围内,大小分布与峰型均正常。For product fragment sorting, for example, MagicPure™ Size Selection DNA Beads (Beijing Quanshijin Biotech, catalog number: EC401) is used in this embodiment. In order to prevent the residue of primers and large fragments, it is recommended to perform two fragment sorting. The specific operation of sorting is omitted. After sorting, the Agilent 2100 high-sensitivity DNA chip was used to detect the sorted products. The test results are shown in Figure 3. It can be seen that the amplified products are mainly distributed in the range of 300-500bp after sorting, and the size distribution and peak shape are normal.
实施例2转座法后直接测序Example 2 Direct sequencing after transposition method
1.mRNA的富集1. Enrichment of mRNA
同实施例1中步骤1Same as step 1 in example 1
2.第一链cDNA的合成2. Synthesis of the first strand cDNA
进行第一链cDNA的合成。例如,本实施例中使用第一链cDNA转录合成试剂盒(北京全式金生物,目录号:AT301)进行该操作:The synthesis of the first strand cDNA is performed. For example, in this example, the first-strand cDNA transcription synthesis kit (Beijing Quanshijin Biology, catalog number: AT301) was used to perform this operation:
1>依照以下体系加入各组分:1>Add each component according to the following system:
Figure PCTCN2020095600-appb-000007
Figure PCTCN2020095600-appb-000007
Figure PCTCN2020095600-appb-000008
Figure PCTCN2020095600-appb-000008
Oligo(T)-VN2为带有第一接头的Oligo(dT)反转录引物2,其序列为:CAAGCAGAAGACGGCATACGAGAT[index-i5]GTGACTGGAGTTCAGACGTGTGCTCT TCCGATCTAAAAAAAAAAAAAAAAAAAAAAAAVN。Oligo(T)-VN2 is Oligo(dT) reverse transcription primer 2 with a first linker, and its sequence is: CAAGCAGAAGACGGCATACGAGAT[index-i5]GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAAAAVN.
其中,[index-i5]起区分各样本数据的作用,每个样本都有不同的index,在测序的时候会通过index的不同将样本的数据区分开;[index-i5]插接在带有第一接头的Oligo(dT)反转录引物原始序列2:CAAGCAGAAGACGGCATACGAGATGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAAAAVN(如SEQ ID NO.6所示)的中间,具体插入位置如带有第一接头的Oligo(dT)反转录引物2的序列所示。Among them, [index-i5] plays a role in distinguishing the data of each sample. Each sample has a different index. When sequencing, the data of the sample will be distinguished by the difference of the index; [index-i5] is inserted in the with The original sequence of the Oligo (dT) reverse transcription primer of the first linker 2: CAAGCAGAAGACGGCATACGAGATGTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTAAAAAAAAAAAAAAAAAAAAAVN (shown in SEQ ID NO. 6) in the middle, the specific insertion position is as in the Oligo (dT) reverse transcription primer 2 with the first linker The sequence is shown.
其中,序列末端的V代表G/A/C三种碱基中的任意一种,N代表G/A/C/T四种碱基中的任意一种。Among them, the V at the end of the sequence represents any of the three bases of G/A/C, and N represents any of the four bases of G/A/C/T.
2>轻轻混匀;42℃孵育30分钟;2>Mix gently; incubate at 42°C for 30 minutes;
3>85℃加热5秒失活RT/RI转录酶。3> Heat at 85°C for 5 seconds to inactivate RT/RI transcriptase.
3.mRNA的消化3. Digestion of mRNA
同实施例1中步骤3Same as step 3 in example 1
4.第二链合成4. Second strand synthesis
同实施例1中步骤4Same as step 4 in example 1
5.转座法第二接头的插入5. Insert the second joint of the transposition method
在上述体系中,使用含有相同长接头(使用Oligo C与Oligo B退火制备而成)的Tn5转座复合物Tn5-L直接对双链cDNA进行接头的插入;其中,Oligo C的序列为:CAAGCAGAAGACGGCATACGAGAT[index-i7]GTCTCGTGGGCTCGGAGATGTGTATA AGAGACAG;[index-i7]插入在Oligo C原始序列CAAGCAGAAGACGGCATACGAGATGTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG(如SEQ ID NO.7所示)中,具体插入位置如Oligo C的序列所示。In the above system, the Tn5 transposable complex Tn5-L containing the same long linker (prepared by the annealing of Oligo C and Oligo B) is used to directly insert the double-stranded cDNA; the sequence of Oligo C is: CAAGCAGAAGACGGCATACGAGAT [index-i7] GTCTCGTGGGCTCGGAGATGTGTATA AGAGACAG; [index-i7] is inserted in the original Oligo C sequence CAAGCAGAAGACGGCATACGAGATGTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG (shown in SEQ ID NO. 7), and the specific insertion position is shown in the sequence of Oligo C.
插入的步骤为:The steps to insert are:
1>依照以下体系加入各组分:1>Add each component according to the following system:
Figure PCTCN2020095600-appb-000009
Figure PCTCN2020095600-appb-000009
注意:转座酶Tn5在室温下也可缓慢反应,因此反应混合液需在冰上加入,并于吹打混匀后立即进行孵育反应。Note: The transposase Tn5 can react slowly at room temperature, so the reaction mixture should be added on ice, and the incubation reaction should be performed immediately after pipetting and mixing.
2>移液枪吹吸混匀,管壁上如有液体可点甩离心,并立即进行下步反应。2>Blow and mix with pipette. If there is liquid on the tube wall, it can be centrifuged, and the next step of reaction will be carried out immediately.
3>将样品置于PCR仪中,55℃孵育5分钟(盖子温度70℃)。3> Put the sample in a PCR machine and incubate at 55°C for 5 minutes (cover temperature 70°C).
4>立即置于冰中,并立即向反应管中加入15μl 5×Tn5 Repair Mix,移液枪吹吸混 匀,PCR仪中65℃孵育5分钟(盖子温度99℃),立即置于冰中。4>Place on ice immediately, and immediately add 15μl 5×Tn5 Repair Mix to the reaction tube, pipette to mix evenly, incubate at 65°C for 5 minutes in the PCR machine (cover temperature 99°C), and place in ice immediately .
6.转座产物纯化6. Purification of transposition products
同实施例1中步骤6。对纯化产物进行电泳检测,如图2所示。Same as step 6 in Example 1. The purified product was detected by electrophoresis, as shown in Figure 2.
7.转座产物的分选:7. Sorting of transposition products:
进行产物片段分选。例如本实施例中使用的是MagicPureTM Size Selection DNA Beads(北京全式金生物,目录号:EC401)。为防止引物及大片段残留,建议进行两次片段分选,分选具体操作略。分选结束后,对分选产物使用安捷伦2100高灵敏DNA芯片检测,检测结果如图4所示,可见,扩增产物分选后主要分布于300~500bp范围内,大小分布与峰型均正常。Carry out product fragment sorting. For example, in this embodiment, MagicPure™ Size Selection DNA Beads (Beijing Quanshijin Biotech, catalog number: EC401) is used. In order to prevent the residue of primers and large fragments, it is recommended to perform two fragment sorting. The specific operation of sorting is omitted. After sorting, the Agilent 2100 high-sensitivity DNA chip was used to detect the sorted products. The test results are shown in Figure 4. It can be seen that the amplified products are mainly distributed in the range of 300-500bp after sorting, and the size distribution and peak shape are normal. .
显然,本发明的上述实施例仅仅是为清楚地说明本发明所作的举例,而并非是对本发明的实施方式的限定,对于所属领域的普通技术人员来说,在上述说明的基础上还可以做出其它不同形式的变化或变动,这里无法对所有的实施方式予以穷举,凡是属于本发明的技术方案所引伸出的显而易见的变化或变动仍处于本发明的保护范围之列。Obviously, the above-mentioned embodiments of the present invention are merely examples to clearly illustrate the present invention, and are not meant to limit the implementation of the present invention. For those of ordinary skill in the art, they can also do on the basis of the above description. In addition to other different forms of changes or changes, it is not possible to list all the implementations here. Any obvious changes or changes derived from the technical solutions of the present invention are still within the protection scope of the present invention.
Figure PCTCN2020095600-appb-000010
Figure PCTCN2020095600-appb-000010
Figure PCTCN2020095600-appb-000011
Figure PCTCN2020095600-appb-000011

Claims (10)

  1. 一种快速构建RNA 3’端基因表达文库的方法,其特征在于,所述方法包括以下步骤:A method for quickly constructing an RNA 3'gene expression library, characterized in that the method includes the following steps:
    使用带有第一接头的Oligo dT反转录引物对mRNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo dT reverse transcription primer with the first linker to perform reverse transcription on the mRNA sample to obtain the first-strand cDNA with the first linker;
    以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
    使用携带第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex carrying the second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
    其中,所述第一接头与第二接头的序列不相同。Wherein, the sequence of the first linker and the second linker are not the same.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括以下步骤:The method according to claim 1, wherein the method further comprises the following steps:
    以根据第一接头序列和第二接头序列设计的PCR引物对双链cDNA片段进行扩增,得到RNA 3’端基因表达文库。The double-stranded cDNA fragments are amplified by PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library.
  3. 根据权利要求2所述的方法,其特征在于,根据第一接头序列和第二接头序列设计的PCR引物还包括建库和测序所需的Index序列。The method according to claim 2, wherein the PCR primers designed according to the first linker sequence and the second linker sequence further include an Index sequence required for library construction and sequencing.
  4. 根据权利要求1所述的方法,其特征在于,所述第一接头和第二接头中带有建库和测序所需的Index序列。The method according to claim 1, wherein the first linker and the second linker carry Index sequences required for library construction and sequencing.
  5. 根据权利要求1所述的方法,其特征在于,所述转座酶复合物包含转座酶和插入DNA;所述转座酶为Tn5转座酶或MuA转座酶。The method of claim 1, wherein the transposase complex comprises a transposase and inserting DNA; and the transposase is Tn5 transposase or MuA transposase.
  6. 根据权利要求1所述的方法,其特征在于,所述转座酶复合物中,携带第二接头的插入DNA为两条一样的DNA分子,或携带第二接头的插入DNA为一条两端带有相同第二接头的DNA分子。The method according to claim 1, wherein in the transposase complex, the insert DNA carrying the second linker is two identical DNA molecules, or the insert DNA carrying the second linker is a band at both ends DNA molecules with the same second linker.
  7. 根据权利要求1所述的方法,其特征在于,带有第一接头的Oligo dT反转录引物,其5’端为带有第一接头序列的Oligo dT引物,其3’末端为含T碱基的核苷酸。The method according to claim 1, wherein the Oligo dT reverse transcription primer with a first linker has an Oligo dT primer with a first linker sequence at its 5'end, and a T base at its 3'end Base nucleotides.
  8. 根据权利要求1所述的方法,其特征在于,带有第一接头的Oligo dT反转录引物,其5’端为带有第一接头序列的Oligo dT引物,其3’末端为带有与紧邻mRNA polyA区域发生退火的含锚定碱基的核苷酸。The method according to claim 1, wherein the Oligo dT reverse transcription primer with a first linker has an Oligo dT primer with a first linker sequence at its 5'end, and an Oligo dT primer with a first linker sequence at its 3'end. Nucleotides containing anchor bases that anneal to the polyA region of the mRNA.
  9. 一种基因表达的测序方法,其特征在于,所述方法包括以下步骤:A method for sequencing gene expression, characterized in that the method comprises the following steps:
    使用带有第一接头的Oligo dT反转录引物对RNA样本进行反转录,得到带有第一接头的第一链cDNA;Use Oligo dT reverse transcription primer with the first linker to perform reverse transcription on the RNA sample to obtain the first strand cDNA with the first linker;
    以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
    使用带有第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
    利用带有第一接头和/或第二接头的双链cDNA片段进行测序工作;Use the double-stranded cDNA fragment with the first linker and/or the second linker for sequencing;
    其中,所述第一接头和第二接头中带有建库和测序所需的Index序列;所述第一接头与第二接头的序列不相同。Wherein, the first linker and the second linker carry Index sequences required for library building and sequencing; the sequences of the first linker and the second linker are different.
  10. 一种基因表达的测序方法,其特征在于,所述方法包括以下步骤:A method for sequencing gene expression, characterized in that the method comprises the following steps:
    使用带有第一接头的Oligo dT反转录引物对RNA样本进行反转录,得到带有第一接 头的第一链cDNA;Use Oligo dT reverse transcription primer with the first linker to perform reverse transcription on the RNA sample to obtain the first-strand cDNA with the first linker;
    以第一链cDNA为模板合成第二链cDNA,获得带有第一接头的双链cDNA;Synthesize the second-strand cDNA using the first-strand cDNA as a template to obtain the double-stranded cDNA with the first linker;
    使用带有第二接头的转座酶复合物处理双链cDNA,在片段化的同时插入第二接头,得到带有第一接头和/或第二接头的双链cDNA片段;Use the transposase complex with a second linker to process the double-stranded cDNA, insert the second linker while fragmenting, to obtain a double-stranded cDNA fragment with the first linker and/or the second linker;
    以根据第一接头序列和第二接头序列设计的PCR引物对双链cDNA片段进行扩增,得到RNA 3’端基因表达文库;Amplify the double-stranded cDNA fragment with PCR primers designed according to the first linker sequence and the second linker sequence to obtain an RNA 3'end gene expression library;
    将RNA 3’端基因表达文库进行测序工作;Sequencing the RNA 3'gene expression library;
    其中,根据第一接头序列和第二接头序列设计的PCR引物还包括建库和测序所需的Index序列;所述第一接头与第二接头的序列不相同。Wherein, the PCR primer designed according to the sequence of the first linker and the sequence of the second linker also includes an Index sequence required for library construction and sequencing; the sequence of the first linker and the second linker are different.
PCT/CN2020/095600 2019-06-25 2020-06-11 Method for rapid construction of rna 3'-end gene expression library WO2020259303A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910552575.7 2019-06-25
CN201910552575.7A CN110257479A (en) 2019-06-25 2019-06-25 A kind of method that rapid build RNA 3 ' holds gene expression library

Publications (1)

Publication Number Publication Date
WO2020259303A1 true WO2020259303A1 (en) 2020-12-30

Family

ID=67921163

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/095600 WO2020259303A1 (en) 2019-06-25 2020-06-11 Method for rapid construction of rna 3'-end gene expression library

Country Status (2)

Country Link
CN (1) CN110257479A (en)
WO (1) WO2020259303A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110257479A (en) * 2019-06-25 2019-09-20 北京全式金生物技术有限公司 A kind of method that rapid build RNA 3 ' holds gene expression library

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102264914A (en) * 2008-10-24 2011-11-30 阿霹震中科技公司 Transposon end compositions and methods for modifying nucleic acids
CN106459967A (en) * 2014-04-29 2017-02-22 Illumina公司 Multiplexed single cell gene expression analysis using template switch and tagmentation
WO2017048993A1 (en) * 2015-09-15 2017-03-23 Takara Bio Usa, Inc. Methods for preparing a next generation sequencing (ngs) library from a ribonucleic acid (rna) sample and compositions for practicing the same
CN106754904A (en) * 2016-12-21 2017-05-31 南京诺唯赞生物科技有限公司 A kind of specific molecular label of cDNA and its application
CN109811045A (en) * 2017-11-22 2019-05-28 深圳华大智造科技有限公司 The construction method of high-throughput unicellular overall length transcript profile sequencing library and its application
CN110257479A (en) * 2019-06-25 2019-09-20 北京全式金生物技术有限公司 A kind of method that rapid build RNA 3 ' holds gene expression library

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107586835B (en) * 2017-10-19 2020-11-03 东南大学 Single-chain-linker-based construction method and application of next-generation sequencing library
CN108103055B (en) * 2018-01-09 2021-05-28 序康医疗科技(苏州)有限公司 Method for reverse transcription of single-cell RNA and construction of library

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102264914A (en) * 2008-10-24 2011-11-30 阿霹震中科技公司 Transposon end compositions and methods for modifying nucleic acids
CN106459967A (en) * 2014-04-29 2017-02-22 Illumina公司 Multiplexed single cell gene expression analysis using template switch and tagmentation
WO2017048993A1 (en) * 2015-09-15 2017-03-23 Takara Bio Usa, Inc. Methods for preparing a next generation sequencing (ngs) library from a ribonucleic acid (rna) sample and compositions for practicing the same
CN106754904A (en) * 2016-12-21 2017-05-31 南京诺唯赞生物科技有限公司 A kind of specific molecular label of cDNA and its application
CN109811045A (en) * 2017-11-22 2019-05-28 深圳华大智造科技有限公司 The construction method of high-throughput unicellular overall length transcript profile sequencing library and its application
CN110257479A (en) * 2019-06-25 2019-09-20 北京全式金生物技术有限公司 A kind of method that rapid build RNA 3 ' holds gene expression library

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
COLE, C. ET AL.: "Tn5Prime, a Tn5 based 5' capture method for single cell RNA-seq.", NUCLEIC ACIDS RES., vol. 46, no. 10, 1 June 2018 (2018-06-01), XP055637367, DOI: 20200901140258X *

Also Published As

Publication number Publication date
CN110257479A (en) 2019-09-20

Similar Documents

Publication Publication Date Title
US8999677B1 (en) Method for differentiation of polynucleotide strands
JP7232643B2 (en) Deep sequencing profiling of tumors
US11761037B1 (en) Probe and method of enriching target region applicable to high-throughput sequencing using the same
JP2020501554A (en) Method for increasing the throughput of single molecule sequencing by linking short DNA fragments
CN111808854B (en) Balanced joint with molecular bar code and method for quickly constructing transcriptome library
CN111549025B (en) Strand displacement primer and cell transcriptome library construction method
WO2013079649A1 (en) Method and kit for characterizing rna in a composition
CN115109842A (en) High sensitivity method for accurate parallel quantification of nucleic acids
KR101913735B1 (en) Internal control substance searching for inter­sample cross­contamination of next­generation sequencing samples
WO2020259303A1 (en) Method for rapid construction of rna 3'-end gene expression library
WO2020005159A1 (en) Method for detection and quantification of genetic alterations
CN115109846A (en) Method for accurate parallel quantification of nucleic acids in diluted or unpurified samples
US20210040540A1 (en) Parallel liquid-phase hybrid capture method for simultaneously capturing sense and antisense double strands of genomic target region
CN113302301A (en) Method for detecting analytes and compositions thereof
US20110269137A1 (en) Rapid and efficient assay to assess the sequence and size of 3' ends of polynucleotides
CN107034267B (en) Method and device for preparing candidate sequencing probe set and application of candidate sequencing probe set
Xu et al. Multiplexed Primer Extension Sequencing Enables High Precision Detection of Rare Splice Isoforms
US20210180125A1 (en) Method for the detection and quantification of genetic alterations
EP4332238A1 (en) Methods for accurate parallel detection and quantification of nucleic acids
JP2024035110A (en) Highly sensitive methods for accurate parallel quantification of variant nucleic acids
Gallardo et al. Application to Assisted Reproductive of Whole-Genome Treatment Technologies
CN115074422A (en) Detection method of unknown fusion gene
CN117965709A (en) Joint for detecting mutation frequency of gene by single-ended multiple amplification and using method
CN118215744A (en) Target enrichment and quantification using isothermal linear amplification probes
WO2022232795A1 (en) Compositions and methods related to modification and detection of pseudouridine and 5-hydroxymethylcytosine

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20833099

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC

122 Ep: pct application non-entry in european phase

Ref document number: 20833099

Country of ref document: EP

Kind code of ref document: A1