CN110643692A - Analysis method and kit for sequencing single cell transcript isomer - Google Patents
Analysis method and kit for sequencing single cell transcript isomer Download PDFInfo
- Publication number
- CN110643692A CN110643692A CN201910611776.XA CN201910611776A CN110643692A CN 110643692 A CN110643692 A CN 110643692A CN 201910611776 A CN201910611776 A CN 201910611776A CN 110643692 A CN110643692 A CN 110643692A
- Authority
- CN
- China
- Prior art keywords
- cdna
- sequencing
- transcript
- sequence
- reverse transcription
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 41
- 238000004458 analytical method Methods 0.000 title claims abstract description 16
- 238000000034 method Methods 0.000 claims abstract description 15
- 239000002299 complementary DNA Substances 0.000 claims description 42
- 238000006243 chemical reaction Methods 0.000 claims description 29
- 238000010839 reverse transcription Methods 0.000 claims description 26
- 239000006166 lysate Substances 0.000 claims description 9
- 102100034343 Integrase Human genes 0.000 claims description 8
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims description 8
- 239000003153 chemical reaction reagent Substances 0.000 claims description 7
- 239000003161 ribonuclease inhibitor Substances 0.000 claims description 7
- 108010012306 Tn5 transposase Proteins 0.000 claims description 6
- 108700009124 Transcription Initiation Site Proteins 0.000 claims description 6
- 229920004890 Triton X-100 Polymers 0.000 claims description 6
- 230000003321 amplification Effects 0.000 claims description 6
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 6
- 230000005030 transcription termination Effects 0.000 claims description 6
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 claims description 5
- 238000012408 PCR amplification Methods 0.000 claims description 5
- 102000001708 Protein Isoforms Human genes 0.000 claims description 5
- 108010029485 Protein Isoforms Proteins 0.000 claims description 5
- 239000013504 Triton X-100 Substances 0.000 claims description 5
- 238000003556 assay Methods 0.000 claims description 5
- 230000009089 cytolysis Effects 0.000 claims description 5
- KWIUHFFTVRNATP-UHFFFAOYSA-N Betaine Natural products C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 claims description 4
- KWIUHFFTVRNATP-UHFFFAOYSA-O N,N,N-trimethylglycinium Chemical compound C[N+](C)(C)CC(O)=O KWIUHFFTVRNATP-UHFFFAOYSA-O 0.000 claims description 4
- 229960003237 betaine Drugs 0.000 claims description 4
- 230000006037 cell lysis Effects 0.000 claims description 4
- 238000005336 cracking Methods 0.000 claims description 4
- 108020004707 nucleic acids Proteins 0.000 claims description 4
- 102000039446 nucleic acids Human genes 0.000 claims description 4
- 150000007523 nucleic acids Chemical class 0.000 claims description 4
- 238000000746 purification Methods 0.000 claims description 4
- JLVVSXFLKOJNIY-UHFFFAOYSA-N Magnesium ion Chemical compound [Mg+2] JLVVSXFLKOJNIY-UHFFFAOYSA-N 0.000 claims description 3
- 238000010276 construction Methods 0.000 claims description 3
- 238000013467 fragmentation Methods 0.000 claims description 3
- 238000006062 fragmentation reaction Methods 0.000 claims description 3
- 229910001425 magnesium ion Inorganic materials 0.000 claims description 3
- 238000003776 cleavage reaction Methods 0.000 claims description 2
- 230000007017 scission Effects 0.000 claims description 2
- 230000015572 biosynthetic process Effects 0.000 claims 1
- 238000003786 synthesis reaction Methods 0.000 claims 1
- 238000001514 detection method Methods 0.000 abstract description 10
- 230000035945 sensitivity Effects 0.000 abstract description 8
- 108020004999 messenger RNA Proteins 0.000 abstract description 7
- 238000004422 calculation algorithm Methods 0.000 abstract description 3
- 238000005457 optimization Methods 0.000 abstract description 3
- 238000011160 research Methods 0.000 abstract description 3
- 230000009286 beneficial effect Effects 0.000 abstract description 2
- 238000007405 data analysis Methods 0.000 abstract description 2
- 238000012049 whole transcriptome sequencing Methods 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 21
- 239000011324 bead Substances 0.000 description 12
- 238000002156 mixing Methods 0.000 description 9
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 8
- 239000007788 liquid Substances 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 239000006228 supernatant Substances 0.000 description 6
- 238000001914 filtration Methods 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 238000001035 drying Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 108090000623 proteins and genes Proteins 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 238000002955 isolation Methods 0.000 description 3
- 108020004414 DNA Proteins 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 238000003149 assay kit Methods 0.000 description 2
- 238000010009 beating Methods 0.000 description 2
- 238000007664 blowing Methods 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 238000007865 diluting Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 239000004615 ingredient Substances 0.000 description 2
- 230000010355 oscillation Effects 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000007671 third-generation sequencing Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention discloses an analysis method and a kit for sequencing single-cell transcript isomers. The invention adds label sequences to two ends of single mRNA, carries out whole transcriptome sequencing after capture and enrichment, and judges the full-length transcript by accurately judging the head and tail structures of different transcript isomers, and the new research strategy can solve the prior technical problem and has the following beneficial effects: 1. improving the sensitivity of full-length transcript detection: because the head-tail terminal region of the target transcript is detected, signals are mostly concentrated at the terminal edge position of the transcript, so that the detection degree of the full-length transcript is higher. 2. Simple and easy, with low costs: the method is simple and easy to implement, wide in application range and low in cost. 3. Improving the accuracy of transcript detection: the invention can reach higher accuracy by combining the tag sequence of the terminal sequence and the optimization of the data analysis algorithm.
Description
The technical field is as follows:
the invention belongs to the field of biological medicine, and particularly relates to an analysis method and a kit for sequencing single-cell transcript isomers.
Background art:
the variability of gene transcript isoforms results in a diversity of gene functions, and different selection of mRNA at the transcription start site, the transcription termination site can result in differences in mRNA stability, localization, and translation efficiency, and even the production of truncated proteins with different functions. Research proves that the single cells composing human organ tissues have remarkable heterogeneity of functional and molecular characteristics, and the existing single cell sequencing methods can distinguish the gene expression difference between different cells, but the methods are mostly based on the capture sequencing of the 5 'end or the 3' end of mRNA, but not on the detection of full-length transcripts. Two single cell full-length transcript sequencing methods reported at present have the problems of low sensitivity, high cost and the like, for example, smart2-seq can carry out full-length detection on a single cell transcriptome, but most detection signals are distributed in the middle of a gene, and are rarely distributed at the tail end of a transcript, so that the sensitivity and the accuracy of the method for analyzing the full-length transcript are low. Pacbio long-reading third-generation sequencing can perform full-length sequencing on mRNA transcripts, but has the problems of high cost, high error rate and the like.
The invention content is as follows:
the invention aims to provide an analysis method and a kit for sequencing single-cell transcript isomers, which have the advantages of high sensitivity, low cost and good accuracy.
The invention discloses an analysis method for sequencing single-cell transcript isomers, which comprises the following steps:
a. cell lysis: separating single cells, and cracking the single cells by using a lysis solution to obtain a cracking product, wherein the lysis solution contains oligo T with an A label sequence;
b. reverse transcription is carried out on the cleavage products by reverse transcriptase, during the reverse transcription, a plurality of C basic groups are added at the 3' tail end of a cDNA chain formed by reverse transcription by the reverse transcriptase, the C basic groups are matched with a TSO primer which is provided with a B tag sequence and contains locked nucleic acid to achieve template conversion, the tail end position of the transcribed cDNA is marked by the tag sequence at the tail end of the TSO primer and a dry C sequence, oligo T with the tag sequence is used as a primer of the reverse transcription reaction, the cDNA is tagged during the reverse transcription, and therefore a first chain of the cDNA generated by the reverse transcription is obtained;
c. after a first strand of cDNA is generated by reverse transcription, PCR amplification is carried out by primers matched with sequences introduced at two ends of the cDNA to obtain the cDNA of a full-length transcriptome;
d. and (3) utilizing Tn5 transposase to construct a library of the cDNA after full-length amplification, fragmenting the cDNA, adding a sequencing universal joint to construct the library, then sequencing, extracting corresponding sequences according to the tag sequences of 3 'and 5' added during reverse transcription, comparing the extracted sequences with a genome, extracting a unique comparison sequence, using CAGEr to perform downstream analysis on the 5 'end sequence to identify a transcription start site, and using the 3' end sequence to identify a transcription termination site.
The sequence of the A tag and the sequence of the B tag may be the same or different.
Preferably, the lysate of step a contains Triton X-100, RNase inhibitor, oligo T with A tag sequence and dNTPs. Triton X-100 lyses cells, and an RNase inhibitor can ensure the integrity of RNA; taking oligodT with a tag sequence as a primer of reverse transcription reaction, and carrying a tag on cDNA while carrying out reverse transcription; dNTPs are added in this step, and the yield of RT-PCR can be improved.
Preferably, in the step b, additional magnesium ions and betaine are added in the reverse transcription reaction system. Some RNAs form secondary structures (such as hairpin or loop structures) which may be overcome by betaines and magnesium ions due to steric hindrance which may result in termination of reverse transcriptase extension on the RNA strand; in addition, the protective capability of thermal stability of the betaine is realized, after the reaction is carried out for 90min at 42 ℃, the temperature can be increased to 50 ℃ and kept for 2min, which is favorable for opening the secondary structure of RNA, and then the temperature is reduced to 42 ℃ and kept for 2min to continue the reverse transcription reaction, so that the temperature is increased and reduced for 10 cycles, and the yield of cDNA is improved to the maximum extent.
Preferably, the PCR amplification in step c is performed in limited cycles, such as 18 cycles. In order to simplify the steps, improve the detection sensitivity and prevent the loss of cDNA, the first strand is synthesized and directly amplified by KAPA HiFi HotStartStreammix direct PCR without purification.
D, constructing a library by using Tn5 transposase of illumina on the cDNA after full-length amplification, and completing cDNA fragmentation and sequencing universal adaptor addition in the same step by using the method; the library of fragmenting and adding sequencing adaptor primer utilizes I5 and I7index primer matched with the tag sequence introduced by cDNA3 'and 5' to make PCR so as to attain the goal of trapping, enriching, library and building two ends of cDNA, and the library fragments are distributed between 200-400bp, so that it can use less sequencing data quantity to meet the requirements for identifying TSS and TES.
Analyzing and filtering sequencing data, and identifying and analyzing the detected TSS and TES, wherein the method specifically comprises the following steps: filtering low-quality and sequencing connector sequences from a fastq file obtained by sequencing, extracting corresponding sequences according to the added 3 'and 5' tag sequences during reverse transcription, comparing the extracted sequences with a genome, and extracting a unique comparison sequence; extracting data, and performing downstream analysis on the sequence at the 5' end by using CAGER to identify a transcription initiation site; the 3' end sequence identifies the transcription termination site.
The second objective of the present invention is to provide a single cell transcript isoform sequencing kit, which comprises oligo T with A-tag sequence, TSO primer with B-tag sequence containing locked nucleic acid and primer matching with the sequence introduced at both ends of the first strand of cDNA.
Further preferably, the kit further comprises a lysate, a reverse transcriptase reaction reagent, a cDNA PCR reaction reagent, Tn5 transposase and a sequencing universal linker.
The lysate contains Triton X-100, RNase inhibitor, oligo D T with A label sequence and dNTPs.
The cDNA PCR reaction reagent is KAPA HiFi HotStart Readymix.
The invention adds label sequences to two ends of single mRNA, carries out whole transcriptome sequencing after capture and enrichment, and judges the full-length transcript by accurately judging the head and tail structures of different transcript isomers, and the new research strategy can solve the prior technical problem and has the following beneficial effects:
1. improving the sensitivity of full-length transcript detection: because the head-tail terminal region of the target transcript is detected, signals are mostly concentrated at the terminal edge position of the transcript, so that the detection degree of the full-length transcript is higher.
2. Simple and easy, with low costs: the method is simple and easy to implement, wide in application range and low in cost. In addition, because sequencing signals are mostly concentrated at the head and tail of the transcript, the sequencing depth required for detecting the same number of transcripts can be reduced, thereby reducing the sequencing cost.
3. Improving the accuracy of transcript detection: the invention can reach higher accuracy by combining the tag sequence of the terminal sequence and the optimization of the data analysis algorithm.
Description of the drawings:
FIG. 1 is a technical schematic of the analysis method of the present invention;
FIG. 2 is a graph comparing the effect of the analysis method of the present invention and the existing single cell sequencing method;
FIG. 3 is an accuracy analysis chart of the analysis method of the present invention;
FIG. 4 shows TSS and TES identified by the assay of the invention.
The specific implementation mode is as follows:
the following examples are further illustrative of the present invention and are not intended to be limiting thereof.
Example 1:
the technical schematic diagram of the analysis method for single-cell transcript isoform sequencing in this example is shown in fig. 1, and the sequences of the primers used are as follows (table 1):
TABLE 1
Wherein N is A, G, C or T, V is A, G or C.
The specific operation steps are as follows:
1. single cell isolation and lysis
Single cells comprising the complete transcriptome can be isolated using, for example, physical mechanical, chemical, biological methods, such as microfluidics, flow cytometry, aspiration separation, gradient dilution, and the like. The method comprises the steps of obtaining an ovum cell suspension of a mouse strain C57/B6 through primary isolation by the method, adding the cell suspension into PBS night drops on a culture dish, diluting the PBS night drops appropriately, obtaining single cells through mouth pipette isolation under a microscope, placing the obtained single cells into a PCR tube containing 4ul of lysate (controlling the volume of the isolated cells to be as small as possible), wherein the specific formula of the lysate of each cell is shown in the table 2:
TABLE 2
Ingredient (solvent is water) | Volume of |
0.2% Triton-X100 | 1.9ul |
RNase inhibitor(40U/ul) | 0.1ul |
oligo-dT 30VN primer(10uM) | 1ul |
dNTP(10mM) | 1ul |
After the single cell is added into the PCR tube of the lysate, immediately vortex the PCR tube, centrifugally collecting the liquid in the tube wall for a short time, and quickly putting the liquid on ice.
2. Reverse transcription of mRNA, addition of sequence tags 5 'and 3' to cDNA, and amplification of full-length cDNA
2.1, placing the PCR tube after cell lysis in a PCR reaction instrument, reacting at 70 ℃ for 3min, immediately placing the PCR tube on ice after the reaction is finished, then centrifuging for a short time to collect liquid in the tube wall, and then placing the PCR tube back on the ice.
2.2 preparing reverse transcription reaction systems, each cell sample reaction system formulation is shown in Table 3, when a plurality of reaction systems are prepared, one more reaction is added to prevent sample loss
TABLE 3
Composition (I) | Volume of |
Super Script III first-strand buffer | 2.00ul |
Super Script III reverse transcriptase | 0.5ul |
RNAse inhibitor(40Uμl-1) | 0.25ul |
DTT(100mM) | 0.50ul |
Betaine(5M) | 2.00ul |
MgCl2(1M) | 0.06ul |
TSO(100μM) | 0.20ul |
ERCC(1:1000) | 0.5ul |
Total | 6.01 |
Add 6.01ul of the above reverse transcription system into the PCR tube of the previous step, gently blow and mix several times with pipette gun, carefully not introduce air bubbles, centrifuge briefly to collect the liquid on the tube wall
2.3 setting the program of the PCR instrument as shown in Table 4, and when the temperature of the PCR instrument is close to 42 ℃, putting the PCR tube into the PCR instrument for reaction
TABLE 4
step | Temperature of | Time of |
|
1 | 42 | |
|
2 | 50 | 2min | |
3 | 42 | 2min | go to step 2,9cycles |
4 | 70 | |
|
5 | 4 | hold |
2.4 adding the following components (Table 5) into the PCR tube of the previous reaction, shaking and mixing, and centrifuging to collect the liquid on the tube wall
TABLE 5
Composition (I) | Volume of |
KAPA HiFi HotStart ReadyMix(2×) | 12.5ul |
IS PCR primers(10μM) | 0.25ul |
Nuclease-free water | 2.25ul |
Total | 15 |
2.5 setting the program of the PCR instrument as follows (Table 6), placing the PCR tube into the PCR reaction instrument for reaction
TABLE 6
2.6 purification after PCR
2.6.1 placing Ampure XP beads to room temperature for balancing for 15-30min, and mixing evenly by vortex oscillation before use;
2.6.2 adding 25ul Ampure XP beads (1:1) into the product obtained after the PCR in the step 2.5, blowing, beating and mixing evenly, and incubating for 8min at room temperature;
2.6.2 placing the PCR tube in a magnetic frame, and standing for 5min or until the solution is clear;
2.6.3 carefully aspirate the supernatant and retain the beads;
2.6.4 adding 200ul of freshly prepared 80% ethanol, incubating for 30s, and removing the supernatant;
2.6.5 repeating the previous step;
2.6.6 completely sucking up the ethanol remained in the PCR tube, and drying the beads at room temperature for 5min, wherein large cracks on the surfaces of the beads caused by excessive drying are avoided;
2.6.7 removing PCR from the magnetic frame, adding 20ul nuclease-free water, mixing and suspending beads, standing at room temperature for 5 min;
2.6.8 moving the PCR tube into the magnetic frame again, standing for 2min until the solution is clarified, carefully sucking the supernatant to a new PCR tube, and marking;
2.7 inspection of cDNA samples using Agilent Bioanalyzer 2100, good quality cDNA with distinct peaks appearing between 1.5-2 kb;
2.8 Using the QubitTMThe cDNA was quantified using the dsDNAHS Assay Kit, according to the Kit instructionsPerforming operation, diluting to 0.1ng/ul after the quantification is finished, and preparing library construction;
3. tn5 of illumina is used for constructing the head and tail ends of cDNA of a second-generation sequencing library for capturing
3.1 fragmentation of cDNA and linker addition Using Tn 5in the following reaction scheme (Table 7) with a cDNA input of 0.1ng
TABLE 7
Composition (I) | Volume of |
Tagmentation DNA buffer | 4ul |
Amplicon Tagmentation mix | 1.67ul |
Resuspension buffer | 1.33ul |
cDNA(0.1ng) | 1ul |
After all the components are added, mixing evenly and centrifuging for a short time
3.2 setting the PCR temperature to 55 ℃ and reacting for 10 min; after the reaction, 2ul of NT buffer was added immediately and left at room temperature for 5min
3.3 to the reaction system of the previous step, the following ingredients (Table 8) were added:
TABLE 8
Mixing, centrifuging for a short time, and collecting liquid on the tube wall
The sequence of the I5index primer is as follows:
5'-AATGATACGGCGACCACCGAGATCTACACTAGATCGCTCGTCGGCAGCGTCAGATGTGTATAAGAGACAGGTGGTATCAACGCAGAGT-3';
the sequence of the I7index primer is as follows:
5'-CAAGCAGAAGACGGCATACGAGATTCGCCTTAGTCTCGTGGGCTCGG-3';
3.4 set up the PCR program as follows (Table 9), place the system into the PCR apparatus for reaction
TABLE 9
step | Temperature of | Time of |
|
1 | 72 | |
|
2 | 98 | 30s | |
3 | 98 | 10s | |
4 | 55 | |
|
5 | 72 | 60s | go to step 3,10-15cycles |
6 | 72 | 5min | |
7 | 4 | hold |
3.5 post-PCR purification of the library
3.5.1 placing Ampure XP beads to room temperature for balancing for 15-30min, and mixing evenly by vortex oscillation before use;
3.5.2 adding 21ul Ampure XP beads (1:0.8) into the product obtained after the PCR in the step 3.4, blowing, beating and mixing evenly, and incubating for 8min at room temperature;
3.5.2 placing the PCR tube in a magnetic frame, and standing for 5min or until the solution is clear;
3.5.3 carefully aspirate the supernatant and retain the beads;
3.5.4 adding 200ul of freshly prepared 80% ethanol, incubating for 30s, and removing the supernatant;
3.5.5 repeating the previous step;
3.5.6 sucking up the residual ethanol in the PCR tube, drying the beads at room temperature for 5min, wherein excessive drying should be avoided to cause large cracks on the surfaces of the beads;
3.5.7 removing the PCR from the magnetic frame, adding 20ul of nuclease-free water, mixing and suspending the beads uniformly, and standing at room temperature for 5 min;
3.5.8 move the PCR tube into the magnetic frame again, stand for 2min until the solution is clarified, carefully suck the supernatant into a new PCR tube, and mark.
Thus, a library was obtained.
4. Quality control and quantification of libraries
4.1 the library constructed in step 3.5 was quality checked using an Agilent Bioanalyzer 2100, the library fragments were concentrated between 300 and 500 bp.
4.2 Using the QubitTMThe dsDNA HS Assay Kit quantitates the library constructed in step 3.5, operates according to Kit instructions, and is diluted to 10nM for sequencing after quantitation is complete.
5. Bioinformatics analysis
Filtering low-quality and sequencing connector sequences from a fastq file obtained by sequencing, extracting corresponding sequences according to 3 'and 5' tag sequences added during reverse transcription, comparing the extracted sequences with a genome, and extracting a unique comparison sequence;
the extracted data, the 5 'end sequence is analyzed by CAGER to identify the transcription start site, and the 3' end sequence identifies the transcription termination site.
The TSS and TES identified by this method are shown in FIG. 4.
The commercial reagents used in the experimental procedure are shown in the attached Table 10, and all primers were synthesized by Biotech.
Second, effect comparison
The experimental method comprises the following steps: taking mouse egg cells of the same days, and respectively carrying out the experiment of the invention and the Smart-seq2, such as cell separation, cell lysis, full-length transcriptome amplification, library construction and the like; the method of the invention is shown in the above specific experimental procedures, and the method of Smart-seq2 refers to the operation flow in the Smart-seq2 literature (Full-length RNA-seq from single species using Smart-seq2.nat Protoc, VOL.9 NO.1,2014,171 and 181); and sequencing the two libraries by using the same sequencing data quantity, filtering, comparing and quantifying the data obtained by sequencing, and calculating the sensitivity and accuracy of the TSS and TES of the transcripts detected by the two methods respectively.
Specific results are shown in fig. 2 and 3, and it can be seen from fig. 2 and 3 that the analysis method for single-cell transcript isoform sequencing (cap-seq) according to the present invention is significantly superior to the conventional method (fig. 2) in capturing TES and TSS sensitivity in comparison with the conventional single-cell sequencing method (Smart-seq2), regardless of TSS or TES. The method has high accuracy, the accuracy can approach 100% after algorithm optimization and data filtration, and the transcription initiation site and the transcription termination site can be analyzed and identified more accurately (figure 3).
Attached table 10:
Claims (10)
1. an analytical method for sequencing single cell transcript isoforms comprising the steps of:
a. cell lysis: separating single cells, and cracking the single cells by using a lysis solution to obtain a cracking product, wherein the lysis solution contains oligo T with an A label sequence;
b. reverse transcription is carried out on the cleavage products by reverse transcriptase, during the reverse transcription, a plurality of C basic groups are added at the 3' tail end of a cDNA chain formed by reverse transcription by the reverse transcriptase, the C basic groups are matched with a TSO primer which is provided with a B tag sequence and contains locked nucleic acid to achieve template conversion, the tail end position of the transcribed cDNA is marked by the tag sequence at the tail end of the TSO primer and a dry C sequence, oligo T with the tag sequence is used as a primer of the reverse transcription reaction, the cDNA is tagged during the reverse transcription, and therefore a first chain of the cDNA generated by the reverse transcription is obtained;
c. after a first strand of cDNA is generated by reverse transcription, PCR amplification is carried out by primers matched with sequences introduced at two ends of the cDNA to obtain the cDNA of a full-length transcriptome;
d. and (3) utilizing Tn5 transposase to construct a library of the cDNA after full-length amplification, fragmenting the cDNA, adding a sequencing universal joint to construct the library, then sequencing, extracting corresponding sequences according to the tag sequences of 3 'and 5' added during reverse transcription, comparing the extracted sequences with a genome, extracting a unique comparison sequence, using CAGEr to perform downstream analysis on the 5 'end sequence to identify a transcription start site, and using the 3' end sequence to identify a transcription termination site.
2. The assay of claim 1 wherein the a tag sequence and the B tag sequence are the same or different.
3. The assay of claim 1, wherein the lysate of step a comprises Triton X-100, rnase inhibitor, oligo T with a tag sequence, and dNTPs.
4. The method of claim 1, wherein in step b, additional magnesium ions and betaine are added to the reverse transcription reaction system.
5. The assay method of claim 1, wherein the PCR amplification in step c is performed in 18 limited cycles, and directly performed by KAPA HiFi HotStart Readymix direct PCR amplification without purification after the first strand synthesis.
6. The assay method according to claim 1, wherein in step d, the cDNA after full-length amplification is subjected to library construction using Tn5 transposase from illumina, and by this method, cDNA fragmentation and sequencing universal adaptor addition are performed in the same step; fragmenting and adding a library of sequencing adaptor primers, and carrying out PCR by using I5 and I7index primers matched with tag sequences introduced by cDNA3 'and 5', thereby achieving capture enrichment of the head and tail ends of the cDNA, and constructing the library.
7. A kit for sequencing single cell transcript isomers is characterized by comprising oligo T with an A tag sequence, TSO primer with a B tag sequence and containing locked nucleic acid and primer matched with sequences introduced at two ends of a first strand of cDNA.
8. The kit of claim 7, further comprising a lysate, reverse transcriptase reaction reagents, cDNA PCR reaction reagents, Tn5 transposase, and a sequencing universal linker.
9. The kit of claim 7, wherein the lysate comprises Triton X-100, RNase inhibitor, oligo T with A tag sequence and dNTPs.
10. The kit of claim 7, wherein the cDNA PCR reaction reagent is KAPA HiFi HotStart Readymix.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910611776.XA CN110643692B (en) | 2019-07-08 | 2019-07-08 | Analysis method and kit for sequencing single-cell transcript isomer |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910611776.XA CN110643692B (en) | 2019-07-08 | 2019-07-08 | Analysis method and kit for sequencing single-cell transcript isomer |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110643692A true CN110643692A (en) | 2020-01-03 |
CN110643692B CN110643692B (en) | 2024-05-17 |
Family
ID=68989845
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910611776.XA Active CN110643692B (en) | 2019-07-08 | 2019-07-08 | Analysis method and kit for sequencing single-cell transcript isomer |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110643692B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111549099A (en) * | 2020-04-23 | 2020-08-18 | 广州再生医学与健康广东省实验室 | Third-generation sequencing-based single-cell transcriptome sequencing method |
CN111876408A (en) * | 2020-06-10 | 2020-11-03 | 南京派森诺基因科技有限公司 | Method for constructing low-initial-quantity transcriptome library of eukaryote |
CN114606301A (en) * | 2020-12-07 | 2022-06-10 | 中国科学院脑科学与智能技术卓越创新中心 | Single cell transcriptome sequencing method |
WO2022256227A1 (en) * | 2021-06-01 | 2022-12-08 | The Regents Of The University Of California | Methods for fragmenting complementary dna |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140213485A1 (en) * | 2013-01-28 | 2014-07-31 | Yale University | Methods For Preparing cDNA From Low Quantities of Cells |
US20160122753A1 (en) * | 2013-06-12 | 2016-05-05 | Tarjei Mikkelsen | High-throughput rna-seq |
CN107893100A (en) * | 2017-11-16 | 2018-04-10 | 序康医疗科技(苏州)有限公司 | A kind of unicellular mRNA reverse transcriptions and the method for amplification |
CN108138365A (en) * | 2015-11-17 | 2018-06-08 | 深圳华大生命科学研究院 | A kind of unicellular transcript profile banking process of high throughput |
WO2019084046A1 (en) * | 2017-10-23 | 2019-05-02 | The Broad Institute, Inc. | Single cell cellular component enrichment from barcoded sequencing libraries |
CN109811045A (en) * | 2017-11-22 | 2019-05-28 | 深圳华大智造科技有限公司 | The construction method of high-throughput unicellular overall length transcript profile sequencing library and its application |
CN109971849A (en) * | 2017-12-28 | 2019-07-05 | 安诺优达基因科技(北京)有限公司 | The unicellular multisample mixing banking process of liver cancer related gene |
-
2019
- 2019-07-08 CN CN201910611776.XA patent/CN110643692B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140213485A1 (en) * | 2013-01-28 | 2014-07-31 | Yale University | Methods For Preparing cDNA From Low Quantities of Cells |
US20160122753A1 (en) * | 2013-06-12 | 2016-05-05 | Tarjei Mikkelsen | High-throughput rna-seq |
CN108138365A (en) * | 2015-11-17 | 2018-06-08 | 深圳华大生命科学研究院 | A kind of unicellular transcript profile banking process of high throughput |
WO2019084046A1 (en) * | 2017-10-23 | 2019-05-02 | The Broad Institute, Inc. | Single cell cellular component enrichment from barcoded sequencing libraries |
CN107893100A (en) * | 2017-11-16 | 2018-04-10 | 序康医疗科技(苏州)有限公司 | A kind of unicellular mRNA reverse transcriptions and the method for amplification |
CN109811045A (en) * | 2017-11-22 | 2019-05-28 | 深圳华大智造科技有限公司 | The construction method of high-throughput unicellular overall length transcript profile sequencing library and its application |
CN109971849A (en) * | 2017-12-28 | 2019-07-05 | 安诺优达基因科技(北京)有限公司 | The unicellular multisample mixing banking process of liver cancer related gene |
Non-Patent Citations (6)
Title |
---|
EVAN Z MACOSKO 等: "Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets", 《CELL》 * |
EVAN Z MACOSKO 等: "Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets", 《CELL》, vol. 161, no. 5, 21 May 2015 (2015-05-21), pages 1 * |
WANZE CHEN 等: "Profiling of Single-Cell Transcriptomes", 《CURR PROTOC MOUSE BIOL》 * |
WANZE CHEN 等: "Profiling of Single-Cell Transcriptomes", 《CURR PROTOC MOUSE BIOL》, vol. 7, no. 3, 30 September 2017 (2017-09-30), XP055697713, DOI: 10.1002/cpmo.30 * |
万睿璇 等: "单细胞基因组和转录组的共测定方法研究进展", 《中国现代应用药学》 * |
万睿璇 等: "单细胞基因组和转录组的共测定方法研究进展", 《中国现代应用药学》, vol. 35, no. 9, 30 September 2018 (2018-09-30) * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111549099A (en) * | 2020-04-23 | 2020-08-18 | 广州再生医学与健康广东省实验室 | Third-generation sequencing-based single-cell transcriptome sequencing method |
CN111549099B (en) * | 2020-04-23 | 2021-07-30 | 生物岛实验室 | Third-generation sequencing-based single-cell transcriptome sequencing method |
CN111876408A (en) * | 2020-06-10 | 2020-11-03 | 南京派森诺基因科技有限公司 | Method for constructing low-initial-quantity transcriptome library of eukaryote |
CN114606301A (en) * | 2020-12-07 | 2022-06-10 | 中国科学院脑科学与智能技术卓越创新中心 | Single cell transcriptome sequencing method |
WO2022256227A1 (en) * | 2021-06-01 | 2022-12-08 | The Regents Of The University Of California | Methods for fragmenting complementary dna |
Also Published As
Publication number | Publication date |
---|---|
CN110643692B (en) | 2024-05-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110643692A (en) | Analysis method and kit for sequencing single cell transcript isomer | |
CN108103055B (en) | Method for reverse transcription of single-cell RNA and construction of library | |
Salimullah et al. | NanoCAGE: a high-resolution technique to discover and interrogate cell transcriptomes | |
CN105969873B (en) | Litopenaeus vannamei osmotic pressure regulation related functional gene EST-SSR marker, and specific primer and detection method thereof | |
CN105238859B (en) | A kind of method for obtaining chicken full-length genome high density SNP marker site | |
CN111440896B (en) | Novel beta coronavirus variation detection method, probe and kit | |
US11761037B1 (en) | Probe and method of enriching target region applicable to high-throughput sequencing using the same | |
WO2020233094A1 (en) | Molecular linker for ngs library construction, preparation method therefor and use thereof | |
CN107893260B (en) | Method and kit for constructing transcriptome sequencing library by efficiently removing ribosomal RNA | |
CN109517881B (en) | High-throughput sequencing library construction method for body fluid trace free RNA | |
WO2021103784A1 (en) | Multi-omics method for combined sequencing of single-cell transcriptiome and translatom | |
CN110218781B (en) | Composite amplification system of 21 micro haplotype sites, next generation sequencing and typing kit and typing method | |
CN107893100A (en) | A kind of unicellular mRNA reverse transcriptions and the method for amplification | |
CN110396516B (en) | Absolute quantitative transcriptome library construction method based on unique recognition sequence | |
CN113293196A (en) | Single cell nucleus extraction method suitable for frozen tissue | |
WO2014079002A1 (en) | Tracing method and detection kit for test sample in next-generation sequencing technique of dna | |
WO2011146942A1 (en) | Methods and kits to analyze microrna by nucleic acid sequencing | |
CN109971843B (en) | Sequencing method of single cell transcriptome | |
CN107075559A (en) | The pre-treating method of clinical micro biopsy paraffin-embedded tissue | |
CN111979353A (en) | Library construction method for sequencing novel coronavirus SARS-CoV-2 full-length genome | |
CN107513570A (en) | The method and kit of more target library constructions are uniformed based on high-flux sequence | |
CN115247203A (en) | Kit for constructing space transcriptome sequencing library and construction method of cDNA library | |
Bhattacharya et al. | Experimental toolkit to study RNA level regulation | |
CN111534858B (en) | Library construction method for high-throughput sequencing and high-throughput sequencing method | |
CN110387399B (en) | Method for linearly amplifying double-stranded DNA and application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |