WO2022021279A1 - Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application - Google Patents

Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application Download PDF

Info

Publication number
WO2022021279A1
WO2022021279A1 PCT/CN2020/106089 CN2020106089W WO2022021279A1 WO 2022021279 A1 WO2022021279 A1 WO 2022021279A1 CN 2020106089 W CN2020106089 W CN 2020106089W WO 2022021279 A1 WO2022021279 A1 WO 2022021279A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
cell
sequence
support
labeled
Prior art date
Application number
PCT/CN2020/106089
Other languages
English (en)
Chinese (zh)
Inventor
焦少灼
韩金桓
李研
刘书杰
马兴勇
罗云超
桑国芹
谢莹莹
徐猛
李宗文
Original Assignee
北京寻因生物科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京寻因生物科技有限公司 filed Critical 北京寻因生物科技有限公司
Priority to CN202080005408.1A priority Critical patent/CN114096678A/zh
Priority to PCT/CN2020/106089 priority patent/WO2022021279A1/fr
Publication of WO2022021279A1 publication Critical patent/WO2022021279A1/fr

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/04Libraries containing only organic compounds
    • C40B40/06Libraries containing nucleotides or polynucleotides, or derivatives thereof
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms

Definitions

  • the present invention relates to a kind of multiple nucleic acid co-labeling support and its preparation method and application.
  • nucleic acid molecule reactions including nucleic acid hybridization, extension, amplification and other reactions are carried out in liquid phase.
  • the liquid phase provides a uniform and stable environment for the nucleic acid and enzyme reactions involved in the reaction to maximize the output.
  • attaching nucleic acids or enzymes involved in the reaction to the surface of the solid phase can give the nucleic acid spatial position information to facilitate purification, separation, detection and analysis. Therefore, more and more solid phase nucleic acids have been developed.
  • the reaction is used for nucleic acid sequence analysis and nucleic acid quantification, such as nucleic acid chip technology in which oligonucleotides are immobilized on a substrate in an orderly manner, bridge amplification and microsphere emulsion amplification technology for next-generation gene sequencing, and high-throughput applications Nucleic acid encoding microbeads for single-cell sequencing, etc.
  • SNPs Single nucleotide polymorphisms
  • GWAS genome-wide association studies
  • paternity testing paternity testing
  • population identification Traditional SNP identification technologies can analyze hundreds of loci simultaneously, including TaqMan fluorescence analysis, KASPar identification technology, and direct PCR sequencing technology.
  • the advantage is that the experimental operation is flexible when the number of SNP loci to be detected is small, but the disadvantage is that a single sample is used.
  • the cost of building a library and sequencing a single sample is high, especially when the number of SNPs detected in a single sample is less than 1000, the average detection cost of a single SNP increases rapidly; when the number of SNPs to be detected in a single sample is Using multiplex PCR library construction combined with next-generation sequencing at 20-1000 can effectively reduce the average detection cost of a single SNP.
  • Multiplex PCR library building generally refers to adding multiple pairs of PCR primers to the PCR amplification system to amplify multiple target fragments at the same time, and then through the second step of universal primer amplification to form a nucleic acid library with adapters and sample tags required by the sequencer . Due to the addition of multiple pairs of primers at a higher concentration in the PCR amplification system, multiple primer-dimers are easily formed in multiplex PCR, and primer-dimers will directly affect the quality of the constructed library. Therefore, how to remove primer-dimers becomes a multiplex PCR construct key link in the library. According to public reports, Zuiyi Yang et al. can reduce the primer-dimer of multiplex PCR by optimizing the sequence of primer pairs.
  • Base modification can also effectively reduce primer dimers during the reaction.
  • primer-dimers can be removed by the methods disclosed above and post-PCR purification, multiplex PCR still has problems such as PCR bias caused by different efficiency of amplifying the target region and non-specific amplification caused by cross combination of primer pairs. More importantly, since the PCR amplification target regions of different primer pairs cannot overlap in liquid phase conditions, it is difficult for ordinary multiplex PCR to achieve continuous sequence analysis of long DNA fragments in a single reaction tube in a liquid phase system.
  • single-cell sequencing mainly includes single-cell genome sequencing, single-cell RNA sequencing, single-cell epigenome sequencing and spatial single-cell sequencing; from the perspective of detection throughput, it is mainly divided into low-throughput single-cell sequencing (one-time detection). 1-500 cells) and high-throughput single-cell sequencing (1000-10,000 cells at a time).
  • High-throughput single-cell RNA detection mainly includes three implementations: water-in-oil-based droplet separation technology, microplate-based beads labeling technology, and microfluidics.
  • Water-in-oil-based droplet segmentation technology is represented by 10X Genomics, Drop-Seq platform and inDrop platform. This technology uses microfluidic technology to encapsulate barcode-labeled microbeads and single cells in oil droplets and cleaves to release RNA containing polyA tails; each gel microbead is coupled with oligo dT containing cell tags and molecular tags.
  • mRNA is bound to the oligo dT nucleic acid molecule of cell tag and molecular tag, and then reverse transcription to cDNA from different cell sources to tag different cell tags and use for subsequent mixed library construction and sequencing analysis.
  • Microplate-based beads labeling technologies are represented by BD CytoSeq, SeqWell and microwell-seq.
  • This technology naturally settles cells into a microwell array with more than ten times the number of cells to ensure a single cell entry rate, and then adds cell label-labeled microbeads to the microwells to capture the mRNA after cell lysis; mRNA is bound to the cell label After the oligo dT with molecular tags, the cDNAs derived from different cells are labeled with different cell tags by reverse transcription and used for subsequent mixed library construction and sequencing analysis.
  • the droplet method completely isolates cells and label beads from other cells and beads through water-in-oil, effectively reducing the possibility of cross-contamination; at the same time, in addition to the realization of 3' RNA expression profiling libraries, the droplet protocol can also Tags and molecular tags are coupled with template switching sequences to achieve 5' single-cell RNA expression profiling; however, due to the instability and suspension characteristics of droplets, the single-cell library construction scheme based on droplet method cannot be used in RNA cells. The medium cannot be changed before and after labeling, thereby reducing the possibility of further complex reactions, especially the lack of positional information of the labeling beads.
  • the microplate method avoids the problem of probabilistic collision affecting the capture efficiency in 10X, and has better cell capture efficiency; the label beads have a fixed position after falling into the microwell, and more liquid exchange operations can be performed; however, due to the The microwell is a semi-closed structure with an open top, which can cause cellular RNA to diffuse out of the well, so all current microwell methods can only construct 3' single-cell RNA expression profiling libraries.
  • An object of the present invention is to provide a multi-nucleic acid co-labeled support.
  • Another object of the present invention is to provide a method for preparing a support for co-labeling multiple nucleic acids.
  • Another object of the present invention is to provide the application of multiple nucleic acid co-labeled supports.
  • the present invention modifies two or more nucleic acid molecules on the support, wherein one nucleic acid molecule is used to capture the target compound from the reaction pool and participate in a specific biochemical process together with other types of molecules co-labeled on the surface of the same solid-phase compound, including However, it is not limited to the application directions of multiplex PCR library construction, single-cell RNA expression profiling, single-cell transcriptome sequencing library construction, and single-cell multi-omics sequencing library construction.
  • the present invention provides a multi-nucleic acid co-labeled support, which includes a support body and a variety of nucleic acid labels located on the surface and/or inside of the support body, nucleic acid labeled on a single support At least include: one or more first nucleic acid markers, whose role at least includes capturing specific compounds in the reaction system to the surface of the support; one or more second nucleic acid markers, whose role at least includes participating in the capture on the surface of the support.
  • the specified biochemical reaction process for a specific compound is not limited to be performed by the reaction system.
  • the support body is solid beads and/or semi-solid hydrogel beads.
  • a plurality of nucleic acid co-labeled supports of the present invention are compositions comprising a plurality of supports.
  • the number of the first nucleic acid label and the second nucleic acid label on the same support can be ⁇ 1 and/or ⁇ 10 13 respectively.
  • the sequences of multiple first nucleic acid markers on the same support are the same or different; the sequences of the first nucleic acid markers on different supports are the same or different; the sequences of multiple second nucleic acid markers on the same support are the same or different; or the sequences of the second nucleic acid markers on different supports are the same or different.
  • the present invention also provides a method for making the described multiple nucleic acid co-labeled supports, comprising:
  • nucleic acids are labeled on the support body by grafting and/or grafting to obtain a support with multiple nucleic acids co-labeled.
  • the preparation method of the multiple nucleic acid co-labeled supports of the present invention includes:
  • the support body and the nucleic acid are respectively modified with functional units that can interact, so that the two react to label the nucleic acid on the support body;
  • the nucleic acid is directly synthesized on the support body according to the preset nucleotide sequence; and/or
  • Nucleic acid labeling is performed on the body of the support using a biochemical reaction for nucleic acid extension or ligation protocols.
  • the present invention also provides the 5' single-cell RNA expression profile analysis of the multiple nucleic acid co-labeled supports, the construction of a 5' single-cell VDJ library of a microwell array platform, and the construction of a 3' single-cell RNA library. , Construction of single-cell transcriptome library, single-cell multi-omics research, multiplex PCR and/or construction of multiplex PCR sequencing library applications.
  • the multiple nucleic acid co-labeled supports of the invention are used for 5' single cell RNA expression profiling.
  • template switching sequences containing cell tags and molecular tags and RNA capture sequences are fixed on the support.
  • at least two nucleic acid sequences are marked on the support: a first nucleic acid sequence and a second nucleic acid sequence;
  • the first nucleic acid sequence contains at least a capture sequence, which is used to capture the target nucleic acid molecule and serve as a primer for extension or reverse transcription;
  • the second nucleic acid sequence Sequences include cell tag sequences, which are used to tag molecules derived from all mRNAs in the same cell; different kinds of supports have different cell tags.
  • the support is made to capture the RNA released after single cell lysis in the micropores of the chip, and the RNA derived from the same cell is labeled with the same cell label by template switching during the reverse transcription process, and then the cDNA is realized by amplification Amplified and finally constructed as a 5' single-cell RNA expression profiling library.
  • the multiple nucleic acid co-labeled supports of the invention are 5' single-cell VDJ libraries used to construct microwell array platforms.
  • template switching sequences containing cell tags and molecular tags and RNA capture sequences are fixed on the support.
  • at least two nucleic acid sequences are marked on the support: a first nucleic acid sequence and a second nucleic acid sequence; the first nucleic acid sequence contains at least a capture sequence, which is used to capture the target nucleic acid molecule and serve as a primer for extension or reverse transcription;
  • the second nucleic acid sequence Sequences include cell tag sequences, molecular tag sequences and template switching sequences.
  • Cell tag sequences are used to label molecules derived from all mRNAs in the same cell; molecular tag sequences are used to label each reverse transcribed cDNA molecule from the same support.
  • the cDNA molecules reversely transcribed from different RNAs are marked with different molecular tags; the template switching sequence can be used as a template to extend the 3' end of the reverse transcribed cDNA to mark the molecular tag sequence and cell tag sequence; different species with different cell labels on the supports.
  • the support is made to capture the RNA released after single cell lysis in the micropores of the chip, and in the reverse transcription process, the RNA derived from the same cell is labeled with the same cell label through template switching, and further through TCR and BCR/
  • the constant region primers of the Ig gene realize the enrichment of TCR and BCR/Ig nucleic acid sequences and finally break them into a high-throughput single-cell VDJ sequencing library.
  • the multiple nucleic acid co-labeled supports of the invention are used to construct a 3' single cell RNA library.
  • a random primer containing a cell tag that can be conditionally blocked and an RNA capture sequence are fixed on the support, and specifically, at least two nucleic acid sequences are marked on the support: a first nucleic acid sequence and a second nucleic acid sequence; the first nucleic acid sequence The sequence contains at least a capture sequence, which is used to capture the target nucleic acid molecule and serve as primer extension or reverse transcription; the second nucleic acid sequence includes a conditional blockable random primer containing a cell tag, and the cell tag sequence is used to tag all cells derived from the same cell.
  • Molecules of mRNA different kinds of supports have different cell tags.
  • the support is made to capture the RNA released after single cell lysis in the micropores of the chip and reverse transcribed into cDNA, and the subsequent random primers containing cell tags are synthesized by two strands to achieve the same tag on the cDNA derived from the same cell. Cell labeling, followed by amplification of cDNA to construct a 3' single-cell RNA library.
  • the multiple nucleic acid co-labeled supports of the invention are used to construct single cell transcriptome libraries.
  • random primer sequences containing cell tags and RNA capture sequences are fixed on the support, different types of supports have different cell tags, and can detect any sequence of RNA molecules without being limited to the 3' end or 5' end;
  • the support includes two types of supports, each type of single support has at least two nucleic acid sequences, a combination of a first nucleic acid sequence and a second nucleic acid sequence, or a third nucleic acid sequence and a third nucleic acid sequence.
  • the first nucleic acid sequence contains at least a capture sequence for capturing target nucleic acid molecules
  • the second nucleic acid sequence includes a random primer sequence containing a cell tag, and the cell tag sequence is used to tag molecules derived from all mRNAs in the same cell
  • the third nucleic acid sequence includes a cell tag sequence and a capture sequence.
  • the support is made to capture the RNA released after the lysis of single cells in the micropores of the chip, and in the process of reverse transcription, the RNA derived from the same cell is labeled with the same cell label, and then the cDNA is amplified and amplified by amplification.
  • a single-cell RNA transcriptome library was constructed.
  • the multiple nucleic acid co-labeled supports of the present invention are used in single-cell multi-omics studies.
  • a library for constructing RNA expression levels and/or for detecting protein expression levels by nucleic acid tags of proteins is included.
  • RNA capture sequences containing cell tags and capture sequences for protein-labeled nucleic acid tags are fixed on the support, and different types of supports have different cell tags.
  • the first nucleic acid sequence includes at least a capture sequence for capturing the target nucleic acid molecule and extending as a primer;
  • the second nucleic acid sequence includes a cell tag sequence, a molecular tag sequence and a template switching sequence;
  • the cell tag sequence is used to label cells derived from the same cell All mRNA molecules of the mRNA;
  • the molecular tag sequence is used to label each reverse transcribed cDNA molecule, and the cDNA molecules reverse transcribed from different RNAs on the same support are marked with different molecular tags;
  • the template switching sequence can be used as a The template continues to extend the 3' end of the reverse transcribed cDNA to label the molecular tag sequence and cell tag sequence;
  • the third nucleic acid sequence includes the cell tag sequence, molecular tag sequence and protein nucleic acid tag capture sequence, and the protein nucleic acid tag capture sequence is used to capture And extend the protein nucleic acid marker in the same spatial structure as the single cell to be tested.
  • the support is made to capture RNA and protein nucleic acid tags released after single cell lysis in the micropores of the chip, and in the reverse transcription process, the RNA and protein nucleic acid tags derived from the same cell are labeled with the same cell tag. , and then finally constructed into a single-cell RNA transcriptome library and a protein-labeled nucleic acid library through amplification.
  • the multiple nucleic acid co-labeled supports of the invention are used to construct multiplex PCR sequencing libraries.
  • the primers that can interfere with each other are respectively fixed on different supports.
  • the supports comprise at least two types of supports: one or more primer-labeled supports of a first kind, and one or more primer-labeled supports of a second kind, each of which is labeled with at least one A pair of nucleic acid primers: a first nucleic acid primer pair is labeled on the first type of primer-labeled support, and a second nucleic acid primer pair different from the first nucleic acid primer pair is labeled on the second type of primer-labeled support.
  • Each kind of support can also selectively include more nucleic acid primer pairs, such as other nucleic acid primer pairs, and the target fragments amplified by multiple pairs of primer pairs on the same support do not overlap on the template; Different primer pairs can amplify different regions of interest, and these regions of interest may or may not overlap.
  • all the supports are mixed in proportions and then mixed with the nucleic acid template and the PCR enzyme reaction system, so as to perform a single-tube unbiased multiplex PCR.
  • the present invention also provides a kit, which includes the multiple nucleic acid co-labeled supports of the present invention.
  • the kit is a 5' single-cell VDJ library that can be applied to 5' single-cell RNA expression profiling, the construction of a microwell array platform, the construction of a 3' single-cell RNA library, the construction of a single-cell transcriptome library, a single-cell VDJ library Kits for multi-omics studies, multiplex PCR and/or construction of multiplex PCR sequencing libraries.
  • the kit also includes one or more of the following compositions:
  • Composition 1 a mixture of a support containing a template switching sequence of a cell tag and a molecular tag and an RNA capture sequence, a microwell chip, a cell lysate, a reverse transcription reagent, a nucleic acid amplification reagent, and a nucleic acid interruption library building module; comprising the The kit of composition 1 can be used for 5' single-cell RNA expression profiling;
  • Composition 2 mixture of template switching sequences containing cell tags and molecular tags and supports for RNA capture sequences, microwell chips, cell lysates, reverse transcription reagents, constant region primers, nucleic acid amplification reagents, and nucleic acid interruption library construction Module; a kit comprising the composition 2 can be used to construct a 5' single-cell VDJ library of a microwell array platform;
  • Composition 3 a mixture of random primers containing cell tags and supports for RNA capture sequences, a microwell chip, a cell lysate, a reverse transcription reagent, a double-stranded synthesis module, and a nucleic acid amplification and extension reagent;
  • the kit can be used to construct 3' single-cell RNA library;
  • Composition 4 a support mixture containing a random primer sequence of a cell tag and an RNA capture sequence, a microwell chip, a cell lysate, a reverse transcription reagent, a two-strand synthesis module, and a nucleic acid amplification and extension reagent; a composition comprising the composition 4
  • the kit can be used to construct single-cell transcriptome library;
  • Composition 5 a capture sequence support mixture containing cell-tagged protein-tagged nucleic acids, a microwell chip, a cell lysate, a reverse transcription reagent, and a nucleic acid interrupt library building module; a kit comprising the composition 5 can be used for single-cell multiplexing. omics research;
  • Composition 6 premixed support mixture of coupled primers, multiplex PCR enzyme and buffer; further optionally, index primers adapted to a high-throughput sequencer; the kit comprising the composition 6 can be used for Multiplex PCR and/or construction of multiplex PCR sequencing library (pre-mixed support mixture of coupled primers and multiplex PCR enzymes and buffers, can achieve single-tube unbiased multiplex PCR; further includes adapting to high-throughput sequencers. Indexing primers to construct multiplex PCR libraries that can be used for sequencing analysis by index PCR).
  • the present invention provides a support for co-labeling of multiple nucleic acids and a method for making and application thereof.
  • the technology of the present invention can capture nucleic acid molecules on the solid surface and perform specific biochemical reactions together with other kinds of nucleic acids modified on the solid surface by carrying out a variety of nucleic acid modification schemes on solid-phase (including semi-solid) supports.
  • Specific types of polynucleic acid modified solid supports can be used in the fields of multiplex PCR library construction, single molecule long fragment nucleic acid sequencing library construction, single cell transcriptome sequencing library construction and single cell multi-omics sequencing library construction.
  • 1A-1C are schematic diagrams of the structures of multiple nucleic acid co-labeled supports of the present invention.
  • FIG. 2A and FIG. 2B are schematic diagrams of the application of multiple nucleic acid co-labeled supports of the present invention to multiplex PCR reactions.
  • FIG. 2C is a schematic diagram of the application of various nucleic acid-labeled supports of the present invention to the construction of multiplex PCR sequencing libraries.
  • FIG. 2D and FIG. 2E are schematic diagrams of the design method and structure of multiple nucleic acid co-labeling supports for multiplex PCR of the present invention.
  • Figure 3A is a schematic structural diagram of a multi-nucleic acid co-labeled support for 5' single-cell RNA expression profiling and the construction of a 5' single-cell VDJ library of a microwell array platform of the present invention.
  • Figure 3B is an experimental flow chart of the multi-nucleic acid labeling support shown in Figure 3A applied to 5' single-cell RNA expression profiling and the construction of a 5' single-cell VDJ library on a microwell array platform.
  • Figure 3C is a schematic diagram of the preparation of the nucleic acid co-labeling support for 5' single-cell RNA expression profiling according to the present invention.
  • Figure 4A is a schematic structural diagram of a multi-nucleic acid co-labeled support applied to a 3' single-cell RNA library of the present invention.
  • Figure 4B is a schematic flowchart of the application of the multiple nucleic acid co-labeled supports of the present invention to a 3' single-cell RNA library.
  • Figure 5A is a schematic structural diagram of a plurality of nucleic acid co-labeling supports for constructing a single-cell transcriptome library of the present invention.
  • FIG. 5B is a schematic flowchart of the use of the multiple nucleic acid co-labeled supports of the present invention to construct a single-cell transcriptome library.
  • FIG. 6A is a schematic structural diagram of a plurality of nucleic acid co-labeling supports for constructing a multi-omics single-cell library of the present invention.
  • FIG. 6B is a schematic flow chart of the multi-omics single-cell library construction using the multiple nucleic acid co-labeled supports of the present invention.
  • FIG. 7A shows the agarose electrophoresis results of the membrane protein nucleic acid tag sequencing library constructed by the procedure in Example 1.
  • FIG. 7A shows the agarose electrophoresis results of the membrane protein nucleic acid tag sequencing library constructed by the procedure in Example 1.
  • Figure 7B shows the analysis results of the 5' single-cell expression profile library fragments constructed by the process in Example 1.
  • FIG. 7C shows the analysis results of the T cell VDJ library fragments constructed by the procedure in Example 1.
  • Figure 7D shows the analysis results of the B cell VDJ library fragments constructed by the procedure in Example 1.
  • Figure 8 shows the distribution of reads at the gene level obtained by sequencing analysis of the 3' single-cell RNA library constructed by the process in Example 2 and the single-cell transcriptome library constructed by the process in Example 3.
  • the BD Phapsody 3' single-cell expression profile library is a library analysis structure constructed entirely with BD Rhapsody.
  • Figure 9 shows the results of fragment size analysis of the multiplex amplification PCR sequencing library constructed using the procedure in Example 4.
  • the present invention first provides structures for multiple nucleic acid co-labeled supports.
  • the support body can be a solid plane (Figure 1A), a solid bead (Figure 1B) or a semi-solid hydrogel (Figure 1C); nucleic acid labels can be located on a solid surface (Figure 1A) and Figure 1B) can also be located in the loose interior of the hydrogel ( Figure 1C).
  • the nucleic acid labeled on a single support at least includes: one or more first nucleic acid labels 101, the function of which at least includes capturing a specific compound in the reaction system to the surface of the support (so the first nucleic acid label 101 is also called a capture nucleic acid label);
  • One or more second nucleic acid labels 102 the functions of which include at least being able to participate in a specified biochemical reaction process of a specific compound captured on the surface of the support (hence the second nucleic acid label 102 is also referred to as a reactive nucleic acid label).
  • other types of nucleic acid labels IN may also be included on the support.
  • a plurality of nucleic acid co-labeled supports of the present invention are compositions comprising a plurality of the above-mentioned supports (supports with the structures shown in FIG. 1A , FIG. 1B and/or FIG. 1C ) .
  • sequences of multiple first nucleic acid markers 101 on the same support may be the same. In a specific application, the sequences of multiple first nucleic acid markers 101 on the same support may be different.
  • sequences of the first nucleic acid markers 101 on different supports may be the same. In certain applications, the sequences of the first nucleic acid markers 101 on different supports may be different.
  • sequences of multiple second nucleic acid labels 102 on the same support may be identical. In certain applications, the sequences of the plurality of second nucleic acid markers 102 on the same support may be different.
  • sequences of the second nucleic acid labels 102 on different supports may be identical. In certain applications, the sequences of the second nucleic acid labels 102 on different supports may be different.
  • sequences of multiple other types of nucleic acid markers IN on the same support may be the same or different for a specific application.
  • sequences of the other kinds of nucleic acid labels on different supports may be the same or different under certain applications.
  • the number of the first nucleic acid label, the second nucleic acid label, and other kinds of nucleic acid labels on the same support can be ⁇ 1 and/or ⁇ 10 13 , respectively.
  • the functions of the first nucleic acid label and the second nucleic acid label can be switched, that is, the same nucleic acid label can have the function of "capturing a specific compound in the reaction system to the surface of the support” described in the present invention, or It can have the function of "participating in the specified biochemical reaction process of the specific compound captured on the surface of the support” described in the present invention.
  • a first nucleic acid label and a second nucleic acid label may be two primers in a primer pair.
  • the present invention also provides a method for preparing multiple nucleic acid co-labeled supports for different purposes.
  • Nucleic acid labeling of the support can be carried out in two ways: “graft to” and “graft from”. In certain applications, nucleic acid labeling of the support can be performed using a "graft to” protocol alone. In certain applications, nucleic acid labeling of the support can be performed using the "graft from” protocol alone. Under specific applications, nucleic acid labeling of supports can be mixed using "graft to" and "graft from” protocols.
  • the support and nucleic acid are modified with functional units that can interact with each other, including but not limited to hydroxyl, aldehyde, epoxy, amino, carboxyl and their activated forms, phosphoric acid, alkynyl, One or more of azide, mercapto, alkene, biotin, avidin, isothiocyanate, isocyanate, acyl azide, sulfonyl chloride, tosyl ester, etc.
  • Different types of nucleic acid labels can be modified with the same functional unit or with different functional units.
  • the modified support and the nucleic acid are brought into contact under specific conditions sufficient to allow the functional units capable of interacting to react and connect to each other.
  • the "graft from” scheme When the "graft from” scheme is adopted, it can be directly synthesized on the support according to the preset nucleotide sequence, or the nucleic acid can be labeled with the scheme of nucleic acid extension or ligation by biochemical reaction. Nucleic acid modifications well known in the art can be added to the labeled nucleic acid, including but not limited to amino, phosphate, alkynyl, azide, sulfhydryl, disulfide, alkene, biotin, azobenzene, methyl, spacer, photocleavage groups One or more of , dI, dU, LNA, XNA, ribonucleic acid bases and dideoxyribonucleic acid bases, and the like.
  • the present invention also provides the use of multiple nucleic acid co-labeled supports for multiplex PCR.
  • multiplex PCR applications all templates and primers are mixed in the same reaction system.
  • the type and total concentration of primer pairs will also increase accordingly, which is easy to form primer dimers and thus Reduce the amplification efficiency of the target region.
  • the use of the multiple nucleic acid co-labeled supports provided by the present invention can well reduce the generation of primer dimers in multiplex PCR, and because mutual interference between PCR primers can be avoided, the continuous sequence of long fragments can be analyzed in a single tube. As shown in FIG.
  • the multiple nucleic acid co-labeled supports provided by the present invention comprise at least two types of supports: one or more first type of primer-labeled supports 1, one or more The second kind of primer-labeled support 2, each support is labeled with at least one pair of nucleic acid primers: as shown in the figure, the first kind of primer-labeled support 1 is labeled with the first nucleic acid primer pair (No.
  • the reverse primer 202R, the two types of supports independently can also selectively include more nucleic acid primer pairs such as other nucleic acid primer pairs (other forward primer 2NF and other reverse primer 2NR).
  • PCR primer software such as Primer Primer optimizes primer sequences to reduce mutual interference between multiple primer pairs on the same magnetic bead.
  • the target fragments amplified by multiple primer pairs on the same support do not overlap on the template.
  • Primer pairs labeled on different supports can Different thus can amplify different target regions, and these target regions can partially overlap or not overlap.
  • Each primer marked on the support at least includes the H region that can be combined and extended with the target region: the first forward primer H region 201FH, The first reverse primer H region 201RH, the second forward primer H region 202FH, the second forward primer H region 202RH, the other forward primer H region 2NFH, the other reverse primer H region 2NRH.
  • Each primer marked on the above also includes at least the universal nucleic acid sequence U region: the first forward primer U region 201FU, the first reverse primer U region 201RU, the second forward primer U region 202FU, the second reverse primer U region 202RU, Other forward primer U region 2NFU, other reverse primer U region 2NRU.
  • the forward primer U region FU or reverse primer U region RU sequence of primers on all supports may be inconsistent.
  • the supports labeled with different kinds of nucleic acid primers are combined with multiplex PCR in a preset ratio.
  • the reaction systems are mixed together, and the preset ratio is determined according to the nucleic acid amplification efficiency on different kinds of supports, which can be as low as the average ratio of different kinds of supports (for example, labeling primers P1/P2/P3 on magnetic beads to form the first type of magnetic particles).
  • the primers P4/P5/P6 are labeled on the magnetic beads to form the second type of magnetic beads, the ratio here refers to 0.01 times the quantity ratio of the first type of magnetic beads to the second type of magnetic beads when the magnetic beads are mixed), Can be as high as 100 times the average ratio of different types of supports.
  • Multiplex PCR reaction system to At least include DNA template, DNA polymerase, dNTP, buffer of appropriate concentration, etc.
  • one of the primers marked on the support for example, the first forward primer 201F or the second forward primer 202F
  • each support can be coupled with less than 5 primer pairs, or each support Up to 10 primer pairs can be coupled to the substrate, or up to 100 primer pairs can be coupled to each support.
  • the present invention also provides the use of multiple nucleic acid co-labeled supports for multiplex PCR sequencing library construction.
  • the supports with nucleic acid sequences 207 and 208 complementary to the complementary strands of the DNA template generated in the reaction shown in FIG. 2B are used as templates for primer sequences compatible with the sequencer: the third forward primer 209F and The third reverse primer 209R performs the PCR reaction of the second step, thereby obtaining a nucleic acid sequencing library that can be used for sequencing.
  • the primer sequences compatible with the sequencer include at least the universal binding sequence in the primers shown in Figure 2A (universal nucleic acid sequence U region, namely other forward primer U region 2NFU/other reverse primer U region 2NRU), sample label 2NFi/2NRi
  • Sequencers used for sequencing include but are not limited to MGIseq sequencing platform, illumina sequencing platform, Ion sequencing platform, PacBio sequencing platform, Nanopore sequencing platform, etc.
  • the present invention also provides a method for making multiple nucleic acid co-labeling supports for multiplex PCR.
  • FIG. 2D when the continuous base sequence of the long fragment 210 needs to be sequenced and analyzed, the amplification of a single primer pair can no longer meet the needs. In this case, multiple pairs of primers need to be designed to amplify and then construct the library for sequencing.
  • a primer pair represented by the H region of the primer in the scheme shown in FIG.
  • the first forward primer H region 201FH and the first reverse primer H region 201RH amplifies the first target fragment 201, and uses the second primer pair ( The second forward primer H region 202FH and the second reverse primer H region 202RH) amplify the second target fragment 202, and use more primer pairs (other forward primer H region 2NFH and other reverse primer H region 2NRH) More target fragments 2N are amplified, and finally the sequencing results are spliced into the sequence of the long fragment 210 .
  • the second forward primer and the first reverse primer together will produce small non-target amplification products, and the primers need to be Divide the multiplex PCR reaction into at least two tubes of parallel amplification, and amplify the primer pairs with no overlap of the target fragment: the first forward primer, the first reverse primer, other forward primers and other reverse primers are one tube, and the second The forward primer and the second reverse primer are in one tube.
  • the multiplex PCR library building method using the multi-nucleic acid labeling support in the present invention can be obtained by labeling the first primer pair that amplifies the target fragment without overlapping and other primer pairs on the same magnetic bead, such as the first magnetic bead.
  • the first type of multiple nucleic acid co-labeled supports, and the second primer pair that overlaps with the amplification target fragment of the primer pair on the first magnetic bead is labeled on another magnetic bead such as the second magnetic bead to obtain the first.
  • the present invention pre-synthesizes a first primer pair 201F, 201R with a 5' specific modification 211, a second primer pair 202F, 202R with a 5' specific modification 211, and more with Additional primer pairs 2NF and 2NR for 5' specific modification of 211 (Fig. 2E).
  • 5' specific modifications include, but are not limited to, hydroxyl, aldehyde, epoxy, amino, carboxyl and their activated forms, phosphoric acid, alkynyl, azide, sulfhydryl, alkene, biotin, avidin, isothiocyanate, Isocyanates, acyl azides, sulfonyl chlorides, tosyl esters, etc.
  • the corresponding selected supports include but are not limited to epoxy, amino, carboxyl, alkynyl, azide, alkene, heavy metal, azide, affinity and other functional groups 212.
  • the supports with functional groups 212 are contacted and coupled with nucleic acid primers with 5' specific modifications 211 under appropriate conditions, in particular the primers capable of producing non-specific products are coupled separately on different supports, e.g.
  • a first primer pair 201F, 201R with a 5' specific modification 211, other primer pairs 2NF, 2NR with a 5' specific modification 211 are coupled to the first microbeads to form a first product 213, and a first product 213 will have a 5' specific modification 211
  • the second primer pair 202F and 202R is coupled to the second microbead to form the second product 214 (Fig. 2E), and finally the first product 213 and the second product 214 are mixed together in proportion to form the final product with multiple nucleic acid labels.
  • the supports are used for multiplex PCR library construction.
  • the present invention also provides the use of multiple nucleic acid co-labeled supports for 5' single-cell RNA expression profiling. It is well known that complex living organisms are composed of many cells with specific properties, and the types and quantities of RNAs transcribed and expressed by each cell in a specific state are different, so it is of great significance to detect RNA transcription at the single-cell level. Current technologies for detecting single-cell transcriptomes can be divided into low- and medium-throughput and high-throughput single-cell transcriptome sequencing technologies according to throughput.
  • the medium and low-throughput single-cell transcriptome sequencing is represented by smart-seq, and the single-cell transcriptome is constructed by reverse transcription and amplification of the RNA obtained by direct lysis of a single cell; the library preparation of high-throughput single-cell transcriptome sequencing is based on oil-in-oil Represented by water microfluidics and microwell array platforms, mRNA molecules derived from different cells are reverse transcribed into corresponding mRNA molecules through oligo dT primers or template switch oligo (TSO) containing cell tags and molecular tags. Uniquely tagged cDNA molecules and further sequencing can simultaneously analyze the mRNA expression of thousands of single cells.
  • the water-in-oil technology platform encapsulates a single cell and a single microbead containing a cell tag in a single droplet for lysis and reverse transcription in one step. According to the sequencing read near the 3' end or 5' end of the RNA, it can be divided into 3' single cells RNA expression profiling library and 5' single cell RNA expression profiling library.
  • the microwell array platform is usually an array chip consisting of microwells with a diameter of 20-60 ⁇ M.
  • RNA reverse transcripts derived from the same cell are labeled with the same and unique cellular label by reverse transcription extension.
  • the efficiency of the current microwell array-based single-cell sequencing library preparation platform largely depends on the RNA capture efficiency in the microwells containing oligo dT microbeads, and unlike the water-in-oil platform, the microwell array platform can only construct 3 'Single-cell RNA expression profiling library.
  • the invention can improve the RNA capture efficiency in the micropore by carrying out various nucleic acid labels on the microbeads in the preparation of the single-cell sequencing library, and can realize the preparation of the 5' single-cell RNA expression profile library.
  • the present invention provides the use of a variety of nucleic acid co-labeled supports for 5' single-cell RNA expression profile analysis, and the construction of a 5'-single-cell VDJ library of a microwell array platform.
  • the support here is microbeads (solid microbeads or semi-solid hydrogel microbeads), and at least two nucleic acid sequences are labeled on the support: a first nucleic acid sequence 301 and a second nucleic acid sequence 304 .
  • the first nucleic acid sequence 301 contains at least a capture sequence 303, which is used to capture the target nucleic acid molecule and serve as a primer for extension or reverse transcription, such as a base sequence oligo dT with a length of 15-40, which can be adjusted by adjusting the first nucleic acid sequence 301 on the support. on the quantity and density to control the efficiency of RNA capture.
  • the first nucleic acid sequence 301 also includes a first universal sequence nucleic acid 302 and a conditionally cleavable site X under certain uses.
  • Conditionally cleavable sites include, but are not limited to, one or more of disulfide modifications, dU modifications, RNA base modifications, dI modifications, DSpacer modifications, AP site modifications, photocleavable PC linkers, and restriction endonuclease recognition sequences. variety.
  • the second nucleic acid sequence 304 is composed of one or more of the second universal nucleic acid sequence 305 , the cell tag sequence 306 , the molecular tag sequence 307 and the template switching sequence 308 .
  • the second universal nucleic acid sequence 305 may include an adapter nucleic acid sequence that matches the sequencer, such as Read1 Sequencing Primer or Read2 Sequencing Primer in the illumina sequencer.
  • the cell tag sequence 306 is used to tag molecules derived from all mRNAs in the same cell, with the same cell tag on each support and different cell tags on different kinds of supports.
  • the cell tag sequence 306 can be a random or semi-random nucleic acid sequence, such as a 12bp degenerate base NNNNNNNNNN, or a combination of multiple fixed nucleic acid sequences, such as 96 8-base sequences and 96 8-base sequences and A random combination of 96 8-base sequences, which may or may not include connecting nucleic acid regions between the 8-base sequences.
  • the molecular tag sequence 307 is used to label each reverse transcribed cDNA molecule, and the cDNA molecules reverse transcribed from different RNAs on the same support are marked with different molecular tags.
  • the molecular tag 307 can be a random or semi-random nucleic acid sequence of 8-20 bases in length, such as 9 random degenerate bases NNNNNNN or NNNNNNNV.
  • the template switching sequence 308 can be used as a template to extend the 3' end of the cDNA reverse transcribed from the first nucleic acid sequence 301 to label the molecular tag sequence 307, the cell tag sequence 306 and the second universal nucleic acid sequence 305.
  • Template switching sequence 308 includes two or more RNA bases rG or other modified base G analogs, such as LNA or XNA, at least at the 3' end.
  • FIG 3B shows the experimental flow chart of the multi-nucleic acid-labeled support shown in Figure 3A in the construction of the 5' single-cell library of the microwell array platform.
  • each support is coupled with two kinds of Nucleic acid markers: the first nucleic acid sequence 301 and the second nucleic acid sequence 304 .
  • the RNA 309 containing the complementary sequence of the second nucleic acid sequence 304 is captured by the first nucleic acid sequence 301 on the support and reversed by reverse
  • the transcription reaction system forms a cDNA molecule 310, in which the cDNA is extended to the 5' end of the RNA 309 by the reverse transcriptase with terminal nucleotransferase function adding a continuous base C to the cDNA strand, and then the cDNA strand will be with the same support surface.
  • the adjacent second nucleic acid sequence 304 containing more than two bases rG or its base analogs is complementary and extended to the second universal nucleic acid sequence 305 to form a complete cDNA molecule 310 with cell tags and molecular tags.
  • the cDNA molecule 310 can be detached from the support through the cleavable site X as a template for the next step of amplification, or can be extended through the single primer extension of the second universal nucleic acid sequence 305, which is complementary to the cDNA molecule 310.
  • the strand is used as the template for the next amplification, or the support containing the cDNA molecule 310 after removing the first nucleic acid sequence 301 and the second nucleic acid sequence 304 not involved in the reverse transcription reaction on the support by enzymatic treatment is used as the template for the next amplification.
  • the fragmented or unfragmented cDNA molecule 310 as a template is PCR amplified with a primer pair containing the first universal sequence 302 and the second universal sequence 305 to form a double-stranded nucleic acid 311 product.
  • double-stranded nucleic acid 311 can analyze the type and abundance of single-cell RNA expression through two library construction methods, one of which is to analyze the expression of all RNA molecules with polyA tails without bias.
  • the library scheme is to randomly break the double-stranded nucleic acid 311, end-repair, and add a base A at the 3' end to form a molecular structure 312, which is then connected to a linker 313 containing a protruding T, and is ligated by the first primer 315 containing the first sample Index317 Amplify with the second primer 319 containing the second sample Index 321 to form the first final library 323 .
  • the first primer 315 comprises a first nucleic acid sequence 316 compatible with the sequencer, a first sample index 317 and a sequence 318 complementary to the long-chain partial sequence in the adapter 313 .
  • the second primer 319 includes a second nucleic acid sequence 320 compatible with the sequencer, a second sample index 321 and a sequence 322 that is identical to a partial sequence of the second universal nucleic acid sequence 305 .
  • This library construction method can also be replaced by other random library construction schemes that can achieve the same purpose, including but not limited to the library construction scheme of transposase interrupted library construction or random primer extension.
  • the purpose of another library construction scheme is to analyze the expression of the target gene in a targeted manner, which can be realized by two-step multiplex PCR, such as using the first gene-specific primer 324, the second gene-specific primer 326 and the universal primer 305 respectively.
  • the primer pairs form the first-step multiplex PCR product 325 and the second-step multiplex PCR product 327, and finally use the second-step multiplex PCR product 327 as a template to pass through the first primer 315 containing the first sample Index317 and the second sample Index321
  • the second primer 319 amplifies to form a second final library 328.
  • the library constructed by targeted multiplex PCR can be used for the analysis of the immune repertoire.
  • the multiplex PCR product 327 in the second step can also be used to construct a full-length VDJ immune repertoire library according to the first random interrupt library construction scheme. Analysis of T cell receptor and antibody VDJ sequences. Both the first final library 323 and the second final library 328 are further used for sequencing and information analysis.
  • the present invention also provides a method for making a nucleic acid co-labeling support for 5' single-cell RNA expression profiling analysis.
  • the cell tag sequence 306 consists of a first cell tag 329, a first attachment region 330, a second cell tag 331, a second attachment region 332, a third attachment region 333, and a third cell tag 334 that are sequentially linked. It consists of 6 areas.
  • the present invention pre-synthesizes the first nucleic acid sequence 301 and the third nucleic acid sequence 335 with a 5' specific modification 340.
  • the 5' specific modification 340 includes But not limited to hydroxyl, aldehyde, epoxy, amino, carboxyl and their activated forms, phosphoric acid, alkynyl, azide, sulfhydryl, alkene, biotin, avidin, isothiocyanate, isocyanate, acyl azide , sulfonyl chloride, tosyl ester, etc.
  • the corresponding selected supports include but are not limited to functional groups such as epoxy, amino, carboxyl, alkynyl, azide, alkene, heavy metal, azide, avidin, etc. Mission 339.
  • the support with functional group 339 is contacted and coupled (step 341) with the first nucleic acid sequence 301 and the third nucleic acid sequence 335 with the 5' modification under appropriate conditions to form a first product 342, the first nucleic acid sequence
  • the ratio of 301 to the third nucleic acid sequence 355 on the first product 342 can be adjusted by adding different concentrations; then the first product 342 and the fourth nucleic acid sequence 336 undergo hybridization in an environment containing an appropriate salt ion concentration, dNTP and a polymerase buffer
  • the second product 344 is obtained by hybridization and extension, and the fourth nucleic acid sequence 336 nucleic acid molecule sequentially contains the complementary sequence 332' of the second connecting region 332, the complementary sequence 331' of the second cell tag 331 and the first connecting region 330 from 5'.
  • the complementary sequence 330' of the third nucleic acid sequence 335 on the support hybridizes with the complementary sequence 330' on the fourth nucleic acid sequence 336 through its own connecting region 330 sequence and extends to the sequence
  • the second cell tag 331 and the second connecting region 332 The second product 344 is connected with the fifth nucleic acid sequence 346 under the action of DNA ligase after the complementary strand is removed under denaturing conditions to generate the third product 347;
  • the fifth nucleic acid sequence 346 is composed of the first nucleic acid molecule 337 and
  • the second nucleic acid molecule 338 is the double-stranded DNA obtained by annealing and hybridization in advance, the first nucleic acid molecule 337 contains the third connecting region 333, the third cell tag 334, the molecular tag sequence 307 and the template switching sequence 308 connected in sequence, and the second nucleic acid molecule 338 It contains at least the complementary sequence of the partial sequence of the second connecting region 332 and the third connecting region 333, and also
  • the third connecting region 333 and the second connecting region 332 on the second product 344 are hybridized together and connected to each other by a ligase; in particular, the 5' end of the first nucleic acid molecule 337 sometimes also contains a phosphoric acid modification; finally, under denaturing conditions
  • the third product 347 undergoes an elution step 348 to wash off the complementary nucleic acid sequence, that is, the second nucleic acid molecule 338 to form a support 349 that is finally marked with the first nucleic acid sequence 301 and the second nucleic acid sequence 304, which has the first nucleic acid sequence 301 and the second nucleic acid sequence 304.
  • the supports 349 labeled with the two nucleic acid sequences 304 can be used directly in the library building procedure for 5' single-cell RNA expression profiling.
  • the present invention also provides the use of a plurality of nucleic acid co-labeled supports applied to a 3' single-cell RNA library, wherein the cell label and the reverse transcription primer oligo dT are respectively located at both ends of the cDNA molecule.
  • Figure 4A shows at least two nucleic acid-labeled supports, where the supports are beads (solid beads or semi-solid hydrogel beads), and at least two nucleic acid sequences are labeled on the supports: the first Nucleic acid sequence 401 and second nucleic acid sequence 404.
  • the first nucleic acid sequence 401 contains at least a capture sequence 403, which is used to capture the target nucleic acid molecule and serve as a primer for extension or reverse transcription, such as a base sequence oligo dT with a length of 15-40, which can be adjusted by adjusting the first nucleic acid sequence 401 on the support.
  • the number and density of the above control the efficiency of capturing RNA; the first nucleic acid sequence 401 also includes the first universal sequence nucleic acid 402 under certain uses.
  • the second nucleic acid sequence 404 is composed of one or more of the second universal nucleic acid sequence 405 , the cell tag sequence 406 , the primer sequence 407 and the reversible blocking site 408 .
  • the second universal nucleic acid sequence 405 may include an adaptor nucleic acid sequence that matches the sequencer, such as Read1 Sequencing Primer or Read2 Sequencing Primer in the illumina sequencer, and the optional second universal nucleic acid sequence 405 contains a conditional breaking site X .
  • Conditionally cleavable sites include, but are not limited to, disulfide modifications, dU modifications, RNA base modifications, dI modifications, DSpacer modifications, AP site modifications, photocleavable PC linkers, and restriction endonuclease recognition sequences.
  • the cell tag sequence 406 is used to tag molecules derived from all mRNAs in the same cell, with the same cell tag on each support and different cell tags on different kinds of supports.
  • the cell tag sequence 406 can be a random or semi-random nucleic acid sequence, such as a 12bp degenerate base NNNNNNNNNN, or a combination of multiple fixed nucleic acid sequences, such as 96 8-base sequences and 96 8-base sequences and A random combination of 96 8-base sequences, which may or may not include connecting nucleic acid regions between the 8-base sequences.
  • the primer sequence 407 can be used as a primer to extend the cDNA molecule that is complementary to it, and can be combined and extended with the cDNA product obtained by reverse transcription of the capture sequence 403; the primer sequence 407 can be a random or semi-random nucleic acid sequence with a length of 5-15 bases, For example, 6 random degenerate bases NNNNNN, and gene-specific sequences can also be used to enrich targeted regions.
  • the function of the reversible blocking site 408 is to prevent the non-specific extension of the primer sequence 407 as a primer when the first nucleic acid sequence 401 captures and extends the target nucleic acid, and in a specific situation, the blocking effect is released to allow the primer sequence 407 to act as a primer. extend.
  • the reversible blocking site 408 can be simple 3' phosphate modification, ddNTP modification or C3 spacer modification, or a combination of cleavable modification and extension blocking modification, and the cleavable modification can be DSpacer modification/RNA base Modification/dU modification, etc., extension blocking modification including but not limited to LNA/XNA/3' phosphate/inverted dT/ddNTP/C3 spacer/C6 spacer/various fluorescent dyes and quenching modifications, such as reversible blocking sites 408 can be (rN)NNNN-C3 or (rN)N-C3-C3-ddN, rN represents any ribonucleotide degenerate base, N represents any deoxyribonucleotide degenerate base, C3 is an extension blocking modified C3 spacer, and ddN is a dideoxyribonucleoside; after this sequence forms a double strand with the target DNA, it can be recognized and excised by R
  • Figure 4B shows the flow chart of the experiment of constructing a 3' single-cell RNA library with the double nucleic acid labeling support shown in Figure 4A.
  • a single support labeled with the first nucleic acid sequence 401 and the second nucleic acid sequence 404 is contacted with RNA derived from a single cell, the RNA 409 containing the complementary sequence of the second nucleic acid sequence 404 is captured by the first nucleic acid sequence 401 on the support and reversed by reverse
  • the transcription reaction system forms a cDNA molecule 410, and then the cDNA molecule 410 is denatured by high temperature and then melted with the RNA to be complementary to the region of the primer sequence 407 on the second nucleic acid sequence 404 near the surface of the same support;
  • the cDNA molecule 410 can be combined with More than one second nucleic acid sequence 404 on the surface of the support is complementary; further, the primer sequence 407 that is complementary to the cDNA molecule 410 can be recognized by a related
  • the two nucleic acid molecules 413 are used as templates for further amplification to form a double-stranded nucleic acid product 414, wherein the forward and reverse amplification primers respectively contain all or part of the nucleic acid sequences of the first universal sequence nucleic acid 402 and the second universal sequence nucleic acid 405. Further, the double-stranded nucleic acid product 414 can be used to analyze the type and abundance of single-cell RNA expression through two library construction methods, one of which aims to unbiased analysis of the expression of all RNA molecules with polyA tails.
  • the library construction scheme is to amplify the double-stranded nucleic acid product 414 and amplify it by the first primer 415 containing the first sample Index417 and the second primer 419 containing the second sample Index421 to form the first final library 423, wherein the first primer 415 includes The first nucleic acid sequence 416, the first sample index 417 and the first primer hybridization region 418 compatible with the sequencer, the second primer 419 includes the second nucleic acid sequence 420, the second sample index 421 and the second primer hybridization region compatible with the sequencer 422.
  • This library construction method can also be replaced by other random library construction schemes that can achieve the same purpose, including but not limited to the library construction scheme of transposase interrupted library construction or random primer extension.
  • the purpose of another library construction scheme is to analyze the expression of the target gene in a targeted manner, which can be achieved by two-step multiplex PCR, such as using the first gene-specific primer 424, the second gene-specific primer 426 and the universal primer 305 respectively.
  • the primer pair is the product of two-step multiplex PCR: the product 425 of the first-step multiplex PCR and the product 427 of the second-step multiplex PCR, and finally the product 427 of the second-step multiplex PCR is used as a template to pass with the product containing the first sample Index417.
  • the first primer 415 and the second primer 419 containing the second sample Index 421 are amplified to form a second final library 428 .
  • Libraries constructed by targeted multiplex PCR can be used for immune repertoire analysis, especially for full-length T cell receptor and antibody VDJ sequences. Both the first final library 423 and the second final library 428 are further used for sequencing and information analysis.
  • the present invention further provides the use of a multi-nucleic acid co-labeled support for constructing a single-cell transcriptome library.
  • the cellular label can label any position of the RNA chain to form a cDNA molecule with cellular and molecular labels.
  • the supports here are microbeads or hydrogel beads, and there are at least two nucleic acid sequences on a single support, such as the first nucleic acid sequence 501 and the second nucleic acid sequence.
  • the first nucleic acid sequence 501 at least contains a capture sequence 503 for capturing target nucleic acid molecules, such as a base sequence oligo dT with a length of 15-40, and the capture can be controlled by adjusting the number and density of the first nucleic acid sequence 501 on the support RNA efficiency.
  • the first nucleic acid sequence 501 also includes a polymerase extension blocking site 504, which can prevent the capture sequence 503 from extending the capture nucleic acid molecule as a primer, including but not limited to LNA/XNA/3' phosphate/inverted dT/ddNTP/C3 spacer/C6 spacer/various fluorescent dyes and quenching modifications, etc.
  • the first nucleic acid sequence 501 also includes the first universal nucleic acid sequence 502 in a specific application, and the capture efficiency of the first universal nucleic acid sequence 502 can be adjusted 503 by adjusting the sequence and length of the first universal nucleic acid sequence 502 .
  • the second nucleic acid sequence 505 is composed of one or more of a second universal nucleic acid sequence 506, a cell tag sequence 507, and a primer sequence 508, wherein the second universal nucleic acid sequence 506 may include a linker nucleic acid sequence matching a sequencer, such as illumina sequencing Read1 Sequencing Primer or Read2 Sequencing Primer in the instrument; cell tag sequence 507 is used to tag molecules derived from all mRNAs in the same cell, each support has the same cell tag and different types of supports have different cell tags .
  • a linker nucleic acid sequence matching a sequencer such as illumina sequencing Read1 Sequencing Primer or Read2 Sequencing Primer in the instrument
  • cell tag sequence 507 is used to tag molecules derived from all mRNAs in the same cell, each support has the same cell tag and different types of supports have different cell tags .
  • the cell tag sequence 507 can be a random or semi-random nucleic acid sequence, such as a 12bp degenerate base NNNNNNNNNN, or a combination of multiple fixed nucleic acid sequences, such as 96 kinds of 8-base sequences and 96 kinds of 8-base sequences and A random combination of 96 8-base sequences, which may or may not include connecting nucleic acid regions between the 8-base sequences.
  • the primer sequence 508 can be combined with the RNA template as a primer extension and extended into a cDNA molecule, and can be combined with the RNA captured by the capture sequence 503 and extended.
  • the primer sequence 508 can be a random or semi-random nucleic acid sequence with a length of 5-15 bases, For example, 6 random degenerate bases NNNN, and gene-specific sequences can also be used to enrich targeted regions.
  • the third nucleic acid sequence 509 is composed of one or more of the second universal nucleic acid sequence 506, the cell tag sequence 507, the molecular tag sequence 510 and the capture sequence 503, wherein the second universal nucleic acid sequence 506 may include an adaptor nucleic acid matching the sequencer Sequence, such as Read1 Sequencing Primer or Read2 Sequencing Primer in the illumina sequencer; the cell tag sequence 507 is used to label molecules derived from all mRNAs in the same cell, and each support has the same cell tag and different types of supports.
  • the second universal nucleic acid sequence 506 may include an adaptor nucleic acid matching the sequencer Sequence, such as Read1 Sequencing Primer or Read2 Sequencing Primer in the illumina sequencer
  • the cell tag sequence 507 is used to label molecules derived from all mRNAs in the same cell, and each support has the same cell tag and different types of supports.
  • the cell tag sequence 507 can be a random or semi-random nucleic acid sequence, such as a 12bp degenerate base NNNNNNNNNN, or a combination of multiple fixed nucleic acid sequences, such as 96 8-base sequences and 96 A random combination of 8-base sequences and 96 kinds of 8-base sequences, the 8-base sequences may or may not include connecting nucleic acid regions; the molecular tag sequence 510 is used to label each reverse transcribed cDNA molecule, which is obtained from the same cDNA molecule. The cDNA molecules reversely transcribed from different RNAs on the support are marked with different molecular tags.
  • the molecular tag 510 can be a random or semi-random nucleic acid sequence with a length of 5-20 bases, such as 9 random degenerate bases.
  • Base NNNNNNNNN or NNNNNNNV; capture sequence 503 is used to capture target nucleic acid molecules, such as base sequence oligo dT with a length of 15-40.
  • Figure 5B shows the experimental flow chart of the construction of single-cell transcriptome library using two types of dual nucleic acid labeling supports.
  • the single nucleic acid-labeled support contacts the RNA derived from a single cell
  • the RNA 512 containing the complementary sequence to the capture sequence 503 is captured by the first nucleic acid sequence 501 or the third nucleic acid sequence 509 on the support
  • the RNA 512 captured on the surface of the support is Under suitable conditions, it combines with the primer sequence 508 of the second nucleic acid sequence 505 and forms a cDNA molecule 514 through a reverse transcription reaction system
  • 509 is the cDNA formed by the primer
  • the reverse transcribed support can be further digested to remove the molecules of the first nucleic acid sequence 501 , the third nucleic acid sequence 509 and the second nucleic acid sequence 505 that do not participate in the reaction.
  • the supports containing the cDNA molecules 514 can then be analyzed for the type and abundance of single-cell RNA expression by two methods of library construction.
  • One of the library construction methods aims to unbiased analysis of the expression of all RNA molecules with polyA tails, and this library construction scheme uses a random primer 517 extension amplification scheme.
  • the random primer 517 is composed of a universal primer sequence 515 and a random base sequence 516:
  • the universal primer sequence 515 can include an adaptor nucleic acid sequence matching the sequencer, such as Read2 Sequencing Primer or Read1 Sequencing Primer in the illumina sequencer;
  • the random base sequence 516 It can be a random or semi-random nucleic acid sequence with a length of 5-15 bases, such as 9 consecutive degenerate bases NNNNNNNNN.
  • the random primer 517 hybridizes to the cDNA molecule 514 in an appropriate environment and produces a complementary strand 518 of the cDNA molecule 514 under the action of a DNA polymerase with strand replacement activity;
  • the primer pair of the nucleic acid sequence 506 and the universal primer sequence 515 is amplified to generate a double-stranded product 519;
  • the double-stranded product 519 is amplified by the first primer 520 containing the first sample Index522 and the second primer 524 containing the second sample Index526 to form the first primer.
  • a final library 528 wherein the first primer 520 comprises a sequencer-compatible first nucleic acid sequence 521, a first sample index 522 and a first primer hybridization region 523, and the second primer 524 comprises a sequencer-compatible second nucleic acid sequence 525, the second sample index526 and the partial sequence of the hybridization region 527 with the second primer;
  • this library construction method can also be replaced with other random library construction schemes that can achieve the same purpose, including but not limited to ultrasonic interruption, enzyme interruption Or library construction schemes such as transposase interruption.
  • the purpose of another library construction scheme is to analyze the expression of the target gene in a targeted manner, which can be achieved by two-step multiplex PCR, such as using the first gene-specific primer 529, the second gene-specific primer 531 and the universal primer 506 respectively.
  • the primer pair is the product of two-step multiplex PCR: the product 530 of the first-step multiplex PCR and the product 532 of the second-step multiplex PCR, and finally the product 532 of the second-step multiplex PCR is used as the template to pass the first primer 520 and the second primer 520 524 is amplified to form a second final library 533.
  • Libraries constructed by targeted multiplex PCR can be used for immune repertoire analysis, especially for full-length T cell receptor and antibody VDJ sequences. Both the first final library 528 and the second final library 533 were further used for sequencing and information analysis.
  • the present invention also provides the use of multiple nucleic acid co-labeled supports for single-cell multi-omics research.
  • DNA carrying genetic information transmits the information to RNA through transcription and is translated into protein, which finally performs the main biological function.
  • RNA and protein due to the complexity of the physiological system, the expression levels of RNA and protein are not consistent, and RNA cannot directly reflect the post-translational modification and interaction of proteins. Therefore, it is very important to study the expression of RNA and protein in the same cell at the same time.
  • the present invention discloses a nucleic acid labeling support structure capable of simultaneously analyzing RNA expression level and sequence and protein expression and interaction.
  • the support here is microbeads (including solid microbeads or semi-solid hydrogel microbeads), and at least three nucleic acid sequences are labeled on the support: a first nucleic acid sequence 601, a second nucleic acid sequence 604 and the third nucleic acid sequence 609.
  • the first nucleic acid sequence 601 contains at least a capture sequence 603 for capturing the target nucleic acid molecule and extending as a primer, such as a base sequence oligo dT with a length of 15-40, which can be adjusted by adjusting the number and amount of the first nucleic acid sequence 601 on the support.
  • a capture sequence 603 for capturing the target nucleic acid molecule and extending as a primer, such as a base sequence oligo dT with a length of 15-40, which can be adjusted by adjusting the number and amount of the first nucleic acid sequence 601 on the support.
  • the first nucleic acid sequence 601 also includes the first universal nucleic acid sequence 602 and a conditional cleavage site X, and the conditionally cleavable site includes but is not limited to disulfide modification, dU modification , RNA base modification, dI modification, DSpacer modification, AP site modification, photocleavage PC linker and restriction endonuclease recognition sequence.
  • the second nucleic acid sequence 604 is composed of one or more of the second universal nucleic acid sequence 605, the cell tag sequence 606, the molecular tag sequence 607 and the template switching sequence 608, wherein the second universal nucleic acid sequence 605 may include a sequencer matching sequence Adapter nucleic acid sequence, such as Read1 Sequencing Primer or Read2 Sequencing Primer in illumina sequencer; cell tag sequence 606 is used to label molecules derived from all mRNAs in the same cell, each support has the same cell tag and different types of support There are different cell tags on the object, and the cell tag sequence 606 can be a random or semi-random nucleic acid sequence, such as 12bp degenerate base NNNNNNNN, or it can be a combination of multiple fixed nucleic acid sequences, such as 96 kinds of 8-base sequences With the random combination of 96 kinds of 8-base sequences and 96 kinds of 8-base sequences, the connecting nucleic acid region may or may not be included between the 8-
  • the molecular tag 607 can be a random or semi-random nucleic acid sequence with a length of 8-20 bases, such as 9 random and base NNNNNNNNN or NNNNNNNV; the template switching sequence 608 can be used as a template to extend the 3' end of the cDNA reverse transcribed from the first nucleic acid sequence 601 to label the molecular tag sequence 607, the cell tag sequence 606 and the second universal nucleic acid sequence 605 , the template switching sequence 608 includes two or more RNA bases rG or other modified base G analogs, such as LNA or XNA, at least at the 3' end.
  • the third nucleic acid sequence 609 is composed of one or more of the third universal nucleic acid sequence 610, the cell tag sequence 606, the molecular tag sequence 607 and the protein nucleic acid tag capture sequence 611, wherein the cell tag sequence 606 and the molecular tag sequence 607 are the same as the first
  • the structures of the two nucleic acid sequences 604 are consistent;
  • the third universal nucleic acid sequence 610 is inconsistent with the second universal nucleic acid sequence 605 and contains an adaptor nucleic acid sequence that matches the sequencer, such as Read1 Sequencing Primer or Read2 Sequencing Primer in the illumina sequencer;
  • protein The nucleic acid tag capture sequence 611 is used to capture and extend the protein nucleic acid tag in the same spatial structure as the single cell to be tested.
  • the same spatial structure means that the protein nucleic acid tag can be located inside the cell, on the surface of the cell, or in the cell where the cell is located. chamber or droplet.
  • FIG. 6B shows the experimental flow chart of the construction of a multi-omics single-cell library with the three nucleic acid labeling supports shown in FIG. 6A .
  • the cells to be tested are contacted and bound with a nucleic acid-labeled antibody molecule 612 that recognizes a specific protein in advance, and the non-specifically bound nucleic acid-conjugated antibody is washed away.
  • the structure of the antibody molecule 612 includes complementary binding to the protein nucleic acid tag capture sequence 611
  • the sequence 613, the protein-specific sequence 614, the fourth universal primer sequence 615, and the molecule 616, the fourth universal primer sequence 615 is inconsistent with the second universal nucleic acid sequence 605, the third universal nucleic acid sequence 610 and contains a matching adapter with the sequencer Nucleic acid sequences, such as Read2 Sequencing Primer or Read1 Sequencing Primer in the illumina sequencer
  • molecule 616 refers to specific antibodies in this process, and can also be small molecular compounds, carbohydrates, peptides and other substances that bind to the target detection protein.
  • RNA 617 and nucleic acid-conjugated antibody molecules 612 are separated by the first nucleic acid sequence 601 and the third nucleic acid sequence 609 on the support, respectively Capture and form a cDNA molecule 618 or a DNA molecule 619 through a reverse transcription reaction system, wherein the cDNA is extended to the 5' end of RNA617 by a reverse transcriptase with terminal nucleotransferase function to add a continuous base C to the cDNA strand, and then the The cDNA strand will be complementary to the template switching sequence 608 containing two or more bases rG or its base analogs near the surface of the same support and continue to extend to the second universal nucleic acid sequence 605 region to form a complete cell tag and molecular tag.
  • the cDNA molecule 618 and the DNA molecule 619 can be detached from the support through the cleavable site X as a template for the next step of amplification, or can pass the second universal nucleic acid sequence 605 and the fourth universal nucleic acid sequence 605.
  • the extended chain complementary to the cDNA molecule 618 or the DNA molecule 619 formed after the single primer extension of the nucleic acid sequence 615 is used as a template for the next amplification, or the first nucleic acid sequence 601, the first nucleic acid sequence 601, the first nucleic acid sequence 601, the first nucleic acid sequence 601, the first nucleic acid sequence 601 on the support that does not participate in the reverse transcription reaction are removed by enzymatic treatment.
  • the support containing the cDNA molecules 618 and 619 is used as the template for the next step of amplification; in the subsequent step, the mixture of the fragmented or unfragmented cDNA molecule 618 and the DNA molecule 619 is used as the template
  • a first double-stranded nucleic acid product 621 and a second double-stranded nucleic acid product are formed by PCR amplification with the first universal nucleic acid sequence 602/second universal nucleic acid sequence 605 and the third universal nucleic acid sequence 610/fourth universal nucleic acid sequence 615 double primer pair 620 mixture.
  • the formed double-stranded nucleic acid product can be used for single-cell multi-omics analysis through three library construction methods.
  • the purpose of the first library construction scheme is to construct a nucleic acid library that can analyze the abundance of the protein to be detected, and the first library 630 can be obtained directly by PCR amplification of the first index primer 622 and the second index primer 626; wherein the first index primer 622 includes The sequencer-compatible first nucleic acid sequence 623, the first sample index 624 and the nucleic acid sequence 625 complementary to the fourth universal nucleic acid sequence 615 are sequentially linked, and the second index primer 626 includes the sequencer-compatible first sequencer The second nucleic acid sequence 627, the second sample index 628 and the nucleic acid sequence 629 complementary to the third universal nucleic acid sequence 610 are composed.
  • the purpose of the second library construction scheme is to analyze the expression of all RNA molecules with polyA tails without bias.
  • This library construction scheme is to randomly interrupt the mixture of the first double-stranded nucleic acid product and the second double-stranded nucleic acid product. End repair and add base A at the 3' end to form a molecular structure 631, which is then ligated with a linker 632 containing an overhang T and amplified by the first primer 634 containing the first sample Index624 and the second primer 636 containing the second sample index628 Augmentation forms a second final library 638, wherein the first primer 634 comprises a sequencer-compatible first nucleic acid sequence 623, a first sample index 624, and a nucleic acid sequence 635 complementary to the first universal nucleic acid sequence 602, connected in sequence, and the second The primer 636 includes a sequencer-compatible second nucleic acid sequence 627, a second sample index 628 and a nucleic acid sequence 637 complementary to the second universal nucle
  • Random library construction schemes including but not limited to transposase interrupt library construction or random primer extension library construction schemes.
  • the purpose of the third library construction scheme is to analyze the expression of the target gene in a targeted manner, which can be achieved by two-step multiplex PCR, such as using the first gene-specific primer 639 and the second gene-specific primer 641 and the second universal primer respectively.
  • the primer pair composed of 605 forms a multiplex PCR product: the first step multiplex PCR product 640 and the second step multiplex PCR product 642, and finally the second step multiplex PCR product 642 is used as a template to pass through with the first primer 634 containing the first sample Index624.
  • a third final library 643 is formed by amplifying with the second primer 636 containing the second sample Index 628 .
  • the library constructed by targeted multiplex PCR can be used for the analysis of the immune repertoire. Furthermore, the multiplex PCR product 642 in the second step can also be used to construct a full-length VDJ immune repertoire library according to the first random interrupt library construction scheme. Analysis of T cell receptor and antibody VDJ sequences.
  • the library 630, the second final library 638, and the third final library 643 are all further used for sequencing and information analysis.
  • Embodiment 1 the support of multiple nucleic acid co-labeling is applied to 5' single-cell RNA expression profile library construction and VDJ library construction and multi-omics library construction
  • nucleic acid co-labeled supports were prepared according to the following operation steps and applied to construct a 5' single-cell RNA expression profile library, a VDJ library and a multi-omics library.
  • reagent 200 ⁇ L system Superscript II first-strand buffer(5 ⁇ ) 40 DTT (100mM) 10 Betaine(5M) 40 MgCl2 (1M) 1.2 dNTP 10mM 20 RNAse inhibitor 5 SuperScript II reverse transcriptase 10 RNase-free water 73.8
  • Example 2 Multiple nucleic acid co-labeled supports applied to 3' single-cell RNA library construction
  • nucleic acid co-labeled supports were prepared according to the following operation steps and applied to construct a 3' single-cell RNA library.
  • Random primer NrNx double-strand extension configure the following hybridization reaction system and suspend the magnetic beads obtained in 3.2.5.
  • Example 3 Multiple nucleic acid co-labeled supports for single-cell transcriptome library construction
  • nucleic acid co-labeled supports were prepared according to the following operation steps and applied to construct a single-cell transcriptome library.
  • nnnnnnn is an 8bp Cell barcode sequence, with a total of 384 types.
  • step temperature time Rotating speed 1 normal temperature 30min normal rotation 2 37°C 30min 1200rpm
  • the nucleic acid sequences measured by this method are more evenly distributed over the full length of RNA than BD rhapsody and the library in Example 2.
  • the distribution of reads at the gene level was obtained through sequencing analysis.
  • the BD Phapsody 3' single-cell expression profile library is a library analysis structure constructed entirely with BD Rhapsody.
  • the sequences contained in the 3' single-cell RNA library constructed in Example 2 are mainly distributed at the 3' end of the gene, while the sequences contained in the single-cell transcriptome library constructed in Example 3 are significantly higher than The 3' single-cell RNA library and the BD Phapsody 3' single-cell expression profiling library were more biased towards the middle of the gene.
  • Example 4 Multiple nucleic acid co-labeled supports for multiplex PCR sequencing library construction
  • the purpose of this example is to realize single-tube multiplex PCR detection of the full-length gene sequences of Brca1 and Brca2, and the design of multiplex PCR primers is as follows:

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Microbiology (AREA)
  • Analytical Chemistry (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Immunology (AREA)
  • Biomedical Technology (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Variété de supports de co-marquage d'acides nucléiques, un procédé de préparation de ceux-ci et une application de ceux-ci. Les supports comprennent un corps de support, et une variété de marqueurs d'acide nucléique situés sur la surface et/ou à l'intérieur du corps de support. L'acide nucléique marqué sur un support unique comprend au moins les éléments suivants : un ou plusieurs premiers marqueurs d'acide nucléique ayant une fonction consistant au moins à piéger un composé spécifique dans un système de réaction sur la surface du support ; et un ou plusieurs seconds marqueurs d'acide nucléique ayant une fonction consistant au moins à participer à un processus de réaction biochimique spécifié du composé spécifique piégé sur la surface du support. La variété susmentionnée de supports co-marqués avec des acides nucléiques peut être utilisée pour l'analyse du profil d'expression de l'ARN d'une cellule unique à l'extrémité 5', la construction d'une banque VDJ d'une cellule unique à l'extrémité 5' pour une plateforme de microréseau, la construction d'une banque d'ARN d'une cellule unique à l'extrémité 3', la construction d'une banque de transcriptome d'une cellule unique, la recherche multi-omique d'une cellule unique, la PCR multiplex et/ou la construction d'une banque de séquençage PCR multiplex, etc.
PCT/CN2020/106089 2020-07-31 2020-07-31 Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application WO2022021279A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202080005408.1A CN114096678A (zh) 2020-07-31 2020-07-31 多种核酸共标记支持物及其制作方法与应用
PCT/CN2020/106089 WO2022021279A1 (fr) 2020-07-31 2020-07-31 Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/106089 WO2022021279A1 (fr) 2020-07-31 2020-07-31 Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application

Publications (1)

Publication Number Publication Date
WO2022021279A1 true WO2022021279A1 (fr) 2022-02-03

Family

ID=80037416

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/106089 WO2022021279A1 (fr) 2020-07-31 2020-07-31 Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application

Country Status (2)

Country Link
CN (1) CN114096678A (fr)
WO (1) WO2022021279A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114574569A (zh) * 2022-03-28 2022-06-03 浙江大学 一种基于末端转移酶的基因组测序试剂盒和测序方法
CN115386622A (zh) * 2022-10-26 2022-11-25 北京寻因生物科技有限公司 一种转录组文库的建库方法及其应用

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115198001B (zh) * 2022-07-21 2023-07-04 北京寻因生物科技有限公司 一种单细胞全序列转录组文库构建方法及应用
CN115198000B (zh) * 2022-07-21 2023-07-21 北京寻因生物科技有限公司 一种单细胞全序列转录组文库构建的方法
CN115747301B (zh) * 2022-08-01 2023-12-22 深圳赛陆医疗科技有限公司 一种测序文库的构建方法、构建测序文库的试剂盒及基因测序方法
CN117089599B (zh) * 2023-10-20 2024-02-13 青岛百创智能制造技术有限公司 一种长编码序列微珠及其制备方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104364392A (zh) * 2012-02-27 2015-02-18 赛卢拉研究公司 用于分子计数的组合物和试剂盒
CN106459967A (zh) * 2014-04-29 2017-02-22 Illumina公司 使用模板转换和标签化的多重单细胞基因表达分析
CN106498040A (zh) * 2016-10-12 2017-03-15 浙江大学 一种分子标记微珠及基于该分子标记微珠的高通量单细胞测序方法
US20190367966A1 (en) * 2018-02-12 2019-12-05 10X Genomics, Inc. Methods and systems for analysis of major histocompatability complex
CN110684829A (zh) * 2018-07-05 2020-01-14 深圳华大智造科技有限公司 一种高通量的单细胞转录组测序方法和试剂盒
WO2020123316A2 (fr) * 2018-12-10 2020-06-18 10X Genomics, Inc. Procédés de détermination d'un emplacement d'un analyte biologique dans un échantillon biologique

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104364392A (zh) * 2012-02-27 2015-02-18 赛卢拉研究公司 用于分子计数的组合物和试剂盒
CN106459967A (zh) * 2014-04-29 2017-02-22 Illumina公司 使用模板转换和标签化的多重单细胞基因表达分析
CN106498040A (zh) * 2016-10-12 2017-03-15 浙江大学 一种分子标记微珠及基于该分子标记微珠的高通量单细胞测序方法
US20190367966A1 (en) * 2018-02-12 2019-12-05 10X Genomics, Inc. Methods and systems for analysis of major histocompatability complex
CN110684829A (zh) * 2018-07-05 2020-01-14 深圳华大智造科技有限公司 一种高通量的单细胞转录组测序方法和试剂盒
WO2020123316A2 (fr) * 2018-12-10 2020-06-18 10X Genomics, Inc. Procédés de détermination d'un emplacement d'un analyte biologique dans un échantillon biologique

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MCEVOY CHRISTOPHER R, SEMPLE TIMOTHY, YELLAPU BHARGAVI, CHOONG DAVID Y, XU HUILING, MIR ARNAU GISELA, FELLOWES ANDREW P, FOX STEPH: "Improved next-generation sequencing pre-capture library yields and sequencing parameters using on-bead PCR", BIOTECHNIQUES, vol. 68, no. 1, 1 January 2020 (2020-01-01), US , pages 48 - 51, XP055891162, ISSN: 0736-6205, DOI: 10.2144/btn-2019-0059 *
YUAN ZHOU: "Single Nucleic Acid Molecule Manipulation and Single Cell Sequencing", DEPARTMENT OF CHEMICAL BIOLOGY, XIAMEN UNIVERSITY, no. 1, August 2017 (2017-08-01), pages 1 - 152, XP055891195 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114574569A (zh) * 2022-03-28 2022-06-03 浙江大学 一种基于末端转移酶的基因组测序试剂盒和测序方法
CN115386622A (zh) * 2022-10-26 2022-11-25 北京寻因生物科技有限公司 一种转录组文库的建库方法及其应用
CN115386622B (zh) * 2022-10-26 2023-10-27 北京寻因生物科技有限公司 一种转录组文库的建库方法及其应用

Also Published As

Publication number Publication date
CN114096678A (zh) 2022-02-25

Similar Documents

Publication Publication Date Title
WO2022021279A1 (fr) Support de co-marquage d'acides multinucleiques, son procédé de préparation et son application
US20220403376A1 (en) Surface-Based Tagmentation
US20210380974A1 (en) Combinatorial sets of nucleic acid barcodes for analysis of nucleic acids associated with single cells
WO2021013244A1 (fr) Procédé de construction d'une banque de capture et kit
KR102531677B1 (ko) 개별 세포 또는 세포 개체군으로부터 핵산을 분석하는 방법
US11306348B2 (en) Complex surface-bound transposome complexes
JP2017532028A (ja) 単離されたオリゴヌクレオチドおよび核酸の配列決定におけるその使用
TW201321518A (zh) 微量核酸樣本的庫製備方法及其應用
US11401543B2 (en) Methods and compositions for improving removal of ribosomal RNA from biological samples
US20230056763A1 (en) Methods of targeted sequencing
CN110886021B (zh) 一种单细胞dna文库的构建方法
WO2021253372A1 (fr) Construction de bibliothèque sans pcr à haute compatibilité et procédé de séquençage
US20100204050A1 (en) Target preparation for parallel sequencing of complex genomes
RU2790295C2 (ru) Сложные комплексы связанной на поверхности транспосомы
WO2023116376A1 (fr) Procédé de marquage et d'analyse d'acides nucléiques unicellulaires
WO2023116373A1 (fr) Procédé de génération d'une population de molécules d'acide nucléique marquées et kit pour le procédé
WO2023115536A1 (fr) Procédé de génération d'une population moléculaire d'acide nucléique marquée et kit associé
WO2022235898A1 (fr) Analyse à haut rendement de biomolécules
CN113490750A (zh) 微量dna甲基化高通量测序方法
CN117089597A (zh) 一种单细胞文库构建测序方法及其应用
CN117305410A (zh) 制备测序文库的方法和试剂盒以及相应测序方法
CN117737217A (zh) 一种用于低质量样本的空间转录组学检测方法及其应用

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20947545

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20947545

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 20947545

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 26/07/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20947545

Country of ref document: EP

Kind code of ref document: A1