CN117661123A - Single-cell library construction method for multiple sample mixing - Google Patents

Single-cell library construction method for multiple sample mixing Download PDF

Info

Publication number
CN117661123A
CN117661123A CN202311638029.8A CN202311638029A CN117661123A CN 117661123 A CN117661123 A CN 117661123A CN 202311638029 A CN202311638029 A CN 202311638029A CN 117661123 A CN117661123 A CN 117661123A
Authority
CN
China
Prior art keywords
primer
sequence
dna fragment
fusion
dna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311638029.8A
Other languages
Chinese (zh)
Inventor
吴声鹏
杨文哲
李强
黄海波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunzhun Pharmaceutical Technology Guangzhou Co ltd
Original Assignee
Yunzhun Pharmaceutical Technology Guangzhou Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunzhun Pharmaceutical Technology Guangzhou Co ltd filed Critical Yunzhun Pharmaceutical Technology Guangzhou Co ltd
Priority to CN202311638029.8A priority Critical patent/CN117661123A/en
Publication of CN117661123A publication Critical patent/CN117661123A/en
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/02Libraries contained in or displayed by microorganisms, e.g. bacteria or animal cells; Libraries contained in or displayed by vectors, e.g. plasmids; Libraries containing only microorganisms or vectors
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/20Polymerase chain reaction [PCR]; Primer or probe design; Probe optimisation
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • G16B30/10Sequence alignment; Homology search
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Theoretical Computer Science (AREA)
  • Microbiology (AREA)
  • Medical Informatics (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Evolutionary Biology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Immunology (AREA)
  • Databases & Information Systems (AREA)
  • Bioethics (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The application relates to the technical field of bioinformatics and discloses a multi-sample mixed single-cell library construction method, wherein genomic DNA of each sample is broken, and DNA fragments with bands distributed in 500bp are obtained; adding an adhesive primer combined with an adhesive tail end into the obtained DNA fragment, and adding a tail end leveling reagent for mixing reaction; designing an initial primer according to the DNA fragment, and respectively connecting a plurality of labels with the designed initial primer to obtain a fusion primer; connecting the first fusion primer with the head end of the DNA fragment by adopting an A joint, and connecting the second fusion primer with the tail end of the DNA fragment by adopting a P joint; purifying magnetic beads; performing PCR amplification on DNA fragments with different fusion primers at the head and the tail; and (3) after quantitative and sample mixing, separating and purifying by agarose gel to obtain a single-cell sequencing document. The method and the device are beneficial to reducing cost and have higher reliability.

Description

Single-cell library construction method for multiple sample mixing
Technical Field
The application relates to the technical field of bioinformatics, in particular to a single-cell library construction method for multi-sample mixing.
Background
Single cell sequencing is a very costly method, and therefore, the cost can be effectively reduced by adopting a mixed sample mode for sequencing. However, existing mixed sample protocols mix up to 12 at a time, while being subject to cellular conditions, adding more tags is detrimental to the activity of the sample and is not easily distinguishable, and thus a reliable mixed sample sequencing technique is needed to solve these problems.
Disclosure of Invention
The present application is directed to a single-cell library construction method for multiple sample mixing, so as to solve the technical problems set forth in the background art.
In order to achieve the above purpose, the present application discloses the following technical solutions:
a single-cell library building method for multiple sample mixing sequentially comprises the following steps:
breaking the genome DNA of each sample to obtain DNA fragments with bands distributed in 500 bp;
adding an adhesive primer combined with an adhesive tail end into the obtained DNA fragment, and adding a tail end leveling reagent for mixing reaction;
designing an initial primer according to the DNA fragment, and respectively connecting a plurality of labels with the designed initial primer to obtain a fusion primer;
connecting the first fusion primer with the head end of the DNA fragment by adopting an A joint, and connecting the second fusion primer with the tail end of the DNA fragment by adopting a P joint;
purifying magnetic beads;
performing PCR amplification on DNA fragments with different fusion primers at the head and the tail;
and (3) after quantitative and sample mixing, agarose gel separation and purification are carried out to obtain a single-cell sequencing library.
Preferably, the fusion primer includes an upstream general primer sequence sequentially arranged in a 5 'to 3' direction and a specific upstream primer sequence designed according to a target amplification region in the purified DNA fragment, and a downstream general primer sequence sequentially arranged in a 5 'to 3' direction and a specific downstream primer sequence designed according to a target amplification region in the purified DNA fragment.
Preferably, the upstream universal primer sequence and the downstream universal primer sequence are 16S rRNA sequencing primers, respectively.
Preferably, the magnetic bead purification includes: the DNA fragment was purified using 1.2 times of AMPure XP beads to remove redundant small fragment adaptors and primers from the DNA fragment.
Preferably, the A-and P-linkers are linkers of Ion Torrent, ion PGM, ion Proton or Ion S5/S5XL sequencing platforms.
Preferably, the concentration of the genomic DNA is 0.1 to 150 ng/. Mu.L.
Preferably, the method further comprises: designing a matched biological information analysis method based on a database building structure;
the biological information analysis method sequentially comprises the following steps:
acquiring sequencing data of a DNA sequence to be tested;
inputting the sequencing data into a purification model for purification according to a preset filtering rule to obtain purified sequencing data only retaining biological information;
filtering the purified sequencing data according to a preset tag filtering rule to obtain a characterization sequence corresponding to the purified sequencing data;
comparing the characterization sequence with preset tag sequence data to obtain fusion primer information;
respectively carrying out GO enrichment analysis, KO enrichment analysis and WGS analysis based on the fusion primer information to obtain sequence types, sequence distribution and data base distribution characteristics corresponding to the characterization sequence;
and (5) carrying out statistics on the characteristics of the purified sequencing data, the sequence types, the sequence distribution and the data base distribution, and outputting the statistics as a biological information analysis result.
Preferably, said WGS analysis is effected by employing one or more of Refseq, GEO Database and Expression Atlas.
The beneficial effects are that: according to the single-cell library construction method for the multi-sample mixing, the quantity of the labels is enriched by adopting the form of the fusion primer, and the corresponding labels are connected to the head end or the tail end of the DNA fragment by adopting the connector, so that the analysis is convenient, the cell state is kept, the cells are easy to distinguish, and the influence on the cell activity is reduced. Further, the real condition of the sample is conveniently restored through the later data filtering, and the related biological information is rapidly acquired.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described below, it being obvious that the drawings in the following description are only some embodiments of the present application, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic flow chart of a single-cell library construction method for multi-sample mixing according to an embodiment of the present application.
Detailed Description
The following description of the technical solutions in the embodiments of the present application will be clear and complete, and it is obvious that the described embodiments are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.
In this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising … …" does not exclude the presence of other like elements in a process, method, article or apparatus that comprises the element.
The applicant found that the existing mixed sample scheme mixes at most 12 at a time, and we can mix more while maintaining the sample quality, and the suspension quality simultaneously meets various single cell platform on-machine requirements. The blood sample is especially suitable, and the cost can be greatly reduced.
Based on the above, this embodiment discloses a single cell library construction method for multi-sample mixing as shown in fig. 1, which sequentially includes the following steps:
s101, breaking the genome DNA of each sample to obtain DNA fragments with bands distributed in 500bp in a centralized way; in this example, the concentration of the genomic DNA is 0.1-150 ng/. Mu.L;
s102, adding a cohesive primer combined with a cohesive terminal to the obtained DNA fragment, and adding a terminal filling reagent to carry out a mixing reaction;
s103, designing an initial primer according to the DNA fragment, and respectively connecting a plurality of labels with the designed initial primer to obtain a fusion primer;
s104, connecting the first fusion primer with the head end of the DNA fragment by adopting an A joint, and connecting the second fusion primer with the tail end of the DNA fragment by adopting a P joint;
s105-magnetic bead purification;
s106, carrying out PCR amplification on the DNA fragments with different fusion primers at the head and the tail;
s107-quantifying, mixing, and performing agarose gel separation and purification to obtain a single-cell sequencing library.
In this embodiment, the fusion primer includes an upstream general primer sequence sequentially arranged in the 5 'to 3' direction and a specific upstream primer sequence designed according to the target amplified region in the purified DNA fragment, and a downstream general primer sequence sequentially arranged in the 5 'to 3' direction and a specific downstream primer sequence designed according to the target amplified region in the purified DNA fragment. Wherein the upstream universal primer sequence and the downstream universal primer sequence are 16S rRNA sequencing primers, respectively.
In this embodiment, the magnetic bead purification comprises: the DNA fragment was purified using 1.2 times of AMPure XP beads to remove redundant small fragment adaptors and primers from the DNA fragment.
It is possible that the A-and P-linkers are linkers of Ion Torrent, ion PGM, ion Proton or Ion S5/S5XL sequencing platforms. It should be noted that Ion Torrent, ion PGM, ion proton or Ion S5/S5XL sequencing platforms are common sequencing platforms, and in order to make the library more compatible, the present embodiment uses a linker in the common sequencing platform.
As a preferred implementation of this embodiment, the method further includes: designing a matched biological information analysis method based on a database building structure;
the biological information analysis method sequentially comprises the following steps:
acquiring sequencing data of a DNA sequence to be tested;
inputting the sequencing data into a purification model for purification according to a preset filtering rule to obtain purified sequencing data only retaining biological information;
filtering the purified sequencing data according to a preset tag filtering rule to obtain a characterization sequence corresponding to the purified sequencing data;
comparing the characterization sequence with preset tag sequence data to obtain fusion primer information;
respectively carrying out GO enrichment analysis, KO enrichment analysis and WGS analysis based on the fusion primer information to obtain sequence types, sequence distribution and data base distribution characteristics corresponding to the characterization sequence;
and (3) carrying out statistics on the characteristics of the purified sequencing data, the sequence types, the sequence distribution and the data base distribution, and outputting the statistics as a biological information analysis result, wherein the real condition of the sample is the difference between the purified sequencing data and the characterization sequence.
In this embodiment, the WGS analysis is performed by using one or more of Refseq, GEO Database, and Expression Atlas.
The single-cell library construction method for mixing multiple samples, which is recorded in the embodiment, adopts the form of fusion primers, enriches the number of labels, and the corresponding labels are connected at the head end or the tail end of the DNA fragment by adopting a connector, so that the analysis is convenient, the cell state is kept, the cells are easy to distinguish, and the influence on the cell activity is reduced. Further, the real condition of the sample is conveniently restored through the later data filtering, and the related biological information is rapidly acquired.
Finally, it should be noted that: the foregoing description is only a preferred embodiment of the present application, and although the present application has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the technical solutions described in the foregoing embodiments, or equivalents may be substituted for some of the technical features thereof, and any modifications, equivalents, improvements or changes that fall within the spirit and principles of the present application are intended to be included in the scope of protection of the present application.

Claims (8)

1. The single-cell library building method for the mixture of multiple samples is characterized by comprising the following steps of:
breaking the genome DNA of each sample to obtain DNA fragments with bands distributed in 500 bp;
adding an adhesive primer combined with an adhesive tail end into the obtained DNA fragment, and adding a tail end leveling reagent for mixing reaction;
designing an initial primer according to the DNA fragment, and respectively connecting a plurality of labels with the designed initial primer to obtain a fusion primer;
connecting the first fusion primer with the head end of the DNA fragment by adopting an A joint, and connecting the second fusion primer with the tail end of the DNA fragment by adopting a P joint;
purifying magnetic beads;
performing PCR amplification on DNA fragments with different fusion primers at the head and the tail;
and (3) after quantitative and sample mixing, agarose gel separation and purification are carried out to obtain a single-cell sequencing library.
2. The multi-sample mixed single cell pooling method according to claim 1, wherein the fusion primer includes an upstream general primer sequence sequentially arranged in a 5 'to 3' direction and a specific upstream primer sequence designed according to a target amplified region in the purified DNA fragment, and a downstream general primer sequence sequentially arranged in a 5 'to 3' direction and a specific downstream primer sequence designed according to a target amplified region in the purified DNA fragment.
3. The multi-sample mixed single cell pooling method of claim 2, wherein the upstream universal primer sequence and the downstream universal primer sequence are each 16S rRNA sequencing primers.
4. The multi-sample mixed single cell banking method according to claim 1, wherein the magnetic bead purification comprises: the DNA fragment was purified using 1.2 times of AMPure XP beads to remove redundant small fragment adaptors and primers from the DNA fragment.
5. The multi-sample mixed single cell pooling method of claim 1, wherein the a-and P-junctions are junctions of Ion Torrent, ion PGM, ion proton or Ion S5/S5XL sequencing platforms.
6. The method for single cell banking of multiple sample mixtures according to claim 1, wherein the concentration of genomic DNA is 0.1 to 150 ng/. Mu.l.
7. The multi-sample mixed single cell banking method according to claim 1, further comprising: designing a matched biological information analysis method based on a database building structure;
the biological information analysis method sequentially comprises the following steps:
acquiring sequencing data of a DNA sequence to be tested;
inputting the sequencing data into a purification model for purification according to a preset filtering rule to obtain purified sequencing data only retaining biological information;
filtering the purified sequencing data according to a preset tag filtering rule to obtain a characterization sequence corresponding to the purified sequencing data;
comparing the characterization sequence with preset tag sequence data to obtain fusion primer information;
respectively carrying out GO enrichment analysis, KO enrichment analysis and WGS analysis based on the fusion primer information to obtain sequence types, sequence distribution and data base distribution characteristics corresponding to the characterization sequence;
and (5) carrying out statistics on the characteristics of the purified sequencing data, the sequence types, the sequence distribution and the data base distribution, and outputting the statistics as a biological information analysis result.
8. The multi-sample mixed single cell pooling method of claim 7 wherein said WGS analysis is achieved by employing one or more of Refseq, GEO Database and Expression Atlas.
CN202311638029.8A 2023-12-02 2023-12-02 Single-cell library construction method for multiple sample mixing Pending CN117661123A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311638029.8A CN117661123A (en) 2023-12-02 2023-12-02 Single-cell library construction method for multiple sample mixing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311638029.8A CN117661123A (en) 2023-12-02 2023-12-02 Single-cell library construction method for multiple sample mixing

Publications (1)

Publication Number Publication Date
CN117661123A true CN117661123A (en) 2024-03-08

Family

ID=90063481

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311638029.8A Pending CN117661123A (en) 2023-12-02 2023-12-02 Single-cell library construction method for multiple sample mixing

Country Status (1)

Country Link
CN (1) CN117661123A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109797437A (en) * 2019-01-18 2019-05-24 北京爱普益生物科技有限公司 A kind of construction method of sequencing library when detecting multiple samples and its application
US20190352711A1 (en) * 2017-04-05 2019-11-21 Genetron Health (Beijing) Co., Ltd. Method for Rapidly Constructing Amplicon Library Through One-Step Process
CN115247281A (en) * 2021-04-25 2022-10-28 中国科学院宁波材料技术与工程研究所慈溪生物医学工程研究所 Preparation method and application of amplicon library
WO2023284768A1 (en) * 2021-07-13 2023-01-19 北京爱普益生物科技有限公司 Fusion primer direct amplification method-based human mitochondrial whole genome high-throughput sequencing kit
WO2023082070A1 (en) * 2021-11-09 2023-05-19 京东方科技集团股份有限公司 Dna terminal repair, linker reagent, kit and dna library construction method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190352711A1 (en) * 2017-04-05 2019-11-21 Genetron Health (Beijing) Co., Ltd. Method for Rapidly Constructing Amplicon Library Through One-Step Process
CN109797437A (en) * 2019-01-18 2019-05-24 北京爱普益生物科技有限公司 A kind of construction method of sequencing library when detecting multiple samples and its application
CN115247281A (en) * 2021-04-25 2022-10-28 中国科学院宁波材料技术与工程研究所慈溪生物医学工程研究所 Preparation method and application of amplicon library
WO2023284768A1 (en) * 2021-07-13 2023-01-19 北京爱普益生物科技有限公司 Fusion primer direct amplification method-based human mitochondrial whole genome high-throughput sequencing kit
WO2023082070A1 (en) * 2021-11-09 2023-05-19 京东方科技集团股份有限公司 Dna terminal repair, linker reagent, kit and dna library construction method

Similar Documents

Publication Publication Date Title
JP2020039346A5 (en)
CN101278058A (en) Improved strategies for sequencing complex genomes using high throughput sequencing technologies
CN105734048A (en) PCR-free sequencing library preparation method for genome DNA
CN105506063A (en) Primer composition and uses thereof
CN113308526B (en) Fusion primer direct amplification method human mitochondrial whole genome high-throughput sequencing kit
CN105087560B (en) A kind of multiple PCR primer and method building pig BCR heavy chain library based on high-flux sequence
Cummings et al. Combining target enrichment with barcode multiplexing for high throughput SNP discovery
CN117051481B (en) Preparation method and application of space bar code chip
CN106011230A (en) Primer composition for detecting fragmentized DNA target area and application thereof
CN113621609A (en) Library construction primer group and application thereof in high-throughput detection
CN111808854A (en) Balanced joint with molecular bar code and method for quickly constructing transcriptome library
CN113811618A (en) Sequencing library constructed based on methylated DNA target region, system and application
CN106795650B (en) PF quick database building method and application thereof
Baudement et al. High-salt–recovered sequences are associated with the active chromosomal compartment and with large ribonucleoprotein complexes including nuclear bodies
CN112592981B (en) Primer group, kit and method for DNA archive construction
Profaizer et al. HLA genotyping using the Illumina HLA TruSight next‐generation sequencing kits: A comparison
CN117661123A (en) Single-cell library construction method for multiple sample mixing
CN113337590A (en) Second-generation sequencing method and library construction method
CN109280696A (en) The method of SNP detection technique fractionation mixing sample
CN113512595B (en) Biomarker, method and application for DNA sample tracking detection
CN115386622A (en) Transcriptome library building method and application thereof
CN111363796B (en) Multi-library mixed sample sequencing method using ultra-low total RNA
CN112251491B (en) cDNA library construction method of capillary 96-well plate
Priya et al. Exome sequencing: capture and sequencing of all human coding regions for disease gene discovery
CN115029414A (en) Method for establishing database by Ribosome-seq of absolute quantitative translation group

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination