WO2022141061A1 - 一种环化文库的快速构建方法及成环接头 - Google Patents
一种环化文库的快速构建方法及成环接头 Download PDFInfo
- Publication number
- WO2022141061A1 WO2022141061A1 PCT/CN2020/140877 CN2020140877W WO2022141061A1 WO 2022141061 A1 WO2022141061 A1 WO 2022141061A1 CN 2020140877 W CN2020140877 W CN 2020140877W WO 2022141061 A1 WO2022141061 A1 WO 2022141061A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- linker
- complementary
- double
- ring
- stranded
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 230000000295 complement effect Effects 0.000 claims abstract description 47
- 239000012634 fragment Substances 0.000 claims abstract description 40
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 7
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 claims description 28
- 238000012986 modification Methods 0.000 claims description 15
- 230000004048 modification Effects 0.000 claims description 15
- 229910000147 aluminium phosphate Inorganic materials 0.000 claims description 14
- 108060002716 Exonuclease Proteins 0.000 claims description 6
- 102000013165 exonuclease Human genes 0.000 claims description 6
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 claims description 2
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 claims description 2
- 238000006392 deoxygenation reaction Methods 0.000 claims 1
- 239000002773 nucleotide Substances 0.000 claims 1
- 125000003729 nucleotide group Chemical group 0.000 claims 1
- 108020004414 DNA Proteins 0.000 abstract description 21
- 102000003960 Ligases Human genes 0.000 abstract description 5
- 108090000364 Ligases Proteins 0.000 abstract description 5
- 230000009471 action Effects 0.000 abstract description 2
- 239000000047 product Substances 0.000 description 21
- 238000010276 construction Methods 0.000 description 15
- 150000007523 nucleic acids Chemical group 0.000 description 15
- 238000006243 chemical reaction Methods 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- 238000012163 sequencing technique Methods 0.000 description 12
- 239000011324 bead Substances 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 238000001914 filtration Methods 0.000 description 6
- 108020004707 nucleic acids Proteins 0.000 description 6
- 102000039446 nucleic acids Human genes 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 239000000872 buffer Substances 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000013467 fragmentation Methods 0.000 description 3
- 238000006062 fragmentation reaction Methods 0.000 description 3
- 238000007481 next generation sequencing Methods 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 239000007984 Tris EDTA buffer Substances 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000007865 diluting Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 238000012165 high-throughput sequencing Methods 0.000 description 2
- 239000002077 nanosphere Substances 0.000 description 2
- 238000005580 one pot reaction Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000005096 rolling process Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 239000005711 Benzoic acid Substances 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229920002594 Polyethylene Glycol 8000 Polymers 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- -1 polytetrafluoroethylene Polymers 0.000 description 1
- 229920001343 polytetrafluoroethylene Polymers 0.000 description 1
- 239000004810 polytetrafluoroethylene Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1093—General methods of preparing gene libraries, not provided for in other subgroups
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B80/00—Linkers or spacers specially adapted for combinatorial chemistry or libraries, e.g. traceless linkers or safety-catch linkers
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B70/00—Tags or labels specially adapted for combinatorial chemistry or libraries, e.g. fluorescent tags or bar codes
Definitions
- the invention belongs to the field of biotechnology, and more specifically, the invention provides a rapid construction method of a circularized library and a circular linker.
- the conventional library construction process mainly includes the following steps: break the genomic nucleic acid chain into fragments by physical or enzymatic cleavage; use exonuclease to perform de-end repair on the broken fragment, and fill in both ends of the fragment so that both ends are uniform. To blunt end, and then use polymerase to add A to the 3' end of DNA to generate a single base sticky end; add the same linker to both ends of the fragment with A added to the 3' end.
- the linker is formed by the complementary pairing of a long nucleic acid chain and a short chain.
- the gap is connected by ligase; the fragment connecting the linker is used as the template, and the nucleic acid single-stranded primer that is complementary to the linker chain is added as a primer for polymerization.
- Enzyme chain reaction the two single strands of the template are separated by denaturation, and after the primer is bound to the corresponding single strand, the corresponding single strand is respectively extended into a double-stranded target product with a complete complementary pairing, one end is an A linker, and one end is a B linker ;
- the ligation product is purified and recovered by magnetic bead purification; the purified and recovered DNA double-stranded product is denatured to obtain single-stranded DNA, and the use of circularized auxiliary nucleic acid single-stranded primer and screening of the phosphate group at the 5' end of the DNA single-stranded , circularize the target DNA single-strand; through exonuclease and other methods to remove unnecessary and remaining uncircular
- the adapter used in NGS high-throughput sequencing itself is a specially designed DNA sequence.
- the characteristic sequence information on the adapter is used as the sequence of the starting site for sequencing during sequencing. Determination of subsequent sequence information.
- the adapters are connected to the two ends of the DNA fragments by ligation and other methods. In order to realize this directional connection and avoid the mutual connection between the adapters, the connection method of sticky end adapters is usually adopted. In traditional library construction, it is necessary to ensure that both ends of the DNA double-strand are connected with adapters, and it is necessary to remove excess adapter products by means of purification.
- the present invention provides a rapid construction method of a circularized library and a circular linker.
- the present invention provides a method for constructing a circularized library, the method comprising:
- the fragment protruding from the 3' end is circularized to form a circular library using a loop-forming linker, the loop-forming linker is a double-stranded incomplete pairing and has 5'-end overhangs at both ends, and the 5' end of the loop-forming linker is double-stranded;
- the 'end overhangs are complementary to the 3' end overhangs of the interrupted fragment.
- the DNA sequence is broken into random fragments by sonication or enzyme cleavage.
- the overhang at the 3' end of the interrupted fragment and the overhang at the 5' end of the looping linker is 1-5 nt in length, such as 3 nt or 2 nt, preferably 1 nt.
- the 3' overhang of the interrupted fragment is A and the 5' overhang of the looped linker is T.
- the interrupted fragment is treated with exonuclease, polymerase and T4 polynucleotide kinase to 5' phosphorylate and 3' more A deoxynucleotide sticky ends.
- the incompletely paired duplexes comprise gaps in one strand or non-matching regions between the duplexes.
- the double strands include two non-matching regions, and the two non-matching regions include barcode sequences for distinguishing samples.
- the ring-forming linker comprises the following ring-forming linker (a) or ring-forming linker (b):
- the loop-forming linker (a) includes a long chain and two short chains paired with both ends of the long chain, the 5' end of the long chain has a phosphoric acid modification, and the short chain complementary to the 3' end of the long chain has a pair of short chains.
- the 5' end has a phosphoric acid modification
- the complementary double-stranded linker has a 3' end T sticky end, and includes a single-stranded non-complementary region of 8-12 nt (eg 10 nt), preferably the single-stranded non-complementary region includes a sample for distinguishing samples. barcode sequence;
- the loop-forming linker (b) comprises two partially complementary double strands, the two ends of the double strand are paired to form a double strand structure, and the double strand structure has a 5'-terminal phosphoric acid modification and a 3'-terminal T sticky end, preferably the
- the duplex includes a complementary portion of 8-12 nt (eg, 10 nt) as a barcode sequence that differentiates the samples.
- the ring-forming linker is a ring-forming linker (a)
- the ligated product is digested with exonuclease, and the digested product undergoes one-step purification to obtain a circular library.
- the loop-forming linker is a loop-forming linker (b), and the ligated product is denatured to obtain a circular library.
- the circularized single-stranded circular library enters a subsequent sequencing step, that is, after rolling circle replication, nucleic acid nanospheres (DNB) are formed to read nucleic acid sequence information.
- NDB nucleic acid nanospheres
- the present invention provides a loop-forming linker constructed from a circularization library, the loop-forming linker is a double-stranded incomplete pairing and has 5'-end overhangs at both ends, and the 5'-end of the loop-forming linker The overhang is complementary to the overhang at the 3' end of the fragment to be circularized.
- the overhang at the 3' end of the fragment to be circularized and the overhang at the 5' end of the looping linker is 1-5 nt in length, such as 3 nt or 2 nt, preferably 1 nt.
- the 3' overhang of the fragment to be circularized is A
- the 5' overhang of the looping linker is T.
- the incompletely paired duplexes comprise gaps in one strand or regions of non-matching between the duplexes.
- the duplex includes two non-matching regions, and the two non-matching regions include barcode sequences for distinguishing samples.
- the ring-forming linker comprises the following ring-forming linker (a) or ring-forming linker (b):
- the loop-forming linker (a) includes a long chain and two short chains paired with both ends of the long chain, the 5' end of the long chain has a phosphoric acid modification, and the short chain complementary to the 3' end of the long chain has a pair of short chains.
- the 5' end has a phosphoric acid modification
- the complementary double-stranded linker has a 3' end T sticky end, and includes a single-stranded non-complementary region of 8-12 nt (eg 10 nt), preferably the single-stranded non-complementary region includes a sample for distinguishing samples. barcode sequence;
- the loop-forming linker (b) comprises two partially complementary double strands, the two ends of the double strand are paired to form a double strand structure, and the double strand structure has a 5'-terminal phosphoric acid modification and a 3'-terminal T sticky end, preferably the
- the duplex includes a complementary portion of 8-12 nt (eg, 10 nt) as a barcode sequence that differentiates the samples.
- the present invention achieves one-step circularization of circularized libraries through specially designed linkers.
- the end-repair of the broken DNA fragment and the addition of A are processed to form the A sticky end at the end, which is complementary to the specially designed linker T sticky end to form a circular structure, and the connection at the gap is completed under the action of ligase.
- Fig. 1 shows the schematic diagram of the circular linker (a):
- A is the sequence information of the final circular structure of the library, the upper part is the insert fragment information, the lower part is the linker sequence information, and the underlined sequence information is the barcode information;
- B, C and D is a schematic diagram of the database construction process.
- Fig. 2 shows the schematic diagram of the looping linker (b):
- A is the sequence information of the final looping structure of the library, the upper part is the insert fragment information, the lower part is the linker sequence information, the underlined non-italicized sequence information is barcode information, and the underlined italicized
- the sequence is a specially designed complementary sequence of barcode information, and the underlined sequence information forms a palindrome sequence;
- B, C and D are schematic diagrams of the library construction process.
- Figure 3 shows GC coverage information (A) for experiments performed with the ring-forming linker (a), and GC coverage information (B) for experiments performed with the ring-forming linker (b).
- the invention solves the long library construction time existing in the traditional MGI-based library construction methods (including the steps of genomic DNA interruption, DNB fragment end-repair and end addition of A, joint connection, PCR amplification, single-strand separation and circularization, etc.). , the problem of low operational simplicity and purification loss.
- the inventors improved in principle, changed the method of conventionally adding a linker, amplifying and then performing circularization, and designed a linking linker with a unique sequence structure, which realized linker connection and product formation. Ring fusion is a one-step reaction, the reaction system and purification steps are optimized, and the time for building a circularized library is greatly shortened.
- A shows the sequence information of the final circular structure of the library, the upper part shows the insert fragment, the lower part shows the linker sequence, and the underlined sequence represents the barcode sequence ;
- B, C and D schematically show the process of building a library.
- the circular linker (a) is formed by complementary pairing of one long nucleic acid chain and two short nucleic acid chains, and this structure enables digestion of the chain where the two short nucleic acid chains are located after the circular formation.
- the length of the circular linker is 84 bp, and the distance between the two nucleic acid short sequences is 10 bp.
- the 5' end of the long nucleic acid chain and the 5' of the short chain at the linker end have phosphoric acid modification, and the complementary double-stranded DNA linker structure is a sticky end with 3' extra T deoxynucleotides, preferably 10bp in the middle of the linker for use.
- Ident single-stranded barcode sequences After the cohesive end of the 3' excess T deoxynucleotide of the double-stranded DNA linker structure is complementary to the cohesive end of the 3' excess A deoxynucleotide of the broken DNA double-stranded fragment, the ligase can be used for ligation. After ligation into a circle, the ligation product can be digested to generate a single-stranded library or directly subjected to DNB.
- A is the sequence information of the final circular structure of the library, the upper part is the insert fragment information, the lower part is the linker sequence information, the underlined non-italicized sequence information is the barcode information, and the underlined italicized
- the sequence is the complementary sequence of specially designed barcode information, and the underlined sequence information forms a palindrome sequence;
- B, C and D schematically show the library construction process.
- the loop-forming linker (b) is formed by the complementary pairing of two long nucleic acid chains, the 5' end has a phosphoric acid modification, and the complementary double-stranded DNA linker structure is a sticky end with 3' extra T deoxynucleotides, and there is a non-stick in the middle of the linker. complementary sequences to form an ⁇ structure.
- the length of the circular linker is 93 bp, and there is a non-complementary sequence in the middle to form a structure of ⁇ .
- the positive and negative strands of the formed circular linker can be measured during sequencing, while reducing the number of linker dimers. form.
- the length of the non-complementary sequence is 34bp, and the sticky end of the 3' excess T deoxynucleotide of the double-stranded DNA linker structure can be complementary to the sticky end of the 3' excess A deoxynucleotide that breaks the DNA double-stranded fragment.
- Ligation is performed by ligase. After ligation into a circle, the ligation product can be denatured to generate a single-stranded library for DNB.
- the design innovation of the present invention lies in the design structure of the circular linker, the library construction is completed for the double-stranded DNA, the AT connection is used, and the sequence design is adapted to the MGI sequencing instrument.
- the linker of the present invention can be connected to both ends of the fragment at the same time, so as to realize one-step loop formation by ligation reaction.
- the use of the circularization linker of the present invention skips the PCR and circularization reaction of traditional library construction, and at the same time omits the purification step, thus greatly reducing the operation time of the entire library construction process.
- the reaction of each step is guaranteed to be carried out continuously in the same tube, which avoids the operation of transferring the tube and the loss in the process.
- a circularized library is produced after one-step purification.
- test methods in the following examples are conventional methods unless otherwise specified.
- the test materials used in the following examples, unless otherwise specified, were purchased from conventional chemical reagent stores. It should be noted that the above summary section and the following detailed description are only for the purpose of specifically illustrating the present invention and are not intended to limit the present invention in any way. The scope of the present invention is to be determined by the appended claims without departing from the spirit and spirit of the present invention.
- Short chain 1 5'-AGTCGGATCGTAGCCATGTCGTTCTGTGAGCCAAGGAGTTG-3' (SEQ ID NO. 2)
- Short chain 2 5'-TTGTCTTCCTAAGACCGCTTGGCCTCCGACTT-3' (SEQ ID NO. 3) looping linker (b)
- both chains of the looping linker (b) are long chains b, that is, the matching regions and non-matching regions at both ends of the looping linker (b) are mirror images of each other, and the matching region in the middle is a palindrome sequence.
- Genomic DNA fragmentation There are various methods for genomic DNA fragmentation, whether it is physical ultrasonic method or enzymatic reaction method, there are very mature solutions on the market, and the physical ultrasonic fragmentation method is adopted in this embodiment.
- the interrupt conditions are set as follows:
- the magnetic bead purification method or the gel recovery method can be used, and the magnetic bead purification method is used in this embodiment.
- the disrupted DNA add 80 ⁇ l Ampure XP magnetic beads, mix well and leave for 7-15min; put it on a magnetic stand to collect the supernatant, add 40 ⁇ l Ampure XP magnetic beads to the supernatant, mix well and place for 7-15min; Put it into a magnetic rack to remove the supernatant, and wash the magnetic beads twice with 75% ethanol; add 50 ⁇ l of TE buffer solution or enzyme-free water after drying, mix well and place for 7-15 minutes to dissolve the recovered product.
- End repair plus A Take 100 ng of the recovered product from the previous step, and supplement the volume of TE to 40 ⁇ L to prepare the system according to the following table: Prepare the reaction mixture as shown in the following table:
- Linker ligation After the reaction, immediately add 5 ⁇ L of the ring-forming linker (a) or the ring-forming linker (b), and the linker concentration is 10 ⁇ M. At the same time, prepare the joint connection mixture, as shown in the following table:
- Reagent name volume 10 ⁇ PNK buffer 3 ⁇ L 100mM ATP 0.8 ⁇ L Nuclease-free water 3.6 ⁇ L 50%PEG8000 16 ⁇ L T4 DNA ligase 1.6 ⁇ L total 25 ⁇ L
- linker ligation mixture After adding an appropriate amount of linker to the end repair product, 25 ⁇ L of linker ligation mixture was added and the following reaction was performed.
- the reaction conditions are shown in the following table:
- DNB Preparation and sequencing of DNB: For specific operation steps, please refer to the instructions of the MGI sequencing kit. DNBs were prepared from double-stranded libraries. Preliminary on-machine analysis and sequencing results are as follows. The GC coverage information of the experiment with the looping linker (a) is shown in Figure 3 A, and the GC coverage information of the experiment with the looping linker (b) is shown in Figure 3 B. Show.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Crystallography & Structural Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Bioinformatics & Computational Biology (AREA)
- Plant Pathology (AREA)
- Immunology (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
长链a(100μM) | 10μL |
短链1(100μM) | 10μL |
短链2(100μM) | 10μL |
5×STE buffer | 20μL |
水 | 50μL |
总计 | 100μL |
长链b(100μM) | 20μL |
5×STE buffer | 20μL |
水 | 60μL |
总计 | 100μL |
填充系数 | 20% |
剧烈度 | 5 |
脉冲系数 | 200 |
打断时间 | 35×4次 |
试剂名称 | 体积 |
无核酸酶的水 | 2.1μL |
10×PNK缓冲液 | 5μL |
5:1 dATP:dNTP | 0.6μL |
Klenow片段 | 0.1μL |
rTaq | 0.2μL |
T4 DNA聚合酶 | 2μL |
总量 | 10μL |
处理条件 | 时间 |
37℃ | 30min |
65℃ | 15min |
4℃ | ∞ |
试剂名称 | 体积 |
10×PNK缓冲液 | 3μL |
100mM ATP | 0.8μL |
无核酸酶的水 | 3.6μL |
50%PEG8000 | 16μL |
T4 DNA连接酶 | 1.6μL |
总量 | 25μL |
处理条件 | 时间 |
23℃ | 30min |
4℃ | ∞ |
样本名 | 成环接头a |
过滤后读长数 | 1076236 |
过滤后碱基数(Mb) | 161.44 |
过滤后比例(%) | 52.62 |
比对率(%) | 11.1 |
特异性比率(%) | 98.26 |
重复数据率(%) | 0.83 |
错配比例(%) | 3.5 |
样本名 | 成环接头b |
过滤后读长数 | 16709122 |
过滤后碱基数(Mb) | 1754.46 |
过滤后比例(%) | 38.24 |
比对率(%) | 50.65 |
特异性比率(%) | 98.1 |
重复数据率(%) | 1.9 |
错配比例(%) | 1.86 |
Claims (10)
- 一种环化文库构建方法,所述方法包括:1)将DNA序列打断成片段;2)使所述打断的片段的两端3’端突出;3)利用成环接头将所述3’端突出的片段环化形成环状文库,所述成环接头为不完全配对且两端具有5’端突出的双链,所述成环接头的5’端突出与所述打断的片段的3’端突出互补。
- 根据权利要求1所述的方法,所述打断的片段的3’端突出为A,所述成环接头的5’端突出为T。
- 根据权利要求2所述的方法,在2)中,将所述打断的片段经外切酶、聚合酶和T4多聚核苷酸激酶的处理成5’磷酸化且3’多出A脱氧核苷酸的粘性末端。
- 根据权利要求1-3任一项所述的方法,在3)中,所述不完全配对的双链包括在一条链上有缺口或者所述双链间有非匹配区。
- 根据权利要求4所述的方法,在3)中,所述双链间包括两段非匹配区,所述两段非匹配区之间包括用于区分样本的条形码序列。
- 根据权利要求1-4任一项所述的方法,在3)中,所述成环接头包括如下的成环接头(a)或成环接头(b):成环接头(a)包括一条长链和与所述长链两端配对的两条短链,所述长链5’端具有磷酸修饰,与所述长链3’端互补配对的短链的5’端具有磷酸修饰,互补形成的双链接头具有3’端T粘性末端,并包括8-12nt(例如10nt)的单链非互补区域,优选所述单链非互补区域包括用于区分样本的条形码序列;成环接头(b)包括两条部分互补配对的双链,所述双链两端配对形成双链结构,所述双链结构具有5’端磷酸修饰和3’端T粘性末端,优选所述双链包括8-12nt(例如10nt)的互补部分,作为区分样本的条形码序列。
- 根据权利要求6所述的方法,在3)中,所述成环接头为成环接头(a),所述连接后的产物经过核酸外切酶消化,消化后产物经过一步纯化后得到环状文库;或者,所述成环接头为成环接头(b),所述连接后的产物经过变性得到环状文库。
- 一种构建环化文库的成环接头,所述成环接头为不完全配对且两端具有5’端突出结构的双链,所述成环接头5’端突出与待环化片段的3’端突出互补。
- 根据权利要求8所述的成环接头,所述不完全配对的双链包括在一条链上有缺口或者所述双链间有非匹配区;优选地,所述双链间包括两段非匹配区,所述两段非匹配区之间包括用于区分样本的条形码序列。
- 根据权利要求8或9所述的成环接头,所述成环接头包括如下的成环接头(a)或成环接头(b):成环接头(a)包括一条长链和与所述长链两端配对的两条短链,所述长链5’端具有磷酸修饰,与所述长链3’端互补配对的短链的5’端具有磷酸修饰,互补形成的双链接头具有3’端T粘性末端,并包括8-12nt(例如10nt)的单链非互补区域,优选所述单链非互补区域包括用于区分样本的条形码序列;成环接头(b)包括两条部分互补配对的双链,所述双链两端配对形成双链结构,所述双链结构具有5’端磷酸修饰和3’端T粘性末端,优选所述双链包括8-12nt(例如10nt)的互补部分,作为区分样本的条形码序列。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202080107392.5A CN116670340A (zh) | 2020-12-29 | 2020-12-29 | 一种环化文库的快速构建方法及成环接头 |
PCT/CN2020/140877 WO2022141061A1 (zh) | 2020-12-29 | 2020-12-29 | 一种环化文库的快速构建方法及成环接头 |
EP20967409.2A EP4273307A1 (en) | 2020-12-29 | 2020-12-29 | Method for rapidly constructing cyclized library and ring-forming linker |
US18/259,978 US20240076657A1 (en) | 2020-12-29 | 2020-12-29 | Method for rapidly constructing cyclized library and cyclization adapter |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/140877 WO2022141061A1 (zh) | 2020-12-29 | 2020-12-29 | 一种环化文库的快速构建方法及成环接头 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022141061A1 true WO2022141061A1 (zh) | 2022-07-07 |
Family
ID=82258757
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/140877 WO2022141061A1 (zh) | 2020-12-29 | 2020-12-29 | 一种环化文库的快速构建方法及成环接头 |
Country Status (4)
Country | Link |
---|---|
US (1) | US20240076657A1 (zh) |
EP (1) | EP4273307A1 (zh) |
CN (1) | CN116670340A (zh) |
WO (1) | WO2022141061A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024098178A1 (zh) * | 2022-11-07 | 2024-05-16 | 深圳华大智造科技股份有限公司 | 制备dna纳米球的反应体系及其应用 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102864498A (zh) * | 2012-09-24 | 2013-01-09 | 天津工业生物技术研究所 | 一种长片段末端文库的构建方法 |
WO2016078095A1 (zh) * | 2014-11-21 | 2016-05-26 | 深圳华大基因科技有限公司 | 鼓泡状接头元件和使用其构建测序文库的方法 |
WO2018081666A1 (en) * | 2016-10-28 | 2018-05-03 | Silgentech Inc. | Methods of single dna/rna molecule counting |
CN112226821A (zh) * | 2020-10-16 | 2021-01-15 | 鲲羽生物科技(江门)有限公司 | 一种基于双链环化的mgi测序平台测序文库的构建方法 |
-
2020
- 2020-12-29 US US18/259,978 patent/US20240076657A1/en active Pending
- 2020-12-29 EP EP20967409.2A patent/EP4273307A1/en active Pending
- 2020-12-29 WO PCT/CN2020/140877 patent/WO2022141061A1/zh active Application Filing
- 2020-12-29 CN CN202080107392.5A patent/CN116670340A/zh active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102864498A (zh) * | 2012-09-24 | 2013-01-09 | 天津工业生物技术研究所 | 一种长片段末端文库的构建方法 |
WO2016078095A1 (zh) * | 2014-11-21 | 2016-05-26 | 深圳华大基因科技有限公司 | 鼓泡状接头元件和使用其构建测序文库的方法 |
WO2018081666A1 (en) * | 2016-10-28 | 2018-05-03 | Silgentech Inc. | Methods of single dna/rna molecule counting |
CN112226821A (zh) * | 2020-10-16 | 2021-01-15 | 鲲羽生物科技(江门)有限公司 | 一种基于双链环化的mgi测序平台测序文库的构建方法 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024098178A1 (zh) * | 2022-11-07 | 2024-05-16 | 深圳华大智造科技股份有限公司 | 制备dna纳米球的反应体系及其应用 |
Also Published As
Publication number | Publication date |
---|---|
US20240076657A1 (en) | 2024-03-07 |
EP4273307A1 (en) | 2023-11-08 |
CN116670340A (zh) | 2023-08-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220064721A1 (en) | Method of preparing libraries of template polynucleotides | |
US10400279B2 (en) | Method for constructing a sequencing library based on a single-stranded DNA molecule and application thereof | |
CN106795514B (zh) | 泡状接头及其在核酸文库构建及测序中的应用 | |
EP3464634B1 (en) | Molecular tagging methods and sequencing libraries | |
US9822395B2 (en) | Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof | |
US5262311A (en) | Methods to clone polyA mRNA | |
WO2016037361A1 (zh) | 试剂盒及其在核酸测序中的用途 | |
EP3607065B1 (en) | Method and kit for constructing nucleic acid library | |
JP2001513639A (ja) | 複数のdna断片をアセンブリーする方法 | |
WO2022141061A1 (zh) | 一种环化文库的快速构建方法及成环接头 | |
CN112795620A (zh) | 双链核酸环化方法、甲基化测序文库构建方法和试剂盒 | |
AU727748B2 (en) | Detection and confirmation of nucleic acid sequences by use of oligonucleotides comprising a subsequence hybridizing exactly to a known terminal sequence and a subsequence hybridizing to an undentified sequence | |
CN113638055B (zh) | 一种制备双链rna测序文库的方法 | |
WO2016058127A1 (zh) | 一种转座酶打断核酸并加接头的方法和试剂 | |
CN114808148A (zh) | 一种dna文库构建试剂盒、文库构建方法和应用 | |
JP2003518953A (ja) | 核酸分析の方法 | |
CN115928222A (zh) | 提升文库转化率的建库方法 | |
CN112795990B (zh) | 一种灵活多变的降低污染及pcr偏倚的多标签二代测序文库接头 | |
WO2018081666A1 (en) | Methods of single dna/rna molecule counting | |
WO2014086037A1 (zh) | 构建核酸测序文库的方法及其应用 | |
CN108265047B (zh) | 用于dna片段的非特异性复制的方法及试剂盒 | |
WO2008075519A1 (ja) | 核酸の増幅方法とこれを用いた核酸の解析方法 | |
US20230026055A1 (en) | Dna construct for sequencing and method for preparing the same | |
US20210395813A1 (en) | Multimer for sequencing and methods for preparing and analyzing the same | |
US6673577B1 (en) | Detection and confirmation of nucleic acid sequences by use of poisoning oligonucleotides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20967409 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 202080107392.5 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 18259978 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2020967409 Country of ref document: EP Effective date: 20230731 |