CN112877403A - Method for constructing sequencing library of target sequence - Google Patents

Method for constructing sequencing library of target sequence Download PDF

Info

Publication number
CN112877403A
CN112877403A CN201911295838.7A CN201911295838A CN112877403A CN 112877403 A CN112877403 A CN 112877403A CN 201911295838 A CN201911295838 A CN 201911295838A CN 112877403 A CN112877403 A CN 112877403A
Authority
CN
China
Prior art keywords
nucleic acid
sequence
acid fragment
amplification
linker
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911295838.7A
Other languages
Chinese (zh)
Other versions
CN112877403B (en
Inventor
王寅
李林蔚
柳焱
孙福明
王柯
张媛媛
茹兰兰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Herui Gene Technology Co ltd
Original Assignee
Fujian Herui Jingchuang Gene Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Herui Jingchuang Gene Technology Co ltd filed Critical Fujian Herui Jingchuang Gene Technology Co ltd
Priority to CN201911295838.7A priority Critical patent/CN112877403B/en
Publication of CN112877403A publication Critical patent/CN112877403A/en
Application granted granted Critical
Publication of CN112877403B publication Critical patent/CN112877403B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1093General methods of preparing gene libraries, not provided for in other subgroups
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B50/00Methods of creating libraries, e.g. combinatorial synthesis
    • C40B50/06Biochemical methods, e.g. using enzymes or whole viable microorganisms

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Biochemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Microbiology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Analytical Chemistry (AREA)
  • Biomedical Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Plant Pathology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Immunology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention relates to high throughput nucleic acid sequencing, and more particularly, to methods of constructing sequencing libraries of target sequences and corresponding kits.

Description

Method for constructing sequencing library of target sequence
The present application is a divisional application of an invention patent application having an application date of 2019, 29/11, application No. 201911207127X and having the title of "method for constructing sequencing library of target sequence".
Technical Field
The present invention relates to high throughput nucleic acid sequencing, and more particularly, to methods of constructing sequencing libraries of target sequences and corresponding kits.
Background
The sequencing library aiming at the target sequence is constructed by efficiently enriching the target sequence, so that the sequencing cost can be effectively reduced, and the sequencing depth can be improved. For applications that typically require high depth sequencing, such as somatic mutation detection, target sequence enrichment performance (e.g., capture efficiency) is a major factor in determining method sensitivity and specificity.
The current common target sequence sequencing method is mainly a liquid phase hybridization capture method based on a nucleic acid probe. The prior art method may comprise the following main steps: 1) performing end repair on the fragmented DNA, connecting the fragmented DNA with a linker (comprising a sequencer bridging anchor sequence (such as a P5/P7 sequence suitable for Illumina platform), an index (index) sequence, a Unique Molecular Identifier (UMI) sequence (or Molecular Barcode/two-dimensional code, MBC), etc.) to introduce the UMI, index sequence and primer amplification region, performing a first round of amplification with primers corresponding to the primer amplification region, thereby constructing a complete whole genome library; 2) a complete whole genome library is hybridized and captured using a nucleic acid probe, followed by a second round of amplification with the same primers to enrich for the target sequence to be tested, thereby generating a sequencing library for high throughput sequencing of the target sequence.
Alternatively, the prior art method may comprise the following main steps: 1) performing end repair on the fragmented DNA, connecting the fragmented DNA with a truncated adaptor with UMI to introduce UMI, 2) performing a first round of amplification with primers with a sequencer bridging anchor sequence and an index sequence to introduce the sequencer bridging anchor sequence and the index sequence, thereby constructing a complete whole genome library; and 3) hybridizing and capturing the complete whole genome library with a nucleic acid probe, followed by a second round of amplification to enrich for the target sequence to be tested, thereby generating a sequencing library for high throughput sequencing of the target sequence. This process is shown, for example, in fig. 1.
However, such a method has the following drawbacks: the total length of the linkers at both ends of the members of the complete whole genome sequencing library (e.g., the length from P5/P7 to UMI shown in fig. 1) is typically about 60-80 nucleotides, and library members carrying linkers of such a length bind to each other (or "overlap") during capture to form a complex, thereby significantly reducing the proportion of target sequences, resulting in reduced capture efficiency.
To solve the problem of reduced capture efficiency due to linker overlap of library members in conventional target sequence sequencing methods, a linker blocking agent is typically added at the probe hybridization stage during capture, which inhibits linker overlap by using specially designed and modified DNA that can efficiently bind to the linker sequence (e.g., as shown in fig. 2). But this results in a considerable cost of detection since the linker blocking agent requires special design and modification and requires higher concentrations to achieve the blocking effect.
Therefore, there is a need in the art for new library construction methods with high capture efficiency and low detection cost for sequencing target sequences.
Disclosure of Invention
The present invention meets the above-described need by providing a novel method of constructing a sequencing library of target sequences. More specifically, the inventors found that by constructing an intermediate library of a specific architecture from nucleic acid fragments, then capturing intermediate library members carrying the target sequence from the intermediate library, and then complementing and amplifying the intermediate library members with primers to construct a sequencing library useful for high throughput sequencing of the target sequence, the method of the invention successfully constructs a sequencing library with maintained or improved capture efficiency compared to prior art methods without the use of linker blocking agents required by prior art methods, thereby eliminating the need for linker blocking agents by prior art methods, thereby reducing detection costs.
In one aspect, the invention provides a method of constructing a sequencing library of target sequences, comprising:
(a) adding linkers to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) Amplifying the adaptor-added nucleic acid fragment using a first amplification forward primer and a first amplification reverse primer to obtain a first amplicon that extends outwardly from the nucleic acid fragment by nucleotide sequences that are each independently 16 to 40 nucleotides in length at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) capturing target nucleic acid fragments from the linker-added nucleic acid fragments or the first amplicons without using a linker blocking agent;
(d) amplifying the captured target nucleic acid fragments using a second amplification forward primer and a second amplification downstream primer to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence.
In another aspect, the invention provides a kit for constructing a sequencing library of target sequences, comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the kit does not comprise a linker blocking agent.
In a further aspect, the invention provides the use of an agent for the preparation of a kit for constructing a sequencing library of target sequences, the agent comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the agent does not include a linker blocking agent.
Drawings
FIG. 1 is a schematic representation of a prior art target sequence sequencing process.
FIG. 2 is a schematic representation of the prior art addition of linker blocking agents to inhibit linker overlap during the hybridization stage.
FIG. 3 is a comparison between the construction of libraries of linkers of the invention without capping agents, the construction of libraries of IDT commercial long linker controls with capping agents, and the construction of libraries of IDT commercial long linker controls without capping agents.
FIG. 4 comparison between library construction without blocking agent for linkers of various lengths of the present invention, library construction without blocking agent for long linker control, and library construction with blocking agent for IDT commercial long linker Control (CK).
FIG. 5 is a comparison between the construction of libraries of the invention without linker blocking and the construction of libraries of IDT commercial linkers plus blocking agents, verified in clinical applications.
FIG. 6 is a comparison between library constructions of linkers of the same length and different nucleotide composition without addition of blocking agents according to the invention.
Detailed Description
These and other aspects, features and advantages will become apparent to those of ordinary skill in the art from a reading of the following detailed description and the appended claims. For the avoidance of doubt, any feature of one aspect of the invention may be used in any other aspect of the invention. The word "comprising" is intended to mean "including", but not necessarily "consisting of. In other words, the listed steps or options need not be exhaustive. It should be noted that the examples given in the following description are intended to illustrate the present invention, and are not intended to limit the present invention to these examples per se. Similarly, all percentages are weight/weight percentages unless otherwise indicated. Except in the working and comparative examples, or where otherwise explicitly indicated, all numbers in this description indicating amounts of material or conditions of reaction, physical properties of materials and/or use are to be understood as modified by the word "about". Numerical ranges expressed as "x to y" are understood to include x and y. When multiple preferred ranges are described in the form of "x to y" for a particular feature, it is to be understood that all ranges combining the different endpoints are also contemplated. In other words, any particular upper limit value may be associated with any particular lower limit value when specifying any range of values. Finally, the reference to an element by the indefinite article "a" does not exclude the possibility that more than one of the element is present, unless the context clearly requires that only one of the element is present. Thus, the indefinite article "a" or "an" usually means "at least one".
Where features relating to a particular aspect of the invention (e.g. a method of the invention) are disclosed, such disclosure is also deemed applicable to any other aspect of the invention (e.g. a kit and use of the invention) and mutatis mutandis.
In one aspect, the invention provides a method of constructing a sequencing library of target sequences, comprising:
(a) adding linkers to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) Amplifying the adaptor-added nucleic acid fragment using a first amplification forward primer and a first amplification reverse primer to obtain a first amplicon that extends outwardly from the nucleic acid fragment by nucleotide sequences that are each independently 16 to 40 nucleotides in length at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) capturing target nucleic acid fragments from the linker-added nucleic acid fragments or the first amplicons without using a linker blocking agent;
(d) amplifying the captured target nucleic acid fragments using a second amplification forward primer and a second amplification downstream primer to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence.
More specifically, the invention provides a method of constructing a sequencing library of target sequences. The method comprises the following steps.
First, linkers are added to the 5 'end and 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment.
In some embodiments, the nucleic acid fragments may be obtained by fragmenting nucleic acids. Enzymes useful for fragmentation of nucleic acids are known in the art. The nucleic acid may be free nucleic acid, e.g., from a bodily fluid, such as blood, lymph, joint synovial fluid, cerebrospinal fluid, and the like. The nucleic acid may also be genomic nucleic acid extracted from cells from a tissue, for example by lysing the cells, for example healthy tissue or diseased tissue, such as a tumor. For example, the nucleic acid can be genomic DNA, mitochondrial DNA, long fragment PCR products, long fragment chromatin co-immunoprecipitation DNA, RNA reverse transcription product cDNA, or circulating tumor DNA (ctdna). Enzymes useful for lysing cells are known in the art and may be protease K or other proteases or mixtures thereof. The length of the nucleic acid fragment may be within a range having endpoints selected from the group consisting of: 50. 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, and 600 nucleotides. For example, in some embodiments, the length of the nucleic acid fragment can be in the range of 100-600 nucleotides.
As used herein, "adapter" refers to a single-stranded or double-stranded or partially double-stranded (e.g., Y-type) sequence for addition to a nucleic acid fragment to be sequenced, e.g., by ligation, such as T-a ligation, or by insertion, such as by transposase transposition insertion, for subsequent library construction. Typically, the preparation of a library requires the addition of linkers to the ends of the nucleic acid fragments to enable sequencing of the nucleic acid fragments by a sequencer. When adding linkers to a nucleic acid fragment, the linkers are typically added randomly to the 5 'end and/or 3' end of the nucleic acid fragment.
Adapters may be single stranded nucleic acid molecules with functional ends, which may differ depending on the characteristics of the nucleic acid fragment, including 1) ligating adapters to nucleic acid fragments with pre-mutexisting overhangs, e.g., ligating adapters with a T-overhang at the 3 'end to nucleic acid fragments with an A-overhang at the 3' end by T-A overhang complementation; and 2) ligating the adaptor to the blunt-ended double-stranded nucleic acid fragment or single-stranded nucleic acid fragment by a special functional modification. The single-stranded linker ligated to the nucleic acid fragment can complement its complementary strand by complementation, thereby obtaining a corresponding double-stranded linker.
The linker may also be a double-stranded nucleic acid molecule, for example, a double-stranded nucleic acid molecule having functional ends formed by annealing or the like of two single-stranded nucleic acid molecules, which may be completely complementary or partially complementary to each other. A double-stranded linker may have a single-stranded portion, e.g., a Y-linker formed by annealing two partially complementary single-stranded nucleic acid molecules, having one double-stranded portion and two single-stranded portions.
An adaptor may also be a nucleic acid molecule that is single-stranded in an initial state, wherein a special base, such as uracil, is introduced into the single-strand to form a closed structure, such as a hairpin, and that is double-stranded in a final state by enzymatic degradation of the special base, such as glycosylase, to open the closed structure after both ends of the single-strand have been added to one end of the double-stranded nucleic acid fragment.
In some embodiments, the joint may be a single link joint or a double link joint. In some embodiments, the double-stranded linker may be a double-stranded Y-linker.
In some embodiments, the linker may be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 nucleotides in length. As used herein, the "length" of an adaptor refers to the length of a single-stranded adaptor when the adaptor is single-stranded, and to the length of the longer of the double strands when the adaptor is (ultimately) double-stranded. For example, in some embodiments, a linker may be a double-stranded linker, wherein one strand may be 16, 20, 24, 31, 34, 37, or 40 nucleotides in length and the other strand may be 15, 19, 23, 30, 33, 36, or 39 nucleotides in length, then the linker is considered 16, 20, 24, 31, 34, 37, or 40 nucleotides in length.
In some embodiments, the length of the linker may be within a range having endpoints selected from the group consisting of: 10. 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 nucleotides. For example, in some embodiments, the length of the linker can be in the range of 16-40 nucleotides, 16-37 nucleotides, 16-34 nucleotides, 20-37 nucleotides, 20-34 nucleotides, 20-31 nucleotides, 20-24 nucleotides, or 24-31 nucleotides. For example, in some embodiments, the linker may be 16, 20, 24, 31, 34, 37, or 40 nucleotides in length. In some embodiments, the linker may be a double-stranded linker, and the length of one strand of the linker may differ from the length of the other strand of the linker by 1, 2, 3, 4, or 5 nucleotides, preferably 1 nucleotide. In some embodiments, the linker may be a double-stranded linker, and the length of one strand of the linker and the length of the other strand of the linker may differ by the spacer sequence of one strand of the linker, e.g., the length of T, e.g., 1 nucleotide. For example, in some embodiments, the linker may be a double-stranded linker, wherein one strand may be 16, 20, 24, 31, 34, 37, or 40 nucleotides in length and the other strand may be 15, 19, 23, 30, 33, 36, or 39 nucleotides in length.
In some embodiments, the linker may comprise in order in the 5 '-3' direction a common sequence, an optional single molecule barcode, and an optional spacer sequence; or consist of them. In some preferred embodiments, the linker may be free of an index sequence and a sequencer bridging anchor sequence.
As used herein, "common sequence" refers to a sequence that is amplified (e.g., PCR) by specific binding of a primer used in a subsequent step to an adaptor.
As used herein, "Single Molecular Barcode (SMB)" refers to a unique nucleotide sequence that can be placed on a linker to add to a nucleic acid fragment to be sequenced, thereby uniquely labeling the nucleic acid fragment. A nucleic acid fragment may carry one or two or more single molecule barcodes, which may be the same or different, after the addition of a linker.
In some embodiments, there may be multiple nucleic acid fragments from a single sample or multiple nucleic acid fragments from multiple samples, resulting in multiple adaptor-added nucleic acid fragments. Such multiple adaptor-added nucleic acid fragments can be mixed together for subsequent procedures. The multiple nucleic acid fragments can be distinguished from each other by the addition of a single-molecule barcode.
After obtaining sequencing data, the corresponding nucleic acid fragments can be uniquely identified by identifying the single molecule barcode, or the sample source can be uniquely identified. For example, in processing nucleic acid samples from two patients, the nucleic acid samples are labeled with two sets of single-molecule barcodes, respectively, then mixed to construct a sequencing library, subjected to high-throughput sequencing once, and after obtaining sequencing data, the two nucleic acid samples are distinguished by identifying the two sets of single-molecule barcodes. In some embodiments, the single-molecule barcode sequences can be divided into single-molecule barcode groups according to the base uniformity principle, and any single-molecule barcode in each group is different from any single-molecule barcode in any other group. Single molecule barcodes are sometimes also referred to in the art as Unique Molecular Identifiers (UMIs) or Molecular barcodes/two-dimensional codes (MBCs).
As used herein, a "spacer (spacer) sequence" refers to a sequence between two functional regions of nucleotides. In some embodiments, a spacer sequence may be present. In some embodiments, the spacer sequence may not be present. The spacer sequence may be 1, 2, 3, 4, 5, 6, 7, or 8 nucleotides in length. For example, the spacer sequence may be T. In some embodiments, the spacer sequence is for a linkage between the linker and the nucleic acid fragment, e.g., a T-a linkage.
As used herein, an "index (index) sequence" is a functional sequence known in the art for use in secondary sequencing, and is often used to identify the source of a sample by virtue of its sequence. For example, in processing nucleic acid samples from two patients, the nucleic acid samples are labeled with two index sequences, and then mixed to construct a sequencing library, subjected to high throughput sequencing once, and after obtaining sequencing data, the two nucleic acid samples are distinguished by identifying the two index sequences.
As used herein, a "sequencer bridge anchor sequence" is a functional sequence known in the art for secondary sequencing that is anchored to the surface of a flow cell of a sequencer for bridge amplification. For example, for the commonly used Illumina platform, the sequencer bridge anchor sequence is commonly referred to as the P5/P7 sequence, the specific sequences of which are well known in the art. The method of the invention can be applied to various second-generation high-throughput sequencing platforms, such as: HiSeq/MiSeq/MiniSeq/MySeq/NovaSeq sequencing platform from Illumina, PGM/Proton sequencing platform from Thermo Fisher, and the like.
In some embodiments, the single molecule barcode may be of sufficient length to enable unique identification of nucleic acid fragments. In some embodiments, the length of the single molecule barcode may be within a range having endpoints selected from: 2.3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 and 14 nucleotides. For example, in some embodiments, the length of a single molecule barcode may be in the range of 2-8 nucleotides, 2-7 nucleotides, 6-8 nucleotides, or 6-7 nucleotides, such as 2, 6, 7, or 8 nucleotides.
In some embodiments, the length of the common sequence may be sufficient to enable subsequent amplification. In some embodiments, the length of the common sequence may be within a range having endpoints selected from: 10. 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 nucleotides. For example, in some embodiments, the length of the common sequence can be in the range of 13-31 nucleotides, 13-28 nucleotides, 13-25 nucleotides, 13-23 nucleotides, 13-17 nucleotides, or 17-23 nucleotides, such as 13, 17, 23, 25, 28, or 31 nucleotides.
The length of each of the single molecule barcode (if any) and the common sequence in a linker of a given length can be adjusted for various purposes, provided that each of the single molecule barcode (if any) and the common sequence is long enough to perform its function. For example, in a given linker of 20 nucleotides in length, it may be that the linker has a single molecule barcode of 6 nucleotides in length, a common sequence of 13 nucleotides in length, and a spacer sequence (e.g., T) of 1 nucleotide in length; it is also possible that, for measuring smaller amounts of nucleic acid fragments to be detected, the linker has a single-molecule barcode of 4 nucleotides in length, a common sequence of 15 nucleotides in length, and a spacer sequence (e.g., T) of 1 nucleotide in length.
Next, after preparing the adaptor-added nucleic acid fragments, the target nucleic acid fragments are ready to be captured. The preparation work is different, since the samples from which the nucleic acid fragments to be sequenced are obtained are different.
In some cases, the concentration of nucleic acid fragments in the sample is high, and thus the adaptor-added nucleic acid fragments can be used directly for capture without amplification. In this case, the linker-added nucleic acid fragment extends outwardly by nucleotide sequences each independently of 16 to 40 nucleotides in length at the 5 'end and 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, which is used directly for capture without amplification. If there are multiple nucleic acid fragments, then there are multiple adaptor-added nucleic acid fragments. One or more adaptor-added nucleic acid fragments serve as an intermediate or pre-library. One or more pre-libraries separately prepared from one or more samples may be mixed together for capture or each may be captured separately.
In other cases, the concentration of nucleic acid fragments in the sample is low, and therefore amplification of the adaptor-added nucleic acid fragments is required for efficient subsequent capture. In such a case, the nucleic acid fragment to which the linker is added is amplified using a first amplification upstream primer and a first amplification downstream primer to obtain a first amplicon that extends outwardly by a nucleotide sequence of 16 to 40 nucleotides each independently at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, respectively. If there are multiple nucleic acid fragments, then there are multiple first amplicons, respectively. The one or more first amplicons act as an intermediate or pre-library. One or more pre-libraries separately prepared from one or more samples may be mixed together for capture or each may be captured separately.
In some embodiments, the length of the nucleic acid fragment to which the linker is added or the first amplicon extending outward at the 5 'end and 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment may each independently be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 nucleotides. In some embodiments, the length of the nucleic acid fragment to which the linker is added or the nucleotide fragment of the first amplicon that extends outward at the 5 'end and 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment may each independently be within a range having endpoints selected from: 10. 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 nucleotides. For example, in some embodiments, the length of the nucleic acid fragment to which the linker is added or the first amplicon that extends outward at the 5 'end and 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment may each independently be in the range of 16-40 nucleotides, 16-37 nucleotides, 16-34 nucleotides, 20-37 nucleotides, 20-34 nucleotides, 20-31 nucleotides, 20-24 nucleotides, or 24-31 nucleotides. For example, in some embodiments, the nucleic acid fragment to which the linker is added or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment compared to the nucleic acid fragment by a length of 16, 20, 24, 31, 34, 37, or 40 nucleotides, respectively.
Part or all of the common sequence derived from the adaptor or the complementary sequence thereof contained in the adaptor-added nucleic acid fragment may constitute a primer binding portion for amplification of the adaptor-added nucleic acid fragment of the first amplification upstream/downstream primer pair.
In some embodiments, part or all of the sequence of the first amplification upstream primer may be sufficiently complementary to part or all of the sequence of the common sequence or its complement to enable amplification of the adaptor-added nucleic acid fragment. In some embodiments, part or all of the sequence of the first amplification downstream primer may be sufficiently complementary to part or all of the sequence of the common sequence or its complement to enable amplification of the adaptor-added nucleic acid fragment.
In some embodiments, the first amplification forward primer and the first amplification downstream primer can be of sufficient length to bind to the common sequence or its complement and amplify the adaptor-added nucleic acid fragment. In some embodiments, the length of the first amplification forward primer and the first amplification downstream primer may each independently be within a range having endpoints selected from: 10. 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 nucleotides. In some embodiments, the length of the first amplification forward primer and the first amplification downstream primer can each independently be 14-28 nucleotides, 14-26 nucleotides, 16-24 nucleotides, or 16-22 nucleotides, such as 16 nucleotides or 22 nucleotides.
In some embodiments, the first amplification upstream primer may be free of an index sequence and a sequencer bridging anchor sequence. In some embodiments, the first amplification downstream primer may be free of an index sequence and a sequencer bridging anchor sequence.
In some embodiments, the 5 'end and/or the 3' end of the first amplification upstream primer may be flush or not flush with the 5 'end and/or the 3' end of the adaptor. In some embodiments, the 5 'end and/or the 3' end of the first amplification downstream primer may be flush or not flush with the 5 'end and/or the 3' end of the adaptor. By "flush" is meant that the two sequences are aligned at the ends, each without protruding nucleotides. By "not flush" is meant that the two sequences are not aligned at the ends, either of which has protruding nucleotides.
In some embodiments, the 5 'end and/or the 3' end of the first amplification forward primer may differ from the 5 'end and/or the 3' end of the linker by 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides. In some embodiments, the 5 'end and/or the 3' end of the first amplification downstream primer may differ from the 5 'end and/or the 3' end of the linker by 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides. This means that the two sequences are not aligned at the ends, one of which has 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 nucleotides protruding.
The amplification may be a PCR amplification. PCR amplification is well known in the art. PCR amplification may be performed using one or more thermostable polymerases. The thermostable polymerase may be selected from: LA-Taq, rTaq, Phusion, Deep Vent (exo-), Gold 360, Platinum Taq, KAPA 2G Robust and Q5 polymerase.
Third, the target nucleic acid fragment is captured from the linker-added nucleic acid fragment or the first amplicon without using a linker blocking agent.
As previously mentioned, prior art methods typically first construct a complete whole genome library, i.e., the resulting library members already contain the functional region sequences required for high throughput sequencing by a sequencer (e.g., sequencer bridge anchor sequences, index sequences, unique molecular identifiers, etc.), and then use nucleic acid probes to hybridize and capture the complete whole genome library to construct a capture library for high throughput sequencing by the sequencer. Since the length of the linker fragment connected to the nucleic acid fragment to be sequenced is usually longer because the linker fragment needs to carry a functional region sequence (such as a sequencer bridging anchor sequence, an index sequence, a unique molecular identifier, etc.) required for high-throughput sequencing by a sequencer, the overlap phenomenon of library members occurs, the capture efficiency is reduced, and therefore, expensive linker blocking agents need to be adopted to reduce the overlap of library members and improve the capture efficiency.
In contrast, the method of the invention is to construct an intermediate library from the nucleic acid fragments and then capture the intermediate library members carrying the target sequence (i.e., the target nucleic acid fragments) from the intermediate library. The members of the intermediate library may be free of functional region sequences required for sequencing, e.g., may be free of sequencer bridging anchor sequences and/or index sequences, and thus may be of shorter length. Thus, little or no overlap occurs when capturing the intermediate library members, thereby eliminating the need for joint sealants of the prior art.
Techniques for capturing target nucleic acid sequences are well known in the art and can be performed, for example, using nucleic acid probes.
Finally, the intermediate library members are complemented and amplified using primers to construct a sequencing library for high throughput sequencing on a sequencer. Amplifying the target nucleic acid fragment using the second amplification upstream primer and the second amplification downstream primer to obtain a second amplicon as a member of the sequencing library, thereby constructing the sequencing library. The second amplicon has a functional region sequence required for high throughput sequencing by a sequencer.
In some embodiments, the second amplification upstream primer can comprise, in order in the 5 '-3' direction, an upstream sequencer bridging anchor sequence, an optional index sequence, and an upstream sequencing sequence; or consist of them.
In some embodiments, the second amplification downstream primer can comprise, in order in the 5 '-3' direction, a downstream sequencer bridging anchor sequence, an optional index sequence, and a downstream sequencing sequence; or consist of them.
The method of the invention can be applied to various second-generation high-throughput sequencing platforms, such as: HiSeq/MiSeq/MiniSeq/MySeq/NovaSeq sequencing platform from Illumina, PGM/Proton sequencing platform from Thermo Fisher, and the like. Thus, the anchor sequence can be bridged using a sequencer suitable for use with a variety of platforms. For example, P5/P7 sequences suitable for use in the Illumina platform can be employed.
In some embodiments, the index sequence may or may not be present. This may be determined based on the type and/or number of samples.
In some embodiments, the index sequences may be the same or different. This may be determined based on the type and/or number of samples.
In some embodiments, amplification may be performed using a second amplification forward primer and/or a second amplification reverse primer that contain different index sequences for different intermediate libraries.
In some embodiments, the sequence of the common sequence or its complement and the sequence of the first amplification upstream primer may each be part or all of the sequence of the upstream sequencing sequence. In other words, the sequence of the common sequence or its complement and the sequence of the first amplification upstream primer may each be identical to part or all of the sequence of the upstream sequencing sequence.
In some embodiments, the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer may each be part or all of the sequence of the downstream sequencing sequence. In other words, the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer may each be identical to part or all of the sequence of the downstream sequencing sequence.
The methods of the invention successfully construct sequencing libraries with maintained or improved capture efficiency compared to prior art methods without the use of linker blocking agents required by prior art methods, thereby eliminating the need for linker blocking agents by prior art methods, thereby reducing detection costs.
As used herein, the term "capture efficiency" is intended to encompass a variety of aspects of the evaluation of capture, including, but not limited to, the following parameters:
(a) target hit rate: refers to the ratio of the number of bases targeted for acquisition in the target region to the total number of bases in the original efficient sequencing run-down data. Higher values indicate higher efficiency in capturing the target sequence, and more valid data can be analyzed.
(b) Coverage degree: the ratio of the number of bases with sequencing depth greater than 0X obtained in the target region to the number of bases in the preset target region is referred to. Higher values indicate higher efficiency in capturing coverage in the target area, and more valid data can be analyzed.
(c) Redundancy: the method refers to the ratio of the number of molecules obtained by mirror image replication to the total number of molecules, and the mirror image replication refers to multiple replication of molecules with identical starting points, end points and sequences in data obtained by sequencing. The lower the targeting rate per unit mass of DNA, the more easily the molecules with low proportion of the targeted portion will be reduced or deleted in the sequencing data, i.e., the molecular diversity of the targeted portion will be reduced, and the redundancy will be affected.
(d) Stability: the standard deviation between the repetitions was used to measure the degree of dispersion. Stability is expressed herein using a forward error bar, with shorter forward error bars indicating better stability.
In another aspect, the invention provides a kit for constructing a sequencing library of target sequences, comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the kit does not comprise a linker blocking agent.
In a further aspect, the invention provides the use of an agent for the preparation of a kit for constructing a sequencing library of target sequences, the agent comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the agent does not include a linker blocking agent.
Preferred aspects in the context of the method according to the invention also apply mutatis mutandis in the context of the kit and the use according to the invention.
The present invention provides the following items:
item 1. a method of constructing a sequencing library of target sequences, comprising:
(a) adding linkers to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) Amplifying the adaptor-added nucleic acid fragment using a first amplification forward primer and a first amplification reverse primer to obtain a first amplicon that extends outwardly from the nucleic acid fragment by nucleotide sequences that are each independently 16 to 40 nucleotides in length at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) capturing target nucleic acid fragments from the linker-added nucleic acid fragments or the first amplicons without using a linker blocking agent;
(d) amplifying the captured target nucleic acid fragments using a second amplification forward primer and a second amplification downstream primer to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence.
Item 2. the method of item 1, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-37 nucleotides each, independently, as compared to the nucleic acid fragment.
Item 3. the method of item 1, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-34 nucleotides each, independently, as compared to the nucleic acid fragment.
Item 4. the method of item 1, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 20-37 nucleotides each independently.
Item 5. the method of item 1, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-34 nucleotides each, independently, as compared to the nucleic acid fragment.
Item 6. the method of item 1, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment by a nucleotide sequence of 20-31 nucleotides each independently.
Item 7. the method of item 1, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 24 nucleotides each, as compared to the nucleic acid fragment.
Item 8. the method of any one of items 1-7, wherein the single molecule barcode is 2-12 nucleotides in length.
Item 9. the method of any one of items 1 to 7, wherein the common sequence is 13 to 31 nucleotides in length.
Item 10. the method of any of items 1-7, wherein the joint is a single-link joint or a double-link joint; optionally, the double-stranded linker is a double-stranded Y-linker.
Item 11. the method of any one of items 1 to 7, wherein the 5 'end and/or the 3' end of the first amplification upstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor; and/or
Wherein the 5 'end and/or the 3' end of the first amplification downstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor.
Item 12. the method of any one of items 1-7, wherein:
(1) the nucleic acid fragments are from a single sample; or
(2) The nucleic acid fragments are from a plurality of samples, and the adaptor comprises the single molecule barcode.
Item 13. the method of any one of items 1 to 7, wherein the linker is free of an index sequence and a sequencer bridging anchor sequence.
Item 14. the method of any one of items 1 to 7, wherein the first amplification forward primer and the first amplification downstream primer are each independently free of an index sequence and a sequencer bridging anchor sequence.
Item 15. a kit for constructing a sequencing library of a target sequence, comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the kit does not comprise a linker blocking agent.
Item 16. the kit of item 15, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 16-37 nucleotides each independently.
Item 17. the kit of item 15, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 16-34 nucleotides each independently.
Item 18. the kit of item 15, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 20-37 nucleotides each independently.
Item 19. the kit of item 15, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 20-34 nucleotides each independently.
Item 20. the kit of item 15, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment a nucleotide sequence of 20-31 nucleotides each independently.
Item 21. the kit of item 15, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 24 nucleotides each, as compared to the nucleic acid fragment.
Item 22. the kit of any one of items 15-21, wherein the single molecule barcode is 2-12 nucleotides in length.
Item 23. the kit of any one of items 15-21, wherein the common sequence is 13-31 nucleotides in length.
Item 24. the kit of any one of items 15-21, wherein the linker is a single-or double-linker; optionally, the double-stranded linker is a double-stranded Y-linker.
Item 25. the kit of any one of items 15-21, wherein the 5 'end and/or the 3' end of the first amplification upstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor; and/or
Wherein the 5 'end and/or the 3' end of the first amplification downstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor.
Item 26. the kit of any one of items 15-21, wherein:
(1) the nucleic acid fragments are from a single sample; or
(2) The nucleic acid fragments are from a plurality of samples, and the adaptor comprises the single molecule barcode.
Item 27. the kit of any one of items 15-21, wherein the linker is free of an index sequence and a sequencer bridging anchor sequence.
Item 28. the kit of any one of items 15-21, wherein the first amplification forward primer and the first amplification downstream primer are each independently free of an index sequence and a sequencer bridging anchor sequence.
Item 29. use of an agent for making a kit for constructing a sequencing library of a target sequence, the agent comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the agent does not include a linker blocking agent.
Item 30. the use of item 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment by a nucleotide sequence of 16 to 37 nucleotides each independently.
Item 31 the use of item 29, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment by a nucleotide sequence of 16-34 nucleotides each independently.
Item 32. the use of item 29, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment by a nucleotide sequence of 20-37 nucleotides each independently.
Item 33. the use of item 29, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment by a nucleotide sequence of 20-34 nucleotides each independently.
Item 34 the use of item 29, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment as compared to the nucleic acid fragment a nucleotide sequence having a length of each independently 20-31 nucleotides.
Item 35. the use of item 29, wherein the linker-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 24 nucleotides each, as compared to the nucleic acid fragment.
Item 36. the use of any one of items 29 to 35, wherein the single molecule barcode is 2 to 12 nucleotides in length.
Item 37. the use of any one of items 29 to 35, wherein the common sequence is 13 to 31 nucleotides in length.
Item 38. the use of any of items 29-35, wherein the joint is a single-link joint or a double-link joint; optionally, the double-stranded linker is a double-stranded Y-linker.
Item 39. the use of any one of items 29 to 35, wherein the 5 'end and/or the 3' end of the first amplification upstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor; and/or
Wherein the 5 'end and/or the 3' end of the first amplification downstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor.
Item 40. the use of any one of items 29-35, wherein:
(1) the nucleic acid fragments are from a single sample; or
(2) The nucleic acid fragments are from a plurality of samples, and the adaptor comprises the single molecule barcode.
Item 41. the use of any one of items 29 to 35, wherein the linker is free of an index sequence and a sequencer bridging anchor sequence.
Item 42. the use of any one of items 29 to 35, wherein the first amplification forward primer and the first amplification downstream primer are each independently free of an index sequence and a sequencer bridging anchor sequence.
The invention will now be illustrated by the following non-limiting examples. Unless otherwise indicated, experimental data are the average of duplicate replicates.
Examples
Method
(1) DNA fragmentation and end repair
a) The repair procedure was set up on a PCR instrument (model T100, manufacturer Bio-Rad) according to Table 1, with a hot lid temperature of 75 ℃ and a reaction volume of 25. mu.L, run and pause.
b) Using Qiagen's 5X WGS Fragmentation Mix (cat # Y9410L), 100ng of sample DNA was placed in a PCR tube, buffer EB (Qiagen, cat # 19086) was added to it to 17.5. mu.L depending on the volume of sample DNA, and then the components were added in sequence as exemplified in Table 2. Flick and mix evenly, centrifuge for 5 seconds, flick PCR tube to remove air bubbles, centrifuge for 5 seconds again, put into suspended PCR instrument, run the procedure of Table 1, in order to prepare the sample DNA fragment with repaired end.
TABLE 1
Reaction temperature Reaction time
32℃ 22min(100ng)
65℃ 30min
4℃ Pausing
TABLE 2
Figure BDA0002320502300000291
Figure BDA0002320502300000301
(2) Connecting joint
a) The ligation procedure was set up on a PCR instrument according to Table 3, the reaction volume was 50. mu.L, run and pause.
TABLE 3
Reaction temperature Reaction time
20℃ 15min
4℃ Pausing
b) The components were added in sequence as exemplified in Table 4 using Qiagen's WGS Ligase (cat # L6030-W-L), mixed well, centrifuged for 5 seconds, flicked to remove air bubbles, centrifuged for 5 seconds again, placed in a pause PCR apparatus and the procedure of Table 3 was run to prepare adaptor-ligated DNA fragments.
TABLE 4
Components Sample addition amount
End-repaired sample DNA fragments 25μL
Buffer EB 9μl
5x WGS Ligase buffer 10μL
Joint (10 μ M) 3μL
WGS Ligase 5μL
Final volume of reaction 50μL
c) Note that: the use of linkers with different single molecule barcodes for each sample avoids the repeated use of the same linker for the same batch of operations.
(3) Ligation product purification
a) Purification was performed using purified magnetic beads. Before purification, the beads were removed from the 4 ℃ freezer, mixed by inversion and equilibrated at room temperature for 30 min. mu.L of the ligation product was mixed with 100. mu.L (2X) of magnetic beads in a 1.5ml EP tube, gently mixed with a finger or shaken, centrifuged briefly for 3 seconds, and allowed to stand at room temperature for 5-10 minutes.
b) The EP tube was placed on a magnetic rack, adsorbed for 5 minutes, and the supernatant was removed by pipetting, taking care not to aspirate the magnetic beads.
c) Slowly adding 200 mu L of cleaning solution into the EP tube along one side far away from the magnetic beads on the magnetic frame, slightly shaking the magnetic frame to avoid the magnetic beads from scattering, and sucking away the supernatant by a pipette with attention to not suck the magnetic beads.
d) Repeating the step c).
e) The EP tube was removed from the magnetic stand, centrifuged briefly for 3 seconds, placed on the magnetic stand again, the entire supernatant was aspirated off with a 10. mu.L pipette, the lid of the tube was opened, and allowed to stand at room temperature for 5 minutes.
f) The EP tube was removed from the magnetic stand, 25. mu.L of DNase/RNase-Free deionized water (Tiangen, cat. RT121) was added thereto, and the eluate and magnetic beads were gently mixed by hand and left at room temperature for 10 minutes.
g) The EP tube was placed on a magnetic rack, adsorbed for 1 minute, and 23. mu.L of the supernatant, which was the purified product after ligation, was pipetted into a new PCR tube.
(4) First round amplification
a) The ligated purified product was returned to ice; the PCR program was set up on a PCR instrument according to Table 5, with a hot lid temperature of 105 ℃ and a reaction volume of 50. mu.L, run and pause:
TABLE 5
Figure BDA0002320502300000311
b) The HiFi enzyme ReadyMix (cat # KK2602) was hot-started using KAPA from Roche-KAPA, the components were added in sequence as exemplified in Table 6, mixed well, centrifuged for 5 seconds, flicked to remove air bubbles, centrifuged for 5 seconds again,
put into a paused PCR instrument, the table 5 program was run to prepare a first round amplification product (i.e., a first amplicon).
TABLE 6
Figure BDA0002320502300000312
Figure BDA0002320502300000321
(5) Purification and concentration determination of first round amplification product
a) Purification was performed using purified magnetic beads. Before purification, the beads were removed from the 4 ℃ freezer, mixed by inversion and equilibrated at room temperature for 30 min. 50 μ L of the first round amplification product (first amplicon) was mixed with 75 μ L (1.5X) of magnetic beads in a 1.5ml EP tube, gently mixed with a finger or shaken, centrifuged briefly for 3 seconds, and allowed to stand at room temperature for 5-10 minutes.
b) The EP tube was placed on a magnetic rack, adsorbed for 5 minutes, and the supernatant was removed by pipetting, taking care not to aspirate the magnetic beads.
c) Slowly adding 200 mu L of cleaning solution into the EP tube along one side far away from the magnetic beads on the magnetic frame, slightly shaking the magnetic frame to avoid the magnetic beads from scattering, and sucking away the supernatant by a pipette with attention to not suck the magnetic beads.
d) Repeating the step c).
e) The EP tube was removed from the magnetic stand, centrifuged briefly for 3 seconds, placed on the magnetic stand again, the entire supernatant was aspirated off with a 10. mu.L pipette, the lid of the tube was opened, and allowed to stand at room temperature for 5 minutes.
f) The EP tube was removed from the magnetic stand, 35. mu.L of DNase/RNase-Free deionized water (Tiangen, cat. RT121) was added thereto, and the eluate and magnetic beads were gently mixed by hand and left at room temperature for 10 minutes.
g) The EP tube was placed on a magnetic rack, adsorbed for 1 minute, and 33. mu.L of the supernatant, which was the pre-library purified product, was pipetted into a fresh EP tube.
h) Mu.l of the supernatant was collected and subjected to the Qubit dsDNA HS Assay Kit (Invitrogen, cat # Q32851) and QubitTM4Fluorometer (Invitrogen, cat # Q33226) concentration meter, the first amplicon concentration in the supernatant was determined.
i) And (3) performing Fragment quality inspection by using Fragment quality inspection equipment, such as Agilent 4200 TapesStation Fragment Analyzer, and performing quality inspection on the first amplicon. The length of the first amplicon is the length of the sample fragment + the length of the adaptors at both ends. The first amplicon can be used immediately for further experiments or stored at-20 + -5 deg.C.
(6) Hybridization of biotin-containing probes
a) The procedure of table 7 was set up on the PCR instrument, run and pause, respectively:
TABLE 7
Figure BDA0002320502300000331
b) From the first amplicon concentration of each sample, a volume of 500ng of each sample was calculated, the first round of amplification product was added to a new PCR tube a, one PCR tube could be filled with 1-8 samples with different molecular barcodes, marked with a marker, using xGen Hybridization and Wash Kit from IDT, Box1, Box2 (cat No. 1080584), the components were added in sequence as exemplified in table 8, centrifuged for 5 seconds, flicked to remove air bubbles, centrifuged for 5 seconds again, the PCR tube was placed into a vacuum concentrator with the tube cap open, and the 45 ℃ concentration procedure was run until there was no liquid in dry state.
TABLE 8
Figure BDA0002320502300000332
c) During library concentration, the Hybridization solution (XGen Hybridization and Wash Kit from IDT, Box1, Box2 (cat 1080584)) and the biotin-containing probe as designed were thawed on ice, the components were added to PCR tube B in the order as exemplified in Table 9, vortexed gently, centrifuged for 5 seconds, flicked to remove air bubbles, and centrifuged for 5 seconds.
TABLE 9
Figure BDA0002320502300000333
Figure BDA0002320502300000341
d) Transferring 17 μ l of the mixture in PCR tube B to PCR tube A concentrated to dry state (when the mixture is added, the pipette tip does not touch the bottom of the tube, and can be added at 1/2 of the tube wall), standing at room temperature for 5min, gently swirling or flicking with fingers, mixing, centrifuging for 5 seconds, removing bubbles, centrifuging for 5 seconds, placing in a PCR instrument, running the hybridization program set in Table 7, and performing overnight hybridization reaction to allow the biotin-containing probe to hybridize with the target molecule.
(7) Streptavidin magnetic beads capture of target molecules hybridized with biotin-containing probes
a) Preparing Buffer, 1h before the Hybridization reaction is finished, taking out the captured magnetic beads Dynabeads Streptavidin M270 from IDT for room temperature balance for 30min, unfreezing other components at room temperature, diluting the washing liquid into 1x working liquid according to the table 10 (taking one sample as an example, mix can be prepared according to actual loss when the sample is more), taking 100 mul of the prepared 1x Wash Buffer I to be packaged into 0.2ml PCR tubes, taking 300 mul of the 1x Stringent Wash Buffer to be packaged into 3 0.2ml PCR tubes (100 mul in each tube), then placing the packaged 1x Wash Buffer I, 1x Stringent Wash Buffer I and Wash Buffer into 65 ℃ for balance for 15min, and placing the rest Wash Buffer I, Wash II and Wash III into stand-by for standby at room temperature.
Watch 10
Figure BDA0002320502300000342
b) The PCR program was set up on the PCR machine with a hot lid temperature of 70 ℃ and a reaction volume of 40. mu.L, run and pause:
TABLE 11
Reaction temperature Reaction time
65℃ 45min
65℃ Pausing
c) Vortexed Dynabeads Streptavidine M270 for 15 seconds and then 50. mu.l of each sample was added to the PCR tube; adding 100 μ l of 1x Bead Wash Buffer (from IDT, XGen Hybridization and Wash Kit, Box1, Box2 (Cat. 1080584)) into each PCR tube, mixing by vortex, placing on a magnetic frame for 1min until the solution is clear, removing the supernatant, performing the operation for 3 times, finally discarding the supernatant, centrifuging for 5 s, placing on the magnetic frame for 1min, and removing the residual liquid; mu.l of magnetic bead resuspension buffer (XGen Hybridization and Wash Kit from IDT, Box1, Box2 (cat. No. 1080584)) was added immediately per sample (magnetic bead resuspension buffer was prepared according to Table 9, in which the biotin-containing probe in the composition was replaced with DNase/RNase-Free deionized water (Tiangen, cat. No. RT121)), centrifuged for 5 seconds, gently vortexed, bubbled, re-centrifuged for 5 seconds, placed in a PCR instrument for the procedure of Table 10, to bind streptavidin magnetic beads to the target molecules hybridized with the biotin-containing probes, followed by rapid next-step reaction.
d) Opening a PCR instrument cover, opening a PCR tube cover, but not stopping the operation program, blowing and sucking the mixed solution of the preheated and resuspended magnetic beads at 65 ℃ for 5 times, quickly adding the mixed solution into the PCR tube, blowing and sucking the mixed solution uniformly, quickly centrifuging for a short time, operating the program in the table 10, blowing and sucking the mixed solution for 10 times or slightly whirling the mixed solution by using a low-adsorption gun head of 20 mu l every 15min to ensure that the magnetic beads are in a suspended state, and centrifuging for a short time to remove bubbles after each mixing.
e) Washing at 65 ℃, adding 150 μ l of 1xWash Buffer I (from IDT, xGen Hybridization and Wash Kit, Box1, Box2 (cat 1080584)) preheated at 65 ℃ into a PCR tube after 45min incubation, performing suction with a 70 μ l low-adsorption gun head for 10 times, placing on a magnetic frame for 1min until the solution is clarified, and removing the supernatant; adding 150 μ l of 65 deg.C preheated 1x Stringent Wash Buffer (from IDT xGen Hybridization and Wash Kit, Box1, Box2 (Cat. No. 1080584)), sucking 10 times with 50 μ l low adsorption gun head, placing on 65 deg.C constant temperature metal bath (manufacturer's jin Yin Xing Biotech (Beijing) Co., Ltd., model H203-100C), incubating at 65 deg.C for 5min, placing on magnetic frame for 1min until the solution is clarified, removing supernatant, and repeating the steps once;
f) washing at room temperature, taking the PCR tube off the magnetic frame, adding 150 μ l of room-temperature-placed 1 × Wash Buffer I (from IDT, xGen Hybridization and Wash Kit, Box1, Box2 (Cat. 1080584)), keeping vortex for 30s, stopping 30s, vortexing again for 30s, centrifuging, placing on the magnetic frame for 1min until the solution is clear, and removing the supernatant; taking down the PCR tube from the magnetic frame, adding 150 μ l of 1 × Wash Buffer II placed at room temperature, keeping vortexing for 30s, stopping vortexing for 30s, vortexing again for 30s, centrifuging, placing on the magnetic frame for 1min until the solution is clarified, and removing the supernatant; removing the PCR tube from the magnetic frame, adding 150. mu.l of room temperature-standing 1x Wash Buffer III (XGen Hybridization and Wash Kit from IDT, Box1, Box2 (cat 1080584)), keeping vortexing for 30s, stopping vortexing for 30s, vortexing again for 30s, centrifugating, standing on the magnetic frame for 1min until the solution is clear, removing the supernatant, and centrifugating again and standing on the magnetic frame to completely remove the residual Wash solution; the PCR tube was removed from the magnetic stand, 23. mu.l DNase/RNase-Free deionized water was added, and the mixture was pipetted, mixed and centrifuged for later use. At this time, both the magnetic beads and the suspension are present in the PCR tube, and care is taken not to remove the magnetic beads from the magnetic rack but to carry the magnetic beads to the next reaction.
(8) Second round amplification and purification after capture
a) An amplification program is set on a PCR instrument, the temperature of a hot cover is 105 ℃, the reaction volume is 55 mu L, the operation is carried out and suspended, and the cycle number can be adjusted according to different biotin-containing probes:
TABLE 12
Figure BDA0002320502300000361
b) The components were added in the order as exemplified in Table 12, mixed well, centrifuged for 5 seconds, flicked to remove air bubbles, centrifuged for 5 seconds again, placed in a suspended PCR apparatus, and the procedure of Table 11 was run to prepare a second round of amplification product.
Watch 13
Figure BDA0002320502300000371
c) And (3) purifying an amplification product: the second round of amplification products were purified using AMPure XP purified magnetic beads. Before purification, the beads were removed from the 4 ℃ freezer, mixed by inversion and equilibrated at room temperature for 30 min. Mixing 50 μ L of the amplification product and 75 μ L of magnetic beads in a PCR tube, flicking with fingers or shaking, centrifuging for 3 seconds, and standing at room temperature for 8 minutes. The sample was placed on a magnetic rack, adsorbed for 5 minutes, and the supernatant was removed by pipetting, taking care not to aspirate the beads. And slowly adding 150 mu L of 80% absolute ethyl alcohol along one side far away from the magnetic beads on the magnetic frame, slightly shaking the magnetic frame to avoid the magnetic beads from scattering, and sucking away the supernatant by using a pipettor to pay attention to not suck the magnetic beads. The washing was repeated once more with 150. mu.L of 80% absolute ethanol. The PCR plate was removed from the magnetic stand, centrifuged briefly for 3 seconds, placed on the magnetic stand again, the whole supernatant was aspirated with a 10. mu.L pipette, the lid of the tube was opened, and allowed to stand at room temperature for 5 minutes. The PCR plate was removed from the magnetic stand, 40. mu.L of Buffer EB (Qiagen Cat. No. 19086) was added, vortexed well for 1min, and allowed to stand at room temperature for 8 min after gentle centrifugation. The PCR plate was placed on a magnetic rack, adsorbed for 1 minute, and 58. mu.L of the supernatant was pipetted into a new 1.5ml EP tube to obtain a sequencing library.
(9) And (3) controlling the quality and sequencing of the sequencing library, wherein the sequencing library is subjected to real-time fluorescent quantitative PCR detection, and a sample is more than or equal to 5nM, so that the next on-machine sequencing can be carried out. If the next experiment cannot be carried out immediately, the sequencing library can be stored at-20 +/-5 ℃ and arranged to be sequenced on the computer within one month. The fragment length of the sequencing library is equal to the length of the sample fragment + the length of the full-length sequencing sequence on both ends. Sequencing the sequencing library on a sequencing platform.
(10) Analyzing sequencing data, such as coverage of the target sequence at a certain sequencing depth, target hit rate and the like.
Example 1: library construction without blocking agent for linkers of the invention (C), IDT commercial Long linker control plus blocking Library construction of blocking Agents (K), library construction of IDT commercial linker controls plus blocking Agents (M), IDT commercial Long linker pairs Between library construction without blocking agent (J1) and IDT commercial linker control library construction without blocking agent (J2) Comparison
Sample preparation: commercial tumor mutation standards (cyanine good gene, tumor SNV 5% gDNA standard, cat # GWOGTM1003) diluted to a certain mutation ratio.
Target: 400K, containing 86 tumor-associated genes (IDT synthesized 120nt 5 end modified biotin probe pool).
And (3) jointing: (1) linker of the invention (31 nucleotides) (example C, sequence see sequence listing); (2) commercial long Adapter control (xGen Dual index with UMI Adapter) purchased from IDT (same for comparative examples K and J1, IDT does not disclose sequence); and (3) a commercial linker control (Duplex Seq Adapter (same for comparative examples M and J2, IDT without published sequence) purchased from IDT.
First round amplification: for the adaptors of the present invention, an intermediate library (or pre-library) is prepared. For the commercial long adapter control purchased from IDT, an already complete whole genome library was constructed. The corresponding primer sequences are shown in a sequence table.
Capturing: no blocking agent was added to example C and comparative examples J1 and J2. Comparative examples K and M were prepared with the addition of blocking agent, XGen Universal blocks-TS Mix from IDT, cat # 1075475.
And (3) second round amplification: sequencing libraries (or final libraries) were prepared and the corresponding primer sequences are shown in the sequence listing. Note that for the IDT commercial adaptor comparative examples (M and J2), the second round of amplification was performed before capture as was the first round of amplification to obtain a complete whole genome library before capture.
As a result:
sequencing was performed as described in the methods section above and the results are shown in FIG. 3. Compared with a contrast method (K, 75%) using IDT commercial long linker control and adding a blocking agent, the method of the invention without adding the blocking agent (C, 82%) has the advantages that the target hit rate is improved by 7%; compared with the contrast method (J1, 30%) using IDT commercial long linker control and no blocking agent, the target hit rate is improved by 52%; the hit rate was increased by 8% compared to the contrast method (M, 74%) using IDT commercial linker control and addition of blocking agent; the on-target rate was increased by 52% compared to the control method (J2, 30%) using IDT commercial linker control without capping agent. It can be seen that the library construction method according to the invention eliminates the need for blocking agents of the prior art methods, thereby reducing the cost of detection and even increasing the rate of target hits.
Example 2: comparison between library construction without blocking agent for linkers of various lengths, library construction without blocking agent for long linker control, and library construction with blocking agent for IDT commercial long linker Control (CK) of the present invention.
Sample preparation: same as in example 1.
Target: same as in example 1.
And (3) jointing: (1) linkers of various lengths of the invention (16 nucleotides, 20 nucleotides, 24 nucleotides, 31 nucleotides, 34 nucleotides, 37 nucleotides, 40 nucleotides); (2) the inventors self-designed long linker controls (62 nucleotides) according to the Illumina platform; and (3) commercial long splice control (same as example 1) purchased from IDT (CK). The linker sequence is shown in the sequence listing.
First round amplification: for linkers of various lengths of the invention, intermediate libraries (or pre-libraries) are prepared. For both the self-designed long linker (62 nucleotides) and the commercial long linker purchased from IDT, an already complete whole genome library was constructed. The corresponding primer sequences are shown in a sequence table.
Capturing: no splice closure agent was added to the examples using splices of various lengths according to the invention; the comparative example using the 62 nucleotide long linker control did not have a linker blocking agent added; a linker blocking agent was added to comparative example (CK) using a commercial linker control purchased from IDT.
And (3) second round amplification: sequencing libraries (or final libraries) were prepared and the corresponding primer sequences are shown in the sequence listing.
As a result:
sequencing was performed as described in the methods section above and the results are shown in FIG. 4. The linkers of the present invention achieved comparable targeting rates to the comparative example (CK, 74.6%) using IDT commercial long linkers and linker blocking agents without linker blocking agents in the 16-40 nucleotide length range (65.5-84.9%), and even higher targeting rates in the 20-31 nucleotide length range (81.4-84.9%), consistent with example 1. When the linker length reaches 62 nucleotides, the targeting rate is low (48.9%) if no linker blocking agent is added. These results indicate that the targeting rate of the library construction method according to the present invention is determined by the length of the capture object (e.g., linker-added nucleic acid fragment or first amplicon directly used for capture) extending outward at the 5 'end and 3' end of the nucleic acid fragment compared to the nucleic acid fragment. The linkers of various lengths of the present invention also demonstrate the successful use of single molecule barcodes of various lengths.
Example 3: in clinical applications, a comparison was made between the linker-unblocked library construction of the present invention and the IDT commercial linker-blocked library construction.
Sample preparation: clinical tumor fresh tissue samples from Co-building laboratories (12 cases)
Target: 39M full Exome derived from IDT xGen outer Research Panel v1.0, cat # 1056115.
And (3) jointing: (1) the linker of the invention (31 nucleotides); and (2) commercial long splice control purchased from IDT (same as example 1). The linker sequence is shown in the sequence listing.
First round amplification: for the adaptors of the present invention, an intermediate library (or pre-library) is prepared. For the commercial long adapter control purchased from IDT, an already complete whole genome library was constructed. The corresponding primer sequences are shown in a sequence table.
Capturing: no linker blocking agent was added in the examples using the linker of the invention; the linker blocking agent was added to the comparative example using a commercial linker control purchased from IDT.
And (3) second round amplification: sequencing libraries (or final libraries) were prepared and the corresponding primer sequences are shown in the sequence listing.
As a result:
sequencing was performed as described in the methods section above and the results are shown in FIG. 5. When the library construction method is applied to clinical samples, compared with the prior art, the library construction method eliminates the need of joint sealants, the target hit rate is improved by 3.54%, the redundancy rate is reduced by 2.88%, the coverage is basically the same, and the stability after 12 times of repetition is better than that of the prior art.
Example 4: comparison between library construction of linkers of the invention of the same length and different nucleotide composition without addition of blocking agent.
Sample preparation: same as in example 1.
Target: same as in example 1.
And (3) jointing: two linkers of the invention of the same length (31 nucleotides), different nucleotide composition C (same as in example 1) and D. The linker sequence is shown in the sequence listing.
First round amplification: an intermediate library (or pre-library) is prepared. The corresponding primer sequences are shown in a sequence table.
Capturing: no linker blocking agent was added.
And (3) second round amplification: sequencing libraries (or final libraries) were prepared and the corresponding primer sequences are shown in the sequence listing.
As a result:
sequencing was performed as described in the methods section above and the results are shown in FIG. 6. Linkers of the invention of the same length and different nucleotide composition gave very similar targeting rates (82.3% vs. 82.7%), indicating that the linker nucleotide composition did not substantially affect the targeting rate.
Example 5: economics analysis of detection methods
The economics of the assay were analyzed using the clinical sample library cost in example 3. The 12 samples were hybridized in a way that 1 capture library was hybridized with 4 samples, with a 14.72% reduction in cost. 12 samples were hybridized in a way that 1 capture library was hybridized to 1 sample, reducing the cost by 15.96%.
Table 14: main reagent for establishing library in prior art (IDT method)
Figure BDA0002320502300000411
Figure BDA0002320502300000421
a: singapore Yuan;b: dollars.
Table 15: main reagent for building warehouse
Figure BDA0002320502300000422
a: singapore Yuan;b: dollars.
Table 16: cost comparison
Figure BDA0002320502300000423
Figure BDA0002320502300000431
The foregoing examples and description of the embodiments should be taken as illustrating, rather than as limiting, the present invention as defined by the claims. As will be readily appreciated, numerous variations and combinations of the features set forth above may be utilized without departing from the present invention as set forth in the claims. Such variations are not to be regarded as a departure from the spirit and scope of the invention, and all such modifications are intended to be included within the scope of the following claims.
Sequence listing
Figure BDA0002320502300000441
Figure BDA0002320502300000451
Figure BDA0002320502300000461
N represents a single molecule barcode sequence, and N represents an index sequence.
Figure IDA0002320502350000011
Figure IDA0002320502350000021
Figure IDA0002320502350000031
Figure IDA0002320502350000041
Figure IDA0002320502350000051
Figure IDA0002320502350000061
Figure IDA0002320502350000071
Figure IDA0002320502350000081
Figure IDA0002320502350000091
Figure IDA0002320502350000101
Figure IDA0002320502350000111
Figure IDA0002320502350000121
Figure IDA0002320502350000131
Figure IDA0002320502350000141
Figure IDA0002320502350000151
Figure IDA0002320502350000161
Figure IDA0002320502350000171
Figure IDA0002320502350000181

Claims (42)

1. A method of constructing a sequencing library of target sequences, comprising:
(a) adding linkers to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) Amplifying the adaptor-added nucleic acid fragment using a first amplification forward primer and a first amplification reverse primer to obtain a first amplicon that extends outwardly from the nucleic acid fragment by nucleotide sequences that are each independently 16 to 40 nucleotides in length at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) capturing target nucleic acid fragments from the linker-added nucleic acid fragments or the first amplicons without using a linker blocking agent;
(d) amplifying the captured target nucleic acid fragments using a second amplification forward primer and a second amplification downstream primer to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence.
2. The method of claim 1, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-37 nucleotides each, independently, as compared to the nucleic acid fragment.
3. The method of claim 1, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-34 nucleotides each, independently, as compared to the nucleic acid fragment.
4. The method of claim 1, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-37 nucleotides each, independently, as compared to the nucleic acid fragment.
5. The method of claim 1, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-34 nucleotides each, independently, as compared to the nucleic acid fragment.
6. The method of claim 1, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 20-31 nucleotides each independently.
7. The method of claim 1, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 24 nucleotides each, independently, as compared to the nucleic acid fragment.
8. The method of any one of claims 1-7, wherein the single molecule barcode is 2-12 nucleotides in length.
9. The method of any one of claims 1-7, wherein the common sequence is 13-31 nucleotides in length.
10. The method of any one of claims 1-7, wherein the linker is a single-link linker or a double-link linker; optionally, the double-stranded linker is a double-stranded Y-linker.
11. The method of any one of claims 1-7, wherein the 5 'end and/or the 3' end of the first amplification upstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor; and/or
Wherein the 5 'end and/or the 3' end of the first amplification downstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor.
12. The method of any one of claims 1-7, wherein:
(1) the nucleic acid fragments are from a single sample; or
(2) The nucleic acid fragments are from a plurality of samples, and the adaptor comprises the single molecule barcode.
13. The method of any one of claims 1-7, wherein the linker is free of an index sequence and a sequencer bridging anchor sequence.
14. The method of any one of claims 1-7, wherein the first amplification forward primer and the first amplification downstream primer are each independently free of an index sequence and a sequencer bridging anchor sequence.
15. A kit for constructing a sequencing library of target sequences, comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the kit does not comprise a linker blocking agent.
16. The kit of claim 15, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-37 nucleotides each, independently, as compared to the nucleic acid fragment.
17. The kit of claim 15, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-34 nucleotides each, independently, as compared to the nucleic acid fragment.
18. The kit of claim 15, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-37 nucleotides each, independently, as compared to the nucleic acid fragment.
19. The kit of claim 15, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-34 nucleotides each, independently, as compared to the nucleic acid fragment.
20. The kit of claim 15, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 20-31 nucleotides each independently.
21. The kit of claim 15, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 24 nucleotides each, independently, as compared to the nucleic acid fragment.
22. The kit of any one of claims 15-21, wherein the single molecule barcode is 2-12 nucleotides in length.
23. The kit of any one of claims 15-21, wherein the common sequence is 13-31 nucleotides in length.
24. The kit of any one of claims 15-21, wherein the linker is a single-or double-linker; optionally, the double-stranded linker is a double-stranded Y-linker.
25. The kit of any one of claims 15-21, wherein the 5 'end and/or the 3' end of the first amplification upstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor; and/or
Wherein the 5 'end and/or the 3' end of the first amplification downstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor.
26. The kit of any one of claims 15-21, wherein:
(1) the nucleic acid fragments are from a single sample; or
(2) The nucleic acid fragments are from a plurality of samples, and the adaptor comprises the single molecule barcode.
27. The kit of any one of claims 15-21, wherein the linker is free of an index sequence and a sequencer bridging anchor sequence.
28. The kit of any one of claims 15-21, wherein the first amplification forward primer and the first amplification downstream primer are each independently free of an index sequence and a sequencer bridging anchor sequence.
29. Use of an agent for the preparation of a kit for constructing a sequencing library of a target sequence, the agent comprising:
(a) a linker capable of being added to the 5 'end and the 3' end of the nucleic acid fragment to obtain a linker-added nucleic acid fragment; optionally, the nucleic acid fragments are obtained by fragmenting nucleic acids;
the linker comprises in order in the 5 '-3' direction a common sequence, optionally a single molecule barcode, and optionally a spacer sequence;
(b)
(i) the linker-added nucleic acid fragment extends outwardly at 5 'and 3' ends of the nucleic acid fragment by a nucleotide sequence of 16 to 40 nucleotides, respectively, compared to the nucleic acid fragment, and is directly used for capture without amplification; or
(ii) A first amplification forward primer and a first amplification reverse primer, the first amplification forward primer and first amplification reverse primer capable of being used to amplify the adaptor-added nucleic acid fragment to obtain a first amplicon that extends outwardly from the nucleic acid fragment by a nucleotide sequence of 16-40 nucleotides each, independently, at the 5 'end and 3' end of the nucleic acid fragment as compared to the nucleic acid fragment, a portion or all of the sequence of the first amplification forward primer being sufficiently complementary to a portion or all of the sequence of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment, and a portion or all of the sequence of the first amplification reverse primer being sufficiently complementary to effect amplification of the common sequence or its complement to effect amplification of the adaptor-added nucleic acid fragment;
(c) a capture reagent capable of being used to capture a target nucleic acid fragment from the adaptor-added nucleic acid fragment or the first amplicon without using an adaptor blocker;
(d) a second amplification forward primer and a second amplification reverse primer capable of being used to amplify the captured target nucleic acid fragments to obtain a second amplicon as a member of a sequencing library, thereby constructing the sequencing library,
the second amplification upstream primer comprises an upstream sequencer bridging anchor sequence, an optional index sequence and an upstream sequencing sequence in the 5 '-3' direction in sequence,
the second amplification downstream primer comprises a downstream sequencer bridging anchor sequence, an optional index sequence and a downstream sequencing sequence in the 5 '-3' direction in sequence,
the sequence of the common sequence or the complement thereof and the sequence of the first amplification upstream primer are each part or all of the sequence of the upstream sequencing sequence,
the sequence of the common sequence or its complement and the sequence of the first amplification downstream primer are each part or all of the sequence of the downstream sequencing sequence;
wherein the agent does not include a linker blocking agent.
30. The use of claim 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-37 nucleotides each, independently, as compared to the nucleic acid fragment.
31. The use of claim 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 16-34 nucleotides each, independently, as compared to the nucleic acid fragment.
32. The use of claim 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-37 nucleotides each, independently, as compared to the nucleic acid fragment.
33. The use of claim 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 20-34 nucleotides each, independently, as compared to the nucleic acid fragment.
34. The use of claim 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, as compared to the nucleic acid fragment, a nucleotide sequence of 20-31 nucleotides each independently.
35. The use of claim 29, wherein the adaptor-added nucleic acid fragment or the first amplicon extends outwardly from the 5 'end and the 3' end of the nucleic acid fragment, respectively, by a nucleotide sequence of 24 nucleotides each, independently, as compared to the nucleic acid fragment.
36. The use of any one of claims 29-35, wherein the single molecule barcode is 2-12 nucleotides in length.
37. The use of any one of claims 29-35, wherein the common sequence is 13-31 nucleotides in length.
38. The use of any one of claims 29-35, wherein the linker is a single-or double-link linker; optionally, the double-stranded linker is a double-stranded Y-linker.
39. The use of any one of claims 29-35, wherein the 5 'end and/or the 3' end of the first amplification upstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor; and/or
Wherein the 5 'end and/or the 3' end of the first amplification downstream primer is flush or not flush with the 5 'end and/or the 3' end of the adaptor.
40. The use of any one of claims 29-35, wherein:
(1) the nucleic acid fragments are from a single sample; or
(2) The nucleic acid fragments are from a plurality of samples, and the adaptor comprises the single molecule barcode.
41. The use of any one of claims 29-35, wherein the linker is free of an index sequence and a sequencer bridging anchor sequence.
42. The use of any one of claims 29-35, wherein the first amplification forward primer and the first amplification downstream primer are each independently free of an index sequence and a sequencer bridging anchor sequence.
CN201911295838.7A 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence Active CN112877403B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911295838.7A CN112877403B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911295838.7A CN112877403B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence
CN201911207127.XA CN110872610B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201911207127.XA Division CN110872610B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence

Publications (2)

Publication Number Publication Date
CN112877403A true CN112877403A (en) 2021-06-01
CN112877403B CN112877403B (en) 2023-11-03

Family

ID=69717477

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201911207127.XA Active CN110872610B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence
CN201911295838.7A Active CN112877403B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CN201911207127.XA Active CN110872610B (en) 2019-11-29 2019-11-29 Method for constructing sequencing library of target sequence

Country Status (1)

Country Link
CN (2) CN110872610B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112226486B (en) * 2020-09-24 2021-11-30 上海英基生物科技有限公司 Library construction method for targeted enrichment of genome methylated DNA and application thereof
CN114517223B (en) * 2020-11-20 2023-09-12 福建和瑞基因科技有限公司 Method for screening SNP loci and application thereof
CN114790453A (en) * 2021-01-25 2022-07-26 上海思路迪生物医学科技有限公司 Rapid hybridization capture kit
CN117230168A (en) * 2023-11-10 2023-12-15 北京贝瑞和康生物技术有限公司 Joint primer and method for reducing high-throughput sequencing redundancy

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103806111A (en) * 2012-11-15 2014-05-21 深圳华大基因科技有限公司 Construction method and application of high-throughout sequencing library
WO2017004083A1 (en) * 2015-06-29 2017-01-05 The Regents Of The University Of California Methods of producing nucleic acid libraries and compositions and kits for practicing same
WO2017054302A1 (en) * 2015-09-30 2017-04-06 中国农业科学院深圳农业基因组研究所 Sequencing library, and preparation and use thereof
CN108060191A (en) * 2017-11-07 2018-05-22 深圳华大基因科技有限公司 A kind of method, library constructing method and the kit of double stranded nucleic acid fragment adjunction head
CN110409001A (en) * 2019-07-25 2019-11-05 北京贝瑞和康生物技术有限公司 The method and kit in a kind of building capture library

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107236729A (en) * 2017-07-04 2017-10-10 上海阅尔基因技术有限公司 The method and kit of a kind of rapid build target nucleic acid sequencing library that enrichment is captured based on probe
CN109234356B (en) * 2018-09-18 2021-10-08 南京迪康金诺生物技术有限公司 Method for constructing hybridization capture sequencing library and application

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103806111A (en) * 2012-11-15 2014-05-21 深圳华大基因科技有限公司 Construction method and application of high-throughout sequencing library
WO2017004083A1 (en) * 2015-06-29 2017-01-05 The Regents Of The University Of California Methods of producing nucleic acid libraries and compositions and kits for practicing same
WO2017054302A1 (en) * 2015-09-30 2017-04-06 中国农业科学院深圳农业基因组研究所 Sequencing library, and preparation and use thereof
CN108060191A (en) * 2017-11-07 2018-05-22 深圳华大基因科技有限公司 A kind of method, library constructing method and the kit of double stranded nucleic acid fragment adjunction head
CN110409001A (en) * 2019-07-25 2019-11-05 北京贝瑞和康生物技术有限公司 The method and kit in a kind of building capture library

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
MATTHIAS MEYER等: "Illumina Sequencing Library Preparation for Highly Multiplexed Target Capture and Sequencing", COLD SPRING HARB PROTOC, vol. 2010, no. 6, pages 1 - 10 *
何雨琦等: "用于Illumina测序平台不同试剂盒制备RNA-seq文库的方法比较", 基因组学与应用生物学, vol. 36, no. 11, pages 4607 - 4615 *
陈丹等: "用于高通量测序的基因组靶序列捕获方法的建立", 遗传, vol. 32, no. 12, pages 1296 - 1303 *

Also Published As

Publication number Publication date
CN110872610A (en) 2020-03-10
CN110872610B (en) 2022-11-29
CN112877403B (en) 2023-11-03

Similar Documents

Publication Publication Date Title
CN110872610B (en) Method for constructing sequencing library of target sequence
US20210254148A1 (en) Measurement of nucleic acid variants using highly-multiplexed error-suppressed deep sequencing
EP3763825B1 (en) High multiplex pcr with molecular barcoding
EP3192900B1 (en) Method for constructing nucleic acid single-stranded cyclic library and reagents thereof
CN106801050B (en) Construction method of circular RNA high-throughput sequencing library and kit thereof
US10590471B2 (en) Target enrichment by single probe primer extension
TW201321518A (en) Method of micro-scale nucleic acid library construction and application thereof
CN107893100A (en) A kind of unicellular mRNA reverse transcriptions and the method for amplification
CN111808854B (en) Balanced joint with molecular bar code and method for quickly constructing transcriptome library
KR20180098412A (en) Profiling of deep-seated sequences of tumors
US20230183797A1 (en) Generation of single-stranded circular dna templates for single molecule sequencing
AU2021209101A1 (en) Methods of targeted sequencing
CN111549380B (en) Kit for constructing double-stranded RNA sequencing library and application thereof
CN111748637A (en) SNP molecular marker combination, multiplex composite amplification primer set, kit and method for genetic relationship analysis and identification
CN112410331A (en) Linker with molecular label and sample label and single-chain library building method thereof
CN113293204A (en) Primer composition, kit and method for detecting microsatellite instability based on next-generation sequencing platform
CN112259165B (en) Method and system for detecting microsatellite instability state
CN108342385A (en) A kind of connector and the method that sequencing library is built by way of high efficiency cyclisation
CN116377099A (en) Primer, kit and method for detecting fever with hemorrhagic disease pathogen
CN111793623A (en) Typing genetic marker composition, kit, identification system and typing method of 62 multi-allelic SNP-NGS
CN114836415B (en) Preparation of MGI platform transposase double-end tag library
CN112708665B (en) Construction method and kit for lung cancer polygenic mutation sequencing library
WO2023201487A1 (en) Adapter, adapter ligation reagent, kit, and library construction method
US20200291453A1 (en) Amplicon generation
US20200208198A1 (en) Methods of assaying nucleic acids of low quantities using a buffer nucleic acid

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20211020

Address after: 352300 R & D building 7, No. 33, Donghu Road, digital Fujian Industrial Park, Changle District, Fuzhou City, Fujian Province

Applicant after: Fujian Herui Gene Technology Co.,Ltd.

Address before: No.28, Fazhan Road, Wenwusha Town, Changle District, Fuzhou City, Fujian Province, 350200

Applicant before: Fujian Herui Jingchuang Gene Technology Co.,Ltd.

TA01 Transfer of patent application right
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40051271

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant