EP3464634A1 - Molecular tagging methods and sequencing libraries - Google Patents

Molecular tagging methods and sequencing libraries

Info

Publication number
EP3464634A1
EP3464634A1 EP17803528.3A EP17803528A EP3464634A1 EP 3464634 A1 EP3464634 A1 EP 3464634A1 EP 17803528 A EP17803528 A EP 17803528A EP 3464634 A1 EP3464634 A1 EP 3464634A1
Authority
EP
European Patent Office
Prior art keywords
nucleic acid
sequence
acid sequence
sample
primer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP17803528.3A
Other languages
German (de)
French (fr)
Other versions
EP3464634B1 (en
EP3464634A4 (en
Inventor
Muhammed MURTAZA
Maria DE LAS NIEVES PERDIGONES BORDERIAS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Translational Genomics Research Institute TGen
Original Assignee
Translational Genomics Research Institute TGen
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Translational Genomics Research Institute TGen filed Critical Translational Genomics Research Institute TGen
Priority to EP21151500.2A priority Critical patent/EP3910068A1/en
Publication of EP3464634A1 publication Critical patent/EP3464634A1/en
Publication of EP3464634A4 publication Critical patent/EP3464634A4/en
Application granted granted Critical
Publication of EP3464634B1 publication Critical patent/EP3464634B1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P19/00Preparation of compounds containing saccharide radicals
    • C12P19/26Preparation of nitrogen-containing carbohydrates
    • C12P19/28N-glycosides
    • C12P19/30Nucleotides
    • C12P19/34Polynucleotides, e.g. nucleic acids, oligoribonucleotides
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1096Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/66General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6853Nucleic acid amplification reactions using modified primers or templates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6853Nucleic acid amplification reactions using modified primers or templates
    • C12Q1/6855Ligating adaptors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/686Polymerase chain reaction [PCR]
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/04Libraries containing only organic compounds
    • C40B40/06Libraries containing nucleotides or polynucleotides, or derivatives thereof
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2525/00Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
    • C12Q2525/10Modifications characterised by
    • C12Q2525/191Modifications characterised by incorporating an adaptor
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/16Primer sets for multiplex assays

Definitions

  • sequence listing is submitted electronically via EFS-Web as an ASCI I-formatted sequence listing with a file named "91482_216_SeqList_ST25.txt" created on May 23, 2017, and having a size of 1 kilobyte, and is filed concurrently with the specification.
  • sequence listing contained in this ASCI I-formatted document is part of the specification and is herein incorporated by reference in its entirety.
  • the present invention is related to methods of molecular tagging of nucleic acids, for example, for the preparation of sequencing libraries.
  • the invention is also directed to tagged sequencing libraries.
  • Molecular barcoding or tagging involves attaching a unique or degenerate oligonucleotide label to each template molecule in the first or early steps of sequencing library preparation so that any low abundance signals of variant may be distinguished from noise introduced during the process.
  • the process has recently been described as a viable approach for detection and quantification of low abundance variants in complex mixture of nucleic acids. The overall goal is to improve sensitivity and accuracy of next generation sequencing for identifying, detecting, and quantifying nucleic acid molecules of any given type (or carrying a variant).
  • existing methods are still incapable of tagging target sequences that are of low abundance. Most current methods have limited efficiency in capture molecules and require high amount of input molecules to achieve adequate sensitivity and accuracy.
  • the invention is directed to methods of adding oligonucleotide tags to a nucleic acid sequence and of producing a sequencing library and directed to a sequencing library comprising nucleic acid sequences tagged with an adapter oligonucleotide at the 3'-end.
  • the methods of adding oligonucleotide tags to a nucleic acid sequence in a sample comprise ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence, wherein the adaptor oligonucleotide comprises a stem- loop intramolecular nucleotide base pairing, a hydroxyl group at the 3'-end, a phosphate at the 5' -end, a random region complementary to nucleic acid sequence, and a random region in the loop comprising the molecular barcode.
  • the methods of producing a sequencing library comprise ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence to produce a hybrid sequence comprising the sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing, a hydroxyl group at the 3'-end, a phosphate at the 5' -end, a random region complementary to nucleic acid sequence, and a random region in the loop comprising the molecular barcode; and amplifying hybrid sequence with a first set of primers.
  • the first set of primers comprises a forward universal primer and a reverse universal sample barcoding primer, wherein the reverse universal sample barcoding primer comprises a sample barcode.
  • amplifying hybrid sequence with the first set of primers produces a barcoded sequence.
  • the first set of primers comprises a target specific primer and a reverse universal primer.
  • amplifying hybrid sequence with the first set of primers produces a target specific sequence
  • the methods further comprise amplifying the target specific sequence with a second set of primers comprising a forward universal primer and sample barcoding primer to produce a barcoded sequence.
  • the methods further comprises amplifying the target-specific sequence with a nested target specific primer and the universal primer prior to amplification with the second set of primers.
  • the methods of the invention comprise a pre-amplification step to increase the sample number.
  • the methods prior to the ligation step, comprise annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence and then linearly amplifying the nucleic acid sequence.
  • the nucleic acid in the sample is fractionated.
  • the methods comprise cleaning up after each amplification step with exonuclease and alkaline phosphatase.
  • the invention relates to a method of adding oligonucleotide tags to a nucleic acid sequence in a sample, the method comprising the steps of: annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence; linearly amplifying the nucleic acid sequence; and ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing; a hydroxyl group at the 3'-end; a phosphate at the 5'-end; a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode
  • the invention relates to a method of producing a sequencing library, the method comprising the steps of: annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence; linearly amplifying the nucleic acid sequence; ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence to produce a hybrid sequence comprising the sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing; a hydroxyl group at the 3'-end; a phosphate at the 5'-end; a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode; and amplifying hybrid sequence with a first set of primers.
  • the invention is directed to a method of producing a sequencing library, the method comprising the steps of: ligating an adapter oligonucleotide to the 3' -end of a nucleic acid sequence to produce a hybrid sequence comprising a sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing; a hydroxyl group at the 3'-end; a phosphate at the 5'-end; a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode; and amplifying the hybrid sequence with a first set of primers.
  • the first set of primers comprises a forward universal primer and a reverse universal sample barcoding primer and the reverse universal sample barcoding primer comprises a sample barcode; and amplifying the hybrid sequence with the first set of primers produces a barcoded sequence.
  • the first set of primers comprises a target specific primer and a reverse universal primer and amplifying the hybrid sequence with the first set of primers produces a target specific sequence; and the method further comprises amplifying the target specific sequence with a second set of primers comprising a forward universal primer and sample barcoding primer to produce a barcoded sequence.
  • the reverse universal primer comprises the nucleotide sequence of SEQ ID NO: 2.
  • the forward universal primer comprises the nucleotide sequence of SEQ ID NO: 1 .
  • the sample barcoding primer comprises a 5' adapter sequence, a 3' region complementary to the reverse universal primer, and a sample index sequence between the 5' adapter sequence and 3' region complementary to the reverse universal primer.
  • the sample barcoding primer comprises the nucleotide sequence of SEQ ID NO: 4.
  • the method further comprises amplifying the target-specific sequence with a nested target specific primer and the universal primer prior to amplification with the second set of primers.
  • the nucleic acid in the sample is fractionated. In one embodiment, the nucleic acid in the sample is fractionated to fragments between about 100 bp and about 500 bp. In another embodiment, the nucleic acid in the sample is fractionated to fragments between about 250 bp and about 350 bp. In yet another embodiment, the nucleic acid in the sample is fractionated to fragments of about 300 bp.
  • ligating the adapter oligonucleotide to the 3'- end of the nucleic acid sequence takes place between -20°C to 40°C. In one aspect, ligating the adapter oligonucleotide to the 3'-end of the nucleic acid sequence takes place between 0°C and 40°C. In another aspect, ligating the adapter oligonucleotide to the 3'-end of the nucleic acid sequence takes place between 10°C and 30°C.
  • the method further comprises cleaning up the amplified sequence with exonuclease and alkaline phosphatase following each amplifying step.
  • the adapter oligonucleotide is ligated to the 3' end of the nucleic acid sequence with a DNA ligase.
  • the adaptor oligonucleotide further comprises a 3' overhang and the 3' overhang comprises the region complementary to the nucleic acid sequence.
  • the region complementary to the nucleic acid sequence is complementary to the 3'-end of the nucleic acid sequence.
  • the stem-loop intramolecular nucleotide base pairing of the adaptor oligonucleotide forms a stem of at least 6 nucleotide pairs long. In another embodiment, the stem comprises at least 1 mismatched pair.
  • the stem-loop intramolecular nucleotide base pairing of the adaptor oligonucleotide forms a loop.
  • the loop of the adapter oligonucleotide comprises a primer-binding region for a second universal primer.
  • the invention is directed to a sequencing library comprising a nucleic acid sequence tagged with an adapter oligonucleotide at the 3'-end produced with the methods disclosed herein.
  • the nucleic acid sequences comprises binding regions for a pair of universal primers and amplification with the pair of universal primers produces an amplicon comprising a sequence of interest for the sequencing library.
  • FIG. 1 depicts the generic structure of an exemplary adapter oligonucleotide.
  • FIGs. 2A-C depict schematics of the methods of molecular tagging of nucleic acids.
  • FIG. 2B depicts a variation that utilizes target specific primers as compared to the schematic of FIG. 2A.
  • FIG. 2C depicts a variation in how the sample barcode is attached as compared to the schematic of FIG. 2A.
  • FIG. 3 depicts the expected results of nucleic acid tagging process shown in FIGs. 2A-C.
  • FIG. 4 is a schematic of the steps described in Example A.
  • FIG. 5 depicts the observed result from the protocol of Example
  • stem-loop refers to a structure formed when two regions of the same strand of nucleic acids can base-pair to form a double helix that ends in an unpaired loop. The two regions of are usually complementary when read in opposite directions. This structure is also known as a hairpin or a hairpin loop.
  • complementary in reference to nucleic acid sequences refers to nucleic acid base sequences that can form a double-stranded structure by matching base pairs.
  • the degree of complementarity between the sequences does not have to 100%, for example, the degree of complementarity may be at least 95%, at least 90%, or at least 85%.
  • nick in a strand is a break in the phosphodiester bond between two nucleotides in the backbone in one of the strands of a duplex between a sense and an antisense strand.
  • a "gap" in a strand is a break between two nucleotides in the single strand.
  • the invention relates to methods of molecular tagging nucleic acids, for example, for the preparation of sequencing libraries, and the specially tagged sequencing libraries. As such, the method also related to the special tagged sequence libraries.
  • the sequencing library and methods are particularly useful for informatics analysis of target sequences that are at low fractions in complex mixture of nucleic acid sequences, for example cell-free DNA in biological samples, such as plasma or urine. Another complex mixture of nucleic acid sequences may be low- input degraded forensic sample.
  • identification, quantification, and detection of cancer mutations in plasma can be used for screening and early detection of cancer, monitoring treatment response and progression, molecular stratification and assessment of clonal evolution and treatment resistance.
  • this invention enables detection of any genomic variant in circulation or in tissue. In addition, it will enable identification, detection and quantification of variants in any complex mixtures of human or non-human samples such as pathogens.
  • This invention can be scaled for multiplexing such that sequencing of multiple genomic regions is possible using this approach, allowing for simultaneous identification, detection and quantification of multiple mutations. It is also readily customizable and can be implemented on an ad hoc basis or developed to focus on specific scenarios (for example cancer diagnostics using a panel of genes).
  • Methods of tagging nucleic acid sequences comprise ligation of an adapter oligonucleotide.
  • the nucleic acid sequence in the sample has been fractionated, for example, to fragments between 100 and 500 bp, between 100 and 300 bp, between 250 and 350 bp, or about 300 bp.
  • the methods further comprise a linear amplification step, and in some aspects, the linear amplification step takes place before the ligation step.
  • the methods further comprise PCR-based steps (see FIGs. 2A-C). The methods rely on the linear amplification step and/or the PCR- based steps to increase the efficiency of conversion of template molecules into a sequencing-ready library.
  • the linear amplification step comprises annealing a primer to the nucleic acid sequences in the sample and linearly amplifying the nucleic acid sequence.
  • the linear amplification step comprises at least 5 cycles, at least 6 cycles, at least 7 cycles, at least 8 cycles, at least 9 cycles, at least 10 cycles, at least 1 1 cycles, at least 12 cycles, at least 13 cycles, at least 14 cycles, or at least 15 cycles.
  • the linear amplification step comprises no more than 15 cycles or no more than 10 cycles.
  • the linear amplification step comprises about 10 cycles of amplification.
  • the primer is complementary to a sequence of interest on the nucleic acid sequence (see PCR1 of FIG. 2A and FIG. 4).
  • sequence of interest on the nucleic acid sequence is a region that is proximal to the region of interest on the nucleic acid sequence.
  • the molecular barcode or tag is introduced to the nucleic acid sequences by ligation with an adapter oligonucleotide.
  • the adapter nucleotide is ligated to the 3'-end of the nucleic acid sequence or the 3'-end of the linearly amplified copy of the nucleic acid sequence.
  • the ligation temperature is between -20°C and 40°C, between 10°C and 40°C, or between 10°C and 30°C.
  • the adaptor oligonucleotide comprises a hairpin structure.
  • the adaptor oligonucleotide comprises a constant stem region, a random molecule tag, a sequence for a universal primer to bind, and a random complementary sequence.
  • the random molecular tag may be a random oligonucleotide.
  • the random molecular tag is at least 6 nucleotides long, at least 7 nucleotides long, at least 8 nucleotides long, at least 9 nucleotides long, or at least 10 nucleotides long, while the random complementary sequence is at least 4 nucleotides long, at least 5 nucleotides long, at least 6 nucleotides long, at least 7 nucleotides long, or at least 8 nucleotides long.
  • the random molecular tag is no more than 9 nucleotides long, no more than 10 nucleotides long, no more than 1 1 nucleotides long, or no more than 12 nucleotides long, while the random complementary sequence is no more than 6 nucleotides long, no more than 7 nucleotides long, or no more than 8 nucleotides long.
  • the random molecular tag is 9 nucleotides long while the random complementary sequence is 6 nucleotides long, as shown in FIG. 1.
  • the adaptor oligonucleotide preferably comprises a phosphate at the 5'-end. In some embodiments, the adapter oligonucleotide comprises a DNA 5'-end and an RNA 3'-end.
  • the 3' end of the adapter oligonucleotide may be blocked using an oligonucleotide modification not extendable or cleavable by polymerase, for example a C3 space.
  • 3' end of the adaptor oligonucleotide comprises an overhang after the random complementary sequence for improved stability.
  • the 3' overhang comprises the region complementary to the nucleic acid sequence.
  • the 3'-overhang region comprises at least 1 nucleotide, at least 2 nucleotides, at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 1 1 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 20 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, or at least 40 nucleotides that are complementary to sequences found in the nucleic acid sequence when the nucleic acid sequence and the adapter oligonucleotide are hybridized to one another. In this manner, the 3' overhang region of the adapter oligonu
  • the 3'-overhang region comprises at least 1 nucleotide, preferably at least 2 nucleotides, preferably at least 3 nucleotides, preferably at least 4 nucleotides, and preferably at least 5 nucleotides that are mismatched with 25 nucleotides found in the nucleic acid sequence when the nucleic acid sequence and adapter oligonucleotide are hybridized to one another.
  • the hybridization between the nucleic acid sequence and the adapter oligonucleotide forms a structure that comprises a nick, wherein the nick can be ligated by either enzymatic or chemical means.
  • the hybridization between the nucleic acid sequence and the adapter oligonucleotide forms a structure that comprises a gap, wherein the gap can be ligated by either enzymatic or chemical means.
  • the hybridization between the nucleic acid sequence and the adapter oligonucleotide forms a stem-loop structure.
  • the stem- loop structure is stable at temperatures as high as 35°C, as high as 40°C, as high as 45°C, as high as 50°C, as high as 55°C, as high as 60°C, as high as 65°C, as high as 70°C, as high as 75°C, as high as 80°C, as high as 85°C, or more. Accordingly, the design of the adaptor oligonucleotide should take care to utilizes sequence that ensure the formed stem-loop is thermostable.
  • the adapter oligonucleotide is a single-stranded oligonucleotide having a double-stranded portion formed of two self-complementary segments, optionally having a loop at one end, and a short overhanging single strand at the other.
  • a hairpin is defined as a double- helical region formed by nucleotide base-pairing between adjacent, inverted, at least partially complementary sequences in a single-stranded nucleic acid, preferably within the same single stranded nucleic acid.
  • the stem structure preferably maintains its structure prior to and under conditions suitable for hybridization between the nucleic acid sequence and the adapter oligonucleotide.
  • the nick or gap formed through the hybridization between the nucleic acid sequence and the adapter oligonucleotide can be fixed by way of ligation.
  • the donor molecule is designed to also have the stem structure be retained under conditions where the nick or gap is ligated by either enzymatic or chemical means.
  • a hybrid molecule is created by the ligation between the nucleic acid sequence and the adapter oligonucleotide.
  • the intramolecular stem structure preferably maintains the stem structure under conditions suitable for hybridization between the donor and acceptor molecule.
  • the stem structure is designed to maintain its structure under conditions where the acceptor and donor molecule hybridize.
  • the intramolecular stem structure of the adapter oligonucleotide has reduced stability where the stem structure is unfolded.
  • the stem structure can be designed so that the stem structure can be relieved of its intramolecular base pairing and resemble more of a linear molecule.
  • the adapter oligonucleotide is designed where the relief of the intramolecular stem structure is thermodynamically favored over the intramolecular stem structure.
  • some implementations comprise amplifying the ligated nucleic acid product.
  • the stem-loop structure does not impair the amplification step, because the intramolecular stem structure may be undone by raising the temperature or adding a chemical denaturant.
  • a probe or primer can be used to sequence or amplify at least a portion of the sequence present in the acceptor molecule.
  • the stem can comprise at least 3 nucleotide pairs, at least 4 nucleotide pairs, at least 5 nucleotide pairs, at least 6 nucleotide pairs, at least 7 nucleotide pairs, at least 8 nucleotide pairs, at least 9 nucleotide pairs, at least 10 nucleotide pairs, at least 1 1 nucleotide pairs, at least 12 nucleotide pairs, at least 13 nucleotide pairs, at least 14 nucleotide pairs, at least 15 nucleotide pairs, at least 20 nucleotide pairs, at least 25 nucleotide pairs, at least 30 nucleotide pairs, at least 35 nucleotide pairs, at least 40 nucleotide pairs, at least 45 nucleotide pairs, at least 50 nucleotide pairs, at least 55 nucleotide pairs, at least 60 nucleotide pairs, at least 65 nucleotide pairs, at least 70 nucleotide pairs, at least 10 nucleotide pairs
  • the stem region comprises at least 1 mismatched pair, at least 2 mismatched pairs, at least 3 mismatched pairs, at least 4 mismatched pairs, at least 5 mismatched pairs, at least 5 mismatched pairs, at least 6 mismatched pairs, at least 7 mismatched pairs, at least 8 mismatched pairs, at least 9 mismatched pairs, at least 10 mismatched pairs, at least 1 1 mismatched pairs, at least 12 mismatched pairs, at least 13 mismatched pairs, at least 14 mismatched pairs, at least 15 mismatched pairs, at least 20 mismatched pairs, at least 25 mismatched pairs, at least 30 mismatched pairs, at least 35 mismatched pairs, at least 40 mismatched pairs, at least 45 mismatched pairs, or at least 50 mismatched pairs.
  • the amount of mismatch pairs in the stem should be sufficient to make the stem structure unstable at a high temperature of at least 60°C, at least 65°C, at least 70°C, at least 75°C, at least 80°C, at least 85°C, at least 90°C, at least 95°C, at least 96°C, at least 97°C, at least 98°C, or at least 99°C.
  • the loop structure of the adapter oligonucleotide can comprise any number of nucleotides.
  • the loop structure comprises at least 1 nucleotide, at least 2 nucleotides, at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 1 1 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at 20 least 15 nucleotides, at least 20 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, or at least 40 nucleotides.
  • the loop comprises about 2-30 nucleotides.
  • ligation and hybridization is performed using temperature cycling ligation varying in range from -20°C to 40°C, for example, -20°C to 4°C, , -20°C to 0°C, 20°C to 40°C, or 22°C to 37°C. 3. PCR-based steps for generating a sequencing library.
  • Target-specific and/or universal primers are used to amplify and enrich a nucleic acid sequence of interest. Multiple rounds of PCR are used to achieve adequate enrichment and selection of targeted nucleic acid sequence while minimizing off-target non-specific reads and adapter dimers.
  • the PCR round comprises at least 15 cycles, at least 20 cycles, or at least 25 cycles.
  • the linear amplification step comprises no more than 35 cycles or no more than 30 cycles.
  • the linear amplification step comprises between 20 and 30 amplification cycles, such as 30 cycles. Additional rounds of PCR are used to introduce sample-specific indexes to enable optimum utilization of downstream sequencing.
  • a nested PCR strategy may be used to enrich for on-target reads and reduce off-targeted non-specific amplification and adapter dimers.
  • the methods further comprise detecting the sequence of interest in the generated sequencing library.
  • the invention is directed to a sequencing library comprising nucleic acid sequences tagged with an adapter oligonucleotide and regions that are binding sites for a pair of universal primers.
  • [0071] Denature DNA by incubation in 37°C for 10 minutes and then in 95°C for 3 minutes. Immediately after the end of incubation at 95°C, incubate in an ice water bath. While in the ice water bath, add the oligo (0.5 ⁇ from 100 ⁇ stock) and the master mix (1 1 ⁇ from recipe described in Table 2) to each tube.
  • Magnetic beads are used to clean up the ligation reaction (1 .8 ratio of beads to DNA by volume). The total reaction of the cleanup reaction per tube is 25 ⁇ . The specific steps for clean up with magnetic beads are as follows:
  • the first PCR (named PCR1 in FIG. 4) is performed using the primers depicted in Table 4.
  • the master mix for the PCR reaction is prepared according to Table 5.
  • Table 6 describes the cycling conditions for the first PCR.
  • ExoSAP clean up is performed using 10 ⁇ of PCR production and 4 ⁇ ExoSAP-IT for 1 replicate of each of the above. ExoSAP incubation was setup per manufacturer's instructions, which is incubation at 37°C for 30 minutes followed by incubation at 85°C for 15 minutes. [0077] 8. The second PCR reaction (named PCR2 in FIG. 4) is performed using the primers depicted in Table 7. The contents of each PCR reaction is described Table 8. Table 9 describes the cycling conditions for the first PCR.
  • ExoSAP clean up is performed. This time 25 ⁇ of PCR production and 10 ⁇ ExoSAP-IT is used per reaction. ExoSAP incubation was setup per manufacturer's instructions, which is incubation at 37°C for 30 minutes followed by incubation at 85°C for 15 minutes. The products of the cleanup may be stored in 4°C until the next step is performed.
  • Magnetic beads are used to clean up the PCR products (1 .2 ratio of beads to DNA by volume). The specific process is the same those described in step 5 but with the amount of beads adjusted. The elution step is adjusted to 16 ⁇ where about 13 ⁇ per reaction.
  • the library is sequenced using MiSeq. All samples are put at 4 nM.
  • the pool contains 4 nM of each sample included for MiSeq with a volume 10 ⁇ per sample. For the blank sample, 1 ⁇ was placed in the pool.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Genetics & Genomics (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • General Engineering & Computer Science (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • General Health & Medical Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Biomedical Technology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Plant Pathology (AREA)
  • Medicinal Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention provides a tagged sequencing library and methods for tagging low abundance target sequences and generating a sequencing library for detecting low abundance target sequences.

Description

MOLECULAR TAGGING METHODS AND SEQUENCING LIBRARIES
RELATED APPLICATION DATA
[0001] This application claims priority to and the benefit of U.S. Provisional Application No. 62/340,954 filed May 24, 2016, the contents of which are hereby incorporated by reference in their entirety.
REFERENCE TO SEQUENCE LISTING SUBMITTED ELECTRONICALLY
[0002] The official copy of the sequence listing is submitted electronically via EFS-Web as an ASCI I-formatted sequence listing with a file named "91482_216_SeqList_ST25.txt" created on May 23, 2017, and having a size of 1 kilobyte, and is filed concurrently with the specification. The sequence listing contained in this ASCI I-formatted document is part of the specification and is herein incorporated by reference in its entirety.
FIELD OF THE INVENTION
[0003] The present invention is related to methods of molecular tagging of nucleic acids, for example, for the preparation of sequencing libraries. The invention is also directed to tagged sequencing libraries.
BACKGROUND OF THE I NVENTION
[0004] Molecular barcoding or tagging involves attaching a unique or degenerate oligonucleotide label to each template molecule in the first or early steps of sequencing library preparation so that any low abundance signals of variant may be distinguished from noise introduced during the process. The process has recently been described as a viable approach for detection and quantification of low abundance variants in complex mixture of nucleic acids. The overall goal is to improve sensitivity and accuracy of next generation sequencing for identifying, detecting, and quantifying nucleic acid molecules of any given type (or carrying a variant). However, existing methods are still incapable of tagging target sequences that are of low abundance. Most current methods have limited efficiency in capture molecules and require high amount of input molecules to achieve adequate sensitivity and accuracy. These obstacles are particularly problematic in the early detection of cancer or monitoring in patients with early stage cancers. In a normal individual, only 3,000 to 5,000 copies of the genome are found in one milliliter of plasma. Within that, the occurrences of a copy of the genome containing a cancer mutation can be as low as 1 in 10,000 copies of the wild-type genome. In combination with the distortion factors from PCR and sequencing, early detection of a cancer-causing mutation is trying to find the proverbial needle in a hay farm rather than a single haystack. Accordingly, there is a need for improving identification and detection of low abundance variants in the genome.
SUMMARY OF THE INVENTION
[0005] The invention is directed to methods of adding oligonucleotide tags to a nucleic acid sequence and of producing a sequencing library and directed to a sequencing library comprising nucleic acid sequences tagged with an adapter oligonucleotide at the 3'-end.
[0006] The methods of adding oligonucleotide tags to a nucleic acid sequence in a sample comprise ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence, wherein the adaptor oligonucleotide comprises a stem- loop intramolecular nucleotide base pairing, a hydroxyl group at the 3'-end, a phosphate at the 5' -end, a random region complementary to nucleic acid sequence, and a random region in the loop comprising the molecular barcode.
[0007] The methods of producing a sequencing library comprise ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence to produce a hybrid sequence comprising the sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing, a hydroxyl group at the 3'-end, a phosphate at the 5' -end, a random region complementary to nucleic acid sequence, and a random region in the loop comprising the molecular barcode; and amplifying hybrid sequence with a first set of primers. In some implementations, the first set of primers comprises a forward universal primer and a reverse universal sample barcoding primer, wherein the reverse universal sample barcoding primer comprises a sample barcode. In these implementations, amplifying hybrid sequence with the first set of primers produces a barcoded sequence. In other implementations, the first set of primers comprises a target specific primer and a reverse universal primer. In these implementations, amplifying hybrid sequence with the first set of primers produces a target specific sequence, and the methods further comprise amplifying the target specific sequence with a second set of primers comprising a forward universal primer and sample barcoding primer to produce a barcoded sequence. In some aspects, the methods further comprises amplifying the target-specific sequence with a nested target specific primer and the universal primer prior to amplification with the second set of primers.
[0008] In some embodiments, the methods of the invention comprise a pre-amplification step to increase the sample number. Thus, prior to the ligation step, the methods comprise annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence and then linearly amplifying the nucleic acid sequence.
[0009] In some aspects, the nucleic acid in the sample is fractionated. In some implementations, the methods comprise cleaning up after each amplification step with exonuclease and alkaline phosphatase.
[0010] In other aspects, the invention relates to a method of adding oligonucleotide tags to a nucleic acid sequence in a sample, the method comprising the steps of: annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence; linearly amplifying the nucleic acid sequence; and ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing; a hydroxyl group at the 3'-end; a phosphate at the 5'-end; a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode
[0011] In yet other aspects, the invention relates to a method of producing a sequencing library, the method comprising the steps of: annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence; linearly amplifying the nucleic acid sequence; ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence to produce a hybrid sequence comprising the sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing; a hydroxyl group at the 3'-end; a phosphate at the 5'-end; a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode; and amplifying hybrid sequence with a first set of primers.
[0012] In one embodiment, the invention is directed to a method of producing a sequencing library, the method comprising the steps of: ligating an adapter oligonucleotide to the 3' -end of a nucleic acid sequence to produce a hybrid sequence comprising a sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises: a stem-loop intramolecular nucleotide base pairing; a hydroxyl group at the 3'-end; a phosphate at the 5'-end; a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode; and amplifying the hybrid sequence with a first set of primers.
[0013] In some embodiments, the first set of primers comprises a forward universal primer and a reverse universal sample barcoding primer and the reverse universal sample barcoding primer comprises a sample barcode; and amplifying the hybrid sequence with the first set of primers produces a barcoded sequence.
[0014] In other embodiments, the first set of primers comprises a target specific primer and a reverse universal primer and amplifying the hybrid sequence with the first set of primers produces a target specific sequence; and the method further comprises amplifying the target specific sequence with a second set of primers comprising a forward universal primer and sample barcoding primer to produce a barcoded sequence.
[0015] In one aspect, the reverse universal primer comprises the nucleotide sequence of SEQ ID NO: 2.
[0016] In another aspect, the forward universal primer comprises the nucleotide sequence of SEQ ID NO: 1 .
[0017] In certain aspects, the sample barcoding primer comprises a 5' adapter sequence, a 3' region complementary to the reverse universal primer, and a sample index sequence between the 5' adapter sequence and 3' region complementary to the reverse universal primer.
[0018] In one embodiment, the sample barcoding primer comprises the nucleotide sequence of SEQ ID NO: 4. [0019] In yet other embodiments, the method further comprises amplifying the target-specific sequence with a nested target specific primer and the universal primer prior to amplification with the second set of primers.
[0020] In certain embodiments, the nucleic acid in the sample is fractionated. In one embodiment, the nucleic acid in the sample is fractionated to fragments between about 100 bp and about 500 bp. In another embodiment, the nucleic acid in the sample is fractionated to fragments between about 250 bp and about 350 bp. In yet another embodiment, the nucleic acid in the sample is fractionated to fragments of about 300 bp.
[0021] In some aspects, ligating the adapter oligonucleotide to the 3'- end of the nucleic acid sequence takes place between -20°C to 40°C. In one aspect, ligating the adapter oligonucleotide to the 3'-end of the nucleic acid sequence takes place between 0°C and 40°C. In another aspect, ligating the adapter oligonucleotide to the 3'-end of the nucleic acid sequence takes place between 10°C and 30°C.
[0022] In some implementations, the method further comprises cleaning up the amplified sequence with exonuclease and alkaline phosphatase following each amplifying step.
[0023] In other implementations, the adapter oligonucleotide is ligated to the 3' end of the nucleic acid sequence with a DNA ligase.
[0024] In one aspect, the adaptor oligonucleotide further comprises a 3' overhang and the 3' overhang comprises the region complementary to the nucleic acid sequence. In another aspect, the region complementary to the nucleic acid sequence is complementary to the 3'-end of the nucleic acid sequence.
[0025] In one embodiment, the stem-loop intramolecular nucleotide base pairing of the adaptor oligonucleotide forms a stem of at least 6 nucleotide pairs long. In another embodiment, the stem comprises at least 1 mismatched pair.
[0026] In one aspect, the stem-loop intramolecular nucleotide base pairing of the adaptor oligonucleotide forms a loop. In another aspect, the loop of the adapter oligonucleotide comprises a primer-binding region for a second universal primer.
[0027] In some embodiments, the invention is directed to a sequencing library comprising a nucleic acid sequence tagged with an adapter oligonucleotide at the 3'-end produced with the methods disclosed herein. [0028] In certain aspects, the nucleic acid sequences comprises binding regions for a pair of universal primers and amplification with the pair of universal primers produces an amplicon comprising a sequence of interest for the sequencing library.
[0029] Additional objectives, advantages and novel features will be set forth in the description which follows or will become apparent to those skilled in the art upon examination of the drawings and detailed description which follows.
BRIEF DESCRIPTION OF THE DRAWINGS
[0030] FIG. 1 depicts the generic structure of an exemplary adapter oligonucleotide.
[0031] FIGs. 2A-C depict schematics of the methods of molecular tagging of nucleic acids. FIG. 2B depicts a variation that utilizes target specific primers as compared to the schematic of FIG. 2A. FIG. 2C depicts a variation in how the sample barcode is attached as compared to the schematic of FIG. 2A.
[0032] FIG. 3 depicts the expected results of nucleic acid tagging process shown in FIGs. 2A-C.
[0033] FIG. 4 is a schematic of the steps described in Example A.
[0034] FIG. 5 depicts the observed result from the protocol of Example
A.
[0035] The headings used in the figures should not be interpreted to limit the scope of the claims.
DETAI LED DESCRIPTION
[0036] In the following description, and for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the various aspects of the invention. It will be understood, however, by those skilled in the relevant arts, that the present invention may be practiced without these specific details. It should be noted that there are many different and alternative configurations, devices and technologies to which the disclosed inventions may be applied. The full scope of the inventions is not limited to the examples that are described below.
[0037] The singular forms "a," "an," and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to "a step" includes reference to one or more of such steps. Unless specifically noted, it is intended that the words and phrases in the specification and the claims be given their plain, ordinary, and accustomed meaning to those of ordinary skill in the applicable arts.
[0038] The term "stem-loop" as used herein in relation to nucleic acid structures refers to a structure formed when two regions of the same strand of nucleic acids can base-pair to form a double helix that ends in an unpaired loop. The two regions of are usually complementary when read in opposite directions. This structure is also known as a hairpin or a hairpin loop.
[0039] As used herein, the term "complementary" in reference to nucleic acid sequences refers to nucleic acid base sequences that can form a double-stranded structure by matching base pairs. The degree of complementarity between the sequences does not have to 100%, for example, the degree of complementarity may be at least 95%, at least 90%, or at least 85%.
[0040] As used herein, a "nick" in a strand is a break in the phosphodiester bond between two nucleotides in the backbone in one of the strands of a duplex between a sense and an antisense strand.
[0041] As used herein, a "gap" in a strand is a break between two nucleotides in the single strand.
[0042] The invention relates to methods of molecular tagging nucleic acids, for example, for the preparation of sequencing libraries, and the specially tagged sequencing libraries. As such, the method also related to the special tagged sequence libraries. The sequencing library and methods are particularly useful for informatics analysis of target sequences that are at low fractions in complex mixture of nucleic acid sequences, for example cell-free DNA in biological samples, such as plasma or urine. Another complex mixture of nucleic acid sequences may be low- input degraded forensic sample.
[0043] In the context of cancer applications, identification, quantification, and detection of cancer mutations in plasma (with or without knowledge of patient-specific cancer mutations a priori) can be used for screening and early detection of cancer, monitoring treatment response and progression, molecular stratification and assessment of clonal evolution and treatment resistance.
[0044] Outside of cancer, such as in non-malignant diseases, this invention enables detection of any genomic variant in circulation or in tissue. In addition, it will enable identification, detection and quantification of variants in any complex mixtures of human or non-human samples such as pathogens. This invention can be scaled for multiplexing such that sequencing of multiple genomic regions is possible using this approach, allowing for simultaneous identification, detection and quantification of multiple mutations. It is also readily customizable and can be implemented on an ad hoc basis or developed to focus on specific scenarios (for example cancer diagnostics using a panel of genes).
[0045] Methods of tagging nucleic acid sequences comprise ligation of an adapter oligonucleotide. In some implementation, the nucleic acid sequence in the sample has been fractionated, for example, to fragments between 100 and 500 bp, between 100 and 300 bp, between 250 and 350 bp, or about 300 bp. In some implementations, the methods further comprise a linear amplification step, and in some aspects, the linear amplification step takes place before the ligation step. To prepare a sequencing library, the methods further comprise PCR-based steps (see FIGs. 2A-C). The methods rely on the linear amplification step and/or the PCR- based steps to increase the efficiency of conversion of template molecules into a sequencing-ready library.
7. Linear Amplification.
[0046] The linear amplification step comprises annealing a primer to the nucleic acid sequences in the sample and linearly amplifying the nucleic acid sequence. In some implementations, the linear amplification step comprises at least 5 cycles, at least 6 cycles, at least 7 cycles, at least 8 cycles, at least 9 cycles, at least 10 cycles, at least 1 1 cycles, at least 12 cycles, at least 13 cycles, at least 14 cycles, or at least 15 cycles. In other implementations, the linear amplification step comprises no more than 15 cycles or no more than 10 cycles. For example, the linear amplification step comprises about 10 cycles of amplification.
[0047] The primer is complementary to a sequence of interest on the nucleic acid sequence (see PCR1 of FIG. 2A and FIG. 4). For example, sequence of interest on the nucleic acid sequence is a region that is proximal to the region of interest on the nucleic acid sequence. 2. Ligation with adapter oligonucleotide.
[0048] The molecular barcode or tag is introduced to the nucleic acid sequences by ligation with an adapter oligonucleotide. The adapter nucleotide is ligated to the 3'-end of the nucleic acid sequence or the 3'-end of the linearly amplified copy of the nucleic acid sequence. In some implementations, the ligation temperature is between -20°C and 40°C, between 10°C and 40°C, or between 10°C and 30°C.
[0049] The adaptor oligonucleotide comprises a hairpin structure. In preferred embodiments, the adaptor oligonucleotide comprises a constant stem region, a random molecule tag, a sequence for a universal primer to bind, and a random complementary sequence. The random molecular tag may be a random oligonucleotide. In some embodiments, the random molecular tag is at least 6 nucleotides long, at least 7 nucleotides long, at least 8 nucleotides long, at least 9 nucleotides long, or at least 10 nucleotides long, while the random complementary sequence is at least 4 nucleotides long, at least 5 nucleotides long, at least 6 nucleotides long, at least 7 nucleotides long, or at least 8 nucleotides long. In other embodiments, the random molecular tag is no more than 9 nucleotides long, no more than 10 nucleotides long, no more than 1 1 nucleotides long, or no more than 12 nucleotides long, while the random complementary sequence is no more than 6 nucleotides long, no more than 7 nucleotides long, or no more than 8 nucleotides long. In one embodiment, the random molecular tag is 9 nucleotides long while the random complementary sequence is 6 nucleotides long, as shown in FIG. 1.
[0050] In some embodiments, the adaptor oligonucleotide preferably comprises a phosphate at the 5'-end. In some embodiments, the adapter oligonucleotide comprises a DNA 5'-end and an RNA 3'-end.
[0051] To reduce dimerization of the hairpin adapter with itself during ligation, the 3' end of the adapter oligonucleotide may be blocked using an oligonucleotide modification not extendable or cleavable by polymerase, for example a C3 space.
[0052] In some aspects, 3' end of the adaptor oligonucleotide comprises an overhang after the random complementary sequence for improved stability. In some embodiments, the 3' overhang comprises the region complementary to the nucleic acid sequence. As such, hybridization between the adapter oligonucleotide and the nucleic acid sequence can be ligated either by enzymatic or chemical means.
[0053] In one embodiment, the 3'-overhang region comprises at least 1 nucleotide, at least 2 nucleotides, at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 1 1 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 20 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, or at least 40 nucleotides that are complementary to sequences found in the nucleic acid sequence when the nucleic acid sequence and the adapter oligonucleotide are hybridized to one another. In this manner, the 3' overhang region of the adapter oligonucleotide is considered as the region of the adapter oligonucleotide that binds to the 3' region of the nucleic acid sequence.
[0054] In various embodiments, the 3'-overhang region comprises at least 1 nucleotide, preferably at least 2 nucleotides, preferably at least 3 nucleotides, preferably at least 4 nucleotides, and preferably at least 5 nucleotides that are mismatched with 25 nucleotides found in the nucleic acid sequence when the nucleic acid sequence and adapter oligonucleotide are hybridized to one another.
[0055] In one embodiment, the hybridization between the nucleic acid sequence and the adapter oligonucleotide forms a structure that comprises a nick, wherein the nick can be ligated by either enzymatic or chemical means. In another embodiment, the hybridization between the nucleic acid sequence and the adapter oligonucleotide forms a structure that comprises a gap, wherein the gap can be ligated by either enzymatic or chemical means.
[0056] In one embodiment, the hybridization between the nucleic acid sequence and the adapter oligonucleotide forms a stem-loop structure. The stem- loop structure is stable at temperatures as high as 35°C, as high as 40°C, as high as 45°C, as high as 50°C, as high as 55°C, as high as 60°C, as high as 65°C, as high as 70°C, as high as 75°C, as high as 80°C, as high as 85°C, or more. Accordingly, the design of the adaptor oligonucleotide should take care to utilizes sequence that ensure the formed stem-loop is thermostable.
[0057] The adapter oligonucleotide is a single-stranded oligonucleotide having a double-stranded portion formed of two self-complementary segments, optionally having a loop at one end, and a short overhanging single strand at the other. Thus, for purposes of the present invention, a hairpin is defined as a double- helical region formed by nucleotide base-pairing between adjacent, inverted, at least partially complementary sequences in a single-stranded nucleic acid, preferably within the same single stranded nucleic acid. The stem structure preferably maintains its structure prior to and under conditions suitable for hybridization between the nucleic acid sequence and the adapter oligonucleotide. In this manner, the nick or gap formed through the hybridization between the nucleic acid sequence and the adapter oligonucleotide can be fixed by way of ligation. In some instances, the donor molecule is designed to also have the stem structure be retained under conditions where the nick or gap is ligated by either enzymatic or chemical means. In this situation, a hybrid molecule is created by the ligation between the nucleic acid sequence and the adapter oligonucleotide.
[0058] In one embodiment, the intramolecular stem structure preferably maintains the stem structure under conditions suitable for hybridization between the donor and acceptor molecule. For example, the stem structure is designed to maintain its structure under conditions where the acceptor and donor molecule hybridize.
[0059] In some conditions, the intramolecular stem structure of the adapter oligonucleotide has reduced stability where the stem structure is unfolded. In this manner, the stem structure can be designed so that the stem structure can be relieved of its intramolecular base pairing and resemble more of a linear molecule. In one embodiment, the adapter oligonucleotide is designed where the relief of the intramolecular stem structure is thermodynamically favored over the intramolecular stem structure. For example, following the ligation of the adapter oligonucleotide and the nucleic acid sequence, some implementations comprise amplifying the ligated nucleic acid product. The stem-loop structure does not impair the amplification step, because the intramolecular stem structure may be undone by raising the temperature or adding a chemical denaturant. Once the intramolecular stem structure is undone, a probe or primer can be used to sequence or amplify at least a portion of the sequence present in the acceptor molecule.
[0060] In some embodiments, the stem can comprise at least 3 nucleotide pairs, at least 4 nucleotide pairs, at least 5 nucleotide pairs, at least 6 nucleotide pairs, at least 7 nucleotide pairs, at least 8 nucleotide pairs, at least 9 nucleotide pairs, at least 10 nucleotide pairs, at least 1 1 nucleotide pairs, at least 12 nucleotide pairs, at least 13 nucleotide pairs, at least 14 nucleotide pairs, at least 15 nucleotide pairs, at least 20 nucleotide pairs, at least 25 nucleotide pairs, at least 30 nucleotide pairs, at least 35 nucleotide pairs, at least 40 nucleotide pairs, at least 45 nucleotide pairs, at least 50 nucleotide pairs, at least 55 nucleotide pairs, at least 60 nucleotide pairs, at least 65 nucleotide pairs, at least 70 nucleotide pairs, at least 75 nucleotide pairs.
[0061] In some implementations, the stem region comprises at least 1 mismatched pair, at least 2 mismatched pairs, at least 3 mismatched pairs, at least 4 mismatched pairs, at least 5 mismatched pairs, at least 5 mismatched pairs, at least 6 mismatched pairs, at least 7 mismatched pairs, at least 8 mismatched pairs, at least 9 mismatched pairs, at least 10 mismatched pairs, at least 1 1 mismatched pairs, at least 12 mismatched pairs, at least 13 mismatched pairs, at least 14 mismatched pairs, at least 15 mismatched pairs, at least 20 mismatched pairs, at least 25 mismatched pairs, at least 30 mismatched pairs, at least 35 mismatched pairs, at least 40 mismatched pairs, at least 45 mismatched pairs, or at least 50 mismatched pairs.
[0062] In one embodiment, the amount of mismatch pairs in the stem should be sufficient to make the stem structure unstable at a high temperature of at least 60°C, at least 65°C, at least 70°C, at least 75°C, at least 80°C, at least 85°C, at least 90°C, at least 95°C, at least 96°C, at least 97°C, at least 98°C, or at least 99°C.
[0063] The loop structure of the adapter oligonucleotide can comprise any number of nucleotides. In one embodiment, the loop structure comprises at least 1 nucleotide, at least 2 nucleotides, at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 1 1 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at 20 least 15 nucleotides, at least 20 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 35 nucleotides, or at least 40 nucleotides. Preferably, the loop comprises about 2-30 nucleotides.
[0064] In one embodiment, ligation and hybridization is performed using temperature cycling ligation varying in range from -20°C to 40°C, for example, -20°C to 4°C, , -20°C to 0°C, 20°C to 40°C, or 22°C to 37°C. 3. PCR-based steps for generating a sequencing library.
[0065] Target-specific and/or universal primers are used to amplify and enrich a nucleic acid sequence of interest. Multiple rounds of PCR are used to achieve adequate enrichment and selection of targeted nucleic acid sequence while minimizing off-target non-specific reads and adapter dimers. In some implementations, the PCR round comprises at least 15 cycles, at least 20 cycles, or at least 25 cycles. In other implementations, the linear amplification step comprises no more than 35 cycles or no more than 30 cycles. For example, the linear amplification step comprises between 20 and 30 amplification cycles, such as 30 cycles. Additional rounds of PCR are used to introduce sample-specific indexes to enable optimum utilization of downstream sequencing. A nested PCR strategy may be used to enrich for on-target reads and reduce off-targeted non-specific amplification and adapter dimers.
[0066] In methods for screening a sequence of interest in a genome, the methods further comprise detecting the sequence of interest in the generated sequencing library.
[0067] In another aspect, the invention is directed to a sequencing library comprising nucleic acid sequences tagged with an adapter oligonucleotide and regions that are binding sites for a pair of universal primers.
[0068] It is well established in the art that, when performing different types of reactions with nucleic acids, for example a PCR after a ligation reaction, it is sometimes necessary to clean up the sample after each reaction before proceeding to the next. As shown in FIG. 4, the addition of alkaline phosphatase ensures a more efficient ligation reaction. Heat inactivation also ensure the complete end of the ligation reaction. Methods for cleaning up a PCR product are well established in the field, and an example is the use of a combination of exonuclease and alkaline phosphatase.
EXAMPLES
[0069] It should be understood that while particular embodiments have been illustrated and described, various modifications can be made thereto without departing from the spirit and scope of the invention as will be apparent to those skilled in the art. Such changes and modifications are within the scope and teachings of this invention as defined in the claims appended hereto. Example A: Protocol for generation of molecular barcoded sequencing library
[0070] 1 . Sample from sheared DNA D6 (2 replicate samples and 1 blanks for EXP266b and same for EXP266c). D6 at 10 ng/μΙ (sheared previously using sonication to -300 bp fragments). The contents of each test tube are reflected in Table 1.
Table 1.
[0071] 2. Denature DNA by incubation in 37°C for 10 minutes and then in 95°C for 3 minutes. Immediately after the end of incubation at 95°C, incubate in an ice water bath. While in the ice water bath, add the oligo (0.5 μΙ from 100 μΜ stock) and the master mix (1 1 μΙ from recipe described in Table 2) to each tube.
Table 2.
[0072] 3. Ligate ssDNA by incubation in 40 cycles of 10°C for one minute and 30°C for one minute. This ligation products may be stored in 4°C until the next step.
[0073] 4. Add FastAP and incubate at 37°C for 30 minutes. The amount of AP and buffer added for teach tube is depicted in Table 3. Table 3.
[0074] 5. Magnetic beads are used to clean up the ligation reaction (1 .8 ratio of beads to DNA by volume). The total reaction of the cleanup reaction per tube is 25 μΙ. The specific steps for clean up with magnetic beads are as follows:
a. Prepare fresh 85% ethanol - 850 μΙ of 100% ethanol + 150 μΙ of water (if needed) - will need total of 360 μΙ x 4 reactions = 1440 ~ 2 ml_s or 2 x 1 ml_ preps.
b. Add 45 μΙ of well-resuspended magnetic beads and pipette mix well. c. Incubate DNA and beads for 5 minutes at room temperature. d. Place beads + DNA on the magnetic and wait until all beads are collected and supernatant is clear.
e. Remove supernatant carefully (do not remove beads).
f. While on the magnetic, Wash 2x with 85% ethanol by adding 180ul of 85% ethanol, wait 30s and aspirate
g. Tap the magnetic to collect all ethanol at the bottom of the tubes and remove any leftover ethanol□ Let it dry at RT for no longer then 5 minutes.
h. Add 20 μΙ of water and mix well with pipette.
i. Incubate for 2 minutes at room temperature.
j. Place tubes back on the magnetic for a few minutes until all beads are collected. And transfer the cleaned product to a new set of tubes (or a new strip tube). Recover -17.0 μΙ per reaction.
[0075] 6. The first PCR (named PCR1 in FIG. 4) is performed using the primers depicted in Table 4. The master mix for the PCR reaction is prepared according to Table 5. Table 6 describes the cycling conditions for the first PCR. Table 4
Table 5
Table 6
[0076] 7. ExoSAP clean up is performed using 10 μΙ of PCR production and 4 μΙ ExoSAP-IT for 1 replicate of each of the above. ExoSAP incubation was setup per manufacturer's instructions, which is incubation at 37°C for 30 minutes followed by incubation at 85°C for 15 minutes. [0077] 8. The second PCR reaction (named PCR2 in FIG. 4) is performed using the primers depicted in Table 7. The contents of each PCR reaction is described Table 8. Table 9 describes the cycling conditions for the first PCR.
Table 7
Table 8
Table 9
[0078] 9. Another ExoSAP clean up is performed. This time 25 μΙ of PCR production and 10 μΙ ExoSAP-IT is used per reaction. ExoSAP incubation was setup per manufacturer's instructions, which is incubation at 37°C for 30 minutes followed by incubation at 85°C for 15 minutes. The products of the cleanup may be stored in 4°C until the next step is performed.
[0079] 10. Magnetic beads are used to clean up the PCR products (1 .2 ratio of beads to DNA by volume). The specific process is the same those described in step 5 but with the amount of beads adjusted. The elution step is adjusted to 16 μΙ where about 13 μΙ per reaction.
[0080] 1 1. Kappa Library quantification using qPCR is performed according to the manufacturer's protocol.
[0081] 12. Quality control of the library is performed using the bioanalyzer.
[0082] 13. The library is sequenced using MiSeq. All samples are put at 4 nM. The pool contains 4 nM of each sample included for MiSeq with a volume 10 μΙ per sample. For the blank sample, 1 μΙ was placed in the pool.
Example B: Summary Table
Table 10

Claims

CLAIMS What is claimed is:
1 . A method of adding oligonucleotide tags to a nucleic acid sequence in a sample, the method comprising the steps of:
annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence;
linearly amplifying the nucleic acid sequence; and
ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence, wherein the adaptor oligonucleotide comprises:
a stem-loop intramolecular nucleotide base pairing;
a hydroxyl group at the 3'-end;
a phosphate at the 5'-end;
a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode
2. A method of producing a sequencing library, the method comprising the steps of:
annealing a first universal primer to the nucleic acid sequence in the sample, wherein the first universal primer is complementary to a sequence of interest on the nucleic acid sequence;
linearly amplifying the nucleic acid sequence;
ligating an adapter oligonucleotide to the 3'-end of the nucleic acid sequence to produce a hybrid sequence comprising the sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises:
a stem-loop intramolecular nucleotide base pairing;
a hydroxyl group at the 3'-end;
a phosphate at the 5'-end;
a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode; and amplifying hybrid sequence with a first set of primers.
3. A method of producing a sequencing library, the method comprising the steps of:
ligating an adapter oligonucleotide to the 3'-end of a nucleic acid sequence to produce a hybrid sequence comprising a sequence of interest on the nucleic acid sequence and the adapter oligonucleotide, wherein the adaptor oligonucleotide comprises:
a stem-loop intramolecular nucleotide base pairing;
a hydroxyl group at the 3'-end;
a phosphate at the 5'-end;
a random region complementary to the nucleic acid sequence; and a random region in the loop comprising a molecular barcode; and amplifying the hybrid sequence with a first set of primers.
4. The method of claim 2 or 3, wherein the first set of primers comprises a forward universal primer and a reverse universal sample barcoding primer and the reverse universal sample barcoding primer comprises a sample barcode; and
wherein amplifying the hybrid sequence with the first set of primers produces a barcoded sequence.
5. The method of claim 2 or 3, wherein the first set of primers comprises a target specific primer and a reverse universal primer and amplifying the hybrid sequence with the first set of primers produces a target specific sequence; and
wherein the method further comprises amplifying the target specific sequence with a second set of primers comprising a forward universal primer and sample barcoding primer to produce a barcoded sequence.
6. The method of claim 5, wherein the reverse universal primer comprises the nucleotide sequence of SEQ ID NO: 2.
7. The method of claim 5, wherein the forward universal primer comprises the nucleotide sequence of SEQ ID NO: 1 .
8. The method of claim 5, wherein the sample barcoding primer comprises a 5' adapter sequence, a 3' region complementary to the reverse universal primer, and a sample index sequence between the 5' adapter sequence and 3' region complementary to the reverse universal primer.
9. The method of claim 8, wherein the sample barcoding primer comprises the nucleotide sequence of SEQ ID NO: 4.
10. The method of claim 5, further comprising amplifying the target-specific sequence with a nested target specific primer and the universal primer prior to amplification with the second set of primers.
1 1. The method of any one of the preceding claims, wherein the nucleic acid in the sample is fractionated.
12. The method of claim 1 1 , wherein the nucleic acid in the sample is fractionated to fragments between about 100 bp and about 500 bp.
13. The method of claim 12, wherein the nucleic acid in the sample is fractionated to fragments between about 250 bp and about 350 bp.
14. The method of claim 13, wherein the nucleic acid in the sample is fractionated to fragments of about 300 bp.
15. The method of any one of the preceding claims, wherein ligating the adapter oligonucleotide to the 3'-end of the nucleic acid sequence takes place between -20°C to 40°C.
16. The method of claim 15, wherein ligating the adapter oligonucleotide to the 3'- end of the nucleic acid sequence takes place between 0°C and 40°C.
17. The method of claim 16, wherein ligating the adapter oligonucleotide to the 3'- end of the nucleic acid sequence takes place between 10°C and 30°C.
18. The method of any one of the preceding claims, further comprising cleaning up the amplified sequence with exonuclease and alkaline phosphatase following each amplifying step.
19. The method of any one of the preceding claims, wherein the adapter oligonucleotide is ligated to the 3' end of the nucleic acid sequence with a DNA ligase.
20. The method of any one of the preceding claims, wherein the adaptor oligonucleotide further comprises a 3' overhang and the 3' overhang comprises the region complementary to the nucleic acid sequence.
21. The method of any one of the preceding claims, wherein the region complementary to the nucleic acid sequence is complementary to the 3'-end of the nucleic acid sequence.
22. The method of any one of the preceding claims, wherein the stem-loop intramolecular nucleotide base pairing of the adaptor oligonucleotide forms a stem of at least 6 nucleotide pairs long.
23. The method of claim 22, wherein the stem comprises at least 1 mismatched pair.
24. The method of any one of the preceding claims, wherein the stem-loop intramolecular nucleotide base pairing of the adaptor oligonucleotide forms a loop.
25. The method of claim 24, wherein the loop of the adapter oligonucleotide comprises a primer-binding region for a second universal primer.
26. A sequencing library comprising a nucleic acid sequence tagged with an adapter oligonucleotide at the 3'-end produced with the method of claim 2 or 3.
27. The sequencing library of claim 26, wherein the nucleic acid sequences comprises binding regions for a pair of universal primers and amplification with the pair of universal primers produces an amplicon comprising a sequence of interest for the sequencing library.
EP17803528.3A 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries Active EP3464634B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP21151500.2A EP3910068A1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201662340954P 2016-05-24 2016-05-24
PCT/US2017/034329 WO2017205540A1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries

Related Child Applications (2)

Application Number Title Priority Date Filing Date
EP21151500.2A Division EP3910068A1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries
EP21151500.2A Division-Into EP3910068A1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries

Publications (3)

Publication Number Publication Date
EP3464634A1 true EP3464634A1 (en) 2019-04-10
EP3464634A4 EP3464634A4 (en) 2020-02-12
EP3464634B1 EP3464634B1 (en) 2021-02-17

Family

ID=60411647

Family Applications (2)

Application Number Title Priority Date Filing Date
EP17803528.3A Active EP3464634B1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries
EP21151500.2A Pending EP3910068A1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries

Family Applications After (1)

Application Number Title Priority Date Filing Date
EP21151500.2A Pending EP3910068A1 (en) 2016-05-24 2017-05-24 Molecular tagging methods and sequencing libraries

Country Status (4)

Country Link
US (1) US12043856B2 (en)
EP (2) EP3464634B1 (en)
ES (1) ES2870626T3 (en)
WO (1) WO2017205540A1 (en)

Families Citing this family (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11111544B2 (en) 2005-07-29 2021-09-07 Natera, Inc. System and method for cleaning noisy genetic data and determining chromosome copy number
US11111543B2 (en) 2005-07-29 2021-09-07 Natera, Inc. System and method for cleaning noisy genetic data and determining chromosome copy number
US9424392B2 (en) 2005-11-26 2016-08-23 Natera, Inc. System and method for cleaning noisy genetic data from target individuals using genetic data from genetically related individuals
US11332785B2 (en) 2010-05-18 2022-05-17 Natera, Inc. Methods for non-invasive prenatal ploidy calling
EP2854057B1 (en) 2010-05-18 2018-03-07 Natera, Inc. Methods for non-invasive pre-natal ploidy calling
US20190010543A1 (en) 2010-05-18 2019-01-10 Natera, Inc. Methods for simultaneous amplification of target loci
US11939634B2 (en) 2010-05-18 2024-03-26 Natera, Inc. Methods for simultaneous amplification of target loci
US11408031B2 (en) 2010-05-18 2022-08-09 Natera, Inc. Methods for non-invasive prenatal paternity testing
US11339429B2 (en) 2010-05-18 2022-05-24 Natera, Inc. Methods for non-invasive prenatal ploidy calling
US11326208B2 (en) 2010-05-18 2022-05-10 Natera, Inc. Methods for nested PCR amplification of cell-free DNA
US10316362B2 (en) 2010-05-18 2019-06-11 Natera, Inc. Methods for simultaneous amplification of target loci
US11332793B2 (en) 2010-05-18 2022-05-17 Natera, Inc. Methods for simultaneous amplification of target loci
US9677118B2 (en) 2014-04-21 2017-06-13 Natera, Inc. Methods for simultaneous amplification of target loci
US11322224B2 (en) 2010-05-18 2022-05-03 Natera, Inc. Methods for non-invasive prenatal ploidy calling
CA2821906C (en) 2010-12-22 2020-08-25 Natera, Inc. Methods for non-invasive prenatal paternity testing
JP6153874B2 (en) 2011-02-09 2017-06-28 ナテラ, インコーポレイテッド Method for non-invasive prenatal ploidy calls
US20140100126A1 (en) 2012-08-17 2014-04-10 Natera, Inc. Method for Non-Invasive Prenatal Testing Using Parental Mosaicism Data
CN106460070B (en) 2014-04-21 2021-10-08 纳特拉公司 Detection of mutations and ploidy in chromosomal segments
EP4428863A2 (en) 2015-05-11 2024-09-11 Natera, Inc. Methods and compositions for determining ploidy
WO2017210372A1 (en) 2016-05-31 2017-12-07 The Translational Genomics Research Institute Molecular tagging methods and sequencing libraries
US11485996B2 (en) 2016-10-04 2022-11-01 Natera, Inc. Methods for characterizing copy number variation using proximity-litigation sequencing
US10011870B2 (en) 2016-12-07 2018-07-03 Natera, Inc. Compositions and methods for identifying nucleic acid molecules
EP3642363A1 (en) 2017-06-20 2020-04-29 Bio-Rad Laboratories, Inc. Mda using bead oligonucleotide
US12084720B2 (en) 2017-12-14 2024-09-10 Natera, Inc. Assessing graft suitability for transplantation
CA3090426A1 (en) 2018-04-14 2019-10-17 Natera, Inc. Methods for cancer detection and monitoring by means of personalized detection of circulating tumor dna
JP7537748B2 (en) 2018-06-06 2024-08-21 ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア Methods for generating nucleic acid libraries and compositions and kits for carrying out the same - Patents.com
US11525159B2 (en) 2018-07-03 2022-12-13 Natera, Inc. Methods for detection of donor-derived cell-free DNA
EP3853362A1 (en) * 2018-09-21 2021-07-28 F. Hoffmann-La Roche AG System and method for modular and combinatorial nucleic acid sample preparation for sequencing
CN109797437A (en) * 2019-01-18 2019-05-24 北京爱普益生物科技有限公司 A kind of construction method of sequencing library when detecting multiple samples and its application
US20220349013A1 (en) * 2019-06-25 2022-11-03 The Translational Genomics Research Institute Detection and treatment of residual disease using circulating tumor dna analysis
AU2021208238A1 (en) * 2020-01-16 2022-08-04 Creatz Inc. Method, system, and non-transitory computer-readable recording medium for measuring spin of ball
AU2023221441A1 (en) * 2022-02-18 2024-09-19 Agilent Technologies, Inc. Systems and methods for targeted nucleic acid capture and barcoding

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6919189B2 (en) * 2000-12-11 2005-07-19 Alexion Pharmaceuticals, Inc. Nested oligonucleotides containing a hairpin for nucleic acid amplification
US20070020640A1 (en) * 2005-07-21 2007-01-25 Mccloskey Megan L Molecular encoding of nucleic acid templates for PCR and other forms of sequence analysis
ATE510930T1 (en) 2005-08-02 2011-06-15 Rubicon Genomics Inc COMPOSITIONS AND METHODS FOR EDITING AND AMPLIFICATION OF DNA USING MULTIPLE ENZYMES IN A SINGLE REACTION
EP2240606B1 (en) 2008-01-14 2016-10-12 Applied Biosystems, LLC Compositions, methods, and kits for detecting ribonucleic acid
US8728763B2 (en) * 2009-08-11 2014-05-20 Response Genetics Methods, primers, probes and kits useful for the detection of BRAF mutations
WO2012112804A1 (en) * 2011-02-18 2012-08-23 Raindance Technoligies, Inc. Compositions and methods for molecular labeling
US9493825B2 (en) * 2011-03-18 2016-11-15 University Of South Florida Materials and methods for profiling microRNAs
GB201203720D0 (en) * 2012-03-02 2012-04-18 Babraham Inst Method of identifying VDJ recombination products
CN104903466B (en) 2012-11-05 2016-11-23 鲁比康基因组学公司 Bar coding nucleic acid
CN105121641A (en) * 2012-12-17 2015-12-02 哈佛大学校长及研究员协会 RNA-guided human genome engineering
WO2014110272A1 (en) * 2013-01-09 2014-07-17 The Penn State Research Foundation Low sequence bias single-stranded dna ligation
ES2908644T3 (en) * 2014-01-31 2022-05-03 Swift Biosciences Inc Improved procedures for processing DNA substrates
CN107002123A (en) * 2014-08-14 2017-08-01 生命技术公司 multiple transcriptome analysis
JPWO2016052405A1 (en) * 2014-09-29 2017-05-25 富士フイルム株式会社 Non-invasive discrimination method and discrimination system for fetal chromosome aneuploidy
US10557134B2 (en) * 2015-02-24 2020-02-11 Trustees Of Boston University Protection of barcodes during DNA amplification using molecular hairpins
CN107922970B (en) * 2015-08-06 2021-09-28 豪夫迈·罗氏有限公司 Target enrichment by single probe primer extension

Also Published As

Publication number Publication date
ES2870626T3 (en) 2021-10-27
EP3910068A1 (en) 2021-11-17
EP3464634B1 (en) 2021-02-17
US20190292575A1 (en) 2019-09-26
US12043856B2 (en) 2024-07-23
EP3464634A4 (en) 2020-02-12
WO2017205540A1 (en) 2017-11-30

Similar Documents

Publication Publication Date Title
EP3464634B1 (en) Molecular tagging methods and sequencing libraries
JP6982087B2 (en) Building a Next Generation Sequencing (NGS) Library Utilizing Competitive Chain Substitution
CA2994601C (en) Target enrichment by single probe primer extension
JP2023182855A (en) Methods for targeted genomic analysis
JP5140425B2 (en) Method for simultaneously amplifying specific nucleic acids
EP2467479B1 (en) Compositions and methods for intramolecular nucleic acid rearrangement
WO2010030683A1 (en) Methods of generating gene specific libraries
WO1994003624A1 (en) Methods for the isothermal amplification of nucleic acid molecules
CN107083427B (en) DNA ligase mediated DNA amplification technology
US11680285B2 (en) Hooked probe, method for ligating nucleic acid and method for constructing sequencing library
US20240263224A1 (en) Amplification of Single Stranded DNA
CN110468179A (en) The method of selective amplification nucleic acid sequence
WO2018081666A1 (en) Methods of single dna/rna molecule counting
CN114364813B (en) Method for multiplex isothermal amplification of nucleic acid sequences
He et al. Nickase-dependent isothermal DNA amplification
JP7333171B2 (en) RNA detection method, RNA detection nucleic acid and RNA detection kit
US20210310061A1 (en) Dna amplification method for probe generation
US10066262B2 (en) Methods for amplification of nucleic acids utilizing hairpin loop or duplex primers
WO2018229547A9 (en) Duplex sequencing using direct repeat molecules
WO2018009677A1 (en) Fast target enrichment by multiplexed relay pcr with modified bubble primers
US10072290B2 (en) Methods for amplifying fragmented target nucleic acids utilizing an assembler sequence
JP2019176860A (en) Methods for amplifying fragmented target nucleic acids utilizing an assembler sequence

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20181219

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20200113

RIC1 Information provided on ipc code assigned before grant

Ipc: C12Q 1/6855 20180101ALI20200107BHEP

Ipc: C12N 15/10 20060101ALI20200107BHEP

Ipc: C12N 15/66 20060101ALI20200107BHEP

Ipc: C12Q 1/68 20180101AFI20200107BHEP

Ipc: C12P 19/34 20060101ALI20200107BHEP

Ipc: C12Q 1/6853 20180101ALI20200107BHEP

Ipc: C40B 40/06 20060101ALI20200107BHEP

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

INTG Intention to grant announced

Effective date: 20200914

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE PATENT HAS BEEN GRANTED

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602017032853

Country of ref document: DE

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 1361525

Country of ref document: AT

Kind code of ref document: T

Effective date: 20210315

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG9D

REG Reference to a national code

Ref country code: NL

Ref legal event code: MP

Effective date: 20210217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210518

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210517

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210617

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210517

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 1361525

Country of ref document: AT

Kind code of ref document: T

Effective date: 20210217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: RS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210617

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2870626

Country of ref document: ES

Kind code of ref document: T3

Effective date: 20211027

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: SM

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602017032853

Country of ref document: DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

26N No opposition filed

Effective date: 20211118

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210531

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210524

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210531

Ref country code: AL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20210531

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210617

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20210531

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 602017032853

Country of ref document: DE

Representative=s name: FORRESTERS IP LLP, DE

Ref country code: DE

Ref legal event code: R082

Ref document number: 602017032853

Country of ref document: DE

Representative=s name: KUEHR, VERA, DIPL.-BIOL., DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

P01 Opt-out of the competence of the unified patent court (upc) registered

Effective date: 20230613

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT; INVALID AB INITIO

Effective date: 20170524

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 602017032853

Country of ref document: DE

Representative=s name: FORRESTERS IP LLP, DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IE

Payment date: 20240527

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20240527

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20240530

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20240603

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20240527

Year of fee payment: 8

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20210217

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20240521

Year of fee payment: 8