WO2018077847A1 - Construction de bibliothèque circulaire à code-barres pour l'identification de produits chimériques - Google Patents

Construction de bibliothèque circulaire à code-barres pour l'identification de produits chimériques Download PDF

Info

Publication number
WO2018077847A1
WO2018077847A1 PCT/EP2017/077111 EP2017077111W WO2018077847A1 WO 2018077847 A1 WO2018077847 A1 WO 2018077847A1 EP 2017077111 W EP2017077111 W EP 2017077111W WO 2018077847 A1 WO2018077847 A1 WO 2018077847A1
Authority
WO
WIPO (PCT)
Prior art keywords
primer
molecule
adaptor
strand
target
Prior art date
Application number
PCT/EP2017/077111
Other languages
English (en)
Inventor
Toumy Guettouche
Aaron RICHARDSON
Original Assignee
F. Hoffmann-La Roche Ag
Roche Diagnostics Gmbh
Roche Sequencing Solutions, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by F. Hoffmann-La Roche Ag, Roche Diagnostics Gmbh, Roche Sequencing Solutions, Inc. filed Critical F. Hoffmann-La Roche Ag
Priority to EP17787432.8A priority Critical patent/EP3532635B1/fr
Priority to CN201780066771.2A priority patent/CN109844137B/zh
Priority to JP2019522579A priority patent/JP6860662B2/ja
Publication of WO2018077847A1 publication Critical patent/WO2018077847A1/fr
Priority to US16/384,013 priority patent/US20190241953A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1065Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay

Definitions

  • the invention relates to the field of nucleic acid sequencing. More specifically, the invention relates to the field of creating barcoded template libraries for nucleic acid sequencing. BACKGROUND OF THE INVENTION
  • the current generation of nucleic acid sequencing methods utilizes libraries of target molecules from which each individual molecule is sequenced.
  • Each molecule in the library comprises a target sequence to be analyzed conjugated to artificial sequences necessary for the chosen sequencing method and sequencing instrument.
  • the artificial sequences commonly include barcodes, short sequences of nucleotides used to uniquely mark an individual molecule or a group of molecules.
  • Unique molecular barcodes have multiple uses. Marking and tracing each individual nucleic acid molecule enables detection of extremely rare sequences, e.g., circulating tumor DNA (ctDNA) present in trace amounts in patient's blood and used for non-invasive early detection and precise monitoring of cancer (See Newman, A., et al., (2014) An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage, Nature Medicine doi:10.1038/nm.3519.) The entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family. Therefore barcodes may be used for error correction. A variation in the sequence not shared by all members of the barcoded family is discarded as an artifact and not a true mutation.
  • ctDNA circulating tumor DNA
  • Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (See Newman, A., et al., (2016) Integrated digital error suppression for improved detection of circulating tumor DNA, Nature Biotechnology 34:547).
  • the barcode-enabled error correction has greatly enhanced the sensitivity of sequencing assays. Sequencing artifacts such as polymerase errors are no longer a barrier to detecting rare point mutations. At the same time, barcodes have not been as beneficial for detecting translocations (gene fusions), another common type of mutation in human malignancies. See F. Mertens, et al. (2015) The emerging complexity of gene fusions in cancer, Nat. Rev. Cancer 15:371; F. Mitelman, et al. (2007) The impact of translocations and gene fusions on cancer causation, Nat. Rev. Cancer 7:233.
  • barcodes are typically randomly ligated to both ends of a target molecule, it is unknown which 5' barcodes originally are associated with which 3' barcodes. This poses a problem during the amplification step of library preparation, as chimeric molecules are produced via template switching in PCR. An artificially produced chimeric molecule present in the sequencing library is indistinguishable from an authentic gene fusion that may have been present in the original sample. This directly limits the capacity to detect low-frequency gene fusions, which can be important driver mutations in cancer. A barcode-based method is needed to trace and eliminate artificial gene fusions to enable detection of true mutations.
  • the invention is a method of making a library of target nucleic acid molecules from a sample comprising a plurality of target molecules, the method comprising for substantially each target molecule: ligating a single adaptor to a target molecule forming a circular molecule, wherein the adaptor comprises two barcodes, two primer binding sites situated between the two barcodes, wherein the primers annealing to the binding sites are facing away from each other, and at least one modified nucleotide effecting a strand synthesis termination by a nucleic acid polymerase situated between the two primer binding sites; annealing a forward primer complementary to the adaptor to one strand of the target molecule; extending the forward primer up to the modified nucleotide, thereby producing a first strand; annealing a reverse primer complementary to the adaptor to the first strand; extending the first primer, thereby producing the second strand and a double- stranded molecule comprising the first strand sand the second strand wherein
  • At least one of the forward and the reverse primer comprises a 5'-fiap sequence not complementary to the adaptor and comprising an additional primer binding site. Then the method further comprises a step of annealing an additional primer to the sequence complementary to the flap sequence in the forward primer and extending the additional primer thereby producing a double-stranded molecule comprising two additional primer sites and the two barcodes flanking the target sequence.
  • the target molecule and the adaptor are single-stranded.
  • the target molecule and the adaptor are double-stranded and the circular molecule is at least partially denatured primer to annealing of the primer.
  • the barcode is a nucleotide sequence 4-20 bases long.
  • the modified nucleotide effecting a strand synthesis termination by a nucleic acid polymerase may be selected from abasic nucleotides, nucleotides with protein side groups, synthetic nucleotide AraC (cytarabine) or deoxyuracil, isoguanine, 5- methylisocytosine, ethylene glycol spacers, nucleotides with bulky analogues such as fiuorophores, or unnatural base pair (UBP) "d5SICS-dNaM" nucleic acid analogues.
  • the ligation may be selected from overhang ligation, T-A ligation, blunt-end ligation and topoisomerase catalyzed ligation.
  • the adaptor has a photocleavable linker on one end.
  • the linker is ligated on one end and exposed to UV light to enable ligation on the other end.
  • the additional primers are sequencing primers.
  • the invention is a library of target nucleic acid molecules wherein each molecule is a circular molecule comprising a target sequence and an adaptor linking the ends of the target sequence, the adaptor comprising: two barcodes; two primer binding sites situated between the two barcodes, wherein the primers annealing to the binding sites are facing away from each other; at least one modified nucleotide effecting a strand synthesis termination by a nucleic acid polymerase situated between the two primer binding sites.
  • the barcode is a nucleotide sequence 4-20 bases long.
  • the modified nucleotide effecting a strand synthesis termination by a nucleic acid polymerase may be selected from abasic nucleotides, nucleotides with protein side groups, synthetic nucleotide AraC (cytarabine) or deoxyuracil, isoguanine, 5-methylisocytosine, ethylene glycol spacers, nucleotides with bulky analogues such as fiuorophores, or unnatural base pair (UBP) "d5SICS-dNaM" nucleic acid analogues.
  • synthetic nucleotide AraC cytarabine
  • deoxyuracil isoguanine
  • 5-methylisocytosine ethylene glycol spacers
  • nucleotides with bulky analogues such as fiuorophores
  • UBP unnatural base pair
  • the invention is a method of sequencing target nucleic acids in a sample comprising a plurality of target molecules, the method comprising: creating a library of target nucleic acid molecules from the sample by ligating a single double-stranded adaptor to substantially each double-stranded target molecule forming a double stranded circular molecule, wherein the adaptor comprises two barcodes, two primer binding sites situated between the two barcodes, wherein the primers annealing to the binding sites are facing away from each other, and at least one modified nucleotide effecting a strand synthesis termination by a nucleic acid polymerase situated between the two primer binding sites; denaturing at least a portion of the double-stranded circular target molecule; annealing a forward primer complementary to the adaptor to one strand of the target molecule; extending the forward primer up to the modified nucleotide, thereby producing a first strand; annealing a reverse primer complementary to the adaptor to the first strand;
  • At least one of the forward and the reverse primer comprises a 5'-flap sequence not complementary to the adaptor and comprising an additional primer binding site.
  • the method further comprises after extending the first primer, annealing an additional primer to the sequence complementary to the flap sequence in the forward primer and extending the additional primer thereby producing a double-stranded molecule comprising two additional primer sites and the two barcodes flanking the target sequence.
  • amplifying or sequencing may be performed with the additional primers.
  • Figure 1 is a diagram of a single-stranded barcoded library molecule according to the invention.
  • Figure 2 is a diagram of the first strand synthesis initiation with a forward primer.
  • Figure 3 is a diagram of the first strand synthesis and termination.
  • Figure 4 is a diagram of the completed first strand.
  • Figure 5 is a diagram of the second strand synthesis initiation with a reverse primer using the first strand as template.
  • Figure 6 is a diagram of the completed second strand.
  • Figure 7 is a diagram of the next round of the first strand synthesis initiation with a forward primer using the second strand as template.
  • Figure 8 is a diagram of a completed sequencing template molecule.
  • sample refers to any composition containing or presumed to contain target nucleic acid.
  • sample includes a sample of tissue or fluid isolated from an individual for example, skin, plasma, serum, spinal fluid, lymph fluid, synovial fluid, urine, tears, blood cells, organs and tumors, and also to samples of in vitro cultures established from cells taken from an individual patient or from a model organism, including the formalin-fixed paraffin embedded tissues (FFPET) and nucleic acids isolated therefrom.
  • FFPPET formalin-fixed paraffin embedded tissues
  • a sample may also include cell-free material, such as cell-free blood fraction that contains cell-free DNA (cfDNA) or circulating tumor DNA (ctDNA).
  • nucleic acid refers to polymers of nucleotides (e.g., ribonucleotides and deoxyribonucleotides, both natural and non-natural) including DNA, RNA, and their subcategories, such as cDNA, mRNA, etc.
  • a nucleic acid may be single-stranded or double-stranded and will generally contain 5'-3' phosphodiester bonds, although in some cases, nucleotide analogs may have other linkages.
  • Nucleic acids may include naturally occurring bases (adenosine, guanosine, cytosine, uracil and thymidine) as well as non-natural bases.
  • non-natural bases include those described in, e.g., Seela et al, (1999) Helv. Chim. Acta 82:1640.
  • the non-natural bases may have a particular function, e.g., increasing the stability of the nucleic acid duplex, inhibiting nuclease digestion or blocking primer extension or strand polymerization.
  • polynucleotide and “oligonucleotide” are used interchangeably.
  • Polynucleotide is a single-stranded or a double-stranded nucleic acid.
  • Oligonucleotide is a term sometimes used to describe a shorter polynucleotide.
  • An oligonucleotide may be comprised of at least 6 nucleotides or about 15-30 nucleotides. Oligonucleotides are prepared by any suitable method known in the art, for example, by a method involving direct chemical synthesis as described in Narang et al. (1979) Meth. Enzymol. 68:90-99; Brown et al. (1979) Meth. Enzymol. 68:109- 151; Beaucage et al. (1981) Tetrahedron Lett. 22:1859-1862; Matteucci et al. (1981) /. Am. Chem. Soc. 103:3185-3191.
  • primer refers to a single-stranded oligonucleotide which hybridizes with a sequence in a target nucleic acid ("primer binding site") and is capable of acting as a point of initiation of synthesis along a complementary strand of nucleic acid under conditions suitable for such synthesis.
  • the primer binding site can be unique to each target or can be added to all targets ("universal priming site” or “universal primer binding site”).
  • adaptor means a nucleotide sequence that may be added to another sequence so as to import additional properties to that sequence.
  • An adaptor is typically an oligonucleotide that can be single- or double-stranded, or may have both a single-stranded portion and a double-stranded portion.
  • An adaptor may contain sequences such as barcodes and universal primer or probe sites.
  • Ligation refers to a condensation reaction joining two nucleic acid strands wherein a 5'-phosphate group of one molecule reacts with the 3'- hydroxyl group of another molecule.
  • Ligation is typically an enzymatic reaction catalyzed by a ligase or a topoisomerase.
  • Ligation may join two single strands to create one single-stranded molecule.
  • Ligation may also join two strands each belonging to a double-stranded molecule thus joining two double-stranded molecules.
  • Ligation may also join both strands of a double-stranded molecule to both strands of another double-stranded molecule thus joining two double- stranded molecules.
  • Ligation may also join two ends of a strand within a double- stranded molecule thus repairing a nick in the double-stranded molecule.
  • the term "barcode” refers to a nucleic acid sequence that can be detected and identified. Barcodes can be incorporated into various nucleic acids. Barcodes are sufficiently long e.g., 2, 5, 10 nucleotides, so that in a sample, the nucleic acids incorporating the barcodes can be distinguished or grouped according to the barcodes.
  • multiplex identifier and "MID” refer to a barcode that identifies a source of a target nucleic acids (e.g., a sample from which the nucleic acid is derived, which is needed when nucleic acids from multiple samples are combined). All or substantially all the target nucleic acids from the same sample will share the same MID. Target nucleic acids from different sources or samples can be mixed and sequenced simultaneously. Using the MIDs the sequence reads can be assigned to individual samples from which the target nucleic acids originated.
  • UID unique molecular identifier
  • UID unique molecular identifier
  • All or substantially all the target nucleic acids from the same sample will have different UIDs.
  • All or substantially all of the progeny (e.g., amplicons) derived from the same original target nucleic acid will share the same UID.
  • universal primer and "universal priming binding site” or “universal priming site” refer to a primer and primer binding site present in (typically, in vitro added to) different target nucleic acids.
  • the universal priming site may be included in an adaptor ligated to the plurality of target nucleic acids.
  • the universal priming site may also be a part of target-specific (non-universal) primers, for example by being added to the 5'-end of a target- specific primer.
  • the universal primer can bind to and direct primer extension from the universal priming site.
  • target sequence refers to a portion of the nucleic acid sequence in the sample which is to be detected or analyzed.
  • target includes all variants of the target sequence, e.g., one or more mutant variants and the wild type variant.
  • sequencing refers to any method of determining the sequence of nucleotides in the target nucleic acid.
  • nucleic acid sequencing is rapidly expanding into clinical practice.
  • the current sequencing technologies employ single molecule sequencing and allow detection of extremely rare targets.
  • nucleic acid sequencing has been used to detect rare tumor DNA shed into a patient's bloodstream.
  • Detecting individual molecules typically requires molecular barcodes such as described in U.S. Patent Nos. 7,393,665, 8,168,385, 8,481,292, 8,685,678, and 8,722,368.
  • a unique molecular barcode is a short artificial sequence added to each molecule in the patient's sample typically during the earliest steps of in vitro manipulations. The barcode marks the molecule and its progeny.
  • the unique molecular barcode (UID) has multiple uses.
  • Barcodes allow tracking each individual nucleic acid molecule in the sample to assess, e.g., the presence and amount of circulating tumor DNA (ctDNA) molecules in a patient's blood in order to detect and monitor cancer without a biopsy (Newman, A., et al., (2014) An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage, Nature Medicine doi:10.1038/nm.3519).
  • Unique molecular barcodes can also be used for sequencing error correction. The entire progeny of a single target molecule is marked with the same barcode and forms a barcoded family. A variation in the sequence not shared by all members of the barcoded family is discarded as an artifact and not a true mutation.
  • Barcodes can also be used for positional deduplication and target quantification, as the entire family represents a single molecule in the original sample (Newman, A., et al., (2016) Integrated digital error suppression for improved detection of circulating tumor DNA, Nature Biotechnology 34:547).
  • the barcode-enabled error correction has greatly enhanced the sensitivity of sequencing assays. Sequencing artifacts such as polymerase errors are no longer a barrier to detecting rare point mutations. At the same time, barcodes have not been as beneficial for detecting translocations (gene fusions), another common type of mutation in human malignancies. See F. Mertens, et al. (2015) The emerging complexity of gene fusions in cancer, Nat. Rev. Cancer 15:371; F. Mitelman, et al. (2007) The impact of translocations and gene fusions on cancer causation, Nat. Rev.
  • the invention is a library of barcoded circular molecules for nucleic acid sequencing.
  • the invention is a method of sequencing nucleic acids via creation of a library of circular barcoded nucleic acid molecules. In some embodiments, the invention is a method of error correction in nucleic acid sequencing that utilizes barcodes to authenticate gene fusion molecules present in the original sample. In variations of this embodiment, the invention is a method of error correction in nucleic acid sequencing that utilizes barcodes to eliminate artificial gene fusion molecules not present in the original sample but generated during the steps of nucleic acid sequencing.
  • the present invention comprises detecting a target nucleic acid in a sample by nucleic acid sequencing. Multiple nucleic acids, including all the nucleic acids in a sample may be detected using the method and compositions described herein.
  • the sample is derived from a subject or a patient.
  • the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy.
  • the sample may also comprise body fluids (e.g., urine, sputum, serum, plasma or lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples).
  • body fluids e.g., urine, sputum, serum, plasma or lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples.
  • the sample may comprise whole blood or blood fractions where normal or tumor cells may be present.
  • the sample, especially a liquid sample may comprise cell-free material such as cell-free DNA or RNA including cell- free tumor DNA or tumor RNA.
  • the sample is a cell-free sample, e.g., cell-free blood-derived sample where cell-free tumor DNA or tumor RNA are present.
  • the sample is a cultured sample, e.g., a culture or culture supernatant containing or suspected to contain nucleic acids derived from the cells in the culture or from an infectious agent present in the culture.
  • the infectious agent is a bacterium, a protozoan, a virus or a mycoplasma.
  • a target nucleic acid is the nucleic acid of interest that may be present in the sample.
  • the target nucleic acid is a gene or a gene fragment.
  • all the genes, gene fragments and intergenic regions (entire genome) constitute target nucleic acids.
  • only a portion of the genome, e.g., only coding regions of the genome (exome) constitute target nucleic acids.
  • the target nucleic acid contains a locus of a genetic variant, e.g., a polymorphism, including a single nucleotide polymorphism or variant (SNP of SNV), or a genetic rearrangement resulting e.g., in a gene fusion.
  • the target nucleic acid comprises a biomarker, i.e., a gene whose variants are associated with a disease or condition.
  • the target nucleic acid is characteristic of a particular organism and aids in identification of the organism or a characteristic of the pathogenic organism such as drug sensitivity or drug resistance.
  • the target nucleic acid is characteristic of a human subject, e.g., the HLA or KIR sequence defining the subject's unique HLA or KIR genotype.
  • one or a plurality of target nucleic acids is converted into the template configuration of the invention.
  • the target nucleic acid occurs in nature in a single-stranded form (e.g.,
  • RNA including mRNA, microRNA, viral RNA; or single-stranded viral DNA
  • the target nucleic acid occurs in nature in a double-stranded form.
  • a single stranded target nucleic acid can be converted into the structure of the invention as shown on Figure 1.
  • a double stranded target nucleic acid can be converted into a double stranded structure where each strand is as depicted in Figure 1.
  • the single-stranded target nucleic acid can be first converted into double-stranded form prior to following the remaining steps of the method disclosed herein.
  • target nucleic acids may be fragmented although in some applications longer target nucleic acids may be desired to achieve a longer read.
  • the target nucleic acid is naturally fragmented, e.g., circulating cell-free DNA (cfDNA) or chemically degraded DNA such as the one founds in preserved samples.
  • cfDNA circulating cell-free DNA
  • chemically degraded DNA such as the one founds in preserved samples.
  • the present invention comprises the use of one adaptor molecule to be ligated to both ends of one target nucleic acid thus forming a circular molecule.
  • the adaptor is a single strand ligated to a single stranded target nucleic acid molecule.
  • ligating single-stranded nucleic acids is performed using splint oligonucleotides see e.g., U.S. Application Pub. No. 20120003657.
  • ligating single-stranded nucleic acids or partially single-stranded nucleic acids is performed using 5'- and 3'- end single stranded regions (overhangs) see e.g., U.S. Application Pub. No.
  • the adaptor is a double stranded molecule ligated to a double stranded target nucleic acid molecule. Ligation of double stranded molecules is well known in the art (See Green M., and Sambrook, J., Molecular Cloning, 2012 CSHL Press), and improvements on the general method are described herein.
  • the double stranded ligation is a blunt-end ligation.
  • the double stranded ligation is a T-A ligation or other overhang ligation.
  • the double stranded ligation is driven by topoisomerase.
  • the double-stranded adaptor has a photo-cleavable spacer on one of the two ends.
  • only one end could ligate to a library molecule in a ligation reaction.
  • the reaction would be exposed to long wavelength UV (-350 nm), cleaving off the photo-cleavable spacer, and leaving a phosphorylated 5'-end of the adaptor.
  • the ligation reaction may continue to form a circular template.
  • the reaction is diluted to reduce template concentration and facilitate self-ligation into circles.
  • the ligation results in reduced artifact formation (e.g., DNA1 - adaptor - DNA2 or adaptor 1 - DNA - adaptor2) and greater recovery of target nucleic acid molecules (greater GE)
  • the adaptor molecules are in vitro synthesized artificial sequences. In other embodiments, the adaptor molecules are in vitro synthesized naturally-occurring sequences. In yet other embodiments, the adaptor molecules are isolated naturally occurring molecules or isolated non naturally- occurring molecules.
  • the adaptor comprises one or more barcodes.
  • a barcode can be a multiplex sample ID (MID) used to identify the source of the sample where samples are mixed (multiplexed).
  • the barcode may also serve as a unique molecular ID (UID) used to identify each original molecule and its progeny.
  • MID multiplex sample ID
  • UID unique molecular ID
  • the barcode may also be a combination of a UID and an MID. In some embodiments, a single barcode is used as both UID and MID.
  • each barcode comprises a predefined sequence. In other embodiments, the barcode comprises a random sequence. Barcodes can be 1- 20 nucleotides long.
  • the library molecules contain at least two barcodes included in the adaptor that is ligated to the target nucleic acid.
  • the barcodes are between about 4-20 bases long so that between 96 and 384 different adaptors, each with a different pair of identical barcodes are added to a human genomic sample.
  • a person of ordinary skill would recognize that the number of barcodes depends on the complexity of the sample (i.e., expected number of unique target molecules) and would be able to create a suitable number of barcodes for each experiment.
  • the invention comprises a pool of adaptors for creating a library of circular barcoded molecules.
  • the adaptors within the pool have a pair of identical barcodes that are at least 1 or at least 3 edit distance apart from other barcodes in the pool.
  • One of skilled in the art would be able to determine what edit distance is optimal for a particular experiment based on typical error rates of a sequencing technology. Generally, greater edit distance means that fewer barcodes can be used in one pool. However, if the sequencing technology or a manufacturing process has a high error rate, greater edit distance will be required. For example, oligonucleotide manufacturing process used to make adaptors may have a high error rate.
  • a nucleic acid polymerase used in DNA amplification or primer extension in the sequencing-by-synthesis workflow can have a high error rate. These error rates would require increasing edit distance among the barcodes in adaptors of the pool. Conversely, improving the accuracy of each of the methods mentioned above will allow decreasing edit distance among the barcodes in adaptors of the pool.
  • the invention comprises an article of manufacture represented by a single vial containing the entire pool of adaptors.
  • an article of manufacture can comprise a kit where one or more adaptors of the pool are present in separate vials.
  • the adaptor further comprises a primer binding site for at least one universal primer. If two primer binding sites are present, the two primers are facing in opposite directions.
  • a double stranded adaptor sequence will have a primer binding site on one or both strands.
  • the primer binding site is a sequence complementary to the primer to which primer can bind and facilitate strand elongation.
  • a single stranded adaptor sequence will have a primer binding site for the first primer and a sequence identical to the second primer.
  • the adaptor has two primer binding sites facing in the opposite direction so as to enable copying of each strand and subsequent PCR amplification of the two strands. In other embodiments, the adaptor has only one primer binding site to enable coping of only one strand. In some embodiments, more than one round of copying is desired. Several rounds can be performed with the same primer or different primers.
  • the adaptor can have several primer binding sites, e.g., second primer binding site internal to the first primer binding site. Alternatively, one or both of the forward and the reverse primer may comprise a 5'- fiap sequence not complementary to the adaptor and comprising an additional primer binding site.
  • the adaptor comprises a nucleic acid synthesis termination (STOP) site.
  • the site comprises one or more nucleotides or nucleotide analogs that are not bypassable by a nucleic acid polymerase.
  • the STOP site is one or more nucleotides and nucleotide analogues selected from abasic nucleotides, nucleotides with protein side groups, synthetic nucleotide AraC (cytarabine) or deoxyuracil, isoguanine, 5-methylisocytosine, ethylene glycol spacers, nucleotides with bulky analogues such as fiuorophores, or unnatural base pair (UBP) "d5SICS-dNaM" nucleic acid analogues (See Malyshev, D., et al., (2012) Efficient and sequence-independent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet.
  • a terminator nucleotide may be specific for a particular nucleic acid polymerase while other nucleic acid polymerases may be able to bypass the same terminator.
  • alkylated deoxyguanines N7 and N2 are commonly synthesis terminators for Taq DNA polymerase. See Ponti, M., et al. (1991) Measurement of the sequence specificity of covalent DNA modification by antineoplastic agents using Taq DNA polymerase, Nucl. Acids Res. 19:2929.
  • deoxyuracil in DNA causes stall of some polymerases while others bypass it. Wardle, J., et al.
  • the invention utilizes enzymes.
  • the enzymes may include a DNA polymerase (including sequencing polymerase), a DNA ligase and a terminal transferase.
  • the DNA polymerase is a high-fidelity DNA polymerase that efficiently terminates synthesis at an unusual base, i.e., the STOP site used in the present invention.
  • high fidelity polymerases are archaeal polymerases such as Pfu (from Pyrococcus furiosus).
  • Taq polymerase is used.
  • the polymerase possesses a 3'-5' exonuclease activity.
  • the polymerase does not have a strand displacement activity.
  • the invention also utilizes a DNA ligase.
  • T4 DNA ligase or E. coli DNA ligase is used.
  • the invention also utilizes a template-independent DNA polymerase, e.g., a terminal transferase or a DNA polymerase with the activity of adding one or more nucleotides in a template-independent manner.
  • a template-independent DNA polymerase e.g., a terminal transferase or a DNA polymerase with the activity of adding one or more nucleotides in a template-independent manner.
  • the invention uses a mammalian terminal transferase or a Taq polymerase.
  • the invention comprises an amplification step.
  • This step can involve linear or exponential amplification, e.g., PCR.
  • Amplification may be isothermal or involve thermocycling.
  • the amplification is exponential and involves PCR.
  • Universal primers are used, i.e., a single pair of primers hybridizes to a binding site in the adaptor. All molecules in the library having the same adaptor can be amplified with the same set of primers. Because PCR with universal primers has reduced sequence bias, the number of amplification cycles need not be limited. The number of amplification cycles where universal primers are used can be low but also can be 10, 20 or as high as about 30 or more cycles, depending on the amount of product needed for the subsequent steps.
  • the library of circular barcoded molecules and the linear amplicons generated from the library can be subjected to nucleic acid sequencing. Sequencing can be performed by any method known in the art. Especially advantageous is the high-throughput single molecule sequencing. Examples of such technologies include the Illumina HiSeq platform (Illumina, San Diego, Cal.), Ion Torrent platform (Life Technologies, Grand Island, NY), Pacific Biosciences platform utilizing the SMRT ( Pacific Biosciences, Menlo Park, Cal.) or a platform utilizing nanopore technology such as those manufactured by Oxford Nanopore Technologies (Oxford, UK) or Roche Genia (Santa Clara, Cal.) and any other presently existing or future DNA sequencing technology that does or does not involve sequencing by synthesis.
  • Illumina HiSeq platform Illumina, San Diego, Cal.
  • Ion Torrent platform Life Technologies, Grand Island, NY
  • Pacific Biosciences platform utilizing the SMRT Pacific Biosciences, Menlo Park, Cal.
  • nanopore technology such as those manufactured by Oxford Nanopore Technologies (
  • the sequencing step may utilize platform-specific sequencing primers. Binding sites for these primers may be introduced in 5'-portions of the amplification primers used in the amplification step. If no primer sites are present in the library of barcoded molecules, an additional short amplification step introducing such binding sites may be performed.
  • the sequencing step involves sequence analysis.
  • the analysis includes a step of sequence aligning.
  • aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same barcodes (UID).
  • barcodes (UIDs) are used to determine a consensus from a plurality of sequences all having an identical barcode (UID).
  • barcodes (UIDs) are used to eliminate artifacts, i.e., variations existing in some but not all sequences having an identical barcode (UID). Such artifacts resulting from PCR errors or sequencing errors can be eliminated.
  • the number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each barcode (UID) in the sample.
  • UID barcode
  • Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence in the original sample.
  • a person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence.
  • the relevant number is reads per UID ("sequence depth") necessary for an accurate quantitative result.
  • the desired depth is 5-50 reads per UID.
  • barcodes are used to detect gene fusions and eliminate artifacts simulating gene fusion events.
  • sequence analysis involves a step of aligning the read of the target sequence to the known genome sequence.
  • Each read must contain a target sequence mapping to the genome of interest and identical barcodes (UIDs) on both ends.
  • a true gene fusion molecule would have a target sequence mapping to different regions of the target genome but identical barcodes (UIDs) on both ends.
  • a molecule having a target sequence mapping to different regions of the target genome but having different barcodes on both ends is an artifact and not a true gene fusion molecule.
  • the inventors have observed such artifacts occurring at a frequency approaching or exceeding the frequency of rare gene fusion molecules present in vivo. Without being bound by a particular theory, the inventors hypothesized that such artifacts arise during PCR with library molecules.
  • the extension of a universal primer may commence on one library molecule and undergo a template switch to continue on a second library molecule.
  • the resulting fusion molecule will have binding sites for two universal primers and be amplified in subsequent cycles of PCR.
  • Using the barcode matching according to the present invention identifies and eliminates such molecules from the sequencing data. Eliminating the artifacts allows detection of true gene fusion events with much higher sensitivity and specificity.
  • the invention is represented in more detail in Figures 1-8.
  • Figure 1 depicts a single-stranded (denatured) library molecule according to the invention.
  • Ligation of a single double-stranded adaptor to a double-stranded target molecule resulted in a double stranded circular molecule that can be denatured to yield structures depicted in Figure 1.
  • BC are barcodes present in the adaptor.
  • Each adaptor contains two identical barcodes. In some embodiments, different barcodes may be used. In either case, the barcode (or a combination of two barcodes) is unique among the adaptors used in the library preparation.
  • Each library and the progeny thereof can be uniquely identified by two copies the same unique barcode or a unique combination of two barcodes.
  • R and F are binding sites for a reverse and forward sequencing primers respectively.
  • a single strand of nucleic acid (such as depicted in Figure 1) contains a binding site (complementary sequence) for one primer (the F primer in Figure 1), and the identical sequence to the opposite facing primer (R primer in Figure 1), while the complementary strand (not shown in Figure 1) would have identical sequence to the F primer and the binding site (complementary sequence) to the R primer.
  • STOP is the strand synthesis terminator described further herein.
  • the circular templates shown in Figure 1 may be isolated from the sample using paramagnetic beads.
  • a non-extendable capture probe complementary to the adaptor molecule is added to the sample.
  • Two capture probes may be used to capture each strand of the circular molecule.
  • the capture probe is biotinylated at the 3'-end and can be captured with streptavidin-coated paramagnetic beads.
  • the probes may have the following structure: F ' R' Biotin 3 ' and
  • Figure 2 depicts initiation of the first strand synthesis wherein the F primer binds to the primer binding site in the library molecule.
  • the primer has an additional non-complementary sequence at its 5'-end.
  • the additional sequence can contain a functional element, e.g., a sequencing primer binding site (P5).
  • Figure 3 depicts first strand synthesis and termination at the STOP.
  • Figure 4 depicts the duplex circular molecule of Figure 3 and the separated (denatured) newly synthesized first strand the sequencing primer binding site (P5), the sequence of the forward primer (F), the target sequence flanked by two barcodes (BC) and a binding site for the reverse primer (R). STOP is not present in the newly synthesized first strand.
  • Figure 5 depicts initiation of the first strand synthesis wherein the R primer binds to the primer binding site in the first strand.
  • the primer has an additional non-complementary sequence at its 5'-end.
  • the additional sequence can contain a functional element, e.g., a sequencing primer binding site (P7).
  • Figure 6 depicts second strand synthesis that copies all the elements of the first strand including the sequencing primer binding site P5.
  • Figure 7 depicts the next round of initiation of the first strand synthesis wherein the P5 sequencing primer binds to its binding site in the second strand.
  • Figure 8 depicts the final linear double-stranded library molecule ready for further steps, such as amplification and sequencing.
  • the double stranded molecule contains sequencing (or amplification) primer binding sites P5 and P7 and barcodes (BC).
  • the molecule also retains the initial forward and reverse primer binding sites F and R.
  • the double-stranded library molecule is characterized by a unique barcode (or a unique combination of barcodes) that distinguished this molecule and its progeny from all other molecules and their progeny in the sample.
  • Example 1 (prophetic)
  • DNA is isolated from the sample.
  • the isolated DNA is optionally fragmented and size selected for an optimal size of circular molecules.
  • RNA is isolated from the sample and reverse-transcribed into cDNA and treated in subsequent steps as DNA isolated directly from the sample.
  • the DNA is end-repaired and A-tailed with T4 DNA polymerase.
  • the addition of the A-tail allows for a subsequent efficient ligation, avoiding complications from blunt ligation.
  • the double-stranded linker has the structure
  • [T] is the added T that base pairs with A at the 3'-end of the target molecule
  • BC is a barcode
  • STOP is the terminator nucleotide
  • R and F are reverse and forward primer binding sites respectively.
  • the sample is treated with T7 exonuclease to remove non-circularized DNA and excess adaptors (any DNA with free ends remaining in the sample).
  • the circular templates may be isolated from the sample using paramagnetic beads.
  • Two non-extendable capture probes are used to capture each strand of the circular molecule.
  • the capture probe is biotinylated at the 3'-end and is captured with streptavidin- coated paramagnetic beads.
  • the capture process comprises the steps of heat denaturation, binding to beads, removal of bead- captured DNA from solution using a magnet, optional washing the beads, and elution from the bead-bound capture probes by denaturation at elevated temperature.
  • the isolated circular templates are then amplified by PCR.
  • the PCR is performed with primers complementary to the primer-binding sites in the adaptor.
  • Each primer has a 5'-fiap non-complementary to the binding sites in the adaptor and containing sequencing primer binding sites or flow cell binding sequences depending on the choice of the sequencing instrument and technology.
  • the PCR produces linear molecules.
  • the STOP bases in each strand of the circular template molecule prevent the completion of a circle by the polymerase.
  • sequence data derived from the linear templates are analyzed.
  • a molecule having a target sequence mapping to different regions of the target genome but identical (or previously matched) barcodes on both ends is detected as a true gene fusion molecule.
  • a molecule having a target sequence mapping to different regions of the target genome but different (or not previously matched) barcodes on both ends is an artifact that is discarded from the sequencing data.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Genetics & Genomics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Analytical Chemistry (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Immunology (AREA)
  • Biomedical Technology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

L'invention concerne un nouveau procédé de construction de bibliothèques pour le séquençage de molécule unique et une composition correspondante. Le procédé utilise des codes à barres qui permettent la détection et le séquençage de molécules cibles chimériques avec une grande sensibilité. Le procédé trouve une application dans la détection de fusions de gènes telles que celles qui sont caractéristiques du cancer.
PCT/EP2017/077111 2016-10-31 2017-10-24 Construction de bibliothèque circulaire à code-barres pour l'identification de produits chimériques WO2018077847A1 (fr)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP17787432.8A EP3532635B1 (fr) 2016-10-31 2017-10-24 Construction de bibliothèque circulaire à code-barres pour l'identification de produits chimériques
CN201780066771.2A CN109844137B (zh) 2016-10-31 2017-10-24 用于鉴定嵌合产物的条形码化环状文库构建
JP2019522579A JP6860662B2 (ja) 2016-10-31 2017-10-24 キメラ生成物の同定のためのバーコードを付けられた環状ライブラリーの構築
US16/384,013 US20190241953A1 (en) 2016-10-31 2019-04-15 Barcoded circular library construction for identification of chimeric products

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201662415245P 2016-10-31 2016-10-31
US62/415,245 2016-10-31

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/384,013 Continuation US20190241953A1 (en) 2016-10-31 2019-04-15 Barcoded circular library construction for identification of chimeric products

Publications (1)

Publication Number Publication Date
WO2018077847A1 true WO2018077847A1 (fr) 2018-05-03

Family

ID=60153328

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2017/077111 WO2018077847A1 (fr) 2016-10-31 2017-10-24 Construction de bibliothèque circulaire à code-barres pour l'identification de produits chimériques

Country Status (5)

Country Link
US (1) US20190241953A1 (fr)
EP (1) EP3532635B1 (fr)
JP (1) JP6860662B2 (fr)
CN (1) CN109844137B (fr)
WO (1) WO2018077847A1 (fr)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020150656A1 (fr) 2017-08-07 2020-07-23 The Johns Hopkins University Méthodes de diagnostic et de traitement du cancer
US11180803B2 (en) 2011-04-15 2021-11-23 The Johns Hopkins University Safe sequencing system
WO2021249825A1 (fr) * 2020-06-08 2021-12-16 F. Hoffmann-La Roche Ag Procédés et compositions de détection de réagencements structuraux dans un génome
US11286531B2 (en) 2015-08-11 2022-03-29 The Johns Hopkins University Assaying ovarian cyst fluid
WO2022133335A1 (fr) * 2020-12-18 2022-06-23 Grail, Inc. Préparation d'échantillons d'acide nucléique pour séquençage
US11525163B2 (en) 2012-10-29 2022-12-13 The Johns Hopkins University Papanicolaou test for ovarian and endometrial cancers

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021180791A1 (fr) * 2020-03-11 2021-09-16 F. Hoffmann-La Roche Ag Nouvelle structure matricielle d'acide nucléique pour séquençage
EP4259820A1 (fr) * 2020-12-11 2023-10-18 The Broad Institute Inc. Procédé de séquençage de duplex
CN115125624A (zh) * 2021-03-25 2022-09-30 南方医科大学 一组条码接头以及中通量多重单细胞代表性dna甲基化建库和测序方法
CN113999893B (zh) * 2021-11-09 2022-11-01 纳昂达(南京)生物科技有限公司 兼容双测序平台的建库元件、试剂盒及建库方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007018601A1 (fr) * 2005-08-02 2007-02-15 Rubicon Genomics, Inc. Compositions et methodes de traitement et d'amplification d'adn consistant a utiliser plusieurs enzymes dans une seule reaction
US7393665B2 (en) 2005-02-10 2008-07-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
US20120003657A1 (en) 2010-07-02 2012-01-05 Samuel Myllykangas Targeted sequencing library preparation by genomic dna circularization
US8481292B2 (en) 2010-09-21 2013-07-09 Population Genetics Technologies Litd. Increasing confidence of allele calls with molecular counting
US20140193860A1 (en) 2013-01-09 2014-07-10 The Penn State Research Foundation Low Sequence Bias Single-Stranded DNA Ligation

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0422551D0 (en) * 2004-10-11 2004-11-10 Univ Liverpool Labelling and sequencing of nucleic acids
US7897344B2 (en) * 2007-11-06 2011-03-01 Complete Genomics, Inc. Methods and oligonucleotide designs for insertion of multiple adaptors into library constructs
CN102203273A (zh) * 2008-09-09 2011-09-28 生命技术公司 生成基因特异性的文库的方法
EP2753712B1 (fr) * 2011-09-06 2017-03-22 Gen-Probe Incorporated Structures fermées d'acide nucléique
US9725765B2 (en) * 2011-09-09 2017-08-08 The Board Of Trustees Of The Leland Stanford Junior University Methods for obtaining a sequence
WO2013188840A1 (fr) * 2012-06-14 2013-12-19 Fred Hutchinson Cancer Research Center Compositions et procédés de détection sensible de mutations dans des molécules d'acide nucléique
CA2889862C (fr) * 2012-11-05 2021-02-16 Rubicon Genomics, Inc. Marquage par code-barre d'acides nucleiques
CN113832216A (zh) * 2013-03-15 2021-12-24 莱尔·J·阿诺德 使用发卡寡核苷酸扩增核酸的方法
EP3495506B1 (fr) * 2013-12-11 2023-07-12 AccuraGen Holdings Limited Procédés pour détecter des variants de séquence rares
CN103668471B (zh) * 2013-12-19 2015-09-30 上海交通大学 一种构建dna高通量测序文库的方法及其配套试剂盒
WO2015188192A2 (fr) * 2014-06-06 2015-12-10 Cornell University Méthode d'identification et d'énumération de changements en matière de séquence d'acide nucléique, expression, copie ou méthylation d'adn en utilisant des réactions associant nucléase, ligase, polymérase et séquençage
US20180016631A1 (en) * 2014-12-24 2018-01-18 Keygene N.V. Backbone mediated mate pair sequencing
JP6966681B2 (ja) * 2015-04-24 2021-11-17 アティラ バイオシステムズ インコーポレイテッドAtila Biosystems Incorporated 限られたヌクレオチド組成を有するプライマーを用いた増幅

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7393665B2 (en) 2005-02-10 2008-07-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
US8168385B2 (en) 2005-02-10 2012-05-01 Population Genetics Technologies Ltd Methods and compositions for tagging and identifying polynucleotides
WO2007018601A1 (fr) * 2005-08-02 2007-02-15 Rubicon Genomics, Inc. Compositions et methodes de traitement et d'amplification d'adn consistant a utiliser plusieurs enzymes dans une seule reaction
US20120003657A1 (en) 2010-07-02 2012-01-05 Samuel Myllykangas Targeted sequencing library preparation by genomic dna circularization
US8481292B2 (en) 2010-09-21 2013-07-09 Population Genetics Technologies Litd. Increasing confidence of allele calls with molecular counting
US8685678B2 (en) 2010-09-21 2014-04-01 Population Genetics Technologies Ltd Increasing confidence of allele calls with molecular counting
US8722368B2 (en) 2010-09-21 2014-05-13 Population Genetics Technologies Ltd. Method for preparing a counter-tagged population of nucleic acid molecules
US20140193860A1 (en) 2013-01-09 2014-07-10 The Penn State Research Foundation Low Sequence Bias Single-Stranded DNA Ligation

Non-Patent Citations (14)

* Cited by examiner, † Cited by third party
Title
BEAUCAGE ET AL., TETRAHEDRON LETT., vol. 22, 1981, pages 1859 - 1862
BROWN ET AL., METH. ENZYMOL., vol. 68, 1979, pages 109 - 151
F. MERTENS ET AL.: "The emerging complexity of gene fusions in cancer", NAT. REV. CANCER, vol. 15, 2015, pages 371
F. MITELMAN ET AL.: "The impact of translocations and gene fusions on cancer causation", NAT. REV. CANCER, vol. 7, 2007, pages 233, XP002458856, DOI: doi:10.1038/nrc2091
GREEN M.; SAMBROOK, J.: "Molecular Cloning", 2012, CSHL PRESS
LA ET AL., HELV. CHIM. ACTA, vol. 82, 1999, pages 1640
MALYSHEV, D. ET AL.: "Efficient and sequence-independent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet", P.N.A.S., vol. 109, no. 30, 2012, pages 12005, XP055241913, DOI: doi:10.1073/pnas.1205176109
MARK T. GREGORY ET AL: "Targeted single molecule mutation detection with massively parallel sequencing", NUCLEIC ACIDS RESEARCH, 17 September 2015 (2015-09-17), GB, pages gkv915, XP055241840, ISSN: 0305-1048, DOI: 10.1093/nar/gkv915 *
MATTEUCCI ET AL., J. AM. CHEM. SOC., vol. 103, 1981, pages 3185 - 3191
NARANG ET AL., METH. ENZYMOL., vol. 68, 1979, pages 90 - 99
NEWMAN, A. ET AL.: "An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage", NATURE MEDICINE, 2014
NEWMAN, A. ET AL.: "Integrated digital error suppression for improved detection of circulating tumor DNA", NATURE BIOTECHNOLOGY, vol. 34, 2016, pages 547
PONTI, M. ET AL.: "Measurement of the sequence specificity of covalent DNA modification by antineoplastic agents using Taq DNA polymerase", NUCL. ACIDS RES., vol. 19, 1991, pages 2929
WARDLE, J. ET AL.: "Uracil recognition by replicative DNA polymerases is limited to the archaea, not occurring with bacteria and eukarya", NUCL. ACIDS RES., vol. 36, no. 3, 2008, pages 705 - 711

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11180803B2 (en) 2011-04-15 2021-11-23 The Johns Hopkins University Safe sequencing system
US11453913B2 (en) 2011-04-15 2022-09-27 The Johns Hopkins University Safe sequencing system
US11459611B2 (en) 2011-04-15 2022-10-04 The Johns Hopkins University Safe sequencing system
US11773440B2 (en) 2011-04-15 2023-10-03 The Johns Hopkins University Safe sequencing system
US12006544B2 (en) 2011-04-15 2024-06-11 The Johns Hopkins University Safe sequencing system
US11525163B2 (en) 2012-10-29 2022-12-13 The Johns Hopkins University Papanicolaou test for ovarian and endometrial cancers
US11286531B2 (en) 2015-08-11 2022-03-29 The Johns Hopkins University Assaying ovarian cyst fluid
WO2020150656A1 (fr) 2017-08-07 2020-07-23 The Johns Hopkins University Méthodes de diagnostic et de traitement du cancer
WO2021249825A1 (fr) * 2020-06-08 2021-12-16 F. Hoffmann-La Roche Ag Procédés et compositions de détection de réagencements structuraux dans un génome
WO2022133335A1 (fr) * 2020-12-18 2022-06-23 Grail, Inc. Préparation d'échantillons d'acide nucléique pour séquençage

Also Published As

Publication number Publication date
CN109844137B (zh) 2022-04-26
CN109844137A (zh) 2019-06-04
US20190241953A1 (en) 2019-08-08
JP6860662B2 (ja) 2021-04-21
EP3532635A1 (fr) 2019-09-04
JP2019532090A (ja) 2019-11-07
EP3532635B1 (fr) 2021-06-09

Similar Documents

Publication Publication Date Title
EP3532635B1 (fr) Construction de bibliothèque circulaire à code-barres pour l'identification de produits chimériques
US11155855B2 (en) Single stranded circular DNA libraries for circular consensus sequencing
US11898204B2 (en) Generation of single-stranded circular DNA templates for single molecule sequencing
US20200208208A1 (en) Circularization methods for single molecule sequencing sample preparation
US12110534B2 (en) Generation of single-stranded circular DNA templates for single molecule sequencing
WO2019086531A1 (fr) Séquençage consensus linéaire
US20200308576A1 (en) Novel method for generating circular single-stranded dna libraries
JP7152599B2 (ja) 塩基配列決定のためのモジュール式およびコンビナトリアル核酸試料調製のためのシステムおよび方法
US11976275B2 (en) Generation of double-stranded DNA templates for single molecule sequencing

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17787432

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2019522579

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2017787432

Country of ref document: EP

Effective date: 20190531